•        889

hOcr2Pdf.NET is a .NET library to convert .hocr html into searchable pdfs using HtmlAgilityPack and iTextSharp. Currently supports Tesseract hocr files and Cuneiform hocr files. It is written in C#.




Related Projects


An open source C# PDF library

pdfocr - Adds text to PDF files using the cuneiform OCR software

pdfocr adds an OCR text layer to scanned PDF files, allowing them to be searched. It currently depends on Ruby 1.8.7 or above, and uses ocropus, cuneiform, or tesseract for performing OCR. For more details, see the manpage.

PDF Little Signer

PDF Little Signer is a .NET3.5 library for self signing PDF document. It's very easy to use. It uses iTextSharp.


PdfReport is a code first reporting engine, which is built on top of the iTextSharp and EPPlus libraries.


PdfEbookCutter is a program to cut PDF Documents into smaller pages, to display them in a better way on ebook readers. It displays a sketch of the pages in a graphical editor, and saves the cropped pages in a new file. It's developed in C# with iTextSharp


As an Open Source library it is provided for free with source code and without any warranty or support. Key features:- 1. Fully-featured PDF viewer with embedded font support, zooming, JBIG2 support, advanced PDF search, bookmarks, thumbnails, Layers support and more… 2. Released under user friendly open source LGPL license with full source code for use in both commercial and Open Source projects. 3. In development for over 10 years and used in corporate software globally. 4. Upgrade route

Pdf Form Tool

Pdf Form Tool demonstrates how the iTextSharp library could be used to fill PDF forms. The input data is provided as a csv file. The application will generate a separate pdf based on a specific template for each record from the csv. The application is developed in VB.Net 10

PDF Form Bubble Up

Bubble Up takes PDF Forms stored in SharePoint document libraries and "bubbles up" the data in the PDF Form to the library. This means the data that had been trapped in the PDF Form can now be used in document library views, workflows, etc. It's developed in C#.


Just a simple project to take a pdf form (like the one you fill out) and make it into an object so its a bit easier to work with. Uses iTextSharp for most of the heavy pdf-ing

Silverlight Export/Print to PDF

Demonstration project of how to print Silverlight 3 UI elements to PDF or PNG to a file on the local disk. Uses Silverlight 3, VS 2010 Beta 2, and the iTextsharp library.

command line PDF signature tool

Command line tool to sign pdf files using a signing certificate in pkcs12 format. code is in c# language.

PDF Template using iTextSharp, Uses an XML doc as the template.

This Project is useful for generating Sales Order, Invoice etc etc


DoddleReport adds automatic reporting (HTML / PDF / Excel / etc) for any LINQ Query, IEnumerable, DataTable or SharePoint List. Reports support custom styling, a fluid API, and maximum extensibility to easily add support for new Report Sources and ReportWriters.

HtmlAgilityPackContrib - Logical extension to HtmlAgilityPack

HtmlAgilityPackContrib - A logical extension to HtmlAgilityPack to parse HTML using jQuery like methods inspired by jSoup


A port of the PDF-library iTextSharp for Microsoft Silverlight.

ITextSharp Sample

????IText Sharp?????????????PDF???????flash,????

FDFToolkit .NET

FDFToolkit .NET makes it easier to read, write, create, populate, merge FDF, XDP, XFDF, and XML data with Acrobat and LiveCycle PDF forms. FDFToolkit.net utilizes iTextSharp 4.x technologies, under MPL license.

jbig2enc - Image Compression Library

JBIG2 encodes bi-level (1 bpp) images using a number of clever tricks to get better compression than G4. It can compress multipage documents. It generates JBIG2 files, or fragments for embedding in PDFs.


Graphical interface for Cuneiform OCR