hOcr2Pdf.NET

  •        830

hOcr2Pdf.NET is a .NET library to convert .hocr html into searchable pdfs using HtmlAgilityPack and iTextSharp. Currently supports Tesseract hocr files and Cuneiform hocr files. It is written in C#.

http://hocrtopdf.codeplex.com/

Tags
Implementation
License
Platform

   




Related Projects

iTextSharp


An open source C# PDF library

PDF Little Signer


PDF Little Signer is a .NET3.5 library for self signing PDF document. It's very easy to use. It uses iTextSharp.

pdfocr - Adds text to PDF files using the cuneiform OCR software


Adds text to PDF files using the cuneiform OCR software

PdfReport


PdfReport is a code first reporting engine, which is built on top of the iTextSharp and EPPlus libraries.

pdfHackery - project to test hacking on itextsharp for splitting out a pdf


project to test hacking on itextsharp for splitting out a pdf



iTextSharpWrapper - Wrapper for the iTextSharp PDF library.


Wrapper for the iTextSharp PDF library.

PdfEbookCutter


PdfEbookCutter is a program to cut PDF Documents into smaller pages, to display them in a better way on ebook readers. It displays a sketch of the pages in a graphical editor, and saves the cropped pages in a new file. It's developed in C# with iTextSharp

JPedal


As an Open Source library it is provided for free with source code and without any warranty or support. Key features:- 1. Fully-featured PDF viewer with embedded font support, zooming, JBIG2 support, advanced PDF search, bookmarks, thumbnails, Layers support and more… 2. Released under user friendly open source LGPL license with full source code for use in both commercial and Open Source projects. 3. In development for over 10 years and used in corporate software globally. 4. Upgrade route

Pdf Form Tool


Pdf Form Tool demonstrates how the iTextSharp library could be used to fill PDF forms. The input data is provided as a csv file. The application will generate a separate pdf based on a specific template for each record from the csv. The application is developed in VB.Net 10

PDF Form Bubble Up


Bubble Up takes PDF Forms stored in SharePoint document libraries and "bubbles up" the data in the PDF Form to the library. This means the data that had been trapped in the PDF Form can now be used in document library views, workflows, etc. It's developed in C#.

PDF Form ORM


Just a simple project to take a pdf form (like the one you fill out) and make it into an object so its a bit easier to work with. Uses iTextSharp for most of the heavy pdf-ing

Silverlight Export/Print to PDF


Demonstration project of how to print Silverlight 3 UI elements to PDF or PNG to a file on the local disk. Uses Silverlight 3, VS 2010 Beta 2, and the iTextsharp library.

command line PDF signature tool


Command line tool to sign pdf files using a signing certificate in pkcs12 format. code is in c# language.

PDF Template using iTextSharp, Uses an XML doc as the template.


This Project is useful for generating Sales Order, Invoice etc etc

iTextSharp - iTextSharp Repo from http://sourceforge.net/projects/itextsharp/


iTextSharp Repo from http://sourceforge.net/projects/itextsharp/

DoddleReport


DoddleReport adds automatic reporting (HTML / PDF / Excel / etc) for any LINQ Query, IEnumerable, DataTable or SharePoint List. Reports support custom styling, a fluid API, and maximum extensibility to easily add support for new Report Sources and ReportWriters.

jbig2.js - Under development JavaScript implementation of the JBIG2 specification.


Under development JavaScript implementation of the JBIG2 specification.

cuneiform - create usable stats from cuneiform xml data


create usable stats from cuneiform xml data

cuneiform - Cuneiform is an multi-language OCR system.


Cuneiform is an multi-language OCR system.