gImageReader is a simple Gtk/Qt front-end to tesseract-ocr. The steps for compiling gImageReader from source are documented in the wiki.
qt ocr pdf-document c-plus-plus tesseract-ocr gtk hocr-documents hocr scannerhOcr2Pdf.NET is a .NET library to convert .hocr html into searchable pdfs using HtmlAgilityPack and iTextSharp. Currently supports Tesseract hocr files and Cuneiform hocr files. It is written in C#.
compressed-pdf cuneiform hocr htmlagilitypack itextsharp jbig2
We have large collection of open source products. Follow the tags from
Tag Cloud >>
Open source products are scattered around the web. Please provide information
about the open source projects you own / you use.
Add Projects.