CTPN - Detecting Text in Natural Image with Connectionist Text Proposal Network

  •        139

These demo codes (with our trained model) are for text-line detection (without side-refinement part). You need a GPU. If you use CUDNN, about 1.5GB free memory is required. If you don't use CUDNN, you will need about 5GB free memory, and the testing time will slightly increase. Therefore, we strongly recommend to use CUDNN.

http://textdet.com
https://github.com/tianzhi0549/CTPN

Tags
Implementation
License
Platform

   




Related Projects

tesseract-ocr-for-php - A wrapper to work with Tesseract OCR inside PHP.

  •    PHP

A wrapper to work with Tesseract OCR inside PHP. ‼️ This library depends on Tesseract OCR, version 3.03 or later.

EasyOCR - Java OCR 识别组件(基于Tesseract OCR 引擎)。能自动完成图片清理、识别 CAPTCHA 验证码图片内容的一体化工作。Java Image cleanup, OCR recognition component (based Tesseract OCR engine, automatically cleanup image and identification CAPTCHA verification code picture content)

  •    

EasyOCR is a Java language using OCR recognition engine (based Tesseract). By means of a few simple API, the Java language can be used to complete the picture content identification work. And integrated image cleanup, recognition CAPTCHA image, bill notes and other content integration efforts. EasyOCR engine supports plugin programming, ETD templates support, provide a graphical ETD template design tools (EasyTemplateDesigner GUI). EasyOCR not only provide services for consumers, but mainly oriented to provide localized development SDK integration with C/S, B/S and Android mobile terminal native integration projects.

pdfocr - Adds text to PDF files using the cuneiform OCR software

  •    Ruby

pdfocr adds an OCR text layer to scanned PDF files, allowing them to be searched. It currently depends on Ruby 1.8.7 or above, and uses ocropus, cuneiform, or tesseract for performing OCR. For more details, see the manpage.

pytesseract - A Python wrapper for Google Tesseract

  •    Python

Python-tesseract is an optical character recognition (OCR) tool for python. That is, it will recognize and "read" the text embedded in images. Python-tesseract is a wrapper for Google's Tesseract-OCR Engine. It is also useful as a stand-alone invocation script to tesseract, as it can read all image types supported by the Python Imaging Library, including jpeg, png, gif, bmp, tiff, and others, whereas tesseract-ocr by default only supports tiff and bmp. Additionally, if used as a script, Python-tesseract will print the recognized text instead of writing it to a file.


OCRmyPDF - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

  •    Python

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched or copy-pasted. For details: please consult the documentation.

JavaOCR

  •    Java

Java OCR is an Optical Character Recognition algorithm based on a mean squared recognizer. This tool also includes utilities to trace and extract characters.

GOCR

  •    C

GOCR is an OCR (Optical Character Recognition) program, developed under the GNU Public License. It converts scanned images of text back to text files. Joerg Schulenburg started the program, and now leads a team of developers.

Tessnet2

  •    CSharp

A .NET 2.0 Open Source OCR assembly using Tesseract engine.

ambar - :mag: Ambar: Document Search System

  •    

Ambar is an open-source document search and management system with automated crawling, OCR, tagging and instant full-text search.There are two editions available: Community and Enterprise. Enterprise Edition is a full featured document search and management system that can handle terabytes of data.

Terese OCR verifier

  •    C++

Terese is a tool for proofreading OCR text. Terese tries to map the text back to the scanned image, and visually shows the differences. See the homepage for further details.

CaptuocrToy - A tool to capture screenshot and recognize text by online ocr apis

  •    Swift

A tool to capture screenshot and recognize text by online ocr apis . You should install Xcode Command Line Tools and cocoapod first.

sonus - :speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection

  •    Javascript

Sonus lets you quickly and easily add a VUI (Voice User Interface) to any hardware or software project. Just like Alexa, Google Now, and Siri, Sonus is always listening offline for a customizable hotword. Once that hotword is detected your speech is streamed to the cloud recognition service of your choice - then you get the results. Generally, running npm install should suffice. This module however, requires you to install SoX.

node-tesseract - A simple wrapper for the Tesseract OCR package

  •    Javascript

There is a hard dependency on the Tesseract project. You can find installation instructions for various platforms on the project site. For Homebrew users, the installation is quick and easy.

iPhone-OCR-Tesseract-and-OpenCV - Simple academic project made using OpenCV and Tesseract

  •    Objective-C

This is a sample project created by me (@PablosPoject) and @_AJ_R for academic purpose. It use the OpenCV framework and tutorial made by BloodAxe(https://github.com/BloodAxe) and some other utilities class made by Aptogo (https://github.com/aptogo). It also uses the Tesseract OCR engine to read the text processed with openCV. I also build a simple user interface that permit to take a photo or choose one from library, and also permit to apply to the image every single step in the image processing, or to apply directly all the processing.

crnn - Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

  •    Lua

This software implements the Convolutional Recurrent Neural Network (CRNN), a combination of CNN, RNN and CTC loss for image-based sequence recognition tasks, such as scene text recognition and OCR. For details, please refer to our paper http://arxiv.org/abs/1507.05717. UPDATE Mar 14, 2017 A Docker file has been added to the project. Thanks to @varun-suresh.

pyocr - A Python wrapper for Tesseract and Cuneiform

  •    Python

PyOCR is an optical character recognition (OCR) tool wrapper for python. That is, it helps using various OCR tools from a Python program.It has been tested only on GNU/Linux systems. It should also work on similar systems (*BSD, etc). It may or may not work on Windows, MacOSX, etc.

gosseract - Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

  •    Go

Golang OCR package, by using Tesseract C++ library. Check Dockerfile for more detail of installation, or you can just try by docker run -it --rm otiai10/gosseract.

open-ocr - Run your own OCR-as-a-Service using Tesseract and Docker

  •    Go

OpenOCR makes it simple to host your own OCR REST API. The heavy lifting OCR work is handled by Tesseract OCR.





We have large collection of open source products. Follow the tags from Tag Cloud >>


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.