JavaOCR

Java OCR is an Optical Character Recognition algorithm based on a mean squared recognizer. This tool also includes utilities to trace and extract characters.

References:



http://javaocr.sourceforge.net/

Bookmark and Share          15039



comments powered by Disqus


Related Products

GOCR

GOCR is an OCR (Optical Character Recognition) program, developed under the GNU Public License. It converts scanned images of text back to text files. Joerg Schulenburg started the program, and now leads a team of developers.

Read more

OCRopus

OCRopus :- The open source document analysis and OCR system featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.

Read more

Tesseract-ocr

The Tesseract OCR engine was one of the top 3 engines in the 1995 UNLV Accuracy test. Between 1995 and 2006 it had little work done on it, but it is probably one of the most accurate open source OCR engines available. The source code will read a binary, grey or color image and output text. A tiff reader is built in that will read uncompressed TIFF images, or libtiff can be added to read compressed images.

Read more

Tessnet2

A .NET 2.0 Open Source OCR assembly using Tesseract engine.

Read more

VietOCR

Provides optical character recognition (OCR) solutions for Vietnamese language.

Read more

Hocr-tools - Tools for manipulating and evaluating the hOCR format for representing multi-lingual OC

AbouthOCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is

Read more

Lime-ocr - A simple, free OCR software for Windows using tesseract-ocr engine

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic translation of images of handwritten, typewritten or printed text (usually captured by a scanner) into machine-editable text. It is used to convert paper books and documents into electronic files. Lime OCR is build with tessearact-ocr which is an OCR Engine that was developed at HP Labs between 1985 and 1995, and now at Google. Lime OCR was initially developed for internal use of Lime Consultants, and now

Read more

Libautocaptcha - Automatic CAPTCHA decoding library for Java

InformationCAPTCHAWhat is a CAPTCHA? See this good article on Wikipedia. Automatic decodinglibautocaptcha aim is to provide automatic CAPTCHA decoding for Java programs. Using pre-processing, segmentation and classification, this library is able to solve some of the easiest visual CAPTCHAs. Creditslibautocaptcha would like to thank developers of JavaOCR for their work on image processing and character recognition. DisclaimerThe user assumes all risks associated with the use of libautocaptcha. Th

Read more

Lector - An interface to tesseract ocr

LectorA graphical ocr solution for GNU/Linux based on Python, Qt4 and tessaract OCR. Author: Davide Setti IntroductionLector can help you to scan your tons of paper and create text document! Lector lets you select areas on which you want to do OCR (Optical Character Recognition). Then you can run tesseract-ocr simply clicking a button. The resulting text can be proofread, formatted and edited directly in Lector. Features: scanning (available only on Linux) OCR via tesseract (with support for mor

Read more

Ocrlib - Open Source Orient text OCR and translation library

Dear friends! For a few years our group has been developing OCR (optical character recognition) and translation system with Open Source code for Asian languages. The key features of the OCR system include: 1. Stream OCR processing During the first stage of the project, we recognized 300 000 pages of Tibetan Canon in Tibetan for TBRC Digital Library (www.tbrc.org) We used MacPro server that has processed all 280 volumes with one OCR set. 2. Tibetan spell checker and online dictionary on 250000 wo

Read more

Related Tags
Browse projects by tags.

Follow feeds Follow bestopensource on Twitter Follow bestopensource on Facebook


Open source products are scattered around the web. Please provide information about the open source projects you own / you use. Add Projects.

Do you provide Consulting, Training, Support for any open source products. Register your business

Tag Cloud >>