+ 1

What is Tesseract OCR?

Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
Tesseract OCR is a tool in the Image Analysis API category of a tech stack.
Tesseract OCR is an open source tool with 58.6K GitHub stars and 9.1K GitHub forks. Here’s a link to Tesseract OCR's open source repository on GitHub

Who uses Tesseract OCR?

16 companies reportedly use Tesseract OCR in their tech stacks, including Shelf, The Paperless Project, and X-Ray.

76 developers on StackShare have stated that they use Tesseract OCR.
Pros of Tesseract OCR
Building training set is easy
Very lightweight library
Decisions about Tesseract OCR

Here are some stack decisions, common use cases and reviews by companies and developers who chose Tesseract OCR in their tech stack.

Aicha Mahfoudh
Needs advice
Tesseract OCRTesseract OCR

Can I use both TensorFlow and Tesseract OCR to create a model that detects text out of a document pdf

See more

Tesseract OCR Alternatives & Comparisons

What are some alternatives to Tesseract OCR?
TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API.
OpenCV was designed for computational efficiency and with a strong focus on real-time applications. Written in optimized C/C++, the library can take advantage of multi-core processing. Enabled with OpenCL, it can take advantage of the hardware acceleration of the underlying heterogeneous compute platform.
JavaScript is most known as the scripting language for Web pages, but used in many non-browser environments as well such as node.js or Apache CouchDB. It is a prototype-based, multi-paradigm scripting language that is dynamic,and supports object-oriented, imperative, and functional programming styles.
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.
GitHub is the best place to share code with friends, co-workers, classmates, and complete strangers. Over three million people use GitHub to build amazing things together.
See all alternatives

Tesseract OCR's Followers
277 developers follow Tesseract OCR to keep up with related blogs and decisions.