Tesseract OCR

A Story by
Tesseract Open Source OCR Engine

What is Tesseract OCR?

Tesseract was originally developed at Hewlett-Packard Laboratories Bristol and at Hewlett-Packard Co, Greeley Colorado between 1985 and 1994, with some more changes made in 1996 to port to Windows, and some C++izing in 1998. In 2005 Tesseract was open sourced by HP. Since 2006 it is developed by Google.
Tesseract OCR is a tool in the Image Analysis API category of a tech stack.

Who is using it?

16 companies use Tesseract OCR in their tech stacks, including Shelf, The Paperless Project, and X-Ray.

Shelf

The Paperless Project

X-Ray

Data Engineering

backend

Rubyroid-Labs-Tech-Stack

AI

ESCHR

Services

Frischergehts.net GmbH

DLabs.AI

Easy2Parts GmbH

Why developers like Tesseract OCR

Building training set is easy
Very lightweight library