Skip to content

Commit b15fbf1

Browse files
authored
Merge pull request #1941 from Shreeshrii/master
Update man page and readme reg two OCR engines in Tesseract 4
2 parents 1beeeee + d160067 commit b15fbf1

File tree

2 files changed

+12
-0
lines changed

2 files changed

+12
-0
lines changed

README.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,12 @@
1212
## About
1313

1414
This package contains an **OCR engine** - `libtesseract` and a **command line program** - `tesseract`.
15+
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
16+
on line recognition, but also still supports the legacy Tesseract OCR engine of
17+
Tesseract 3 which works by recognizing character patterns. Compatibility with
18+
Tesseract 3 is enabled by using the Legacy OCR Engine mode (--oem 0).
19+
It also needs traineddata files which support the legacy engine, for example
20+
those from the tessdata repository.
1521

1622
The lead developer is Ray Smith. The maintainer is Zdenko Podobny.
1723
For a list of contributors see [AUTHORS](https://github.com/tesseract-ocr/tesseract/blob/master/AUTHORS)

doc/tesseract.1.asc

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -17,6 +17,12 @@ between 1985 and 1995. In 1995, this engine was among the top 3 evaluated by
1717
UNLV. It was open-sourced by HP and UNLV in 2005, and has been developed
1818
at Google since then.
1919

20+
Tesseract 4 adds a new neural net (LSTM) based OCR engine which is focused
21+
on line recognition, but also still supports the legacy Tesseract OCR engine of
22+
Tesseract 3 which works by recognizing character patterns. Compatibility with
23+
Tesseract 3 is enabled by --oem 0. It also needs traineddata files which support
24+
the legacy engine, for example those from the tessdata repository.
25+
2026

2127
IN/OUT ARGUMENTS
2228
----------------

0 commit comments

Comments
 (0)