Releases: tesseract-ocr/tesseract
Releases · tesseract-ocr/tesseract
5.3.0-rc1
What's Changed
- Fix memory issues in ScrollView::MessageReceiver by @p12tic in #3872
- autotools: Add rule for svpaint executable by @stweil in #3873
- Replace call of exit function by return statement in main function by @stweil in #3878
- Fix the build on CodeQL/Analyze by @arseniy-sonar in #3888
- CI: Remove Ubuntu 18.04 by @amitdo in #3902
- configure.ac: fix build on aarch64_be by @ffontaine in #3907
- SW CI: Add paths filter by @amitdo in #3908
- Create .mailmap by @amitdo in #3910
- Fix tesseract.pc from cmake to match autotools by @jeroen in #3930
- Update README.md by @nicholasz2510 in #3935
- Fixed 2 errors by @Gitoffthelawn in #3938
- fix issue #3940 - remove colormap before thresholding by @zdenop in #3942
- Update upload-artifact action by @rettinghaus in #3949
- Update checkout action to version 3 by @rettinghaus in #3948
- Fix Markdownlint by @Saibamen in #3950
- Fix broken links in CONTRIBUTING.md by @doraeric in #3951
- pdfrenderer.cpp: Ignore non-text blocks by @amitdo in #3959
- lstm.train: allow .box from .raw.png too by @bertsky in #3962
- Fix a number of performance issues (reported by Coverity Scan) by @stweil in #3967
- Fix training tools for legacy engine (issue #3925) by @stweil in #3970
- Fix function tesseract::WriteFeature (issue #3925) by @stweil in #3972
- Modernize function ObjectCache::DeleteUnusedObjects (fix issue with s… by @stweil in #3978
- More fixes for issue #3925 by @stweil in #3977
New Contributors
- @p12tic made their first contribution in #3872
- @arseniy-sonar made their first contribution in #3888
- @nicholasz2510 made their first contribution in #3935
- @rettinghaus made their first contribution in #3949
- @Saibamen made their first contribution in #3950
- @doraeric made their first contribution in #3951
Full Changelog: 5.2.0...5.3.0-rc1
5.2.0
This is a new minor version of Tesseract 5.
- Improvements and fixes for continuous integration,
autoconf and cmake builds. - Set /Os for some 32 bit MS compilers (fixes #3769).
- Improve comments and other documentation.
- Add initial support for Intel AVX512F.
- Fix for very large PDF files on 32 bit hosts (fixes #3805).
- Fix NEON detection on FreeBSD.
- Fix regression with UZN files (fixes #3837).
- Fix calling delete[] for memory allocated by malloc in C API.
- Add an API function to init tesseract with traineddata from memory
(fixes #3691). - Replace direct access to Leptonica internal data structures by
function calls and support latest releases of Leptonica. - Replace std::regex by std::string functions (fixes issue #3830).
- Use compiled-in TESSDATA_PREFIX also on Windows (fixes #3767).
- Add new parameter 'invert_threshold', change the default threshold
from 0.5 to 0.7 and mark parameter 'tessedit_do_invert' as deprecated.
See also list of all changes.
5.1.0
This is a new minor version of Tesseract 5.
- Handle image and line regions in output formats ALTO, hOCR and text.
- New parameter curl_timeout for curl_easy_setop.
- Build fixes and improvements.
- Catch nullptr in PageIterator::Orientation to improve robustness.
- Remove unused code.
See also list of all changes.
5.0.1
This is a bug fix release of Tesseract 5.0.
- Add SPDX-License-Identifier to public include files.
- Support redirections when running OCR on a URL.
- Lots of fixes and improvements for cmake builds.
Distributions should use the autoconf build. - Fix broken msys2 build with gcc 11.
- Fix parameter certainty_scale (was duplicated).
- Fix some compiler warnings and clean code.
- Correctly detect amd64 and i386 on FreeBSD.
- Add libarchive and libcurl in continuous integration actions.
- Update submodule googletest to release v1.11.0.
See also list of all changes.
5.0.0
This is the final stable release of Tesseract 5.0.0.
- Limit BCER to interval [0,1]
- Improved build process
- Cleaned code
See also list of all changes.
5.0.0-rc3
This is the third release candidate of Tesseract 5.0.0.
- Improve training messages
- Add RowAttributes getter to PageIterator
See also list of all changes.
4.1.3
5.0.0-rc2
This is the second release candidate of Tesseract 5.0.0.
- Fix regression for OCR with more than one model file
- Bug fixes
- Optimizations
See also list of all changes.
4.1.2
5.0.0-rc1
This is the first release candidate of Tesseract 5.0.0.
- Enable fast float32 LSTM by default
- Switch to NFC normalisation everywhere
- Remove banner message
- Disable music staff detection and removal
- Add new command line option --loglevel
- Bug fixes
See also list of all changes.