5.4.0
版本发布时间: 2024-06-06 21:35:20
tesseract-ocr/tesseract最新发布版本:5.4.1(2024-06-12 02:25:20)
What's Changed
This releases provides an improved PDF renderer, adds a new PAGE XML renderer, extends the API to retrieve the text angle/gradient and has lots of smaller updates for code and documentation:
- Update appveyor.yml - Url has changed by @softwaretirol in https://github.com/tesseract-ocr/tesseract/pull/4188
- Fix grey result of indexed PNG in pdfrenderer. by @sjbronner in https://github.com/tesseract-ocr/tesseract/pull/4189
- Fix some typos by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4191
- normstrngs: add more hyphens and quotes by @bertsky in https://github.com/tesseract-ocr/tesseract/pull/4195
- Rename frk -> deu_latf (ISO 639-3, ISO 15924) by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4202
- Fix some performance issues which were reported by Coverity Scan by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4204
- Remove broken Dockerfile by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4205
- PAGE XML renderer / export by @JKamlah in https://github.com/tesseract-ocr/tesseract/pull/4214
- Remove unsupported OpenCL code and related API functions by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4220
- facilitate vectorization for generic build by @heshpdx in https://github.com/tesseract-ocr/tesseract/pull/4223
- Support training without lstmf files by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4215
- Simplify
GridSearch<...>
variables using typedef equivalents by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4226 - Use
std::min
andstd::max
for min & max operations inmakerow.cpp::most_overlapping_row()
by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4229 - Fix a few typos in comments by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4227
- Remove an unused variable in
paragraphs.cpp::DetectParagraphs()
by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4228 - A few refactors in some files by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4225
- Fix <Unicode> output and issues reported by Coverity Scan for PAGE XML renderer by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4234
- Update documentation by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4235
- Fix some issues which were reported by GitHub code scanning by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4236
- Improve CCUtil::main_setup (fixes issue #4230) by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4239
- Allow for text angle/gradient to be retrieved by @Balearica in https://github.com/tesseract-ocr/tesseract/pull/4070
- Fix setup of datadir on installations with Conda (issue #4230) by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4240
- Fix FP exception in Wordrec::angle_change (issue #4242) by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4243
- Use AM_CPPFLAGS also for compilation of all sources by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4244
- Fix some compiler warnings by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4245
- Remove unused xmlns:xlink from ALTO renderer by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4241
- Fix some compiler warnings by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4246
- Fixes #4247: remove unnecessary nullptr checks by @hribz in https://github.com/tesseract-ocr/tesseract/pull/4248
- Avoid redundant conversion from std::string to char * to std::string by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4249
- Replace strcpy and strncpy by new inline helper function by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4250
- Make function Network::spec pure virtual by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4253
New Contributors
- @softwaretirol made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4188
- @sjbronner made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4189
- @JKamlah made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4214
- @heshpdx made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4223
- @Balearica made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4070
- @hribz made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4248
Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.4...5.4.0