5.4.0-rc1
版本发布时间: 2024-05-12 21:46:44
tesseract-ocr/tesseract最新发布版本:5.4.1(2024-06-12 02:25:20)
What's Changed
This releases provides an improved PDF renderer, adds a new PAGE XML renderer, extends the API to retrieve the text angle/gradient and has lots of smaller updates for code and documentation:
- Update appveyor.yml - Url has changed by @softwaretirol in https://github.com/tesseract-ocr/tesseract/pull/4188
- Fix grey result of indexed PNG in pdfrenderer. by @sjbronner in https://github.com/tesseract-ocr/tesseract/pull/4189
- Fix some typos by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4191
- normstrngs: add more hyphens and quotes by @bertsky in https://github.com/tesseract-ocr/tesseract/pull/4195
- Rename frk -> deu_latf (ISO 639-3, ISO 15924) by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4202
- Fix some performance issues which were reported by Coverity Scan by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4204
- Remove broken Dockerfile by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4205
- PAGE XML renderer / export by @jkamlah in https://github.com/tesseract-ocr/tesseract/pull/4214
- Remove unsupported OpenCL code and related API functions by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4220
- facilitate vectorization for generic build by @heshpdx in https://github.com/tesseract-ocr/tesseract/pull/4223
- Support training without lstmf files by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4215
- Simplify
GridSearch<...>
variables using typedef equivalents by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4226 - Use
std::min
andstd::max
for min & max operations inmakerow.cpp::most_overlapping_row()
by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4229 - Fix a few typos in comments by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4227
- Remove an unused variable in
paragraphs.cpp::DetectParagraphs()
by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4228 - A few refactors in some files by @sadra-barikbin in https://github.com/tesseract-ocr/tesseract/pull/4225
- Fix <Unicode> output and issues reported by Coverity Scan for PAGE XML renderer by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4234
- Update documentation by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4235
- Fix some issues which were reported by GitHub code scanning by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4236
- Improve CCUtil::main_setup (fixes issue #4230) by @stweil in https://github.com/tesseract-ocr/tesseract/pull/4239
- Allow for text angle/gradient to be retrieved by @Balearica in https://github.com/tesseract-ocr/tesseract/pull/4070
New Contributors
- @softwaretirol made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4188
- @sjbronner made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4189
- @JKamlah made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4214
- @heshpdx made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4223
- @Balearica made their first contribution in https://github.com/tesseract-ocr/tesseract/pull/4070
Full Changelog: https://github.com/tesseract-ocr/tesseract/compare/5.3.4...5.4.0-rc1