magic_pdf-0.9.1-released
版本发布时间: 2024-11-06 12:07:21
opendatalab/MinerU最新发布版本:magic_pdf-0.10.5-released(2024-12-02 14:16:59)
What's Changed
- Feat/tune docs by @icecraft in https://github.com/opendatalab/MinerU/pull/833
- fix(ocr_mkcontent): improve content handling for different languages and equation types by @myhloli in https://github.com/opendatalab/MinerU/pull/839
- feat(list): improve list detection algorithm & fix(list): improve list identification accuracy by @myhloli in https://github.com/opendatalab/MinerU/pull/843
- docs(tutorial): update magic-pdf command with output directory by @myhloli in https://github.com/opendatalab/MinerU/pull/844
- feat(para_split_v3): improve list identification with block aspect ratio by @myhloli in https://github.com/opendatalab/MinerU/pull/845
- fix(dict2md): improve text concatenation logic by @myhloli in https://github.com/opendatalab/MinerU/pull/847
- Update pdf_extract_kit.py by @CiaranYoung in https://github.com/opendatalab/MinerU/pull/853
- feat(table): upgrade StructEqTable model and integrate into PDF Extract Kit by @myhloli in https://github.com/opendatalab/MinerU/pull/854
- feat(model): add HTML minification to StructTableModel by @myhloli in https://github.com/opendatalab/MinerU/pull/855
- chore: add .gitattributes to configure file linguist attributes by @myhloli in https://github.com/opendatalab/MinerU/pull/856
- fix(merge_text): add ligature replacement functionality #305 #241 by @myhloli in https://github.com/opendatalab/MinerU/pull/857
- chore: add CSS and SCSS files to linguist-vendored- Update .gitattributes to mark CSS and SCSS files as vendored by @myhloli in https://github.com/opendatalab/MinerU/pull/858
- docs(README): update Colab demo link by @myhloli in https://github.com/opendatalab/MinerU/pull/860
- fix(table): improve table image processing by @myhloli in https://github.com/opendatalab/MinerU/pull/866
- docs(faq): add troubleshooting for illegal instruction error on Linux servers by @myhloli in https://github.com/opendatalab/MinerU/pull/867
- feat: mineru_demo接口文档替换为链接 by @LollipopsAndWine in https://github.com/opendatalab/MinerU/pull/871
- test(table): improve HTML validation for table extraction by @myhloli in https://github.com/opendatalab/MinerU/pull/874
- docs: update arXiv paper link in README files by @myhloli in https://github.com/opendatalab/MinerU/pull/875
- docs(README): update changelog for v0.9.1 release by @myhloli in https://github.com/opendatalab/MinerU/pull/877
New Contributors
- @CiaranYoung made their first contribution in https://github.com/opendatalab/MinerU/pull/853
Full Changelog: https://github.com/opendatalab/MinerU/compare/magic_pdf-0.9.0-released...magic_pdf-0.9.1-released
1、 magic_pdf-0.9.1-py3-none-any.whl 1.09MB