0.15.12
版本发布时间: 2024-09-13 22:39:58
Unstructured-IO/unstructured最新发布版本:0.15.12(2024-09-13 22:39:58)
0.15.12
Enhancements
-
Improve
pdfminer
element processing Implemented splitting ofpdfminer
elements (groups of text chunks) into smaller bounding boxes (text lines). This prevents loss of information from the object detection model and facilitates more effective removal of duplicatedpdfminer
text.