0.15.6
版本发布时间: 2024-08-20 20:47:04
Unstructured-IO/unstructured最新发布版本:0.15.12(2024-09-13 22:39:58)
0.15.6
Enhancements
Features
Fixes
-
Bump to NLTK 3.9.x Bumps to the latest
nltk
version to resolve CVE. -
Update CI for
ingest-test-fixture-update-pr
to resolve NLTK model download errors. -
Synchronized text and html on
TableChunk
splits. When aTable
element is divided during chunking to fit the chunking window,TableChunk.text
corresponds exactly with the table text inTableChunk.metadata.text_as_html
,.text_as_html
is always parseable HTML, and the table is split on even row boundaries whenever possible.