0.14.5
版本发布时间: 2024-06-10 21:50:07
Unstructured-IO/unstructured最新发布版本:0.15.12(2024-09-13 22:39:58)
0.14.5
Enhancements
-
Filtering for tar extraction Adds tar filtering to the compression module for connectors to avoid decompression malicious content in
.tar.gz
files. This was added to the Pythontarfile
lib in Python 3.12. The change only applies when using Python 3.12 and above. -
Use
python-oxmsg
forpartition_msg()
. Outlook MSG emails are now partitioned using thepython-oxmsg
package which resolves some shortcomings of the prior MSG parser.
Features
Fixes
-
8-bit string Outlook MSG files are parsed.
partition_msg()
is now able to parse non-unicode Outlook MSG emails. -
Attachments to Outlook MSG files are extracted intact.
partition_msg()
is now able to extract attachments without corruption.