MarkItDown: Python Tool for Converting Files and Office Documents to Markdown

from blog Daring Fireball, | ↗ original
Nifty new convert-to-Markdown library from a small indie development shop named Microsoft: The MarkItDown library is a utility tool for converting various files to Markdown (e.g., for indexing, text analysis, etc.) It presently supports: PDF (.pdf) PowerPoint (.pptx) Word (.docx) Excel (.xlsx) Images (EXIF metadata, and OCR) Audio (EXIF...