Extracting Text From PDF Using Go

from blog Blog on hjr265.me, | ↗ original
TL;DR: Read all text values from PDF, ordered by Y-position. Make a state machine. The link to the complete code is at the end of the article. When it comes to extracting text from PDF, you will likely face one of these two scenarios: The PDF you generate or export from some software where all the text is still text. The PDF you scan from a...