I'm working on a new compontent for viewing PDFs in original format and structure but show text highlighting while a specific piece of the PDF is being played in the TTS engine. This for my app (https://with.audio). Which already supports PDF parsing and TTS of PDF files. WithAudio currently converts the input PDF to Markdown and performs TTS and synchronized text highlighting on the Markdown content. I want to do this on the original rendered PDF content itself.
Initial results are promosing Extracting the text and figuring out which lines belong to the same paragraph and then try to map those to the original positions in the PDF...