It can, you could try prompting the model to use object detection vision and text extraction, we rea...

yoeven • today at 7:10 AM • 0 replies • view on HN

It can, you could try prompting the model to use object detection vision and text extraction, we realized when we purely extract text it does amazing at word/sentence level bounds since the text acts as the anchor. However, when you treat it as a object detection problem, it sees that chunk of text as a segment allowing you the extract it as one column bound. Give that a try.

alt Hacker News