By coincidence, I've looked yesterday a small documentary [1] about the people tagging all those invoices to train theses models. For 120 €/month they are reading about 1000 to 4000 invoices per day and check and tag them for AI training.
[1] https://www.arte.tv/en/videos/126831-000-A/arte-reportage/
OCR based invoice recognition has been a solved problem for well over a decade. Source: I've consulted for a company doing that. No exploitation. No LLMs. Just clever engineering.
In my neck of the woods, B2B invoices are now required to be delivered over the Peppol network in UBL format, which further improves reliability.
Doesn't necessarily eliminate the need for an accountant, because the chosen UBL standard has lots of room for interpretation and ambiguity, and it's impossible to uniformly decide how process an invoice based on the invoice alone (e.g. is this deductible? is this even a business expense at all? which ledger should this go in? etc).
AI: Actual Indians^WMalagasy
Were they sore about it?
Or don’t tell me, if it’s well worth the 24min watch
> For 120 €/month they are reading about 1000 to 4000 invoices per day and check and tag them for AI training.
AGI will solve poverty, btw. Any second now. Just need 500 bil more bro.
Reminds me of openai paying Kenyans $2/hr to flag violent and toxic stuff for them and a bunch of people ending up with ptsd