logoalt Hacker News

hommelixtoday at 5:01 AM5 repliesview on HN

By coincidence, I've looked yesterday a small documentary [1] about the people tagging all those invoices to train theses models. For 120 €/month they are reading about 1000 to 4000 invoices per day and check and tag them for AI training.

[1] https://www.arte.tv/en/videos/126831-000-A/arte-reportage/


Replies

cantalopestoday at 5:55 AM

Reminds me of openai paying Kenyans $2/hr to flag violent and toxic stuff for them and a bunch of people ending up with ptsd

show 3 replies
elrictoday at 8:45 AM

OCR based invoice recognition has been a solved problem for well over a decade. Source: I've consulted for a company doing that. No exploitation. No LLMs. Just clever engineering.

In my neck of the woods, B2B invoices are now required to be delivered over the Peppol network in UBL format, which further improves reliability.

Doesn't necessarily eliminate the need for an accountant, because the chosen UBL standard has lots of room for interpretation and ambiguity, and it's impossible to uniformly decide how process an invoice based on the invoice alone (e.g. is this deductible? is this even a business expense at all? which ledger should this go in? etc).

show 1 reply
sphtoday at 8:50 AM

AI: Actual Indians^WMalagasy

Barbingtoday at 5:40 AM

Were they sore about it?

Or don’t tell me, if it’s well worth the 24min watch

show 1 reply
wiseowisetoday at 6:42 AM

> For 120 €/month they are reading about 1000 to 4000 invoices per day and check and tag them for AI training.

AGI will solve poverty, btw. Any second now. Just need 500 bil more bro.