Our pre-LLM system does better than that, but any improvement would help us do more lucrative things with our labor hours
I am left wondering if it is such a critical task, how even 1% error rate would reduce human review of all outputs.
I am left wondering if it is such a critical task, how even 1% error rate would reduce human review of all outputs.