logoalt Hacker News

zephyrwhimsyyesterday at 9:10 PM1 replyview on HN

Input quality is almost always the actual bottleneck. Teams spend months tuning retrieval while feeding HTML boilerplate into their vector stores.


Replies

hrmtst93837today at 8:38 AM

Plenty of folks obsess over retrieval but daily quries are word salad and the input is still garbage.