logoalt Hacker News

NiloCKtoday at 3:24 PM0 repliesview on HN

This is wonderful.

Having models attempt an SVG letter S remains one of my personal/informal LLM benchmarks. They are still pretty bad at it.