logoalt Hacker News

dnauticsyesterday at 3:52 PM1 replyview on HN

Don't assume. Empirically, they are not. (This post Feb 2026 may change in future yadda yadda)

See: autocodebench

https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/tree/ma...


Replies

Towaway69yesterday at 4:10 PM

Reading that made me think how much that might be related to Elixir being very similar in syntax to Ruby. Do LLMs really differentiate between the two?

Specific studies, as the one quoted, are a long way from original real world problems.

show 4 replies