That claim keeps contradicted hard by other parties, who say Mythos beats 5.5 resoundingly on both a...

ACCount37 • yesterday at 7:24 PM • 1 reply • view on HN

That claim keeps contradicted hard by other parties, who say Mythos beats 5.5 resoundingly on both autonomous search and discovery and creation of complex exploit chains.

There might be a harness difference, but also, this CTF-type benchmark might not capture the capability difference fully.

Replies

nimchimpsky • yesterday at 11:29 PM

[dead]

alt Hacker News

Replies