logoalt Hacker News

steve1977yesterday at 6:02 PM2 repliesview on HN

The question of course is, did it get the car wash question right because it is "the car wash question" or because it could actually infer why the car needed to be there?


Replies

embedding-shapeyesterday at 6:15 PM

Wasn't that "twoot" (or whatever Mastodon calls them) made just a week ago? Unlikely to have been in the training dataset of a model becoming available for public use today, unless Google made some serious advancements on the training front.

jama211yesterday at 6:17 PM

Shouldn’t be too hard to come up with a new unique reasoning question