logoalt Hacker News

coalhousetoday at 11:14 AM0 repliesview on HN

anything that compares proprietary models will be very miscalibrated and may not be indicative, there have been too many model changes in both chat and the api where model providers did not even say the word before it got too noticable