logoalt Hacker News

spindump8930today at 6:53 PM1 replyview on HN

The article seems quite editorialized, shifting between describing "large-scale AI models" and "neural network-based approaches".

The underlying paper itself is more precise, comparing against LUAR, a 2021 method based on bert-style embeddings (i.e. a model with 82M parameters, which is 0.2% the size of e.g. the recent OS Gemma models). I don't fault the authors of the paper at all for this, their method is interesting and more interpretable! But you can check the publication history, their paper was uploaded originally in 2024: https://arxiv.org/abs/2403.08462

A good example of why some folks are bearish on journals.

"AI bad" seems to sell in some circles, and while there are many level-headed criticisms to be made of current AI fads, I don't think this qualifies.


Replies

throwanemtoday at 7:21 PM

Are you prepared to demonstrate a superior result with models newer than those available when the research was done? Can you suggest a candidate experiment design to test your hypothesis?