logoalt Hacker News

esafaktoday at 1:20 AM5 repliesview on HN

Except learning to reason is a far cry from curve fitting. Our brains have more than five parameters.


Replies

voxelghosttoday at 2:11 AM

After a quick content browse, my understanding is this is more like with a very compressed diff vector, applied to a multi billion parameter model, the models could be 'retrained' to reason (score) better on a specific topic , e.g. math was used in the paper

sdenton4today at 4:08 AM

It's the statistics equivalent of 'no one needs more than 640kb of RAM'

ekucktoday at 2:10 AM

speak for yourself!

esttoday at 1:54 AM

reasoning capability might just be some specific combinations of mirror neurons.

even some advanced math usually evolves applying patterns found elsewhere into new topics

measurablefunctoday at 2:30 AM

I agree, I don't think gradient descent is going to work in the long run for the kind of luxurious & automated communist utopia the technocrats are promising everyone.