logoalt Hacker News

red75primetoday at 7:01 AM1 replyview on HN

The latest models are mostly LMMs (large multimodal models). If a model builds an internal representation that integrates all the modalities we are dealing with (robotics even provides tactile inputs), it becomes harder and harder to imagine why those representations should be qualitatively different.


Replies

qseratoday at 8:11 AM

It can't, simply because the textual description of a concept is different from the concept itself.

show 1 reply