The latest models are mostly LMMs (large multimodal models). If a model builds an internal represent...

red75prime • today at 7:01 AM • 1 reply • view on HN

The latest models are mostly LMMs (large multimodal models). If a model builds an internal representation that integrates all the modalities we are dealing with (robotics even provides tactile inputs), it becomes harder and harder to imagine why those representations should be qualitatively different.

Replies

qsera • today at 8:11 AM

It can't, simply because the textual description of a concept is different from the concept itself.

➕ show 1 reply

alt Hacker News

Replies