Agree. I'm unclear what's the highlight of this post. Is the multimodality of the model (t...

snickmy • today at 3:30 PM • 1 reply • view on HN

Agree. I'm unclear what's the highlight of this post. Is the multimodality of the model (that can replace computer vision), is it the reasoning part, is it the overall wrapper that makes it very easy to develop on top?

Replies

PunchTornado • today at 6:07 PM

It's the fact that is not task specific.

alt Hacker News

Replies