> There are LLMs for image generation, That part isn’t handled by an LLM > voice generatio...

nrrbtrbbrb • yesterday at 4:02 AM • 2 replies • view on HN

> There are LLMs for image generation,

That part isn’t handled by an LLM

> voice generation,

That part isn’t handled by an LLM

> video generation

That part isn’t handled by an LLM

Replies

notepad0x90 • yesterday at 3:15 PM

What is it handled by? I'm honestly curious, there are models specifically labeled as for those tasks.

famouswaffles • yesterday at 7:19 AM

Yes it can be, and often is. Advanced voice mode in chatGPT and the voice mode in Gemini are LLMs. So is the image gen in both chatGPT and Gemini (Nano Banana).

alt Hacker News

Replies