This isn't an image model. It's a text model, but text models can output SVG so you can ch...

simonw • yesterday at 12:15 PM • 1 reply • view on HN

This isn't an image model. It's a text model, but text models can output SVG so you can challenge them to generate a challenging image and see how well they do.

Replies

cedws • yesterday at 12:41 PM

>Multimodal by design: Gemma 3n natively supports image, audio, video, and text inputs and text outputs.

But I understood your point, Simon asked it to output SVG (text) instead of a raster image so it's more difficult.

➕ show 1 reply

alt Hacker News

Replies