This model is fully compatible with anything previously done with gemma3. Just passed it to one of ...

pilooch • 06/26/2025 • 1 reply • view on HN

This model is fully compatible with anything previously done with gemma3. Just passed it to one of my vlm fine-tuning scripts and it started without issues (hf transformer code). On a single GPU with Lora the E4B model takes 18Gb of VRAM in batch size 1 where gemma-4B was 21Gb. Nice one from deepmind, the gemma3 family tops the open weights VLLMs.

Replies

pilooch • 06/27/2025

Fix: it's the E2B

alt Hacker News

Replies