logoalt Hacker News

netduryesterday at 5:16 PM2 repliesview on HN

had a good run with Gemma 4 E2B Unsloth 4Q: https://youtube.com/shorts/XLsAnz5aAAI

The E4B model doesn’t fit on my phone TPU, so it swaps to RAM, the QAT version means more accuracy, good!


Replies

ComputerGuruyesterday at 11:30 PM

How were you getting anything useful out of that? We found the (unquantized!) E2B model to be completely useless at even the simplest real-world classification tasks.

prism56yesterday at 8:34 PM

How do you know it swaps to ram vs on the TPU?

Would be interested in testing this on my pixel.

show 1 reply