logoalt Hacker News

potus_kushnertoday at 7:20 AM1 replyview on HN

@cafkafk got a recommendation for a good model that fits into 64GB and leaves a couple GB free for other tasks ?


Replies

cafkafktoday at 7:35 AM

Honestly, at this point you're probably looking at a smaller model, for the Gemma series I'd go with Gemma 4 E4B with drafters, but that's just a hunch from using it on my laptop (where I do have a RTX 4060 M and 96gb ram).

So you'd change the invocation slightly here, but a lot of things you can potentially reuse.

That said, the Gemma 4 E4B models have so far in my experience been... not great when it comes to long context, but they are very passable for basic tasks, and even seem surprisingly okay at tool calls.

show 2 replies