logoalt Hacker News

macwhispereryesterday at 8:09 PM0 repliesview on HN

I run the latest 20b-30b models on a MacBook Air... running inference with an MoE (25 tps) for like 2 hours is like 10% battery.. (look me up on huggingface to download my models)

also you gotta realize frontier models have massive "system prompts" that clog up the context window with garbage.

being able to write your own system prompts gives you a MASSIVE edge..