Even 1gb model is prohibitively big for phones if you want mass adoption.
The 1B model works on iPhones[0].
See my other comments. anemll appears to use less memory.
[0] https://huggingface.co/anemll/anemll-llama-3.2-1B-iOSv2.0
We'll just have to await next week's oai-agi1-0.4b-a0.1b-iq1_xs.gguf
The 1B model works on iPhones[0].
See my other comments. anemll appears to use less memory.
[0] https://huggingface.co/anemll/anemll-llama-3.2-1B-iOSv2.0