So, only Americans can use data against others?
By the way, I'm running 400B model on my computer with 72GB VRAM: Qwen3.5-397B-A17B-GGUF/UD-Q4_K_XL getting 13 t/s. Subjectively, I feel it's runs at the level of Anthropic Claude, just slower.
Question for you, that 13t/s, is that pretty solid even with high context/tokens?
I know Apple marketing says 'look at our 20t/s' but they sent less than 40 tokens.
Question for you, that 13t/s, is that pretty solid even with high context/tokens?
I know Apple marketing says 'look at our 20t/s' but they sent less than 40 tokens.