logoalt Hacker News

habosatoday at 3:57 AM0 repliesview on HN

Can you ELI5 why this is so slow for local inference but so fast for using hosted models?