logoalt Hacker News

techsystemsyesterday at 9:41 AM0 repliesview on HN

How does the context length scaling at 256K tokens compare to Llama's 1M in terms of performance? How are the contexts treated differently?