Benchmarks, we have internal ones testing reasoning fine-tuned v/s frontier + prompts For som...

a-t-c-g • yesterday at 3:55 AM • 1 reply • view on HN

Benchmarks, we have internal ones testing reasoning fine-tuned v/s frontier + prompts

For some use cases it can be parity performance at 1/20th the cost up to exceeds at 1/10th the cost. Trade-off is ofc narrow applicability

objektif • yesterday at 12:51 PM

How can I learn more about these models? Are they open source?

➕ show 1 reply

alt Hacker News