logoalt Hacker News

bjtyesterday at 9:41 PM1 replyview on HN

"frontier level" is doing a lot of work there, but the idea would be to only feed it earlier sources.

There are people working on this.

e.g. https://github.com/haykgrigo3/TimeCapsuleLLM


Replies

lovecgyesterday at 9:59 PM

The problem is the amount of data with that cutoff is really minuscule to produce anything powerful. You might be able to generate a lot of 1700s sounding data, you’d have to be careful not to introduce newer concepts or ways of thinking in that synthetic data though. A lot of modern texts talk about rates of change and the like in ways that are probably influenced by preexisting knowledge of calculus.

show 1 reply