logoalt Hacker News

colechristensentoday at 4:34 PM1 replyview on HN

No, they're actually training weights based on context before compaction. Context is context, this is splitting the model into persistent weights and malleable ones which are periodically updated.


Replies

delis-thumbs-7etoday at 4:39 PM

Wouldn’t that be extremely computationaly expensive considering how resource incentive training is?

show 1 reply