logoalt Hacker News

rush86999yesterday at 7:22 PM1 replyview on HN

Only solution is to train the issue for the next time.

Architecturally focusing on Episodic memory with feedback system.

This training is retrieved next time when something similar happens


Replies

atarusyesterday at 7:41 PM

Training is an overkill at this point imo. I have seen agents work quite well with a feedback loop, some tools and prompt optimisation. Are you doing fine-tuning on the models when you say training?

show 1 reply