logoalt Hacker News

cyanydeezyesterday at 7:43 PM1 replyview on HN

If you want to get out ahead of what's coming, it'll be small models that bootstrap the harness rather than anything else.


Replies

robbrown451yesterday at 7:51 PM

I used to think that, but ended up going the other direction, partly because I don't have the wherewithall to build a model but then I realized, with existing models that can take more than a tiny amount of context, you can just let any model bootstrap itself with a good prompt sent by the system.

There's a ton of other tricks to it, but mostly keeping the protocol simple for the AI so it can concentrate on coding logic and not stuff like managing BS boilerplate, dependencies, etc. (for instance I make extensive use of things like abstract syntax tree library to help with surgical edits from the LLM)

That said, I would be very open to collaborating with someone who builds such small models, I don't think the system strictly needs it, but it also could have some extra power if it had it.

show 2 replies