logoalt Hacker News

fumeux_fumetoday at 3:22 PM1 replyview on HN

Seeing half of an AR LLM's output tokens go to generating a predefined json schema bothers me so much. I would love to have an option to use diffusion for infilling.


Replies

jmalickitoday at 3:48 PM

One trick I learned for this was to use csv for LLM I/I and translate json <-> csv at the boundary layer

show 1 reply