logoalt Hacker News

clemvonstengellast Monday at 5:13 PM1 replyview on HN

I rly liked the point about ctrl-c only being able to be labelled retrocausally. I do think that with enough past context you should be able to know what was copied - in some sense the past does encode the future - but also an agentic decision is precisely the kind where the future is more informative than the past for reconstructing that decision.

It does make me wonder if you should have the inverse dynamics model split into specifically retrocausal and causal. You kind of do this already with the inverse and forward dynamics model, but the idea of a model that knows only about the future training in a feedback loop with a model that knows only about the past is kind of interesting.

I think you could just do a clever masking regime in your diffusion model to achieve the same effect without a whole architecture change.


Replies

g413nlast Monday at 5:29 PM

yeah we actually had some wacky ideas with ctc + a reverse-causal mask but diffusion does just make it all a bit more simple