It's a comment. On Hacker News. Not the RL subreddit, or whatever. I'm just amazed at the ...

greesil • today at 3:47 AM • 2 replies • view on HN

It's a comment. On Hacker News. Not the RL subreddit, or whatever. I'm just amazed at the jargon. I'm sure it's useful, but one could just call it model output.

Replies

esafak • today at 5:18 AM

https://en.wikipedia.org/wiki/Reinforcement_learning#Policy

antonvs • today at 4:54 AM

> one could just call it model output.

That would be incorrect. My other reply attempts to address this.

➕ show 1 reply

alt Hacker News

Replies