logoalt Hacker News

ivanovmyesterday at 5:24 PM0 repliesview on HN

RL environment (instruction, stateful container, reward function) is the training data product being bought