logoalt Hacker News

ClaireBookwormlast Monday at 5:23 PM2 repliesview on HN

What sort of fine tuning data was needed to allow the model to self-drive? One hour of video of someone driving, or extra labeling?


Replies

nee1rlast Monday at 5:27 PM

i actually drove the car (with arrow keys) around south park for around ~45 minutes as finetuning data, no extra labelling other than that. think the car line graph is super cool because you actually see the videegame prior working

g413nlast Monday at 5:29 PM

relevant note is that we finetuned by having the human also use arrow keys which keeps it in-distribution but also slower to collect