I worked in this field since long before LLMs. Nobody outside of the field really cared about GPT2, and even insiders knew the "too dangerous" part was a PR gag at best and the first dig of the moat at worst. After all, they released smaller versions of it along with detailed instructions on training it in the paper, so anyone with a lot of compute and a bunch of internet scrapers could try to recreate it. But basically noone did, even though it would have only cost ~50k back then (and less than 3k today). A few normal users started to take notice with GPT 3, but even then it was super limited. Even instructGPT didn't cause real shockwaves, despite being very close to the final product. Only ChatGPT/3.5 finally lit the fuse and people suddenly cared about having this too.
I worked in this field since long before LLMs. Nobody outside of the field really cared about GPT2, and even insiders knew the "too dangerous" part was a PR gag at best and the first dig of the moat at worst. After all, they released smaller versions of it along with detailed instructions on training it in the paper, so anyone with a lot of compute and a bunch of internet scrapers could try to recreate it. But basically noone did, even though it would have only cost ~50k back then (and less than 3k today). A few normal users started to take notice with GPT 3, but even then it was super limited. Even instructGPT didn't cause real shockwaves, despite being very close to the final product. Only ChatGPT/3.5 finally lit the fuse and people suddenly cared about having this too.