incredibly impressive demos. I wonder how the training data for these models look like?
is it separate batches of special "skills" that are added post training? how can they guarantee the models won't eventually lose a skill?