Reminds me of my years working on digital forensic software... Just I was working on smaller scale, but the idea was kind of similar, extract, carve, pull as many raw files as possible, then process them through various threads / pipelines of processing, then categorize and make some sort of report. I guess in this case, its get it all buttoned up for training. I have to also imagine, some of it goes through some level of human review, anyone wanting to make a worthwhile model is better off letting humans describe things, the outputs become drastically better is my understanding, sure the training can find all the patterns, but the wording to describe it all if you can get just enough detail, makes a difference.