I have a hard time believing that AI can be used to label AI-generated videos without there being a significant number of false positives/negatives. I think back to ZeroGPT and it labeling the Declaration of Independence as AI-generated.
Considering YouTube's current level of detection of absolutely anything, this will likely be an absolute disaster with no way of appealing to a real human.
In theory you're right, in practice you're not.
We don't need the metaphysical solution to the problem of detecting AI videos for the rest of time. Certainly, it's fairly easy to make something that mostly works most of the time. Enough to be very, very useful.
I don't know how YouTube's detection will work, but if it were based solely on watermarks, there would be many false negatives, but there shouldn't be false positives.
If it simply costs more time and money to generate videos that pass the filter I'm all for it. The time and money cost of creating videos has tended to zero so there is a lot of low quality stuff now.
It's not just from AI either. Video creation used to require a fancy camera and a above average internet connection. Now the whole world has that so we're seeing a lot of low quality profit seeking content on any platform where there is money to be made. There was a GitHub repo with 100s of low quality PRs because people thought it would boost their job prospects.
I mean, between SynthID and C2PA don't you already have labels and watermarks that covering a lot of major players like Google, Adobe, ElevenLabs, NIVIDEA? No real concern about false negatives there.
As for false positive, the most straightforward path seems to be to let stuff slide unless you are really sure. Maybe that slightly rewards players like Kling because they keep the invisible watermarks for their own use, and that of the CCP,but not third parties. NBD.
It's not like catching everything is that important. YouTube isn't claiming this is perfect. And I don't know that anyone need this to be perfect. It's not like even the best photorealistic video creation tools don't have plenty of tells anyway.
This doesn't seem like ZeroGPT at all. Having a flag or not having flag on a YouTube short is low stakes. Its not like it's being sold as a solution for something high stakes like academic grading.
videos and images have non-ai methods for detecting if its ai, its easier than text. There are some image artifacts in AI that can be found statistically.
[dead]
All they need to do is "randomly" label 99,999 of every 100,000 as AI and they'll be right 99.999% of the time.
Cryptographically verifiable provenance and chain of custody is going to be necessary to get to the human only stuff, before long, but the good AI stuff will be better. Just a matter of time, at this point.
There's a reason why they led with simply labeling author self-reported AI videos as AI, and then casually mentioned they'll also try to detect AI videos automatically. They're not confident in it working reliably and want people to have low expectations. This is probably realistic. Using AI to detect AI is not reliable. Detecting AI videos is likely to become an arms race and will require an ongoing commitment of resources.
This is better than nothing, but it's not going to provide immunity against AI fakes trending and having impact before they're identified as AI.