logoalt Hacker News

vardumptoday at 8:20 AM1 replyview on HN

Is it possible to accomplish tagging with local AI instead of Gemini?


Replies

icyfoxtoday at 7:27 PM

As far as I've seen, local OSS video understanding models just really aren't there yet. I briefly looked at facial recognition models but a good amount of signal was actually in the video's audio instead of the raw video frames. Depends on the accuracy you're looking for at the end of the day.