Is it possible to accomplish tagging with local AI instead of Gemini?

vardump • today at 8:20 AM • 1 reply • view on HN

Replies

As far as I've seen, local OSS video understanding models just really aren't there yet. I briefly looked at facial recognition models but a good amount of signal was actually in the video's audio instead of the raw video frames. Depends on the accuracy you're looking for at the end of the day.

alt Hacker News

Replies