logoalt Hacker News

jeroenhdtoday at 12:29 PM1 replyview on HN

Their video is completely different from what Gemini does now. It analyses mouse movements, like circling around things, underlining things with the mouse, pointing at things to indicate where they need to go. It's a lot like the interfaces you might see in sci-fi movies, where generic gestures are understood within context in a way that modern computers can't handle.


Replies

bel8today at 1:40 PM

> circling around things, underlining things with the mouse

Do we use the same Android Gemini assistant?

Because the one I use does that and it has object detection smart enough to be intuitive. It usually gets it right when I point something on the screen. And when it doesn't, I can circle around the thing or just click again.

This Instagram post for example, it automatically highlighted the entire person, but I wanted to know about the shoes. I then clicked once on the shoes and it knew exactly what I wanted and gave me the info in about 2 seconds:

https://imgur.com/a/lHUeciy

This is useful to non tech savvy folks. Not just to us hackers.

show 1 reply