You can get the user's head position using WorldTrackingProvider, that's enough for xeyes to follow you across the room.
There's even a sample app close enough
https://developer.apple.com/documentation/visionOS/placing-e...