mpvi [1] is the video control part. I have only used it a little bit but it is incredibly good. Control the playback completely from Emacs and quickly make timestamped org notes.
I don't know what the other parts are. Curious to learn!
[1]: https://github.com/lorniu/mpvi