logoalt Hacker News

itissidyesterday at 8:39 PM1 replyview on HN

Congrats on the launch. I've been fooling around with using my pipecat MCP(https://github.com/pipecat-ai/pipecat-mcp-server) with WebRTC. The WebRTC is hooked into a Webapp interface and this allows me to "talk" to different containers(projects) on my truenas.

I have just a list of chat sessions on the web app on all my projects. The webapp is modified to launch claude code daemons (borrowed from humanlayer/codelayer) and exposes the outbound STT from the WebRTC into a chat session.

- MCP Auth is via auth0

- Webapp itself is gated by a Bearer token.

This itself gets me pretty far. I am not sure what more this is offering?

My TTS/STT models are local by Kyutai and the voice agent's LLM between STT and TTS is used to determine some basic context: e.g. what project directories, mcp servers to select and what skills to use for launching the daemons.


Replies

kmansm27yesterday at 9:56 PM

This sounds solid, similar stuff to what we do! Sounds like this setup gets you most of the way there. We also have a mobile app + notifications. And I haven't tried using a coding voice agent via MCP, I'll try that out soon!

show 1 reply