logoalt Hacker News

mahimaiyesterday at 10:03 PM2 repliesview on HN

I built this because Voice AI is moving fast and I couldn't find a single place that walks a developer from "what is a voice agent" through to production telephony, evals, and the FCC/EU AI Act stuff you actually need to know before shipping.

Every citations are verified and active, tagged Beginner/Intermediate/ Advanced, and grouped so you can read it top-to-bottom:

foundations →

pick a framework (LiveKit or Pipecat) →

swap in STT/TTS/LLM components →

add WebRTC or SIP →

evals and production. →

What's intentionally not in there: - Vendor-locked tutorials presented as neutral guides (flagged where authors have commercial interests) - Anything that hasn't been updated in the last 12 months

Sections I'd most welcome feedback on: the open-source TTS picks (this moves weekly), the evaluation tooling (newest category), and whether the suggested 5-week learning path is realistic.

PRs and harsh critique both welcome.


Replies

nitwit-setoday at 5:48 AM

Useful collection of links - thanks. The suggested learning path section: reads AI generated. It won’t take anyone five weeks to work this all out - likely a weekend would suffice (coming from someone who has implemented most variations of voice including SIP).

My personal take on learning this stuff: ask Claude Code to build a greenfield project that does what you want and then actually read the code it produced and really try to understand what it’s doing.

fenomastoday at 2:35 AM

(vouched - just fyi this comment was dead, no idea why)

show 1 reply