logoalt Hacker News

omcnoetoday at 8:20 AM0 repliesview on HN

An eventual goal is likely to allow interacting with the LLM directly via audio tokens in input/output skipping tts and stt completely.