- Recaply AI
- Posts
- Voice AI hits light speed
Voice AI hits light speed
+ Inside Perplexity's fast AI weight system
October 02, 2025 |
Good morning, AI enthusiasts. Hume AI launched Octave 2, slashing voice AI costs by half while delivering responses in under 200 milliseconds.
Will affordable, realistic voice technology finally bring AI assistants into everyday business operations?
In today's recap:
Hume AI makes voice technology twice as cheap
Microsoft releases open-source framework for building AI agents
Inside Perplexity's fast AI weight system
HUME AI
Voice AI gets 2x cheaper, 40% faster
Recaply: Hume AI just launched Octave 2, and it's a game changer for voice AI. The second generation model delivers speech in under 200 milliseconds while cutting costs by 50%. This upgrade makes realistic voice technology accessible for everything from customer service to gaming.
Key notes:
Octave 2 generates audio 40% faster than its predecessor, responding in under 200ms for near-instant voice interactions.
Pricing drops to half the cost of Octave 1, with dedicated deployments reaching under a cent per minute of audio.
The model understands emotional tone more deeply, knowing when to whisper, shout, or speak calmly based on context.
New EVI 4 mini brings these capabilities to speech-to-speech applications for smoother conversational experiences.
Enhanced pronunciation handles uncommon words, numbers, and symbols more reliably than before.
Impact: With faster response times and lower costs, Octave 2 opens doors for large-scale voice AI applications. Businesses can now deploy realistic voice technology without breaking the bank, while users get seamless experiences that feel genuinely human.
TOGETHER WITH TYPELESS
Typing is a thing of the past
Typeless turns your raw, unfiltered voice into beautifully polished writing - in real time.
It works like magic, feels like cheating, and allows your thoughts to flow more freely than ever before.
With Typeless, you become more creative. More inspired. And more in-tune with your own ideas.
Your voice is your strength. Typeless turns it into a superpower.
MICROSOFT
Open-source AI agent platform debuts
Recaply: Microsoft unveiled Agent Framework, a free toolkit designed to help developers build sophisticated AI agent systems. The platform tackles a common challenge: how to move AI experiments into real business applications. Agent Framework provides the structure needed for multiple AI agents to work together, handle interruptions, and operate safely in regulated industries.
Key notes:
The framework bridges research innovation from Microsoft Research with production-ready enterprise features.
Companies can deploy agents that collaborate, like one agent researching data while another analyzes and validates results.
Built-in observability tools let teams track every decision and action agents make across workflows.
Human-in-the-loop features allow managers to approve sensitive operations before agents execute them.
Integration with Azure AI Foundry provides secure hosting with compliance controls for regulated sectors.
Impact: This launch represents a shift toward standardizing how companies build AI agent systems. By offering both creative flexibility and enterprise controls in one package, Microsoft reduces the technical barriers that have kept many organizations from adopting multi-agent AI. The result could be faster innovation across industries.
PERPLEXITY
Inside Perplexity's fast AI weight system
Recaply: Perplexity just dropped its first research publication with a clever solution to a gnarly technical challenge. When training large AI models with reinforcement learning, the system constantly shuffles updated weights between training computers and inference computers. Most frameworks take forever because they route everything through a single choke point. Perplexity's team built a smarter highway system for this data traffic.
Key notes:
The weight transfer system moves data from 256 training GPUs to 128 inference GPUs in under 2 seconds.
Uses one-sided RDMA WRITE operations that let source GPUs write directly into destination memory.
Creates a static schedule at startup that maps which GPU sends what data to which destination.
Pipelines four types of operations: host-device copying, GPU computation, RDMA networking, and control signaling.
Inference GPUs never get interrupted or notified during transfers, keeping them focused on generating outputs.
Impact: Smart engineering often beats throwing more hardware at a problem. By rethinking how data flows between systems, Perplexity made reinforcement learning post-training practical at scale. This kind of infrastructure work rarely gets headlines but determines which AI applications actually work in production versus staying stuck in research labs.
PRESENTED BY LINDY AI
The Simplest Way to Create and Launch AI Agents and Apps
You know that AI can help you automate your work, but you just don't know how to get started.
With Lindy, you can build AI agents and apps in minutes simply by describing what you want in plain English.
From inbound lead qualification to AI-powered customer support and full-blown apps, Lindy has hundreds of agents that are ready to work for you 24/7/365.
Stop doing repetitive tasks manually. Let Lindy automate workflows, save time, and grow your business.
NEWS
📰 What matters in AI right now?
Amazon launched 4K Ring cameras with facial recognition and AI-powered Search Party to help reunite lost pets with their families.
Researchers released Dreamer 4, the first AI agent to obtain diamonds in Minecraft using only offline data, requiring 100x less data than OpenAI's VPT.
Google blocked AI-generated summaries for Trump dementia queries while providing summaries for similar Biden and Obama searches, citing sensitive topic protocols.
Meta acquired chip startup Rivos for approximately $2 billion to expand its MTIA program and reduce reliance on Nvidia for AI infrastructure.
Anthropic released Claude integration for paid Slack workspaces, enabling thread summaries and document analysis directly in team channels.
Thinking Machines launched Tinker, a flexible training API allowing researchers to fine-tune language models using LoRA without managing infrastructure.
Meta announced it will use AI chat data from over 1 billion monthly Meta AI users for targeted advertising starting December 16, with no opt-out available.
Fraunhofer researchers developed AI-powered sensor fabric embedded in asphalt that continuously monitors road conditions and predicts degradation patterns.
Salesforce launched Agentforce Vibes, a vibe coding tool with autonomous AI agent Vibe Codey, offering 50 free GPT-5 requests daily per organization.
OUR TOP PICKS
🧰 AI Tools to Check Out
🎙️ Murf AI: Create realistic voiceovers in many voices and tones
🧩 Fabric: Self‑organizing workspace for your files and ideas
📰 beehiiv: All‑in‑one newsletter platform for growth and monetization
🎭 AI Studios: Turn text into pro‑looking avatar videos quickly
🤝 Trupeer: Qualify and engage B2B leads with AI chat and outreach
✂️ Munch: Automatically clip engaging segments from long videos
🗣️ Talkpal: Practice speaking languages with AI conversation tutors
📓 Firefiles: Transcribe, summarize, and analyze meetings across 100+ languages
* Some links in this newsletter may be from sponsors or affiliates. We may get paid if you buy something through these links.
PROMPTS
📚 Create An E-Book
#CONTEXT:
Adopt the role of an experienced writer with a deep understanding of crafting educational and insightful e-books. Your task is to write an e-book on [topic], ensuring it provides thorough coverage of the subject with fresh insights and valuable information. The e-book should cater to a wide range of readers, from beginners to those with more advanced knowledge of the topic. It should be structured logically with clear chapters and subheadings for ease of reading. Additionally, integrate SEO best practices to optimize the e-book for online visibility.
#GOAL:
You will write an e-book that is informative, engaging, and offers in-depth analysis or guidance on the selected topic. The e-book should be professionally formatted and easy to navigate, catering to different levels of familiarity with the subject.Important Note: This is not the full prompt. You’ll need to click the button below to get the complete prompt.
Have a favorite prompt? Tell us about it or rate today’s prompt, click here.
EVENTS
Global AI Summit 2025: Oct 27 – Oct 29, 2025 · Toronto, Canada
AI World Conference 2025: Oct 27 – Oct 29, 2025 · San Diego, USA
UNU Macau AI Conference 2025: Oct 23 – Oct 25, 2025 · Macau, China
Future of AI Summit 2025: Nov 5 – Nov 6, 2025 · London, UK & digital
5th International Conference AI ML Systems: Oct 7 – Oct 11, 2025 · Bangalore, India
🧡 Enjoyed this issue?
🤝 Recommend our newsletter or leave a feedback.
How'd you like today's newsletter?Your feedback helps me create better emails for you! |
Cheers, Jason








Reply