• The AI Pulse
  • Posts
  • šŸ¤– OpenAIā€™s Three New Audio Models to Build Voice Agents

šŸ¤– OpenAIā€™s Three New Audio Models to Build Voice Agents

PLUS: New Metric Discovers Mooreā€™s Law for AI Agents

Welcome back AI enthusiasts!

In todayā€™s Daily Report:

  • āš™ļøOpenAIā€™s Three New Audio Models to Build Voice Agents

  • šŸ“ˆNew Metric Discovers Mooreā€™s Law for AI Agents

  • šŸ› Trending Tools

  • šŸ„ŖBrief Bites

  • šŸ’°Funding Frontlines

  • šŸ’¼Whoā€™s Hiring?

Read Time: 3 minutes

šŸ—žRECENT NEWS

OPENAI

āš™ļøOpenAIā€™s Three New Audio Models to Build Voice Agents

Image Source: OpenAI Developers (i.e., @OpenAIDevs on X)/ā€œThree new state-of-the-art audio models in the APIā€/Screenshot

OpenAI introduced a new suite of three Audio Models for developers worldwide to build, develop, and deploy Voice Agents.

Key Details:
  1. gpt-4o-transcribe: A speech-to-text AI model powered GPT-4o (ā€œoā€ for ā€œomniā€) to transcribe audio.

  2. gpt-4o-mini-transcribe: A faster, more efficient, and lighter-weight speech-to-text AI model that captures nuances in speech like pauses or fillers.

  3. gpt-4o-mini-tts: A text-to-speech AI model powered by GPT-4o mini (ā€œoā€ for ā€œomniā€) that converts text to natural-sounding spoken audio. It also adapts to requests of speaking styles in prompts like: ā€œUse a pirate voice!ā€

Why Itā€™s Important:
  • OpenAI also launched OpenAI.fm, an interactive demo that enables developers to try gpt-4o-mini-tts for free.

  • This new suite of three Audio Models integrates with Agents SDK, which allows developers to build Multi-Agent Systems (MAS) where multiple voice-enabled AI Agents work collectively to perform complex tasks like generating natural-sounding spoken audio.

AI RESEARCH

šŸ“ˆNew Metric Discovers Mooreā€™s Law for AI Agents

Image Source: Model Evaluation & Threat Research (METR)/METR Founder, CEO Beth Barnes/ā€œMeasuring AI Ability to Complete Long Tasksā€/Screenshot

Model Evaluation & Threat Research (METR) just proposed ā€œ50%-task-completion time horizon,ā€ a new metric for measuring AI Performance based on the length of tasks AI Agents can complete.

Key Details:
  • Despite the rapid progress in AI Benchmarks, their real-world meaning remains unclear. Theyā€™re designed to measure domain-specific skills by relying on curated datasets and controlled environments, which donā€™t reflect the chaos and complexity of real-world tasks.

  • This new metric compares the length of time it takes a skilled human to complete a real-world task with the length of time it takes an AI Agent to complete the same real-world task with 50% accuracy.

  • For example, Anthropicā€™s Claude 3.7 Sonnet has a ā€œ50%-task-completion time horizonā€ of 59 minutes. This statement means it can successfully complete a real-world task with 50% accuracy that takes a skilled human nearly an hour to complete.

Why Itā€™s Important:
  • This new metric has been ā€œexponentially increasing over the past 6 years, with a doubling time of around 7 months.ā€ This trend suggests that, in under 5 years, AI Agents will be able to autonomously complete real-world tasks that take skilled humans weeks to complete.

  • AI Experts are referring to this trend as Mooreā€™s Law for AI Performance. In 1965, Gordon E. Moore, the Co-Founder of Intel, made a bold observation that the number of transistors on a microchip doubles roughly every two years.

šŸ©ŗ PULSE CHECK

Will AI Agents autonomously replace Data Analyst within the next 10 years?

Vote Below to View Live Results

Login or Subscribe to participate in polls.

šŸ› TRENDING TOOLS

šŸ•øļøReworkd effortlessly extracts web data at scale.

šŸ’¬PromptimizeAI makes you an expert prompt engineer.

šŸ›’20paths makes interactive product demos that convert.

šŸ¤Fellow gives support before, during, and after every meeting.

āœ‚ļøWuri turns your ideas into stunning videos with AI-powered editing.

šŸ”®Browse our always Up-To-Date AI Tools Database.

šŸ„ŖBRIEF BITES

Anthropic announced that you can now use ā€œClaudeā€ to search the internet for up-to-date information relevant to your prompts.

Apple recently shuffled around AI Executive Ranks in an effort to get back on track with integrating Apple Intelligence into Siri.

OpenAI released ā€œo1-pro,ā€ charging developers a whopping $150 per Million Input Tokens and $600 per Million Output Tokens.

Perplexity AI CEO Aravind Srinivas unveiled that heā€™s upgrading ā€œDeep Researchā€ to think longer, use code execution, and render in-line charts.

šŸ’°FUNDING FRONTLINES

  • BuildOps secures a $127M Series C to build Mission Control for Contractors.

  • Halliday lands a $20M Series A to build AI Agents that operate safely on Blockchain.

  • Browser Use raises a $17M Seed Round to make navigating websites easier for AI Agents.

šŸ’¼WHOā€™S HIRING?

  • SoundCloud (New York, NY): Marketing Analytics Intern, Summer 2025

  • Meta (Los Angeles, CA): Data Scientist, Product Analytics, Entry-Level

  • Coinbase (San Francisco, CA): Technology Risk Analyst, Mid-Level

  • Red Hat (Boston, MA): Senior MLOps Engineer, AI Inference, Senior-Level

šŸ“’FINAL NOTE

FEEDBACK

How would you rate todayā€™s email?

It helps us improve the content for you!

Login or Subscribe to participate in polls.

ā¤ļøTAIP Review of The Day

ā€œI legit look forward to reading this every morning!ā€

-Enya (1ļøāƒ£ šŸ‘Nailed it!)
REFER & EARN

šŸŽ‰Your Friends Learn, You Earn!

You currently have 0 referrals, only 1 away from receiving āš™ļøUltimate Prompt Engineering Guide.

Reply

or to participate.