- The AI Pulse
- Posts
- š¤ OpenAIās New Cloud-Based Software Engineering Agent
š¤ OpenAIās New Cloud-Based Software Engineering Agent
PLUS: LLMs Get Lost In Multi-Turn Conversations

Welcome back AI enthusiasts!
In todayās Daily Report:
āļøOpenAIās New Cloud-Based Software Engineering Agent
āļøLLMs Get Lost In Multi-Turn Conversations
š Trending Tools
š„ŖBrief Bites
š°Funding Frontlines
š¼Whoās Hiring?
Read Time: 3 minutes
šRECENT NEWS
OPENAI
āļøOpenAIās New Cloud-Based Software Engineering Agent
OpenAI recently introduced āCodex,ā a cloud-based Software Engineering Agent that autonomously analyzes, writes, and implements code.
Key Details:
Itās powered by codex-1, a tailored version of OpenAI o3 thatās optimized for complex software engineering workflows.
OpenAI o3 is a reasoning model designed to handle more complex multi-step problems. It leverages Chain-of-Thought (CoT) techniques to break down these more complex multi-step problems into manageable sub-problems. Then, it solves each manageable sub-problem, combining them into a complete solution.
Pro, Team, and Enterprise users can access āCodexā today through the sidebar within ChatGPT. To assign āCodexā a new coding task, you simply provide an input and click āCode.ā If you also want to ask āCodexā a question about your codebase, you click āAsk.ā
Why Itās Important:
What makes āCodexā special is that it introduces parallelism: the ability to manage multiple parts of a complex software engineering workflow at the same time.
Just look at your web browser right now. How many tabs do you have open? Now, whatās happening in all those tabs? Each browser tab is static. You canāt have one tab open for writing code and another tab open for writing a Slack message at the same time.
𩺠PULSE CHECK
Can AI ever truly be creative?Vote Below to View Live Results |
AI RESEARCH
āļøLLMs Get Lost In Multi-Turn Conversations
Microsoft and Salesforce found that even the most capable LLMs significantly underperform during multi-turn conversations, often getting ālostā in all the dialogue.
Key Details:
Multi-turn conversations occur when user instructions are given in multiple stages rather than all at once. In other words, itās a back-and-forth dialogue that unfolds over several inputs, or āturns.ā
They examined OpenAIās GPT-4.1, Anthropicās Claude 3.7 Sonnet, and Googleās Gemini 2.5 Pro across six conversational tasks, analyzing over 200,000 simulated back-and-forth dialogues.
They discovered that each LLMās accuracy and performance dropped by an average of 39% across all six conversational tasks when inputs were split over multiple turns.
In contrast, the same LLMs achieved a 90% success rate across all six conversational tasks when using single-turn conversations: when user instructions are given as a single, complete input or āprompt.ā
Why Itās Important:
Understanding that even the most capable LLMs significantly struggle with multi-turn conversations highlights the need for clear, concise, and cohesive inputs.
By providing a single, well-structured āprompt,ā you reduce the chance of the LLM youāre using from getting ālostā in all the dialogue. So, aim to condense your āpromptsā to maximize the effectiveness of the outputs you receive.
š TRENDING TOOLS
š·Scottie builds any AI Agent in 5 minutes.
š°syft. creates news impossibly tailored to you.
šBugster is a Software Testing Agent for busy developers.
šļøURL to Any converts URLs into shortened links or QR codes for free.
š»matterai.dev generates code without bugs, latencies, or vulnerabilities.
š§° Browse our Always Up-To-Date AI Toolkit.
š„ŖBRIEF BITES
Y Combinator Startup Firecrawl has set aside a $1 million budget to hire three AI Agents as employees.
NVIDIA CEO Jensen Huang explained that if he were a student today, the first thing heād do is ālearn how to interact with AI.ā
Tech Billionaire Elon Muskās AI chatbot Grok said it was āskepticalā about the Holocaust death toll, then blamed a āprogramming error.ā
Poe just examined Spring 2025 AI Model Usage Trends, revealing major shifts in user preference across AI Models for text, image, audio, video, code, and reasoning use cases.
š°FUNDING FRONTLINES
Somite raises over a $47M Series A to revolutionize cell therapy with AI.
Moonvalley lands a $53M Series B to craft high-definition generative videos with AI.
Granola secures a $43M Series B for an AI-based notepad that autonomously takes notes on your behalf.
š¼WHOāS HIRING?
Haize Labs (New York City, NYC): Software Engineer Intern, Fall 2025
NVIDIA (Santa Clara, CA): Firmware Engineer, Entry-Level
Recidiviz (New York City, NYC): Policy Data Analyst, Mid-Level
Postman (San Francisco, CA): Senior Data Analyst, Senior-Level
šFINAL NOTE
FEEDBACK
How would you rate todayās email?It helps us improve the content for you! |
ā¤ļøTAIP Review of The Day
āThe curated summaries are SUPER amazing!š¤©ā
REFER & EARN
šYour Friends Learn, You Earn!
You currently have 0 referrals, only 1 away from receiving š3 Simple Steps to Turn ChatGPT Into an Instant Expert.
Share your unique referral link: https://theaipulse.beehiiv.com/subscribe?ref=PLACEHOLDER
Reply