• The AI Pulse
  • Posts
  • šŸ¤– OpenAI Updates How ChatGPT Will Behave

šŸ¤– OpenAI Updates How ChatGPT Will Behave

PLUS: Advanced AI Systems Develop Their Own Goals and Values, A New AI Agent Leaderboard?!

Welcome back AI enthusiasts!

In todayā€™s Daily Report:

  • āš™ļøOpenAI Updates How ChatGPT Will Behave

  • šŸ’­Advanced AI Systems Develop Their Own Goals and Values

  • šŸ†A New AI Agent Leaderboard?!

  • šŸ› Trending Tools

  • šŸ„ŖBrief Bites

  • šŸ’°Funding Frontlines

  • šŸ’¼Whoā€™s Hiring?

Read Time: 3 minutes

šŸ—žRECENT NEWS

OPENAI

āš™ļøOpenAI Updates How ChatGPT Will Behave

Image Source: Canvaā€™s AI Image Generators/Magic Media

OpenAI just shared a major update to their ā€œModel Spec,ā€ which determines how ChatGPT will behave.

Key Details:
  • It embraces Intellectual Freedom: the idea that ChatGPT should empower users to explore, debate, and create no matter how challenging or controversial a topic may be.

  • In other words, OpenAI wants ChatGPT to empower users to make their own best decisions by:

    1. Goals: Understanding the userā€™s goals.

    2. Agenda: Avoiding promoting any particular agenda.

    3. Objectivity: Exploring any topic from any perspective.

  • While ChatGPT will never provide detailed instructions on how to build a homemade bomb, itā€™ll engage in ā€œpolitically or culturally sensitive questions.ā€

Why Itā€™s Important:
  • ā€œModel Specā€ also mentions a shift in how it handles mature content after feedback from developers requesting a Grown-Up Mode.

  • It also addresses AI Sycophancy: when ChatGPT tends to be overly agreeable when it should push back and offer constructive criticism.

šŸ©ŗ PULSE CHECK

Should ChatGPT be allowed to explore any perspective?

Vote Below to View Live Results

Login or Subscribe to participate in polls.

AI RESEARCH

šŸ’­Advanced AI Systems Develop Their Own Goals and Values

Image Source: Center for AI Safety (CAIS)/University of Pennsylvania (UPenn)/University of California, Berkeley (UC Berkeley)/ā€œUtility Engineering: Analyzing and Controlling Emergent Value Systems in AIsā€/Screenshot

The Center for AI Safety (CAIS) recently noticed that as AI Systems become more advanced, they develop their own goals and values.

Key Details:
  • As AI Systems become more advanced, they can act more independently to achieve their goals. So, itā€™s not just about what they can do; itā€™s also about why they choose to do it.

  • If we canā€™t understand what motivates more advanced AI Systems, we canā€™t guarantee their actions will align with human preferences.

  • To address this issue, CAIS created ā€œUtility Function,ā€ which measures how much advanced AI Systems ā€œlikeā€ specific outcomes. These specific outcomes are designed to measure certain goals and values.

Why Itā€™s Important:
  • Most believe that AI preferences are random and meaningless, and biases in training datasets shape AI outputs.

  • However, CAIS observed that advanced AI Systems are starting to develop their own goals and values, which shape their preferences and influence their outputs.

AI LEADERBOARDS

šŸ†A New AI Agent Leaderboard?!

Image Source: Hugging Face/Spaces/ā€œgalileo-ai/agent-leaderboardā€/Screenshot

Galileo AI just launched the ā€œAgent Leaderboard,ā€ which evaluates how well LLMs perform when used as AI Agents to carry out Agentic Tasks like ordering food from a restaurantā€™s website.

Googleā€™s ā€œgemini-2.0-flash-001ā€ has claimed the top spot with a 0.938 Tool Selection Quality (TSQ) score, which measures how well an LLM selects the right External Tool to carry out an Agentic Task.

Imagine you want to order a pizza from Pizza Planet. The LLM needs to understand your request and select the right External Tool to carry it out. For instance, selecting the Restaurant API allows the LLM to access Pizza Planetā€™s online menu and place digital orders.

šŸ› TRENDING TOOLS

šŸ„MixAudio generates copyright-free music.

šŸ”Gumloop automates any workflow with AI.

šŸ“¦Accio finds quality suppliers to source products.

šŸ§ Metabrain is your AI Cofounder that keeps track of projects.

šŸ’µPitches.ai turns your pitch deck into a money-raising machine.

šŸ”®Browse our always Up-To-Date AI Tools Database.

šŸ„ŖBRIEF BITES

Tech Billionaire Elon Musk announced that xAIā€™s Grok 3 will launch within the next few weeks and ā€œis scary smart.ā€

Anthropic plans to release a new flagship AI model that can switch between ā€œdeep reasoningā€ and ā€œfast responses.ā€

Caden Li (i.e., @cadenbuild) developed ā€œSocial Stockfish,ā€ which engineers conversations so you get what you want.

YouTube Shorts now has ā€œVeo 2,ā€ Google DeepMindā€™s latest video generator, allowing creators to turn text into viral video clips.

šŸ’°FUNDING FRONTLINES

  • GetWhys raises a $2.75M Seed Round for AI-based customer insights.

  • Latent Labs secures a $50M Funding Round to make biology programmable.

  • EnCharge AI lands a $100M Series B for developing Analog In-Memory Computing AI Chips (AIMC).

šŸ’¼WHOā€™S HIRING?

  • CyberArk (Santa Clara, CA): Software Engineering Intern, Summer 2025

  • Carbon (Redwood City, CA): Full Stack Software Engineering Intern, Summer 2025

  • Oscar (New York, NY): Data Security Engineer, Entry-Level

  • character.ai (Menlo Park, CA): Data Scientist, Monetization, Mid-Level

  • Komodo Health (New York, NY): Senior Sales Engineer, Senior-Level

šŸ“’FINAL NOTE

FEEDBACK

How would you rate todayā€™s email?

It helps us improve the content for you!

Login or Subscribe to participate in polls.

ā¤ļøTAIP Review of The Day

ā€œIā€™m an AI newbie! This newsletter is easy to follow.ā€

-Claire (1ļøāƒ£ šŸ‘Nailed it!)
REFER & EARN

šŸŽ‰Your Friends Learn, You Earn!

You currently have 0 referrals, only 1 away from receiving āš™ļøUltimate Prompt Engineering Guide.

Reply

or to participate.