🤖 Chinese AI Lab Shocks Silicon Valley

PLUS: Scale AI’s New “HLE” Benchmark, Meta CEO Mark Zuckerberg’s $65 Billion AI Infrastructure Plan

Welcome back AI enthusiasts!

In today’s Daily Report:

  • 🇨🇳Chinese AI Lab Shocks Silicon Valley

  • 📊Scale AI’s New “HLE” Benchmark

  • 👷Meta CEO Mark Zuckerberg’s $65 Billion AI Infrastructure Plan

  • 🛠Trending Tools

  • 💰Funding Frontlines

  • 💼Who’s Hiring?

Read Time: 3 minutes

🗞RECENT NEWS

AI TRENDS

🇨🇳Chinese AI Lab Shocks Silicon Valley

Image Source: Canva’s AI Image Generators/Magic Media

Chinese AI Lab DeepSeek recently launched “DeepSeek-R1,” a Reasoning Engine that outperforms “OpenAI o1” at a fraction of the cost.

Key Details:
  • DeepSeek-R1” mimics a human’s decision-making capabilities and problem-solving capacity by deploying a Chain-of-Thought (CoT) technique that breaks down complex tasks into manageable steps.

  • It was also trained using Reinforcement Learning (RL), which mimics the “trial-and-error” process humans use to learn, where decisions that lead to desired outcomes are reinforced.

  • CoT and RL enable “DeepSeek-R1” to recognize and correct mistakes when generating responses to prompts.

  • DeepSeek spent $5.58 million to train “DeepSeek-R1” compared to the hundreds of millions of dollars spent to train “OpenAI o1.”

  • DeepSeek-R1” achieved 97.3% accuracy on the Math Word Problem Solving (MATH-500) benchmark, which covers 12,500 algebra and geometry problems. In comparison, “OpenAI o1” achieved 96.4% accuracy.

Why It’s Important:
  • President Joe Biden introduced several export controls to prevent China from acquiring advanced AI chips to power AI projects.

  • This lack of access to computational resources (e.g., memory and processing power) led to technical breakthroughs. “Necessity is the mother of invention,” said Perplexity AI CEO Aravind Srinivas.

AI BENCHMARKS

📊Scale AI’s New “HLE” Benchmark

Image Source: Scale AI, Center for AI Safety (CAIS)/“Humanity’s Last Exam (HLE)”/Screenshot

Scale AI created a new benchmark called “Humanity’s Last Exam (HLE)” to measure how advanced AI models perform across dozens of subjects.

Key Details:
  • Benchmarks are essential for tracking the capabilities of LLMs. However, the capabilities of LLMs are outpacing the difficulty of benchmarks.

  • For context, LLMs now achieve over 90% accuracy on popular benchmarks like Massive Multitask Language Understanding (MMLU).

  • HLE” consists of 3,000 multiple-choice and short-answer questions across dozens of subjects, including mathematics, humanities, and the natural sciences.

Why It’s Important:
  • Advanced AI models like “DeepSeek-R1” achieved less than 10% accuracy on the “HLE” benchmark.

  • “We got math, physics, and biology professors to craft the hardest questions they could possibly imagine,” said Scale AI CEO Alexandr Wang.

META

👷Meta CEO Mark Zuckerberg’s $65 Billion AI Infrastructure Plan

Image Source: Facebook/Meta CEO Mark Zuckerberg/“This will be a defining year for AI.”/Screenshot/(4).jpg

In a Facebook post on Friday, Meta CEO Mark Zuckerberg said his company plans to invest up to $65 billion in 2025 to Acquire AI Talent and Build Data Centers that support over a Gigawatt (GW) of compute, or roughly the amount of power consumed by 750,000 homes. Zuckerberg also added that Meta AI is on pace to reach over 1 billion users this year.

This announcement comes as other AI firms pour billions of dollars into their AI infrastructure plans. For instance, OpenAI recently announced “The Stargate Project,” a new company set to spend $500 billion over the next four years building AI infrastructure in America. It’s supported by Oracle, SoftBank Group Corp., and MGX.

🩺 PULSE CHECK

Is Meta over-investing in AI infrastructure?

Vote Below to View Live Results

Login or Subscribe to participate in polls.

🛠TRENDING TOOLS

🧙‍♀️Spline turns images into 3D worlds.

💬UPDF AI enables you to chat with any PDF.

⚙️TRAE is an AI-powered coding buddy that fixes errors.

🍨Snoops.co turns Reddit discussion into business opportunities.

🕸️ChatPerk creates a tailored AI assistant that integrates into your website.

🔮Browse our always Up-To-Date AI Tools Database.

💰FUNDING FRONTLINES

  • Package.ai secures a $14M Series A to modernize retail operations with AI.

  • Vivident raises a $1.5M Seed Round to launch an AI-powered virtual character platform.

  • Maki lands a $28.6M Series A to deploy a conversational AI Agent that redefines talent acquisition.

💼WHO’S HIRING?

  • Array Labs (Palo Alto, CA): Software Engineering Intern, Summer 2025

  • Scale AI (New York, NY): Research Intern, Post-Training, Summer 2025

  • Amazon (Los Gatos, CA): SDE Intern, Embedded Systems, Summer 2025

  • NVIDIA (Santa Clara, CA): Marketing Documentation and Data Privacy Intern, Summer 2025

  • Rivian (Palo Alto, CA): Software Engineering Intern, Autonomy, Perception, Summer 2025

🤖PROMPT OF THE DAY

BUSINESS LAW

👨‍⚖️Legal Considerations for Startups

Identify the critical legal considerations for [Startup] with [Product or Service] in [Industry] with [Target Audience] and [Key Stakeholders]. Focus on choosing the right business structure (e.g., Partnerships or Corporations) and intellectual property protection (e.g., Patents or Trademarks).

Startup = [Insert Here]

Product or Service = [Insert Here]

Industry = [Insert Here]

Target Audience = [Insert Here]

Key Stakeholders = [Insert Here]

📒FINAL NOTE

FEEDBACK

How would you rate today’s email?

It helps us improve the content for you!

Login or Subscribe to participate in polls.

❤️TAIP Review of The Day

“It’s my first time reading, and I read it all. Great stuff!🥑”

-Ishaan (1️⃣ 👍Nailed it!)
REFER & EARN

🎉Your Friends Learn, You Earn!

You currently have 0 referrals, only 1 away from receiving ⚙️Ultimate Prompt Engineering Guide.

Refer 3 friends to learn how to 👷‍♀️Build Custom Versions of OpenAI’s ChatGPT.

Reply

or to participate.