• The AI Pulse
  • Posts
  • 🤖 Tech Giants Use YouTube Videos Without Consent?!

🤖 Tech Giants Use YouTube Videos Without Consent?!

PLUS: OpenAI Founding Member Creates AI-Native School, Microsoft’s New SpreadsheetLLM

Welcome back AI enthusiasts!

In today’s AI Report:

  • 😳Tech Giants Use YouTube Videos Without Consent?!

  • 🎓OpenAI Founding Member Creates AI-Native School

  • 📊Microsoft’s New SpreadsheetLLM

  • 🛠5 Trending Tools

  • 💰Venture Capital Updates

  • 💼Who’s Hiring?

Read Time: 3 minutes

🗞RECENT NEWS

TECH GIANTS

😳Tech Giants Use YouTube Videos Without Consent?!

Image Source: Michael Tran/AFP/Getty Images/CNBC Screenshot

Tech Giants like Apple, Anthropic, Nvidia, and Salesforce used thousands of YouTube videos to train AI models without the creator’s consent.

Key Details:
  • An investigation by Proof News found that Tech Giants used subtitles from 173,536 YouTube videos across 48,000 channels to curate datasets for AI model training.

  • Tech Giants purchased scraped content from these YouTube megastars:

    1. Marques Brownlee: 19.1 Million Subscribers, Seven Videos Taken

    2. Jacksepticeye: 30.7 Million Subscribers, 377 Videos Taken

    3. PewDiePie: 111 Million Subscribers, 337 Videos Taken

    4. MrBeast: 303 Million Subscribers, Two Videos Taken

  • Tech Giants purchased the scraped content from EleutherAI, a company that claims to be a “non-profit AI research lab” that helps developers and researchers train AI models.

  • EleutherAI curated and organized the scraped content into datasets called “Pile.”

  • “Pile’s” datasets are accessible and open to anyone on the internet with enough space and computing power to access them.

Why It’s Important:
  • Tech Giants technically avoid “fault” because they’re not directly scraping the content to curate datasets. EleutherAI would be at “fault” for breaking YouTube’s Terms of Service.

  • Apple’s OpenELM, an efficient open-source language model family designed to run locally on iPhones, MacBooks, and iMacs, used “Pile” datasets for training.

🚨Search what YouTube videos were used to train AI models here.

🩺 PULSE CHECK

Should Tech Giants compensate YouTube’s content creators for helping them train AI models?

Vote Below to View Live Results

Login or Subscribe to participate in polls.

ANDREJ KARPATHY

🎓OpenAI Founding Member Creates AI-Native School

Image Source: Canva AI Image Generator

Andrej Karpathy, former Senior Director of AI at Tesla and founding OpenAI member, created an AI-native school with course materials supported and scaled by an AI Teaching Assistant.

Key Details:
  • Karpathy’s new venture, Eureka Labs, is an AI-native school that aims to provide a “Teacher + AI Symbiosis” environment where human expert written course materials are supported and scaled with an AI Teaching Assistant.

  • The first class will be the “World’s Obviously Best AI Course,” or LLM101n. The course will help students train their own AI model.

  • Karpathy wants Eureka Labs to be “a proper, self-sustaining business,” but he also doesn’t “want to gatekeep educational content.”

Why It’s Important:
  • Karpathy believes the ideal experience for learning anything new is under the guidance of subject matter experts who are “deeply passionate, great at teaching, infinitely patient, and fluent in all of the world’s languages.”

  • An AI Teaching Assistant can handle repetitive tasks like grading, explaining solutions, and offering personalized problems, freeing human experts to focus on more complex aspects of education like critical thinking.

AI RESEARCH

📊Microsoft’s New SpreadsheetLLM

Microsoft researchers introduced SpreadsheetLLM and SheetCompressor, new AI frameworks for encoding spreadsheets for Large Language Models (LLMS).

In other words, SpreadsheetLLM and SheetCompressor are experimental LLMs tailored to process, analyze, and execute information on spreadsheets like Microsoft Excel and Google Sheets.

Spreadsheets are characterized by their extensive two-dimensional grids, flexible layouts, and varied formatting options, which pose significant challenges for LLMs.

SpreadsheetLLM translates data in spreadsheets into a format that AI models can understand, allowing LLMs to analyze the data and help users write formulas or generate reports.

SheetCompressor identifies the most essential unique data points to create a more compact spreadsheet version.

SheetCompressor enables SpreadsheetLLM to work with much larger spreadsheets while requiring less computational power, making it more efficient.

🛠TRENDING TOOLS

🖍Crayon tracks competitors to help you win deals.

🕸Elementor hosts, builds, and grows your dream website.

PollThePeople effortlessly conducts consumer research.

📈ExplodingTopics surfaces rapidly growing topics before they take off.

🏁SequensAI generates content at scale to boost your lead pipeline effectively.

🔮Browse our always Up-To-Date AI Tools Database.

💰VENTURE CAPITAL UPDATES

  • Exa secures a $17M Series A to build Google for AIs.

  • Huma raises an $80M Series D to turn text into healthcare apps with GenAI.

  • Acree AI lands a $24M Series A to scale Small Language Models (SLMs) for enterprise-specific domains.

💼WHO’S HIRING?

  • GuidewireRX (Remote): Machine Learning (ML) Intern, Fall 2024

  • Netflix (Los Angeles, CA): Machine Learning (ML) Intern, Research, Fall 2024

  • Jane Street (New York, NY): Software Engineer Intern, Summer 2025

  • Akuna Capital (Chicago, IL): Software Engineer Intern, Python, Summer 2025

  • NVIDIA (Santa Clara, CA): Developer Technology Engineer, AI, New Grad

🤖PROMPT OF THE DAY

PRICING

🛒Pricing Strategies

Imagine you’re launching [Product/Service] for [Business] in [Industry]. Outline three potential pricing frameworks you could adopt, considering market competitiveness, perceived value, and long-term sustainability.

How would you balance initial user acquisition with maximizing revenue?

Product/Service = [Insert Here]

Business = [Insert Here]

Industry = [Insert Here]

📒FINAL NOTE

If you found this useful, follow us on Twitter or provide honest feedback below. It helps us improve our content.

How was today’s newsletter?

❤️TAIP Review of the Day

“I look forward to reading The AI Pulse every day. I find it informative, educational, and bite-sized. Kudos to the team!”

-Audrey (⭐️⭐️⭐️⭐️⭐️Nailed it!)
REFER & EARN

🎉Your Friends Learn, You Earn!

You currently have 0 referrals, only 1 away from receiving ⚙️Ultimate Prompt Engineering Guide.

Refer 5 friends to enter 🎰July’s $200 Gift Card Giveaway.

Reply

or to participate.