- The AI Pulse
- Posts
- š¤ Appleās New MM1 AI Model
š¤ Appleās New MM1 AI Model
PLUS: Elon Muskās xAI Open-Sources Grok-1, VLOGGER Method Synthesizes Humans From Audio

Welcome back AI enthusiasts!
In todayās AI Report:
šAppleās New MM1 AI Model
šElon Muskās xAI Open-Sources Grok-1
šVLOGGER Method Synthesizes Humans From Audio
š 5 Trending Tools
š°Venture Capital Updates
š¼Whoās Hiring?
Read Time: 3 minutes
šRECENT NEWS
APPLE
šAppleās New MM1 AI Model

Image Source: Catalin Abagiu/ACC Marketing P
Apple researchers published a report showcasing MM1, a family of multimodal models that combine visual and language inputs to enable advanced capabilities.
Key Details:
The MM1 model comprises dense models and Mixture-of-Experts (MOE) variants: several smaller models that are mini-experts on a specific task.
The MOE variants tackle their part of a userās prompt and combine the results to offer a refined and complete output.
The MM1 model was trained on a diverse dataset of image-caption pairs, interleaved image-text documents, and text-only data to understand and generate language based on a mix of visual and linguistic cues.
Apple claims the MM1 model sets a new standard in āAIās ability to perform tasks such as image captioning, visual question answering, and natural language inference.ā
Why Itās Important:
Apple researchers highlighted the MM1 modelās exceptional in-context learning abilities with its 30 billion parameter configuration. This version exhibits remarkable capabilities for multi-step reasoning over multiple images using āchain-of-thoughtā prompting.
āChain-of-thoughtā prompting is a technique that enables the MM1 model to perform complex, open-ended problem-solving based on minimal examples.
šØAccess the full report here.
ELON MUSK
šElon Muskās xAI Open-Sources Grok-1

Image Source: Marla Aufmuth/TED
Elon Muskās xAI open-sourced the base code of Grok-1, allowing users to analyze the weights and architecture of the massive 314 billion parameter language model.
Key Details:
xAIās Grok-1 is a conversational chatbot with a ārebellious streakā and āwittyā personality that leverages real-time data from X to answer user queries.
Grok-1 is a 314 billion parameter Mixture of Experts (MOE) model with 25% of the weights active on a given token (i.e., a unit of data processed by algorithms to generate text outputs).
The released model is the raw, pre-trained checkpoint from October 2023 and isnāt fine-tuned for any specific task.
xAI published Grok-1 on GitHub and Hugging Face, allowing developers to access the language model scripts and provide solutions to improve them.
Perplexity CEO Aravind Srinivas posted on X that he plans to fine-tune Grok-1 for conversational search and optimize the inference.
Why Itās Important:
Many companies have open-sourced several AI models, including Metaās Llama, Mistralās AI Large, and Googleās Gemma 7B.
By open-sourcing xAIās Grok-1, Musk is putting public pressure on OpenAI to revert to permitting developers to access the companyās language models.
Muskās decision to open-source xAIās Grok-1 follows his lawsuit accusing OpenAI of transforming into a āclose-sourced de facto subsidiaryā for Microsoft.
š©ŗ PULSE CHECK
Are you on Team Elon Musk or Team OpenAI?Vote Below to View Live Results |
AI RESEARCH
šVLOGGER Method Synthesizes Humans From Audio
Google researchers developed VLOGGER, an AI model that generates photorealistic talking avatars from a single image and audio clip.
VLOGGERās human-to-3D-motion diffusion model generates controllable avatars that capture facial expressions and emotions.
The model was trained on a large multimedia dataset containing 800,000 speeches with filtered categories for each body part and facial emotion.
Promising use cases include dubbing videos in other languages or creating avatars for video games with unique personalities, dialogue, and relationships, making the virtual world feel more lifelike.
š TRENDING TOOLS
šŖµWudpecker crafts tailored notes for your Zoom, Google Meet, and Microsoft Teams meetings.
š°Tailor organizes chaotic feeds into daily summaries of what matters most to you.
švetted is a shopping agent that identifies the best quality products for your budget.
š¦ŗGalileo builds and evaluates GenAI apps faster.
ā³Userdoc scopes your software projects in minutes with defined user types, features, and roadmaps.
š®Browse our always Up-To-Date AI Tools Database.
š°VENTURE CAPITAL UPDATES
š¼WHOāS HIRING?
Live Nation (Remote): Data Engineer Intern, Summer 2024
PostEra (Remote): Machine Learning Intern, Summer 2024
Tesla (Palo Alto, CA): Software Engineer, Energy Engineering Intern, Fall 2024
Simbe (San Francisco, CA): Robotics Software Engineer Intern, Fall 2024
Mistral AI (San Francisco, CA): GPU Programming Expert
š¤PROMPT OF THE DAY
BUSINESS AUDIT
š§½Refine Operations
Act as a business consultant and conduct a comprehensive audit of my current [Business Model (e.g., Business Description, Value Proposition, Channels, etc.)].
Business Model = [Insert Here]
šFINAL NOTE
If you found this useful, follow us on Twitter or provide honest feedback below. It helps us improve our content.
How was todayās newsletter?
ā¤ļøAI Pulse Review of The Day
āNice balance between the good, the bad, and the ugly.ā
šNOTION TEMPLATES
šØSubscribe to our newsletter for free and receive these powerful Notion templates:
āļø150 ChatGPT prompts for Copywriting
āļø325 ChatGPT prompts for Email Marketing
šSimple Project Management Board
ā±Time Tracker
Reply