
Welcome back AI enthusiasts!
In today’s AI Report:
🍎Apple’s New MM1 AI Model
🔓Elon Musk’s xAI Open-Sources Grok-1
🔊VLOGGER Method Synthesizes Humans From Audio
🛠5 Trending Tools
💰Venture Capital Updates
💼Who’s Hiring?
Read Time: 3 minutes
🗞RECENT NEWS
APPLE
🍎Apple’s New MM1 AI Model

Image Source: Catalin Abagiu/ACC Marketing P
Apple researchers published a report showcasing MM1, a family of multimodal models that combine visual and language inputs to enable advanced capabilities.
Key Details:
The MM1 model comprises dense models and Mixture-of-Experts (MOE) variants: several smaller models that are mini-experts on a specific task.
The MOE variants tackle their part of a user’s prompt and combine the results to offer a refined and complete output.
The MM1 model was trained on a diverse dataset of image-caption pairs, interleaved image-text documents, and text-only data to understand and generate language based on a mix of visual and linguistic cues.
Apple claims the MM1 model sets a new standard in “AI’s ability to perform tasks such as image captioning, visual question answering, and natural language inference.”
Why It’s Important:
Apple researchers highlighted the MM1 model’s exceptional in-context learning abilities with its 30 billion parameter configuration. This version exhibits remarkable capabilities for multi-step reasoning over multiple images using “chain-of-thought” prompting.
“Chain-of-thought” prompting is a technique that enables the MM1 model to perform complex, open-ended problem-solving based on minimal examples.
🚨Access the full report here.
ELON MUSK
🔓Elon Musk’s xAI Open-Sources Grok-1

Image Source: Marla Aufmuth/TED
Elon Musk’s xAI open-sourced the base code of Grok-1, allowing users to analyze the weights and architecture of the massive 314 billion parameter language model.
Key Details:
xAI’s Grok-1 is a conversational chatbot with a “rebellious streak” and “witty” personality that leverages real-time data from X to answer user queries.
Grok-1 is a 314 billion parameter Mixture of Experts (MOE) model with 25% of the weights active on a given token (i.e., a unit of data processed by algorithms to generate text outputs).
The released model is the raw, pre-trained checkpoint from October 2023 and isn’t fine-tuned for any specific task.
xAI published Grok-1 on GitHub and Hugging Face, allowing developers to access the language model scripts and provide solutions to improve them.
Perplexity CEO Aravind Srinivas posted on X that he plans to fine-tune Grok-1 for conversational search and optimize the inference.
Why It’s Important:
Many companies have open-sourced several AI models, including Meta’s Llama, Mistral’s AI Large, and Google’s Gemma 7B.
By open-sourcing xAI’s Grok-1, Musk is putting public pressure on OpenAI to revert to permitting developers to access the company’s language models.
Musk’s decision to open-source xAI’s Grok-1 follows his lawsuit accusing OpenAI of transforming into a “close-sourced de facto subsidiary” for Microsoft.
🩺 PULSE CHECK
Are you on Team Elon Musk or Team OpenAI?
AI RESEARCH
🔊VLOGGER Method Synthesizes Humans From Audio
Google researchers developed VLOGGER, an AI model that generates photorealistic talking avatars from a single image and audio clip.
VLOGGER’s human-to-3D-motion diffusion model generates controllable avatars that capture facial expressions and emotions.
The model was trained on a large multimedia dataset containing 800,000 speeches with filtered categories for each body part and facial emotion.
Promising use cases include dubbing videos in other languages or creating avatars for video games with unique personalities, dialogue, and relationships, making the virtual world feel more lifelike.
🛠TRENDING TOOLS
🪵Wudpecker crafts tailored notes for your Zoom, Google Meet, and Microsoft Teams meetings.
📰Tailor organizes chaotic feeds into daily summaries of what matters most to you.
🛍vetted is a shopping agent that identifies the best quality products for your budget.
🦺Galileo builds and evaluates GenAI apps faster.
⏳Userdoc scopes your software projects in minutes with defined user types, features, and roadmaps.
🔮Browse our always Up-To-Date AI Tools Database.
💰VENTURE CAPITAL UPDATES
💼WHO’S HIRING?
Live Nation (Remote): Data Engineer Intern, Summer 2024
PostEra (Remote): Machine Learning Intern, Summer 2024
Tesla (Palo Alto, CA): Software Engineer, Energy Engineering Intern, Fall 2024
Simbe (San Francisco, CA): Robotics Software Engineer Intern, Fall 2024
Mistral AI (San Francisco, CA): GPU Programming Expert
🤖PROMPT OF THE DAY
BUSINESS AUDIT
🧽Refine Operations
Act as a business consultant and conduct a comprehensive audit of my current [Business Model (e.g., Business Description, Value Proposition, Channels, etc.)].
Business Model = [Insert Here]📒FINAL NOTE
If you found this useful, follow us on Twitter or provide honest feedback below. It helps us improve our content.
How was today’s newsletter?
❤️AI Pulse Review of The Day
“Nice balance between the good, the bad, and the ugly.”
🎁NOTION TEMPLATES
🚨Subscribe to our newsletter for free and receive these powerful Notion templates:
⚙️150 ChatGPT prompts for Copywriting
⚙️325 ChatGPT prompts for Email Marketing
📆Simple Project Management Board
⏱Time Tracker
