- The AI Pulse
- Posts
- š¤ Appleās ReALM Model Outperforms OpenAIās GPT-4
š¤ Appleās ReALM Model Outperforms OpenAIās GPT-4
PLUS: DALL-E 3 Gets Mind-Blowing Editing Powers, AI-Powered Robot Reads Braille at Record Pace
Welcome back AI enthusiasts!
In todayās AI Report:
šAppleās ReALM Model Outperforms OpenAIās GPT-4
šøDALL-E 3 Gets Mind-Blowing Editing Powers
šØAI-Powered Robot Reads Braille at Record Pace
š 5 Trending Tools
š°Venture Capital Updates
š¼Whoās Hiring?
Read Time: 3 minutes
šRECENT NEWS
APPLE
šAppleās ReALM Model Outperforms OpenAIās GPT-4
Image Source: Catalin Abagiu/ACC Marketing P
Apple researchers introduced ReALM, an AI model that enables Siri to monitor on-screen tasks, conversational context, and background processes.
Key Details:
Reference Resolution as Language Modeling (ReALM) offers a new approach to converting on-device screen information into text inputs, allowing ReALM to bypass bulky image recognition parameters when processing user requests.
In other words, the AI model considers whatās on the userās screen and what tasks are active.
Apple researchers outlined four sizes of ReALM:
ReALM-80M
ReALM-250M
ReALM-1B
ReALM-3B
The āMā and āBā represent the number of parameters in millions and billions, respectively. In comparison, OpenAIās GPT-3.5 has 175 billion parameters, while GPT-4 boasts 1.5 trillion parameters.
Appleās larger ReALM models outperform OpenAIās GPT-4, despite having fewer parameters.
Why Itās Important:
Digital assistants like Siri struggle to understand references in conversations. For example, if an iPhone user states, āCall the restaurant I visited last week,ā Siri struggles to recall the restaurant.
Appleās ReALM tackles this issue by converting all contextual information into text, including open apps (e.g., iMessage or Apple Maps). So, Siri can understand references in requests.
š©ŗ PULSE CHECK
Whatās a cool AI-enabled Siri feature youād like to see?Vote Below to View Live Results |
OPENAI
šøDALL-E 3 Gets Mind-Blowing Editing Powers
Image Source: Canva AI Image Generator
OpenAI unveiled a new feature that allows users to edit images generated by DALL-E 3 directly within ChatGPT.
Key Details:
The DALL-E editor enables users to edit images by selecting an area of the image to edit and describing the changes in ChatGPT.
The editor interface offers a wide range of options to alter photos:
Undo and Redo Buttons
Add, Update, and Remove Highlight Features
Color Change Chat Options
The DALL-E editor is accessible through the website interface or ChatGPT mobile app.
Why Itās Important:
Integrating native image editing capabilities into DALL-E 3 improves creative exploration and promotes a simplified workflow. Users can now effectively alter the details of visual outputs without restarting the image generation process.
Midjourney has a āVery Regionā button that lets users edit specific segments of generated images. Elon Musk is exploring a partnership between X and Midjourney to elevate xAIās Grok-1.5 product offerings.
AI RESEARCH
šØAI-Powered Robot Reads Braille at Record Pace
Researchers at the University of Cambridge leveraged machine learning algorithms to teach a robotic sensor to quickly slide over braille text lines.
An onboard camera captures blurred images and sharpens them with optimized machine-learning algorithms. These algorithms analyze the images to explore, analyze, and find meaning in complex data sets.
The robot could read the braille at 315 words per minute with nearly 90% accuracy, which is twice as fast and accurate as human braille readers.
š TRENDING TOOLS
š¦Magic Hour is an all-in-one video creation platform that streamlines content production from ideation to production.
šSemanticPDF automatically scans, converts, and embeds PDFs to find answers.
šCanyon streamlines your job search by creating resumes, tracking applications, and helping you prepare for interviews.
š¦øāāļøElicit analyzes research papers at superhuman speed.
āļøEllipsis reviews pull requests and converts GitHub comments into working, tested code.
š®Browse our always Up-To-Date AI Tools Database.
š°VENTURE CAPITAL UPDATES
Hailo lands $120M to keep battling Nvidia as most AI chip startups struggle.
Mike Lynch-backed legal technology startup Luminance raises $40M to build a legal personal assistant.
StealthMole secures a $7M Series A to develop an AI-powered dark web intelligence platform.
š¼WHOāS HIRING?
Uber (San Francisco, CA): Safety and Insurance Actuarial Analyst Intern, Summer 2024
Nutanix (San Jose, CA): Product Management Intern, Summer 2024
PayPal (Austin, TX): Mobile Dev Ops Intern, Fall 2024
Universal Electronics (Santa Ana, CA): Software Engineer New Grad
Open Asset (New York, NY): Support Engineer, Entry-Level New Grad
š¤PROMPT OF THE DAY
SEIZE OPPORTUNITIES
š£Hook Investors and Clients
Write me a compelling, concise, and unique 30-second elevator pitch for [Business].
Business = [Insert Here]
šFINAL NOTE
If you found this useful, follow us on Twitter or provide honest feedback below. It helps us improve our content.
How was todayās newsletter?
ā¤ļøAI Pulse Review of The Day
āI seriously enjoy reading these clear and concise daily AI updates.ā
šNOTION TEMPLATES
šØSubscribe to our newsletter for free and receive these powerful Notion templates:
āļø150 ChatGPT prompts for Copywriting
āļø325 ChatGPT prompts for Email Marketing
šSimple Project Management Board
ā±Time Tracker
Reply