- Evolving AI Insights
- Posts
- š OpenAI's Journey to Human-Level AI
š OpenAI's Journey to Human-Level AI
Also: Google Gemini now powering robots
Welcome, AI enthusiasts
Exciting developments are underway at OpenAI as they unveil a new scale to measure AI's progress toward achieving Artificial General Intelligence (AGI). This scale defines stages from Level 1 to Level 5, with OpenAIās models nearing Level 2. Meanwhile, Google Gemini is breaking new ground by integrating large language models with robots, boosting their capacity to perform real-world tasks while Elon Musk's xAI has announced a delay for Grok 2 until August. Letās dive in!
In todayās insights:
OpenAI's Journey to Human-Level AI
Language Models Powering Real-World Robots
Elon Musk Delays Grok 2
Read time: 4 minutes
šļø LATEST DEVELOPMENTS
OPENAI
š OpenAI's Journey to Human-Level AI
Evolving AI: OpenAI made a system to determine how smart its AI systems are, ranging from Level 1 to Level 5.
Key Points:
OpenAI's internal scale measures progress toward artificial general intelligence (AGI).
Current chatbots like ChatGPT are at Level 1; nearing Level 2 indicates AI can solve basic PhD-level problems.
Level 5 represents AI capable of performing the work of entire organizations.
Details:
OpenAIās scale marks the journey from simple chatbot abilities to AI handling complex organizational tasks. Level 1 includes todayās chatbots; Level 2 means solving basic problems like a PhD. Level 3 involves taking actions on a user's behalf, Level 4 signifies creating innovations, and Level 5, the pinnacle, involves performing work equivalent to entire organizations. Despite differing timelines, AGI remains a distant goal, requiring immense computing power. The new scale clarifies progress, aiding OpenAI in its AGI mission.
Source: Bloomberg reporting
Why It Matters:
Despite the challenges, the future of AI holds immense promise. OpenAI's human intelligence scale provides a structured approach to evaluate progress and identify areas that need further research and development. AGI is still only theoretical, meaning weāre not yet sure if it will be achieved, but we believe it can be. Many believe that we still need one or two breakthroughs before reaching AGI is possible. However, OpenAI introducing this scale gives us some hope that weāre on the right path. If theyāre preparing themselves for reaching AGI, itās probably best for you to do so too.
Become an AI-Powered Finance Decisions to get 10x ROI on your money (free AI masterclass) š
More than 300 million people use AI, but less than 0.03% use it to build investing strategies. And you are probably one of them.
Itās high time we change that. And you have nothing to lose ā not even a single $$
Rated at 9.8/10, this masterclass will teach how you to:
Do market trend analysis & projections with AI in seconds
Solve complex problems, research 10x faster & make your simpler & easier
Generate intensive financial reports with AI in less than 5 minutes
Build AI assistants & custom bots in minutes
GOOGLE
š¦¾ Language Models Powering Real-World Robots
How can Gemini 1.5 Proās long context window help robots navigate the world? š¤
A thread of our latest experiments. š§µ
ā Google DeepMind (@GoogleDeepMind)
2:05 PM ā¢ Jul 11, 2024
Evolving AI: Researchers are integrating large language models with robots, enhancing their ability to perform real-world tasks.
Key Points:
Google DeepMindās robot uses Gemini to understand and execute complex commands.
Geminiās multimodal capabilities significantly improve human-robot interaction.
Industry and academic labs are rapidly advancing AI-enhanced robotics.
Details:
In California, a sleek, wheeled robot equipped with Googleās Gemini language model serves as a tour guide and office assistant. This Gemini-enhanced robot processes both text and video, enabling it to navigate and perform tasks based on commands like āFind me somewhere to write,ā leading users to a whiteboard.
Introduced in December, Geminiās multimodal natureāhandling video and textāpowers this robotās impressive navigation abilities. Demonstrating up to 90% reliability, it excels even with complex instructions like āWhere did I leave my coaster?ā The researchers note the system's enhanced naturalness and usability in human-robot interactions.
This development illustrates the potential for large language models in practical applications beyond web browsers and apps. Gemini's ability to interpret visual data marks a significant leap in robot capabilities. Academic labs and startups are racing to leverage AI for robotics, with substantial investments fueling this innovation.
Why This Matters:
Researchers guided robots through real-world environments and showed them important locations. The robots were then able to find these locations again. A simple smartphone video is enough to give the robot an overview of the environment. The ease with which this can be achieved is significant and sets a precedent for robots being trained in the real world. Expect more and more similar headlines in the future with increasing capabilities of the robots, until this becomes a normal thing and weāre all used to it. Dystopian? Maybe. Understandably, not everyone will love these developments, but this will be our future.
GROK
š§ Elon Musk Delays Grok 2
Evolving AI: Elon Musk's xAI delays Grok 2 to August, while teasing an even bigger leap with Grok 3 by year's end.
Key Points:
Grok 2 postponed to August; Grok 3 promised by end of year.
xAI leases 24,000 H100 GPUs; plans to train Grok 3 on 100,000 H100s.
Moving from Oracle to build world's most powerful training cluster.
Grok's chatbot lags behind ChatGPT; issues with fake news stories.
Details:
Elon Musk's AI company, xAI, announced a delay in the release of Grok 2, initially scheduled for May, now pushed to August. Musk promises Grok 3, a major upgrade, by year-end. Grok 2 was revealed in March, with Musk boasting it would surpass OpenAI's GPT-4, though specifics were scant. xAI has secured 24,000 H100 GPUs from Oracle for Grok 2's training; the delay is attributed to the meticulous process of purging training data of content from other language models. Musk claims this will yield a substantial improvement. Grok 3, teased in July, will utilize 100,000 H100s and commence training on xAI's in-house infrastructure, aimed at building the most powerful training cluster. This shift from Oracle is part of xAI's strategy to maintain control and outpace competitors. Currently, Grok's chatbot lags behind ChatGPT and Claude, partly due to generating fake news from real-time data on X.
Our Thoughts:
We all know that Elon Musk makes a lot of predictions for his companies that will not always be met. However, xAI has received notable investments in the last few months and is constructing a new AI data center in Memphis with NVIDIA chips. Will he meet his goals? He surely knows that if he wants to catch up and be a serious competitor in the space, Grok 2 needs to be a hit.
šÆ SNAPSHOTS
Direct links to relevant AI articles.
š IBM: Teaching the language of your business.
š Samsung: Samsung announces their first-ever smart Galaxy Ring for $399.
š Trending AI Tools
š Evolving AIās Prompt Hub - The world's #1 ChatGPT Prompt Hub, featuring prompts that consistently produce great results (link)
š„ Salad - Provides you with an affordable GPU cloud for your business (link)
š§ Breezemail - Organize your inbox with AI (link)
š¤ Pangea - Platform that connects startups with talent (link)
š Noah - An AI work assistant integrated with Google Drive, Notion, and more (link)
š§ Mailogy - Talk to your email with AI (link)
What'd you think of today's edition? |
Reply