šŸš€ Google's AI voice chat mode is here!

Also: Elon Musk's X unveils Grok-2

In partnership with

Welcome, AI enthusiasts

Gemini Live, Googleā€™s much-anticipated answer to OpenAIā€™s ChatGPT voice assistant, has finally launched. Meanwhile, xAI has unveiled Grok-2, a cutting-edge language model with state-of-the-art reasoning capabilities and the ability to generate images. In a groundbreaking development, Sakana AI has announced the AI Scientistā€”a fully automated pipeline for end-to-end research paper generation, powered by the latest advances in foundation models. Letā€™s dive in! 

In todayā€™s insights:

  • Gemini Live, Googleā€™s answer to ChatGPTā€™s Advanced Voice Mode

  • xAI releases Grok-2, adds image generation on X

  • The worldā€™s first autonomous AI Scientist is here

Read time: 5 minutes

šŸ—žļø LATEST DEVELOPMENTS

Evolving AI: Google introduces Gemini Live, a voice-enabled AI that mimics human conversation, challenging ChatGPTā€™s Advanced Voice Mode.

Key Points:

  • Google launches Gemini Live, a voice mode allowing natural, conversational interactions with AI, currently available for Android.

  • Unlike OpenAIā€™s delayed ChatGPT Advanced Voice Mode, Googleā€™s feature is more widely accessible.

  • Gemini Live integrates deeply with Android, offering hands-free operation and context-aware assistance.

Details:

Google has taken a leap in the generative AI race with the launch of Gemini Live, a new voice mode for its AI model, Gemini. Available through the Gemini app on Android, this feature allows users to engage in natural, free-flowing conversations with AI, even permitting interruptions and topic changes, much like a real phone call. While OpenAI showcased a similar feature earlier, its roll-out has been limited. In contrast, Gemini Live has began rolling out yesterday in English to their Gemini Advanced subscribers on Android phones, and in the coming weeks will expand to iOS and more languages. This integration also enhances the Android experience by offering context-aware assistance, enabling users to interact with on-screen content seamlessly.

Why It Matters:

Weā€™ll have to see how well this works in practice, of course. But while OpenAI promised three months ago that Advanced Voice Mode would launch "in the coming weeks," Google has actually delivered. Despite setbacks in the AI race, Google is now firmly back in contention with the launch of Gemini Live. The question now is, how will OpenAI respond?

šŸ”„ TOGETHER WITH ARTISAN

Hire Ava, the Industry-Leading AI BDR

Ava automates your entire outbound demand generation so you can get leads delivered to your inbox on autopilot. She operates within the Artisan platform, which consolidates every tool you need for outbound:

  • 300M+ High-Quality B2B Prospects

  • Automated Lead Enrichment With 10+ Data Sources Included

  • Full Email Deliverability Management

  • Personalization Waterfall using LinkedIn, Twitter, Web Scraping & More

Evolving AI: xAI has launched Grok-2 and Grok-2 mini, expanding their AI capabilities to include image generation on the X platform, targeting Premium users.

Key Points:

  • Grok-2 and its mini variant are now in beta on X.

  • The AI models offer advanced reasoning, coding, and chat capabilities.

  • Image generation, currently unrestricted, may raise concerns about misinformation.

Details:

Elon Musk's xAI has rolled out Grok-2 and its smaller counterpart, Grok-2 mini, in beta, offering enhanced reasoning and coding capabilities. Available exclusively to Premium and Premium+ users on X, Grok-2 now features image generation, powered by FLUX.1 from Black Forest Labs. However, the absence of robust safeguards has led to the creation of unfiltered content, including politically sensitive images. Despite these concerns, xAI is pushing forward, with plans to integrate Grok-2 into various AI-driven features on X and to make the models available to developers through an enterprise API later this month.

Why It Matters:

Grok-2 is already making a mark, ranking fourth on the model leaderboard. With its new image generation feature powered by FLUX.1 from Black Forest Labs, xAI is solidifying its position in the AI landscape. However, the lack of sufficient safeguards is likely to spark significant debate in the days ahead, raising concerns about the responsible use of such powerful tools.

Source: The-Decoder

Evolving AI: The AI Scientist autonomously conducts research, producing scientific papers for just $15 each.

Key Points:

  • The AI Scientist can independently brainstorm, experiment, and generate full scientific papers.

  • It costs under $15 to produce a research paper in areas like diffusion models and language models.

  • Despite promising results, limitations like inaccurate results and poor citation practices persist.

Details:

A new AI system, dubbed "The AI Scientist," has been developed by researchers from the University of British Columbia, University of Oxford, and startup Sakana AI. This groundbreaking system autonomously handles all aspects of scientific research, from idea generation to paper writing. In trials, it produced complete papers in machine learning domains such as diffusion models and transformer-based language models, all at an astonishingly low cost of $15 each. However, the system is not without flaws; challenges like limited experiment depth, citation issues, and occasional errors in interpreting results highlight the need for further refinement.

Why It Matters:

Yes, this is as groundbreaking as it sounds. While not yet on par with top human researchers, having an AI model that autonomously conducts research and self-improves is a significant step toward AGI. The final paper, which exceeded expectations, highlights the potential of AI-driven scientific discovery. For those interested in this breakthrough, the full announcement is available on their socialsā€”definitely worth a look.

šŸ’” Tip of the Day

At MadeByGoogle '24, Google introduced major AI developments, with Gemini taking center stage. If you havenā€™t caught up on the announcements yet, we recommend watching the 2-minute recap below or the full keynote here.

šŸŽÆSNAPSHOTS

Direct links to relevant AI articles.

šŸŽ® VideoGameBunny: This AI understands video games.

šŸ‘€ Akeana: Company raised over $100M and unveils new RISC-V chip designs.

šŸŽØ AI art: Artists celebrate as copyright infringement case against AI image generators moves forward

šŸ“ˆTrending AI Tools

  • āœšŸ½ NovelCrafter - Organize and craft your novels (link)

  • šŸŽ„ GoEnhance - Transform videos and enhance images (link)

  • šŸŽ„ Kaiber - Turn text, videos, photos and music into videos (link)

  • šŸš€ Empress.eco - On-demand automation services for business growth and cost-cutting (link)

  • šŸ—£ļø Deepgram - Build voice AI into your apps (link)

Reply

or to participate.