What new AI tools is Google introducing for Gemini Advanced

Google is introducing several exciting new AI tools and features for Gemini Advanced subscribers, showcasing the company’s commitment to pushing the boundaries of artificial intelligence. These innovations aim to enhance user experience, productivity, and creative capabilities. Let’s explore the key developments in detail:
New AI Models
Google has unveiled two new experimental AI models for Gemini Advanced users, significantly expanding the platform’s capabilities:
Gemini 2.0 Pro Experimental
This model represents Google’s most advanced AI offering to date. It is designed to excel in complex tasks, particularly in areas such as coding and mathematics. Key features include:
- Enhanced factuality for more accurate information retrieval
- Improved performance on coding and math-related prompts
- Better navigation of complex tasks with increased ease and accuracy
Gemini 2.0 Flash Thinking
This experimental model builds upon the speed and performance of the 2.0 Flash model, with a unique focus on transparency and reasoning. Its notable features include:
- Real-time display of the AI’s thought process
- Ability to expand and view the reasoning behind responses
- Transparent showcasing of assumptions made during processing
These new models demonstrate Google’s commitment to advancing AI technology while also addressing concerns about transparency and explainability in AI decision-making processes.
Video Generation Tools
One of the most anticipated features teased by Google is the introduction of AI-powered video generation tools for Gemini Advanced subscribers. While specific details and release dates have not been provided, this capability has the potential to revolutionize content creation. Key aspects of this upcoming feature include:
- Integration with the Gemini platform for seamless video creation
- Potential use of Google’s Veo 2 video generation model, which is currently behind a Google Labs waitlist
- Possible inline editing features for generated videos
This tool could significantly impact various industries, from marketing and advertising to education and entertainment, by simplifying the video creation process and making it more accessible to a broader range of users.
Image Generation Enhancements
Google is also planning to improve its image generation capabilities within the Gemini platform:
- Current access to Imagen 3, Google’s latest image generation model
- Potential integration of inline editing features for generated images
- Improved user interface for easier image creation and manipulation
These enhancements could provide users with more control over the AI-generated images, allowing for finer adjustments and customizations to better suit their needs.

Audio Generation Tools
In addition to video and image generation, Google has hinted at introducing audio generation capabilities to Gemini Advanced. This feature is likely to leverage Google’s existing audio AI technologies:
- Possible integration of MusicLM for music generation
- Potential use of Lyria for other audio content creation
- Applications could range from creating background music for videos to generating voice-overs or sound effects
The addition of audio generation tools would complete a suite of multimedia creation capabilities within the Gemini platform, allowing users to produce comprehensive, multi-format content using AI assistance.
Agentic Tools
One of the most intriguing developments teased by Google is the introduction of agentic tools for Gemini Advanced subscribers. These tools are designed to perform tasks on behalf of users, potentially revolutionizing productivity and task management. Key aspects of this development include:
Project Mariner Integration
Google DeepMind’s Project Mariner, introduced at Google I/O 2024, is expected to play a significant role in these agentic capabilities:
- Ability to execute multiple complex tasks across various applications with a single command
- Potential for automating workflows and streamlining productivity
Workspace Integration
The agentic tools are likely to be deeply integrated with Google Workspace, offering features such as:
- Automatic organization of email attachments in Google Drive
- Generation of spreadsheets with data extracted from various sources
- Automated analysis of data using tools like Data Q&A
Task Automation
These tools aim to free up users’ time by taking on various tasks:
- Potential for scheduling, research, and basic data analysis
- Ability to interact with multiple Google services to complete complex workflows
The introduction of agentic tools represents a significant step towards more autonomous and intelligent AI assistants, potentially transforming how users interact with their digital workspaces.
Enhanced Multimodal Capabilities
Google is expanding Gemini’s ability to work with multiple types of input and output, enhancing its versatility and usefulness:
Native Tool Use
Gemini 2.0 introduces improved capabilities for native tool use, allowing the AI to interact more seamlessly with various applications and services.
Image Creation
For the first time, Gemini will be able to natively create images, expanding its creative capabilities beyond text generation.
Speech Generation
The addition of speech generation capabilities will allow Gemini to produce audio output, potentially opening up new use cases in areas like accessibility and voice-based interfaces.
AI Agents for Specific Domains
Google is exploring the practical application of AI agents in various domains, showcasing the versatility of Gemini 2.0:
Universal AI Assistant
A research prototype is being developed to explore the future capabilities of a universal AI assistant, potentially offering more comprehensive and context-aware help across various tasks.
Browser-Based Agent
Google is working on a research prototype that can understand and reason across browser screen information, navigate web interfaces, and perform actions like typing, scrolling, or clicking in active browser tabs.
Coding Agent
An AI coding experience called Jules is being previewed, which can integrate directly with GitHub workflows. This agent can develop and execute plans for code changes and resolve coding issues independently.
Gaming Agents
Experimental AI agents powered by Gemini 2.0 are being developed to provide real-time game analysis and suggestions. These agents can understand game rules and engage in real-time conversations about gameplay.
Improved Performance and Capabilities
The new Gemini models show significant improvements across various benchmarks:
General Knowledge
- MMLU-Pro: Gemini 2.0 Experimental Pro achieves 79.1% accuracy, up from 75.8% in the previous version
Coding
- LiveCodeBench (v5): Improved performance in Python code generation
- Bird-SQL (Dev): Enhanced ability to convert natural language questions into executable SQL
Reasoning and Factuality
- GPQA (diamond): Significant improvement in answering challenging questions in biology, physics, and chemistry
- SimpleQA: Major boost in world knowledge factuality without search enabled
Multilingual Capabilities
- Global MMLU (Lite): Enhanced performance across 15 languages, including both culturally sensitive and culturally agnostic samples
Mathematics
- MATH: Improved performance on challenging math problems across various disciplines
- HiddenMath: Better results on competition-level math problems
Long-context Understanding
- MRCR (1M): Enhanced ability to handle and understand long-context scenarios
Multimodal Capabilities
- MMMU: Improved performance on multi-discipline college-level multimodal understanding and reasoning problems
These performance improvements demonstrate Google’s commitment to enhancing Gemini’s capabilities across a wide range of tasks and domains.

Conclusion
Google’s introduction of these new AI tools and features for Gemini Advanced represents a significant leap forward in AI technology. From new experimental models to video generation tools, agentic capabilities, and domain-specific AI agents, these innovations promise to enhance user productivity, creativity, and problem-solving abilities.
As these features roll out in the coming months, Gemini Advanced subscribers will have access to an increasingly powerful and versatile AI platform. However, it’s important to note that many of these features are still in development or experimental stages, and their full capabilities and limitations will become clearer as they are released to users.
The introduction of these tools also raises important questions about the future of AI integration in daily work and life. As AI becomes more capable of performing complex tasks and making decisions, issues of ethics, privacy, and the changing nature of human-AI interaction will become increasingly important topics of discussion.
Google’s continued investment in AI research and development, as evidenced by these new tools, underscores the company’s commitment to remaining at the forefront of AI innovation. As these technologies evolve, they have the potential to reshape how we interact with technology, approach problem-solving, and conceptualize the role of AI in our personal and professional lives.
FAQs
Q: When will these new features be available to Gemini Advanced subscribers?
A: Google has not provided specific release dates for most of these features. They are expected to roll out “in the coming months,” with some experimental models already available to subscribers.
Q: Will these new features be available on all devices?
A: While Google has not specified device availability for all features, it’s likely that most will be accessible through the web interface initially, with mobile support following later.
Q: How does Gemini 2.0 Pro Experimental compare to other leading AI models?
A: Gemini 2.0 Pro Experimental is currently ranked as one of the world’s most powerful generative AI models on Imarena’s Chatbot Arena LLM Leaderboard, showcasing its competitive performance against other leading models.
Q: Will these new features be available in languages other than English?
A: Initially, many features may be limited to English. However, Google has a track record of expanding language support over time, so additional language options are likely to be introduced in the future.
Q: How will the introduction of these new features affect the pricing of Gemini Advanced?
A: Google has not announced any immediate changes to Gemini Advanced pricing related to these new features. However, as the platform evolves, pricing structures may be adjusted to reflect new capabilities.