AI Mode, Veo 3, Imagen 4, Android XR, and More


Google’s annual event I/O has returned this year, pushing the boundaries of AI further than ever before. From bringing agentic AI features to Google search to launching enhanced image and video generation models, Google I/O 2025 was an absolute spectacle! What started off with the keynote speech by Google CEO Sundar Pichai, highlighting the milestones accomplished by the tech giant, soon escalated to an ever-exciting show of AI-powered advancements and new generative AI tools. From the new AI mode on Google Search and Gemini Live to the launch of Veo 3, Imagen 4, and Flow, to the unveiling of Android XR and Samsung Moohan – Google was pulling one AI rabbit after the other out of the hat. Of all that was said and shown, this blog brings you the 5 biggest AI breakthroughs and launches announced at the Google I/O 2025 event.

1. Google Beam & Real-Time Translation in Google Meet

Google has taken video calling to a whole new level with Google Beam – an evolution of Project Starline that offers immersive 3D video communication. This new technology captures the views of the speaker from 6 different camera angles, along with their movement at 60 fps. It then puts them all together to generate a 3D version of the speaker, making it feel like the person is right in front of you. Aimed at making virtual interactions feel more lifelike, Google Beam will soon be available to Google Meet users in the US and then to other countries.

Complementing this, Google Meet now features real-time speech translation. Powered by AI, this translation feature can pick up your dialect, tone, and nuances to give accurate translations in real-time. Initially supporting English and Spanish, Google plans to add more languages soon, facilitating seamless multilingual conversations during video calls. This new feature has already been rolled out to US users and will soon be launched worldwide. Google Enterprise users will also get access to this feature towards the end of this year.

The biggest announcement made at Google I/O 2025 has got to be about the new AI Mode in Google Search. Owing to the widespread acceptance of AI overviews on Google Search, they have now brought the power of AI directly to the search bar with the AI Mode. This new feature lets users use AI directly to search for results, just as they would on ChatGPT, Gemini, or any other AI chatbot.

With an expanded search window, users can now add more context and ask multiple questions within the same search query. Google Search breaks user queries into multiple smaller queries and categories and runs parallel searches on all of them. With AI-powered reasoning capabilities, it then puts together all the info and generates a comprehensive and contextual response. This transforms Google Search into a more interactive experience.

Key Features

Google Search’s new AI Mode offers 7 new features:

  1. Personal Context: You can now get Google to give you personalized responses by integrating your search history and data from other Google apps and tools like Gmail. This integration lets the AI understand your style and choices, to generate smarter responses that are uniquely helpful to you.
  2. Deep Research: This feature multiples the web search capabilities of Google to do dozens or even hundreds of searches at the same time, to gather more information, resulting in more detailed and well-researched responses.
  3. Multiple Response Formats: The now AI-powered Google Search dynamically generates the best layout for each response, based on the query. For instance, it can intelligently generate interactive lists and graphs for sports and financial queries.
  4. Personalized Shopping Suggestions: Instead of simply listing out product pages and shopping links, Google Search can now give you personalized shopping suggestions based on your taste, previous searches, and purchase history. While you can add more context and details to your search query, Google also recommends points to consider to help you make the right choice.
  5. Virtual Outfit Trials: Another highlight of the AI Mode is the AI-powered shopping with virtual try-ons. You can now virtually try on clothes before buying them, directly on Google Search. Simply select the outfit, upload your image, and watch as Google magically dresses you up in that outfit right on the screen. This feature has also been rolled out to users in the US today.
  6. Search Live: You can now do live video calls to Google Search for real-time visual assistance, similar to the Gemini Live feature on the chatbot.
  7. AI-powered Visual Search: Where Google Lens would earlier find similar images based on the input image, it can now give AI overviews of any image you click or upload. It can basically explain anything that’s in front of your eyes, being a virtual companion, especially to the visually impaired.

Google announced some of its latest and most advanced generative AI tools at the Google I/O 2025 event. This included:

  1. Music AI Sandbox with Lyria 2: The Music AI Sandbox, powered by Lyria 2, enables users to generate music compositions using AI. It can create harmonies, rhythms, background scores, and even full compositions with orchestra based on user input.
  2. Imagen 4: Imagen 4 is Google’s latest text-to-image generation model, capable of producing high-quality, photorealistic images from textual descriptions. Not only does it get text and spelling right, but it can also intelligently select the right font, font size, etc., based on the query. Moreover, it works up to 10x faster than previous models.
  3. Genie 2: This advanced tool from Google can transform 2D Images into interactive 3D environments in just 2 steps and a prompt. It has a wide range of applications in gaming, virtual reality, and digital content creation.
  4. Veo 3: Google launched its latest version of Veo at the annual event. The upgraded Veo 3 takes AI-powered video generation to a whole new level, creating hyper-realistic and high-quality videos from text prompts. Along with video, it also generates realistic audio output including dialogues and background sounds.
  5. Flow: This new filmmaking tool from Google brings together the creative capabilities of Veo, Imagen, and Gemini. It allows users to generate short films from text or image prompts, integrating sound, dialogue, and visual effects. With text-to-image, image-to-video, and text-to-video features, it becomes a one-stop-shop for bringing imagination to reality. Moreover, it also comes with scene extension and editing features.

Most of these updates have been integrated into the Gemini chatbot making it much more advanced and creatively capable.

4. Gemini Live, Imagen 4, Veo 3, and More

Google I/O 2025 was more about Gemini than about AI, as proven by CEO Sundar Pichai’s word counter. Several announcements regarding Google’s Gemini chatbot were made at the event including updates on Deep Research and Canvas and integrations with Google’s latest generative AI tools.

Gemini Updates Launched at Google I/O 2025

Here’s a list of all the Gemini Updates revealed at the Google I/O event this year.

  1. Gemini Live: The biggest update of the Gemini chatbot announced at the Google I/O event this year was the Gemini Live feature, which lets users have live video calls with the AI-powered Gemini chatbot. It offers real-time AI assistance across devices, letting users engage in interactive camera conversations, receive on-the-go translations, and share screens or camera feeds for help. This feature is now available in over 45 languages across 150+ countries, to both Android and iOS users.
  2. Gemini in Chrome: The next big thing is that Google will soon be rolling out Gemini on Google Chrome as a web browsing AI agent. This lets users ask their search queries and follow-up questions about the search results, directly to the AI chatbot.
  3. Gemini Voice: Google has integrated native audio output into Gemini’s Voice Mode which lets it respond to users in a more personalized and nuanced manner. It can switch between languages, change tones, and even whisper during the same conversation. You can test out this updated version via the Gemini API.
  4. Deep Research: You can now upload your own files to guide the research agent while doing Deep Research using Google Gemini. You can also connect it to your Gmail and Google Drive to fetch more data or provide some context.
  5. Canvas: The Canvas feature on Gemini can now convert deep research reports into custom podcasts, quizzes, infographics, and more.
  6. Imagen 4: Google Gemini’s image generation capabilities are now powered by Imagen 4 making the images more realistic and detailed.
  7. Veo 3: Gemini can now generate realistic videos with accurate audio, dialogues, and background sound, thanks to the newly integrated Veo 3.

5. Android XR and Samsung Moohan

Android XR is Google’s first ever android platform, venturing into extended reality. This technology, powered by Gemini, fosters an immersive experience for users through hyper-realistic videos in real-time. Samsung’s Moohan, a newly designed pair of smart glasses, would be the first device that leverages Android XR for AI assistance. These glasses offer features like real-time navigation, translation, and camera live-streaming, aiming to enhance user interaction with the digital world.

With these glasses you can watch live events from your home, as if you were sitting in the front row of the stadium. With the ability to show Google Maps in 3D, it can visually take you places in real-time, giving you a realistic experience. Moreover, it comes with memory and can answers questions. Designed to provide AI assistance in real-time just like a human companion, Samsung Moohan can click pictures, make bookings, and even translate audio to text. Unlike most other smart glasses that come in a single sci-fi inspired design, these ones are going to be designed in various styles by Gentle Moster and Warby Parker.

Conclusion

Google I/O 2025 showcased the company’s ambitious strides in artificial intelligence, integrating advanced AI capabilities across its product spectrum. From enhancing everyday tools like Search and Meet to pioneering creative platforms like Flow and Genie 2, Google’s innovations aim to redefine user interaction with technology. As AI continues to evolve, these developments mark a significant step toward a more intuitive and immersive digital future.

K.C. Sabreena Basheer

Sabreena is a GenAI enthusiast and tech editor who’s passionate about documenting the latest advancements that shape the world. She’s currently exploring the world of AI and Data Science as the Manager of Content & Growth at Analytics Vidhya.

Login to continue reading and enjoy expert-curated content.

Leave a Comment