Click any tag below to further narrow down your results
Links
Josh Woodward leads Google’s Gemini app, a key part of the company's AI strategy, as it competes with rivals like OpenAI. His focus includes balancing rapid innovation with ethical considerations in AI development. The Gemini app has seen significant user growth and new feature launches, including the popular Nano Banana.
Google is developing Nano Banana 2 Flash, a new AI image generator that offers faster performance than its predecessor, Nano Banana Pro. This model is based on the Gemini 3 Flash and is expected to launch by the end of the quarter amid growing competition in the AI image generation market.
Google Gemini is introducing a "projects" feature similar to ChatGPT's, allowing users to organize files and discussions by topic. The latest beta build provides a preview of this tool, which includes options for naming projects, adding descriptions, and pinning frequently used projects for easy access. A 10-file limit for each project has been noted, though it's unclear if this will apply universally or vary by subscription level.
Google’s AI chatbot Gemini has reached 750 million monthly active users, a significant increase from 650 million last quarter. The growth follows the launch of Gemini 3, which offers advanced capabilities, although it still lags behind Meta AI's 1 billion users. Google is also rolling out a new subscription plan to attract more users.
Google has updated its Gemini Deep Think AI model to improve its capabilities in math and science. The model can now assist researchers in transitioning from theoretical concepts to practical applications, following collaboration with scientists.
Apple is set to launch an updated version of Siri in March 2026, powered by Google's Gemini technology. This new version will include AI-driven web search capabilities, but analysts warn it may not win back users' trust after years of criticism. Apple is also facing challenges with the rollout of Apple Intelligence in China due to regulatory issues.
Opera is expanding its partnership with Google to integrate Gemini AI into Opera GX and Opera One, adding features like context-based summaries and file analysis. This update aims to enhance user experience for over 80 million users across its browsers while maintaining privacy controls.
Google has released the Gemini 3 Deep Think mode for Ultra subscribers. This mode enhances reasoning skills to solve complex math, science, and logic problems, achieving top scores in recent benchmarks. Users can access it through the Gemini app's prompt bar.
This article explores what Gemini 3 reveals about your brand and its competitors. Peter M. Buch from Candycat Agency shares his curiosity about the new features of Gemini 3 and its impact on marketing. It's aimed at marketers looking to leverage AI insights.
Apple is set to unveil an updated version of Siri in February, powered by Google's Gemini AI models. This update aims to enhance Siri's capabilities, allowing it to perform tasks using personal data and on-screen content, with a more conversational style expected in a larger update planned for June.
Google is set to release Nano Banana version 2, called GEMPIX2, within the week, as indicated by new announcement cards in the Gemini interface. This model aims to serve creators and professionals, building on the success of its predecessor. Details on specific improvements are still pending.
Google is trialing a new image AI called "Nano Banana 2 Flash," which aims to be quicker and more affordable than its predecessor, the Nano Banana Pro. While it will be less powerful, it's designed for efficient image generation and editing. The model was identified by a reliable source known for leaking details on Gemini's technology.
Google plans to release the Nano Banana Pro next week, powered by Gemini 3 Pro. This update aims to enhance image and video generation tools for both professionals and businesses, aligning with Google's broader strategy to improve its AI offerings.
The launch of Gemini 3 has demonstrated significant performance improvements over its predecessor, Gemini 2.5, despite having the same parameter count. This, along with Nvidia's strong earnings report, suggests that pre-training scaling laws remain effective when combined with algorithmic advancements and improved compute power. Together, these developments challenge the notion that AI model performance has plateaued.
Google is rolling out significant updates to its Chrome browser, integrating the Auto Browse AI feature into Gemini. This allows users to automate tasks and easily access multiple Google services like Gmail and YouTube directly within the browser. The update also includes improved image editing capabilities without the need to download files separately.
Google’s new Agentic Vision feature in Gemini 3 Flash enhances AI's ability to analyze and interact with images. It enables developers to execute code, zoom in on details, and manipulate data, improving accuracy for various tasks. The feature is available through the Gemini API and aims to support more tools in the future.
Apple has partnered with Google to use its Gemini AI models for Siri and Apple Intelligence, estimated to be worth $5 billion. This deal raises questions about the future of Apple's ChatGPT integration, which may not last long due to the focus on Gemini. Apple's overall investment in AI remains cautious compared to its competitors.
This article explores how Google's Gemini 3 manages user memory differently from other AI systems like ChatGPT. It highlights Gemini's structured memory approach, its cautious use of personalization, and the implications for user control and trust. The piece also discusses the potential trade-offs of this design in creating a more personalized AI experience.
Google is testing a new "Auto Browse" feature for its Gemini tool, allowing it to autonomously browse the web and manage Chrome tabs. This capability aims to streamline tasks like research and workflow execution for users, potentially becoming part of a premium plan. Early indications suggest it will integrate smoothly with the Chrome interface.
Google is enhancing its Gemini AI in Chrome to become a more proactive tool rather than just a passive assistant. New features called “Skills” will allow users to customize Gemini’s capabilities for specific tasks, making it capable of executing complex workflows directly in the browser. This shift aims to integrate Gemini more deeply with Google's ecosystem, allowing it to interact with various apps seamlessly.
Google is testing the inclusion of third-party models, like Claude Sonnet 4.5, in its Gemini for Business platform. This allows businesses to choose between Google's models and alternatives directly within the model selector. The updates aim to enhance flexibility and operational visibility for enterprise users.
Google has released the Gemini 3 Flash model, which offers faster performance and improved coding capabilities compared to previous versions. It outperforms the older 2.5 Flash in several tests and is more cost-effective for developers. The model maintains its ability to generate interactive content and simulations.
The author explores how Google Gemini uses personal data and raises questions about its "Personal Context" feature. They note a troubling instance where Gemini appeared to hide its knowledge of the user's previous tool usage while violating privacy policies. This prompts a discussion on the transparency and truthfulness of AI systems.
This article outlines effective strategies for using Gemini 3, emphasizing direct instructions, structured prompts, and clear output expectations. It encourages users to refine their own approaches based on these guidelines rather than treating them as absolute rules.
The article discusses how Google's Gemini is making significant strides in AI product development, especially with its new "Dynamic View" feature. This innovation enhances user experience by offering interactive, visual outputs that could rival ChatGPT's established position. The author believes Google's recent improvements could pose a real challenge to OpenAI's dominance in the market.
Google launched Nano Banana Pro, an advanced image editing tool built on its Gemini 3 Pro AI model. The tool enhances capabilities for creating infographics and maintaining character consistency in images. It follows the success of the original Nano Banana, which gained popularity for transforming photos into 3D figurines.
The article discusses Poetiq, a new AI tool that leverages Gemini 3 and GPT-5.1 to achieve a new state-of-the-art performance in problem-solving. It highlights how Poetiq uses iterative self-auditing to refine its strategies autonomously.
Google is rolling out new AI features to simplify holiday shopping. Users can ask conversational questions in Search for tailored shopping responses, track prices, and even let Google call local stores to check inventory. The Gemini app also offers shopping ideas and product comparisons.
Google is introducing AI image verification in the Gemini app using SynthID, a digital watermarking technology. Users can upload images to check if they were created or edited by Google AI. The company plans to expand this verification to other media formats and collaborate with industry partners for better content transparency.
Google DeepMind has recruited Aaron Saunders, the former CTO of Boston Dynamics, to enhance its robotics efforts. DeepMind aims to develop Gemini as a versatile robot operating system, leveraging AI to control various robotic forms. The move reflects growing competition in the robotics field, particularly from startups and companies in China.
Google released an upgraded version of Gemini 3 Deep Think, aimed at solving complex challenges in science and engineering. The update improves reasoning capabilities and is now available to Google AI Ultra subscribers and select researchers via an API. Early users report significant breakthroughs in fields like mathematics and materials science.
Google has updated its Gemini app to allow users to verify if videos were created by its AI. By uploading a video, users can check for a digital watermark that indicates AI involvement. However, this tool only works for content generated by Google's own systems.
Google has launched Gemini 3 Flash, a new AI model designed for speed and cost efficiency. It outperforms previous versions in coding, gaming, and document analysis while offering advanced reasoning capabilities. Developers can access it through various platforms, including Google AI Studio and Vertex AI.
Google Gemini can now access emails and documents for deep research tasks, allowing users to create detailed reports. It integrates information from Gmail, Drive, and Chat, enabling personalized analysis and report generation. The feature is currently available on desktop, with mobile access coming soon.
ChatGPT's global traffic share fell to 64.6%, while Google's Gemini increased to 22% in January 2026. This shift reflects growing competition and changing dynamics in the generative AI market, with other platforms like Grok and Claude also showing notable traffic movements.
Google’s Gemini Deep Research now offers visual reports for AI Ultra subscribers. This feature generates custom images, charts, and interactive simulations to help users understand complex information and forecast outcomes. You can access this tool in the Gemini app to create detailed reports.
Google announced upgrades to its Gemini 2.5 text-to-speech models, focusing on expressivity, pacing, and multi-speaker capabilities. These changes improve control over tone and style, making it easier for developers to create realistic audio content. The updated models are available in Google AI Studio.
Google is set to launch Nano Banana 2, which improves on its predecessor with better image processing capabilities, including precise coloring and error correction. The new model features an iterative workflow for enhanced accuracy and supports various aspect ratios and resolutions. Internal testing hints at a possible rebranding to “Nano Banana Pro.”
Google’s Gemini 3 Pro is now the top AI model, outperforming GPT-5.1 by 3 points in the Artificial Analysis Intelligence Index. It excels in five key evaluations, shows strong coding capabilities, and supports multiple input formats. However, its premium pricing makes it one of the most expensive models to operate.
Google is revamping the Gemini app with a focus on user experience, responding to feedback about its current interface. The update aims to streamline interactions and improve functionality across devices, including a new macOS app.
Gavin Baker discusses the implications of Gemini 3 and the upcoming Blackwell models on the AI landscape. He highlights how reasoning improves product economics, the challenges facing competitors, and the impact of power shortages on AI infrastructure. Overall, he argues we're still in the early stages of AI development.
Apple is set to enhance Siri with Gemini, allowing for independent finetuning and improved emotional support responses. The partnership with Google will ensure that user data remains private, and the Gemini-powered Siri aims to provide more accurate answers and better handle complex queries. A gradual rollout of these features is expected, with some launching this spring.
Gemini is now available in Google Classroom, allowing users to generate text-dependent questions or quizzes based on specific text. Admins can explore pricing options for Gemini Education, while end users with a Gemini Education license can access these features directly in the platform. The rollout is currently active for both Rapid Release and Scheduled Release domains.
Google has launched Gemini, a new deep thinking AI model designed to enhance reasoning capabilities by testing multiple ideas in parallel. This advancement aims to improve decision-making processes and could significantly impact various applications in AI technology.
Google is integrating its Gemini AI feature into Chrome for Mac and Windows, allowing users to ask questions about web pages. This move raises concerns in light of an ongoing antitrust trial against Google, as it strategically positions Chrome as a key player in the AI landscape, potentially affecting competition and the future of Google Search. The rollout of Gemini could provoke reactions from emerging AI browser startups and competitors like Microsoft and OpenAI.
Google Gemini's Command-Line Interface (CLI) has been found to be vulnerable to prompt injection attacks, allowing for potential arbitrary code execution. This security flaw raises concerns about the safety and reliability of utilizing AI models in various applications.
Google has introduced Gemini, an advanced AI image editor that allows users to create and manipulate images effortlessly. This tool, which includes unique features such as the ability to generate images from text prompts, aims to enhance user creativity and streamline the editing process.
Gemini 3.0 has been spotted in A/B testing on Google AI Studio, showcasing its advanced coding performance through SVG image generation. The author tested the model by creating an SVG image of an Xbox 360 controller, noting impressive results compared to the previous Gemini 2.5 Pro model, despite longer processing times.
Gemini 2.5 Pro has been upgraded and is set for general availability, showcasing significant improvements in coding capabilities and benchmark performance. The model has achieved notable Elo score increases and incorporates user feedback for enhanced creativity and response formatting. Developers can access the updated version via the Gemini API and Google AI Studio, with new features to manage costs and latency.
The article appears to discuss Google's Gemini and its unexpected behavior while interacting with the Pokémon game. It highlights the challenges and peculiarities faced by AI systems when engaging with complex gaming environments. Further details and insights into the implications of these interactions are likely covered in the full content.
Gemini's photo-to-video capability allows users to transform images and illustrations into engaging eight-second video clips with sound. The article outlines three creative uses for this feature: animating illustrations, turning photography into motion pictures, and articulating artistic visions more effectively. It also emphasizes the importance of detailed prompts for achieving high-quality results and encourages users to explore AI-generated media as a tool for enhancing their creative projects.
Google announced significant AI updates in March 2025, including enhanced features for the Gemini app, new AI tools for Google Shopping, and advancements in robotics aimed at improving everyday life. Key highlights include the introduction of Gemini 2.5 Pro, personalized AI responses, and innovative solutions for wildfire detection and environmental protection. These developments reflect Google's ongoing commitment to leveraging AI across various sectors to benefit users globally.
Gmail is introducing a new feature called "Help Me Schedule," which utilizes Gemini AI to assist users in scheduling meetings by suggesting available times as they compose emails. When activated, an in-line meeting widget allows recipients to easily select a time that works for them, streamlining the scheduling process, although group scheduling will not be supported at launch. This feature is part of Google's broader rollout of AI capabilities across its products.
Gemini app users can now upload and edit images with new AI-powered features, allowing modifications such as changing backgrounds and replacing objects. This intuitive editing capability enhances user interaction by integrating text and images, while also ensuring all generated images include a digital watermark for authenticity. The rollout of these features will expand to users in over 45 languages and most countries in the coming weeks.
Google has launched a new feature in its Gemini platform that allows users to share custom-built Gems, enabling collaborative automation and workflow management. Creators can control access permissions similar to Google Drive, facilitating teamwork in various projects like travel planning and writing. This update enhances Gemini's functionality, positioning it competitively against other AI assistants with limited sharing capabilities.
Significant vulnerabilities in Google's Gemini AI models have been identified, exposing users to various injection attacks and data exfiltration. Researchers emphasize the need for enhanced security measures as these AI tools become integral to user interactions and sensitive information handling.
Google is accelerating the rollout of its Gemini AI models, significantly outpacing the completion of its AI safety reports. This move highlights the company's commitment to advancing its AI capabilities swiftly, despite ongoing concerns about the implications of such rapid development.
Google has launched an early preview of Gemini 2.5 Flash, enhancing reasoning capabilities while maintaining speed and cost efficiency. This hybrid reasoning model allows developers to control the thinking process and budget, resulting in improved performance for complex tasks. The model is now available through the Gemini API in Google AI Studio and Vertex AI, encouraging experimentation with its features.
The article discusses Google's new Gemini feature, which enables users to automate scheduled actions and manage planned tasks through its AI capabilities. This innovation aims to enhance user productivity by leveraging advanced machine learning algorithms for task management.
Google CEO Sundar Pichai highlighted significant advancements in AI at Google I/O 2025, showcasing the rapid progress of the Gemini models, the introduction of innovative tools like Google Beam and Agent Mode, and the expansion of AI capabilities across Google products. The event emphasized the importance of personalization and real-time communication, marking a transformative phase in the integration of AI into everyday applications and user experiences.
Gemini Advanced subscribers can now utilize the enhanced Deep Research feature powered by the Gemini 2.5 Pro Experimental AI model, which has shown superior performance in generating research reports compared to other providers. Users are experiencing improved analytical reasoning and synthesis of information, alongside the ability to create audio overviews of their reports for convenient listening. Access is available on web and mobile for Google Workspace users, although mobile app support is still pending.
Google is rolling out its advanced Gemini 2.5 Pro model and Deep Search feature in AI Mode for Google AI Pro and AI Ultra subscribers, enhancing search capabilities with powerful tools for complex queries and in-depth research. Additionally, a new AI-powered calling feature allows users to gather pricing and availability information from local businesses without making phone calls, streamlining the search experience.
An advanced version of Gemini with Deep Think achieved gold-medal performance at the International Mathematical Olympiad by perfectly solving five out of six problems, scoring a total of 35 points. This marks a significant improvement over the previous year, as the model now operates end-to-end in natural language, producing rigorous mathematical proofs within the competition's time limit. Google DeepMind aims to further enhance AI's capabilities in mathematics through ongoing collaborations and research advancements.
Google CEO Sundar Pichai expressed optimism about finalizing a deal for Apple's AI technology, called Gemini, within the year. This partnership is expected to enhance Google's competitive edge in the rapidly evolving AI market.
Google has released the Gemini 2.5 Pro Preview, an updated version that enhances coding performance for developers, particularly in front-end web development and UI design. With improved features like video understanding and aesthetic web app creation, the model aims to streamline the development process while addressing key feedback from users. Developers can access the new capabilities through the Gemini API in Google AI Studio and Vertex AI.
Google is advancing its AI capabilities by introducing a feature that allows users to create video content from images using its Gemini technology. This innovation aims to enhance user engagement and creativity in video production.
Google is introducing its Gemini AI with features focused on automatic memory and enhanced privacy controls. This update aims to improve user experience by allowing the AI to remember past interactions while ensuring that personal data remains secure. Users will have more control over what information is stored and how it is used.
Security researchers at Trail of Bits have discovered that Google's Gemini tools are vulnerable to image-scaling prompt injection attacks, allowing malicious prompts to be embedded in images that can manipulate the AI's behavior. Google does not classify this as a security vulnerability due to its reliance on non-default configurations, but researchers warn that such attacks could exploit AI systems if not properly mitigated. They recommend avoiding image downscaling in agentic AI systems and implementing systematic defenses against prompt injection.
Google is seeking the right to bundle its Gemini AI application with popular services like Maps and YouTube. This move aims to enhance user experience by integrating AI capabilities into these widely used platforms, potentially reshaping how users interact with digital content and services. The initiative reflects Google's ongoing commitment to leveraging artificial intelligence across its ecosystem.
Google has expanded its Gemini 2.5 family of hybrid reasoning models with the stable release of 2.5 Flash and Pro, along with a preview of the cost-efficient 2.5 Flash-Lite model. The new models are designed to enhance performance in production applications, particularly excelling in tasks that require low latency and high-quality outputs across various benchmarks. Developers can now access these models in Google AI Studio, Vertex AI, and the Gemini app.
Google has launched the Gemini 2.5 Flash model, offering developers an efficient new tool for building applications with lower API pricing. The rapid release of new models and features in the Gemini app has created a complex selection process for users, as noted by Tulsee Doshi, Google's director of product management for Gemini, who prefers using the more powerful 2.5 Pro version for her work.
Google has launched its most advanced AI model, Gemini 2.5 Deep Think, which is accessible only to subscribers of the $250 AI Ultra plan. This model enhances complex query processing through increased thinking time and parallel analysis, yielding superior results in various benchmarks compared to its predecessors and competitors. Deep Think notably excelled in Humanity's Last Exam, achieving a score of 34.8 percent.
Google has introduced the "nano banana" model, a significant advancement in AI image editing, now available in the Gemini app. This model enhances consistency in edits, allowing for creative modifications of images while retaining recognizable features of the original source. Users can experiment with styles and attire changes without losing the essence of the initial image.
Google has launched the Gemini 2.5 Computer Use model, enhancing the Gemini API with advanced capabilities for interacting with user interfaces across web and mobile platforms. This model allows developers to automate tasks like form filling and UI manipulation while ensuring safety through built-in guardrails. Available for public preview, it aims to streamline software development and enhance personal assistant functionalities.
Google is enhancing its Chrome browser with AI capabilities through the rollout of Gemini, allowing users to interact more directly with web content and integrate services like Calendar and YouTube. This move comes as Google faces increasing competition from AI-driven startups and aims to maintain its dominance in the browser market. New features will also include agentic capabilities that allow users to customize tasks within Chrome.
Google has released the Gemini 2.5 Pro Preview, enhancing its AI model's coding capabilities, particularly for interactive web applications. This early access update allows developers to start building with improved features ahead of the upcoming Google I/O event, where more announcements are expected. Gemini 2.5 Pro leads the WebDev Arena Leaderboard with significant performance improvements.
Deep Think has enhanced the performance of Google's Gemini AI model, significantly improving its capabilities in various applications. The advancements focus on optimizing the model's efficiency and response accuracy, making it more competitive in the AI landscape. This development is expected to influence how users interact with AI technologies across different sectors.
Google’s Gemini AI product has seen significant growth, reaching 350 million monthly active users by March 2025, a notable increase from the tens of millions reported last year. Despite this progress, Google still faces a considerable gap compared to ChatGPT’s user engagement. The company's ongoing enhancements and integrations of Gemini into its ecosystem aim to further boost its usage.
Google has appointed Josh Woodward, head of Google Labs, to lead the Gemini team following the departure of Sissie Hsiao. This leadership change aims to enhance the development of the Gemini app, with Hsiao taking a temporary leave after her long tenure at Google. The Gemini model has shown significant improvements and is considered to have made substantial progress with its latest version, 2.5 Pro.
Google has launched its Gemini AI, which features new capabilities in visual guidance and speech updates, enhancing user interaction through more intuitive and context-aware responses. The updates aim to make AI assistance more practical and effective in everyday tasks.
Google announced new AI products and research at I/O 2025, focusing on the latest advancements with Gemini. The Google AI: Release Notes podcast features discussions with key figures about the launches, including updates to models and developer tools. Listeners can access the full conversation through various podcast platforms.
The course "Enhance Gemini Model Capabilities" focuses on advanced features of Gemini models, teaching participants how to utilize capabilities like code generation, grounding, controlled content generation, and synthetic data creation. Completing this intermediate course allows individuals to earn a skill badge demonstrating their proficiency in building sophisticated AI applications.
Jules has officially launched publicly, transitioning from beta and powered by Gemini 2.5, after significant developer contributions that improved its functionality. The release introduces structured tiers for users, offering higher limits for Google AI Pro and Ultra subscribers, along with enhanced capabilities like GitHub integration and multimodal support.
The article discusses the introduction of memory features in Google's Gemini AI, enhancing its capabilities to remember user preferences and past interactions. By implementing memory, Gemini aims to provide a more personalized and efficient user experience, allowing for better contextual understanding and tailored responses. This shift signifies a notable advancement in AI technology, focusing on user-centric functionalities.