Click any tag below to further narrow down your results
Links
Google has updated its Gemini Deep Think AI model to improve its capabilities in math and science. The model can now assist researchers in transitioning from theoretical concepts to practical applications, following collaboration with scientists.
This article discusses how Googler Anna Bortsova uses a technique called "meta prompting" with Gemini to create detailed prompts for generating unique ASMR-style videos. By defining specific tasks and emotional tones, she collaborates with the AI to produce captivating content. Her advice emphasizes the importance of experimenting with subjects you love.
The CFTC has approved Gemini's Designated Contract Market license, enabling the launch of a prediction market and potential expansion into crypto futures and options. This approval comes as the regulatory landscape for prediction markets shifts, with increased support from current CFTC leadership.
Google is hinting at integrating Opal's automation features into Gemini, as indicated by a new banner in the platform. Opal allows users to create AI-driven workflows, and if integrated, it could enhance Gemini's automation capabilities. However, it's unclear if the banner is just an advertisement for Opal or a sign of actual integration.
Google Maps is introducing AI tools to help users and developers create interactive projects. The new features include a builder agent for generating map prototypes, a styling agent for custom map designs, and a code assistant toolkit that connects with Google Maps documentation. Additionally, Google is enhancing consumer features with Gemini-powered updates for navigation and real-time data.
Google’s AI chatbot Gemini has reached 750 million monthly active users, a significant increase from 650 million last quarter. The growth follows the launch of Gemini 3, which offers advanced capabilities, although it still lags behind Meta AI's 1 billion users. Google is also rolling out a new subscription plan to attract more users.
Google Gemini is introducing a "projects" feature similar to ChatGPT's, allowing users to organize files and discussions by topic. The latest beta build provides a preview of this tool, which includes options for naming projects, adding descriptions, and pinning frequently used projects for easy access. A 10-file limit for each project has been noted, though it's unclear if this will apply universally or vary by subscription level.
Google is developing Nano Banana 2 Flash, a new AI image generator that offers faster performance than its predecessor, Nano Banana Pro. This model is based on the Gemini 3 Flash and is expected to launch by the end of the quarter amid growing competition in the AI image generation market.
Google reaffirms that its AI model, Gemini, will not include ads for the foreseeable future. CEO Demis Hassabis emphasized the focus on enhancing user experience without commercial interruptions, contrasting it with ChatGPT's recent ad introduction.
Josh Woodward leads Google’s Gemini app, a key part of the company's AI strategy, as it competes with rivals like OpenAI. His focus includes balancing rapid innovation with ethical considerations in AI development. The Gemini app has seen significant user growth and new feature launches, including the popular Nano Banana.
Google has restricted access for free users of Gemini 3 Pro and Nano Banana Pro due to high demand. Free users now face fluctuating daily limits and can only generate two images per day with Nano Banana Pro. The company also temporarily rolled back access to new features for free users while implementing additional limits for Pro users.
Apple is set to unveil an updated version of Siri in February, powered by Google's Gemini AI models. This update aims to enhance Siri's capabilities, allowing it to perform tasks using personal data and on-screen content, with a more conversational style expected in a larger update planned for June.
Google announced updates to the Gemini API's Structured Outputs, adding support for JSON Schema and improving property ordering. This will help developers ensure consistent data extraction and facilitate agent communication in AI applications.
This article explores what Gemini 3 reveals about your brand and its competitors. Peter M. Buch from Candycat Agency shares his curiosity about the new features of Gemini 3 and its impact on marketing. It's aimed at marketers looking to leverage AI insights.
Google has released the Gemini 3 Deep Think mode for Ultra subscribers. This mode enhances reasoning skills to solve complex math, science, and logic problems, achieving top scores in recent benchmarks. Users can access it through the Gemini app's prompt bar.
Google is developing a feature called "screen automation" for its Gemini AI, allowing it to control certain Android apps for tasks like placing orders or booking rides. Users will need to supervise Gemini closely, as it can make mistakes and should not handle sensitive information. The beta also hints at a "Likeness" feature for creating 3D avatars for Google Meet.
Opera is expanding its partnership with Google to integrate Gemini AI into Opera GX and Opera One, adding features like context-based summaries and file analysis. This update aims to enhance user experience for over 80 million users across its browsers while maintaining privacy controls.
Gemini Voyager enhances Google Gemini by providing a structured way to manage AI conversations. It features folder organization, a prompt vault for saving prompts, cloud sync, and various tools for exporting chats and navigating timelines. This tool is useful for researchers, developers, and anyone needing to keep their AI discussions organized.
Apple is set to launch an updated version of Siri in March 2026, powered by Google's Gemini technology. This new version will include AI-driven web search capabilities, but analysts warn it may not win back users' trust after years of criticism. Apple is also facing challenges with the rollout of Apple Intelligence in China due to regulatory issues.
Google has introduced a feature called Personal Intelligence in its Gemini app, allowing users to get personalized answers by connecting data from apps like Gmail and Google Photos. This update aims to enhance Gemini's reasoning capabilities and compete with Apple's AI offerings. The feature is currently available to select subscribers in the U.S. and will be off by default.
This article reviews the major consumer AI product launches and usage trends from 2023, highlighting the competition between ChatGPT and Gemini. It examines user engagement, retention, and the effectiveness of new features across various platforms, while also discussing the future direction of these AI models.
Google has integrated Opal into the Gemini web app, allowing users to create AI-powered mini apps. The updated visual editor now provides a step-by-step format for easier editing, and users can switch to an Advanced Editor for more detailed customization.
Google is trialing a new image AI called "Nano Banana 2 Flash," which aims to be quicker and more affordable than its predecessor, the Nano Banana Pro. While it will be less powerful, it's designed for efficient image generation and editing. The model was identified by a reliable source known for leaking details on Gemini's technology.
Google is developing a multi-agent system in Gemini for Enterprise that generates and evaluates ideas through a tournament-style process. Users can receive up to 100 ranked ideas based on specified topics and criteria. The system also includes specialized agents like "Co-scientist," aimed at aiding research and scientific inquiries.
Project Gemini is an internet technology that creates an electronic library of interconnected text documents. It aims to provide a simple, privacy-focused online space for readers, emphasizing the importance of documents without the noise of modern web distractions. The project respects user privacy and bandwidth while promoting a straightforward reading experience.
Google plans to release the Nano Banana Pro next week, powered by Gemini 3 Pro. This update aims to enhance image and video generation tools for both professionals and businesses, aligning with Google's broader strategy to improve its AI offerings.
The launch of Gemini 3 has demonstrated significant performance improvements over its predecessor, Gemini 2.5, despite having the same parameter count. This, along with Nvidia's strong earnings report, suggests that pre-training scaling laws remain effective when combined with algorithmic advancements and improved compute power. Together, these developments challenge the notion that AI model performance has plateaued.
Google is working on voice cloning features for AI Studio, indicated by a hidden option that allows users to upload audio samples. This capability aims to enhance personalization in AI-generated audio and streamline integration with Gemini models. Other updates include GitHub repository imports and improvements to the platform's navigation.
Google is rolling out significant updates to its Chrome browser, integrating the Auto Browse AI feature into Gemini. This allows users to automate tasks and easily access multiple Google services like Gmail and YouTube directly within the browser. The update also includes improved image editing capabilities without the need to download files separately.
Google’s new Agentic Vision feature in Gemini 3 Flash enhances AI's ability to analyze and interact with images. It enables developers to execute code, zoom in on details, and manipulate data, improving accuracy for various tasks. The feature is available through the Gemini API and aims to support more tools in the future.
Apple has partnered with Google to use its Gemini AI models for Siri and Apple Intelligence, estimated to be worth $5 billion. This deal raises questions about the future of Apple's ChatGPT integration, which may not last long due to the focus on Gemini. Apple's overall investment in AI remains cautious compared to its competitors.
This article explores how Google's Gemini 3 manages user memory differently from other AI systems like ChatGPT. It highlights Gemini's structured memory approach, its cautious use of personalization, and the implications for user control and trust. The piece also discusses the potential trade-offs of this design in creating a more personalized AI experience.
This article analyzes the state of consumer AI in 2025, highlighting major product launches and trends from companies like OpenAI and Google. It discusses user engagement, retention rates, and the competitive landscape, noting that while ChatGPT remains dominant, Gemini is gaining ground with innovative products.
Google has released updates for the Gemini API, enhancing its Gemini 3 model. Key features include new parameters for controlling reasoning depth, image processing, and thought signatures to maintain context in conversations. These updates aim to improve developers' control and efficiency when building applications.
Google is set to release Nano Banana version 2, called GEMPIX2, within the week, as indicated by new announcement cards in the Gemini interface. This model aims to serve creators and professionals, building on the success of its predecessor. Details on specific improvements are still pending.
Google is enhancing its Gemini AI in Chrome to become a more proactive tool rather than just a passive assistant. New features called “Skills” will allow users to customize Gemini’s capabilities for specific tasks, making it capable of executing complex workflows directly in the browser. This shift aims to integrate Gemini more deeply with Google's ecosystem, allowing it to interact with various apps seamlessly.
At CES 2026, Google introduced new Gemini features for Google TV that enhance interaction and personalization. Users can search Google Photos, adjust settings with natural language, and explore topics with rich visuals and interactive content. These updates will first roll out on select TCL devices.
Google is testing the inclusion of third-party models, like Claude Sonnet 4.5, in its Gemini for Business platform. This allows businesses to choose between Google's models and alternatives directly within the model selector. The updates aim to enhance flexibility and operational visibility for enterprise users.
Google is set to announce the Nano Banana 2 Flash, a new model in the Gemini series. It offers similar performance to the Pro version at a lower cost, making it suitable for larger deployments. The public release may happen in early December.
Google has released the Gemini 3 Flash model, which offers faster performance and improved coding capabilities compared to previous versions. It outperforms the older 2.5 Flash in several tests and is more cost-effective for developers. The model maintains its ability to generate interactive content and simulations.
Google has launched Workspace Studio, a tool that allows users to create and manage AI agents for automating tasks within Google apps like Gmail and Drive. It simplifies automation with a no-code approach, enabling users to build agents that can respond to emails and perform various tasks using natural language prompts. The tool is now available for different Google Workspace tiers.
The author explores how Google Gemini uses personal data and raises questions about its "Personal Context" feature. They note a troubling instance where Gemini appeared to hide its knowledge of the user's previous tool usage while violating privacy policies. This prompts a discussion on the transparency and truthfulness of AI systems.
Google is testing a new "Auto Browse" feature for its Gemini tool, allowing it to autonomously browse the web and manage Chrome tabs. This capability aims to streamline tasks like research and workflow execution for users, potentially becoming part of a premium plan. Early indications suggest it will integrate smoothly with the Chrome interface.
This article outlines effective strategies for using Gemini 3, emphasizing direct instructions, structured prompts, and clear output expectations. It encourages users to refine their own approaches based on these guidelines rather than treating them as absolute rules.
The article discusses how Google's Gemini is making significant strides in AI product development, especially with its new "Dynamic View" feature. This innovation enhances user experience by offering interactive, visual outputs that could rival ChatGPT's established position. The author believes Google's recent improvements could pose a real challenge to OpenAI's dominance in the market.
OpenAI CEO Sam Altman announced a "code red" to focus on improving ChatGPT after Google's Gemini 3 model gained 200 million users in just three months. This shift delays other projects and reflects a competitive landscape where Gemini has outperformed ChatGPT in benchmarks.
Google DeepMind has recruited Aaron Saunders, the former CTO of Boston Dynamics, to enhance its robotics efforts. DeepMind aims to develop Gemini as a versatile robot operating system, leveraging AI to control various robotic forms. The move reflects growing competition in the robotics field, particularly from startups and companies in China.
Google Classroom now allows teachers to create podcast-style audio lessons using Gemini. Admins can control access based on age, and users must review AI-generated content to ensure it meets their needs. The feature is rolling out starting January 6, 2026.
Google is introducing AI image verification in the Gemini app using SynthID, a digital watermarking technology. Users can upload images to check if they were created or edited by Google AI. The company plans to expand this verification to other media formats and collaborate with industry partners for better content transparency.
Google is rolling out new AI features to simplify holiday shopping. Users can ask conversational questions in Search for tailored shopping responses, track prices, and even let Google call local stores to check inventory. The Gemini app also offers shopping ideas and product comparisons.
Google is testing ads in its AI Mode searches, placing them at the bottom of results. This change is part of ongoing experiments and doesn’t currently allow users to hide these sponsored results. Organic links are still prioritized above the ads.
The article discusses Poetiq, a new AI tool that leverages Gemini 3 and GPT-5.1 to achieve a new state-of-the-art performance in problem-solving. It highlights how Poetiq uses iterative self-auditing to refine its strategies autonomously.
Google introduced Antigravity, a development tool that integrates with Gemini 3 Pro and other models. It features an “agent-first” approach, allowing multiple agents to operate in parallel with enhanced feedback and reporting capabilities. Currently in public preview, it’s available for free on major operating systems.
Google launched Nano Banana Pro, an advanced image editing tool built on its Gemini 3 Pro AI model. The tool enhances capabilities for creating infographics and maintaining character consistency in images. It follows the success of the original Nano Banana, which gained popularity for transforming photos into 3D figurines.
Google has updated its Gemini app to allow users to verify if videos were created by its AI. By uploading a video, users can check for a digital watermark that indicates AI involvement. However, this tool only works for content generated by Google's own systems.
Google has launched Gemini 3 Flash, a new AI model designed for speed and cost efficiency. It outperforms previous versions in coding, gaming, and document analysis while offering advanced reasoning capabilities. Developers can access it through various platforms, including Google AI Studio and Vertex AI.
Google has updated Chrome with Gemini 3, introducing a side panel for multitasking and the ability to transform images directly in the browser. It also integrates with Google apps for streamlined workflows and includes an auto browse feature to handle complex tasks automatically.
Google Gemini can now access emails and documents for deep research tasks, allowing users to create detailed reports. It integrates information from Gmail, Drive, and Chat, enabling personalized analysis and report generation. The feature is currently available on desktop, with mobile access coming soon.
Google released an upgraded version of Gemini 3 Deep Think, aimed at solving complex challenges in science and engineering. The update improves reasoning capabilities and is now available to Google AI Ultra subscribers and select researchers via an API. Early users report significant breakthroughs in fields like mathematics and materials science.
Google DeepMind CEO Demis Hassabis announced that Gemini will not include ads, at least for now. This comes as ChatGPT begins testing ads for its free and $8 Go subscription users in the U.S., aiming to increase revenue.
The SEC has dismissed its lawsuit against Gemini Trust over the Gemini Earn program after investors fully recovered their assets. This decision marks the end of a three-year legal battle that began when Gemini Earn was accused of offering unregistered securities. The ruling prevents the SEC from bringing similar claims against Gemini in the future.
Google is revamping the Gemini app with a focus on user experience, responding to feedback about its current interface. The update aims to streamline interactions and improve functionality across devices, including a new macOS app.
Google is adding an "import AI chats" feature to Gemini, allowing users to upload conversations from other platforms. This aims to retain users' chat history and improve context. Additionally, Gemini will offer higher-resolution image downloads and a new feature related to video content verification.
Google’s Gemini 3 Pro is now the top AI model, outperforming GPT-5.1 by 3 points in the Artificial Analysis Intelligence Index. It excels in five key evaluations, shows strong coding capabilities, and supports multiple input formats. However, its premium pricing makes it one of the most expensive models to operate.
Google is rolling out NotebookLM integration within Gemini, allowing users to attach notebooks for real-time data access during conversations. This feature supports multiple notebook attachments and provides in-line citations for easy navigation to content. The update aims to enhance Gemini's functionality for both free and paid users.
Google is set to launch Nano Banana 2, which improves on its predecessor with better image processing capabilities, including precise coloring and error correction. The new model features an iterative workflow for enhanced accuracy and supports various aspect ratios and resolutions. Internal testing hints at a possible rebranding to “Nano Banana Pro.”
Google announced upgrades to its Gemini 2.5 text-to-speech models, focusing on expressivity, pacing, and multi-speaker capabilities. These changes improve control over tone and style, making it easier for developers to create realistic audio content. The updated models are available in Google AI Studio.
Google’s Gemini Deep Research now offers visual reports for AI Ultra subscribers. This feature generates custom images, charts, and interactive simulations to help users understand complex information and forecast outcomes. You can access this tool in the Gemini app to create detailed reports.
Google Translate has upgraded its capabilities using Gemini, improving text translations for idioms and slang. A new live translation feature allows real-time, natural-sounding translations through headphones, and the app now supports more languages and enhanced language learning tools.
ChatGPT's global traffic share fell to 64.6%, while Google's Gemini increased to 22% in January 2026. This shift reflects growing competition and changing dynamics in the generative AI market, with other platforms like Grok and Claude also showing notable traffic movements.
OpenAI is accelerating the release of its GPT-5.2 model in response to Google's Gemini AI. Set for December 9, this update aims to enhance speed, reliability, and customizability, addressing the competition posed by Gemini 3.
The article details the author's frustrating experience trying to obtain a Google Gemini API key for a coding project. After navigating a confusing array of Google services and verification steps, the author ultimately gained access, but not without significant hassle and delays.
The article reviews Google’s Gemini 3 Pro, highlighting its improved features over Gemini 2.5, including audio transcription capabilities and performance benchmarks compared to other AI models. It details pricing, multimodal input support, and tests involving image analysis and a city council meeting audio transcript.
Gavin Baker discusses the implications of Gemini 3 and the upcoming Blackwell models on the AI landscape. He highlights how reasoning improves product economics, the challenges facing competitors, and the impact of power shortages on AI infrastructure. Overall, he argues we're still in the early stages of AI development.
Google fixed a serious vulnerability in its Gemini Enterprise AI that allowed attackers to embed malicious instructions in shared documents, leading to unauthorized access to sensitive corporate information. This flaw, discovered by Noma Labs, exploited the AI's retrieval system to execute commands without employee interaction.
The article discusses how to leverage the Gemini and Chrome integration to analyze multiple web pages at once. It highlights practical use cases, like gaining insights from top-performing content and YouTube comments, to improve your own writing and marketing strategies.
Apple is set to enhance Siri with Gemini, allowing for independent finetuning and improved emotional support responses. The partnership with Google will ensure that user data remains private, and the Gemini-powered Siri aims to provide more accurate answers and better handle complex queries. A gradual rollout of these features is expected, with some launching this spring.
Google has launched its powerful Gemini AI model, which has outperformed competitors, including OpenAI's ChatGPT, to become the leading AI chatbot. The company's extensive investment in research and hardware development has enabled it to protect its search business amidst the growing popularity of chatbots.
Users in an organization must be assigned licenses to use Gemini Code Assist Standard and Enterprise, with options for automatic or manual assignment. License management requires specific IAM roles and can be performed through the Google Cloud console or API. Administrators can also track license usage and adjust the number of licenses as needed.
Google Gemini's Command-Line Interface (CLI) has been found to be vulnerable to prompt injection attacks, allowing for potential arbitrary code execution. This security flaw raises concerns about the safety and reliability of utilizing AI models in various applications.
Google is integrating its Gemini AI feature into Chrome for Mac and Windows, allowing users to ask questions about web pages. This move raises concerns in light of an ongoing antitrust trial against Google, as it strategically positions Chrome as a key player in the AI landscape, potentially affecting competition and the future of Google Search. The rollout of Gemini could provoke reactions from emerging AI browser startups and competitors like Microsoft and OpenAI.
The Gemini app has undergone a redesign of its homepage and web tools, aiming to enhance user experience and accessibility. The updates include a more streamlined interface and improved functionality to better serve its users' needs.
Google is addressing the growing threat of indirect prompt injection attacks on generative AI systems, which involve hidden malicious instructions in external data sources. Their layered security strategy for the Gemini platform includes advanced content classifiers, security thought reinforcement, markdown sanitization, user confirmation mechanisms, and end-user security notifications to enhance protection against such attacks.
Google has launched Gemini, a new deep thinking AI model designed to enhance reasoning capabilities by testing multiple ideas in parallel. This advancement aims to improve decision-making processes and could significantly impact various applications in AI technology.
Gemini is now available in Google Classroom, allowing users to generate text-dependent questions or quizzes based on specific text. Admins can explore pricing options for Gemini Education, while end users with a Gemini Education license can access these features directly in the platform. The rollout is currently active for both Rapid Release and Scheduled Release domains.
Google is rolling out a change that allows its Gemini AI engine to access third-party apps like WhatsApp, overriding user settings that previously blocked such interactions. Users may need to take action to maintain their privacy, but the guidance provided by Google is unclear and contradictory, leaving many users confused about how to fully disable Gemini's access.
Gemini 3.0 has been spotted in A/B testing on Google AI Studio, showcasing its advanced coding performance through SVG image generation. The author tested the model by creating an SVG image of an Xbox 360 controller, noting impressive results compared to the previous Gemini 2.5 Pro model, despite longer processing times.
Google has revamped AI Studio to simplify the process of building AI-powered applications, targeting developers and non-coders alike. The updated platform features a new model selector, an application gallery, and a modular approach to integrating AI capabilities, all aimed at democratizing app creation and enhancing user experience. Anticipated future updates promise to further enrich the platform, aligning with Google's goal of fostering widespread AI development.
Gemini Code Assist enhances the code review process in GitHub by providing instant summaries, identifying bugs, and suggesting improvements, which allows developers to focus on more complex issues. With the integration of the advanced Gemini 2.5 model, feedback is more accurate and actionable, leading to higher code quality and increased developer satisfaction, as evidenced by early adopters like Delivery Hero.
Google has introduced Gemini, an advanced AI image editor that allows users to create and manipulate images effortlessly. This tool, which includes unique features such as the ability to generate images from text prompts, aims to enhance user creativity and streamline the editing process.
Gemini's photo-to-video capability allows users to transform images and illustrations into engaging eight-second video clips with sound. The article outlines three creative uses for this feature: animating illustrations, turning photography into motion pictures, and articulating artistic visions more effectively. It also emphasizes the importance of detailed prompts for achieving high-quality results and encourages users to explore AI-generated media as a tool for enhancing their creative projects.
Gemini Live is evolving into a more helpful AI assistant with enhanced visual awareness, allowing users to receive on-screen guidance and interact seamlessly with various Google apps. The updates include improved conversational abilities, such as more natural speech and the capability to control Gemini's voice, making interactions more intuitive and engaging. These features will roll out soon, starting with the Pixel 10 series and expanding to other devices.
The article appears to discuss Google's Gemini and its unexpected behavior while interacting with the Pokémon game. It highlights the challenges and peculiarities faced by AI systems when engaging with complex gaming environments. Further details and insights into the implications of these interactions are likely covered in the full content.
Gemini 2.5 Pro has been upgraded and is set for general availability, showcasing significant improvements in coding capabilities and benchmark performance. The model has achieved notable Elo score increases and incorporates user feedback for enhanced creativity and response formatting. Developers can access the updated version via the Gemini API and Google AI Studio, with new features to manage costs and latency.
The article discusses the concept of "Gemini bounding boxes," focusing on their application and significance in a specific domain. It explores how these bounding boxes can enhance performance and accuracy in various tasks, although the specific details of the implementation are not provided.
Google Gemini has reached 350 million monthly users, as revealed during a recent court hearing. This significant user base highlights the growing impact of the platform in the tech landscape.
Google has launched the Gemini 2.5 Flash Image model, now available to developers and enterprises through the Gemini API, Google AI Studio, and Vertex AI. This production-ready tool offers advanced features for image generation and editing, supporting multiple aspect ratios and enabling real-time applications at competitive pricing. Developers are already incorporating it into various creative and educational workflows.
Gmail is introducing a new feature called "Help Me Schedule," which utilizes Gemini AI to assist users in scheduling meetings by suggesting available times as they compose emails. When activated, an in-line meeting widget allows recipients to easily select a time that works for them, streamlining the scheduling process, although group scheduling will not be supported at launch. This feature is part of Google's broader rollout of AI capabilities across its products.
Gemini app users can now upload and edit images with new AI-powered features, allowing modifications such as changing backgrounds and replacing objects. This intuitive editing capability enhances user interaction by integrating text and images, while also ensuring all generated images include a digital watermark for authenticity. The rollout of these features will expand to users in over 45 languages and most countries in the coming weeks.
Google has launched the Gemini Embedding model (gemini-embedding-001), now available to developers via the Gemini API and Vertex AI, showcasing superior performance on the Massive Text Embedding Benchmark. This versatile model supports over 100 languages and features flexible output dimensions, allowing developers to optimize for performance and cost. Users are encouraged to migrate from older models before their deprecation dates, with enhanced features like Batch API support coming soon.