Quit Emailing Yourself

# kubernetes → gke

10 links tagged with all of: kubernetes + gke

Click any tag below to further narrow down your results

+ cloud-computing (3) + ai (2) + cloud (2) + identity-management (1) + autopilot (1) + ai-serving (1) + inference-gateway (1) + llm (1) + ebook (1) + storage (1) + performance (1) + data-cache (1) + automation (1) + ipam (1) + innovation (1)

Links

EKS vs. GKE — Security

The article compares the security features of AWS Elastic Kubernetes Service (EKS) and Google Kubernetes Engine (GKE), focusing on key areas such as identity and access management, network traffic control, configuration management, vulnerability management, and runtime threat detection. It highlights the differences in default settings and capabilities of both managed services, emphasizing aspects like IAM integration, firewall options, and runtime security tools.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ security + eks gke ✓ kubernetes ✓ + identity-management

Run OpenAI’s new gpt-oss model at scale with GKE | Google Cloud Blog

OpenAI has released its new gpt-oss model, and Google is now supporting its deployment on Google Kubernetes Engine (GKE) with optimized configurations. GKE is designed to manage large-scale AI workloads, offering scalability and performance with advanced infrastructure, including GPU and TPU accelerators. Users can quickly get started with the GKE Inference Quickstart tool, which simplifies the setup and provides benchmarking capabilities.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

+ openai gke ✓ + gpt-oss kubernetes ✓ + ai-workloads

10 years of GKE ebook | Google Cloud Blog

Google Kubernetes Engine (GKE) celebrates its 10th anniversary with the launch of an ebook detailing its evolution and impact on businesses. Highlighting customer success stories, including Signify and Niantic, the article emphasizes GKE's role in facilitating scalable cloud-native AI solutions while allowing teams to focus on innovation rather than infrastructure management.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

gke ✓ kubernetes ✓ + cloud-computing + ai + ebook

Implementing High-Performance LLM Serving on GKE: An Inference Gateway Walkthrough | Google Cloud Blog

The guide outlines how to deploy large language models (LLMs) at scale using Google Kubernetes Engine (GKE) and the GKE Inference Gateway, which optimizes load balancing by considering AI-specific metrics. It provides a step-by-step walkthrough for setting up an inference pipeline with the vLLM framework, ensuring efficient resource management and performance for AI workloads. Key features include intelligent load balancing, simplified operations, and support for multiple models and hardware configurations.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

gke ✓ + llm + inference-gateway kubernetes ✓ + ai-serving

GKE Data Cache, now GA, accelerates stateful apps | Google Cloud Blog

GKE Data Cache is now generally available, enhancing Google Kubernetes Engine's performance for stateful and stateless applications by utilizing high-speed local SSDs as a caching layer for persistent disks. This solution provides significant improvements in read latency and throughput, making it easier to manage data access while potentially lowering costs. Users can configure caching for their workloads with straightforward setup instructions and options for data consistency.

Saved by tldr-importer · Last saved October 29, 2025 · 3 min read

gke ✓ + data-cache kubernetes ✓ + performance + storage

How GKE powers AI innovation | Google Cloud Blog

Google Kubernetes Engine (GKE) is enhancing its capabilities to support AI workloads, with new features like Cluster Director for managing large clusters, GKE Inference Quickstart for simplifying AI model deployment, and GKE Autopilot for optimizing resource usage. These advancements aim to empower platform teams to efficiently scale and manage AI applications without needing to overhaul their existing Kubernetes investments.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

kubernetes ✓ + ai gke ✓ + cloud + innovation

GKE Auto-IPAM simplifies IP address management | Google Cloud Blog

GKE auto IPAM simplifies IP address management in Google Kubernetes Engine by dynamically allocating and deallocating IP address ranges as clusters grow, reducing manual intervention and administrative overhead. This feature enhances IP efficiency, prevents IP exhaustion, and supports demanding workloads, making it easier to scale applications confidently.

Saved by tldr-importer · Last saved October 29, 2025 · 2 min read

gke ✓ + ipam kubernetes ✓ + cloud + automation

Multi-subnet support for GKE clusters increases scalability | Google Cloud Blog

Google Kubernetes Engine (GKE) clusters now support multi-subnet functionality, allowing for increased scalability and optimized resource utilization by adding additional subnets to existing clusters. This enhancement helps prevent IP exhaustion by enabling new node pools to utilize new subnets, thus facilitating easier cluster growth without the need for recreation.

Saved by tldr-importer · Last saved October 29, 2025 · 1 min read

gke ✓ kubernetes ✓ + multi-subnet + scalability + cloud-computing

GKE Autopilot now available to all qualifying clusters | Google Cloud Blog

GKE Autopilot is now extended to all qualifying GKE clusters, allowing users to benefit from a fully managed Kubernetes environment that simplifies operations and enhances resource efficiency. This update includes a dynamic container-optimized compute platform and new compute classes, making it easier for customers to scale their workloads while optimizing costs. Users can enable these features by enrolling in the Rapid release channel and upgrading their cluster versions.

Saved by tldr-importer · Last saved October 29, 2025 · 4 min read

gke ✓ + autopilot kubernetes ✓ + cloud-computing + resource-management

Understanding Yahoo Mail's multi-tenant GKE platform design | Google Cloud Blog

Yahoo Mail is migrating its application to Google Cloud using a multi-tenant Google Kubernetes Engine (GKE) platform, incorporating a combination of lift-and-shift and strategic replatforming. The design emphasizes high availability, efficient workload management, and robust security measures, while addressing technical challenges throughout the migration process. The collaboration between Yahoo and Google aims to optimize the mail system for future demands.

Saved by tldr-importer · Last saved October 29, 2025 · 6 min read

+ google-cloud gke ✓ + multi-tenancy + migration kubernetes ✓