10 links
tagged with all of: kubernetes + gke
Click any tag below to further narrow down your results
Links
The article compares the security features of AWS Elastic Kubernetes Service (EKS) and Google Kubernetes Engine (GKE), focusing on key areas such as identity and access management, network traffic control, configuration management, vulnerability management, and runtime threat detection. It highlights the differences in default settings and capabilities of both managed services, emphasizing aspects like IAM integration, firewall options, and runtime security tools.
OpenAI has released its new gpt-oss model, and Google is now supporting its deployment on Google Kubernetes Engine (GKE) with optimized configurations. GKE is designed to manage large-scale AI workloads, offering scalability and performance with advanced infrastructure, including GPU and TPU accelerators. Users can quickly get started with the GKE Inference Quickstart tool, which simplifies the setup and provides benchmarking capabilities.
Google Kubernetes Engine (GKE) celebrates its 10th anniversary with the launch of an ebook detailing its evolution and impact on businesses. Highlighting customer success stories, including Signify and Niantic, the article emphasizes GKE's role in facilitating scalable cloud-native AI solutions while allowing teams to focus on innovation rather than infrastructure management.
The guide outlines how to deploy large language models (LLMs) at scale using Google Kubernetes Engine (GKE) and the GKE Inference Gateway, which optimizes load balancing by considering AI-specific metrics. It provides a step-by-step walkthrough for setting up an inference pipeline with the vLLM framework, ensuring efficient resource management and performance for AI workloads. Key features include intelligent load balancing, simplified operations, and support for multiple models and hardware configurations.
GKE Data Cache is now generally available, enhancing Google Kubernetes Engine's performance for stateful and stateless applications by utilizing high-speed local SSDs as a caching layer for persistent disks. This solution provides significant improvements in read latency and throughput, making it easier to manage data access while potentially lowering costs. Users can configure caching for their workloads with straightforward setup instructions and options for data consistency.
Google Kubernetes Engine (GKE) is enhancing its capabilities to support AI workloads, with new features like Cluster Director for managing large clusters, GKE Inference Quickstart for simplifying AI model deployment, and GKE Autopilot for optimizing resource usage. These advancements aim to empower platform teams to efficiently scale and manage AI applications without needing to overhaul their existing Kubernetes investments.
GKE auto IPAM simplifies IP address management in Google Kubernetes Engine by dynamically allocating and deallocating IP address ranges as clusters grow, reducing manual intervention and administrative overhead. This feature enhances IP efficiency, prevents IP exhaustion, and supports demanding workloads, making it easier to scale applications confidently.
Google Kubernetes Engine (GKE) clusters now support multi-subnet functionality, allowing for increased scalability and optimized resource utilization by adding additional subnets to existing clusters. This enhancement helps prevent IP exhaustion by enabling new node pools to utilize new subnets, thus facilitating easier cluster growth without the need for recreation.
GKE Autopilot is now extended to all qualifying GKE clusters, allowing users to benefit from a fully managed Kubernetes environment that simplifies operations and enhances resource efficiency. This update includes a dynamic container-optimized compute platform and new compute classes, making it easier for customers to scale their workloads while optimizing costs. Users can enable these features by enrolling in the Rapid release channel and upgrading their cluster versions.
Yahoo Mail is migrating its application to Google Cloud using a multi-tenant Google Kubernetes Engine (GKE) platform, incorporating a combination of lift-and-shift and strategic replatforming. The design emphasizes high availability, efficient workload management, and robust security measures, while addressing technical challenges throughout the migration process. The collaboration between Yahoo and Google aims to optimize the mail system for future demands.