Click any tag below to further narrow down your results
Links
AWS has introduced a Responsible AI Lens and updated its Machine Learning and Generative AI Lenses within the Well-Architected Framework. These updates aim to help professionals design and manage AI systems with a focus on ethics, risk management, and operational best practices.
This article discusses Autocomp, a framework designed to optimize code for tensor accelerators using large language models. It highlights how Autocomp outperforms human experts in efficiency and portability, particularly when applied to AWS Trainium. The authors explore the challenges of programming tensor accelerators and the unique optimizations required for effective performance.
Explore a wide range of AI and data tools available in the AWS Marketplace, designed to enhance data integration, machine learning, and analytics capabilities. Users can access these tools with their AWS account, starting with free trials and flexible pay-as-you-go billing. The platform provides technical guidance and top tools for various data-driven use cases within an AWS environment.
Vector search for Amazon ElastiCache is now generally available, allowing customers to index and search billions of high-dimensional vector embeddings with low latency and high recall. It is particularly useful for applications such as semantic caching for large language models, recommendation engines, and anomaly detection. Users can implement this feature on new or existing clusters by upgrading to Valkey version 8.2 at no additional cost.