10 links
tagged with all of: nvidia + gpu
Click any tag below to further narrow down your results
Links
NVIDIA has introduced native Python support for its CUDA platform, which allows developers to write CUDA code directly in Python without needing to rely on additional wrappers. This enhancement simplifies the process of leveraging GPU capabilities for machine learning and scientific computing, making it more accessible for Python users.
NVIDIA CEO Jensen Huang promoted the benefits of AI during his visits to Washington, D.C. and Beijing, meeting with officials to discuss AI's potential to enhance productivity and job creation. He also announced updates on NVIDIA's GPU applications and emphasized the importance of open-source AI research for global advancement and economic empowerment.
Alibaba Cloud has introduced a new pooling system that reportedly reduces the use of Nvidia GPUs by 82%. This innovative approach aims to optimize cloud resource management and enhance efficiency for users relying on high-performance computing. The initiative reflects Alibaba's efforts to compete in the cloud services market against other major players.
Nvidia has introduced DGX Cloud Lepton, a service that expands access to its AI chips across various cloud platforms, targeting artificial intelligence developers. This initiative aims to connect users with Nvidia's network of cloud providers, enhancing the availability of its graphics processing units (GPUs) beyond major players in the market.
NVIDIA's new Rubin CPX technology is set to challenge AMD's current strategies, potentially forcing them to reevaluate their approach in the competitive GPU market. The advancements in performance and efficiency presented by NVIDIA could shift the balance, prompting AMD to innovate further to keep up.
AWS has announced updates to the pricing and usage model for Amazon EC2 instances powered by NVIDIA GPUs, including the introduction of savings plans for P6-B200 instances and significant price reductions for P5, P5en, P4d, and P4de instances. These changes, effective June 2025, aim to enhance accessibility to advanced GPU computing across various global regions.
The author critiques NVIDIA's design decisions regarding their RTX 40 and 50 series GPUs, particularly focusing on the problematic 12VHPWR power connector and its inherent flaws that lead to overheating issues. The article also discusses the company's reliance on proprietary technologies and the stagnant performance of ray tracing, questioning the value of high-priced graphics cards that still require upscaling to achieve acceptable frame rates in demanding games.
Researchers have successfully demonstrated a Rowhammer attack against the GDDR6 memory of an NVIDIA A6000 GPU, revealing that a single bit flip could drastically reduce the accuracy of deep neural network models from 80% to 0.1%. Nvidia has acknowledged the findings and suggested enabling error-correcting code (ECC) as a mitigation strategy, although it may impact performance and memory capacity. The researchers have also created a dedicated website for their proof-of-concept code and shared their detailed findings in a published paper.
Nvidia has introduced a new GPU specifically designed for long context inference, aimed at enhancing performance in AI applications that require processing extensive data sequences. This innovation promises to improve efficiency and effectiveness in complex tasks, catering to the growing demands of AI technologies.
Alibaba's new AI chip is designed to compete directly with NVIDIA’s H200, aiming to capture a share of the growing AI hardware market. The chip boasts advanced capabilities tailored for AI workloads and is positioned to challenge NVIDIA's dominance in the sector. With significant investments in AI technology, Alibaba is poised to leverage its infrastructure to enhance performance and efficiency.