Quit Emailing Yourself

# gpu

5 links tagged with gpu

Click any tag below to further narrow down your results

+ software (3) + technology (2) + rust (2) + formal verification (2) + pytorch (1) + debugging (1) + alibaba (1) + inference (1) + innovation (1) + architecture (1) + data (1)

Links

Don't Move The Data! - by Nicholas Wilt

The article discusses the evolution of GPU architecture, emphasizing the growing disparity between the increasing performance of GPUs and the limited data bandwidth available through traditional buses like PCI Express. It argues for a reevaluation of how data is moved to and from powerful GPUs, highlighting the need for new architectures to address bottlenecks in performance and energy efficiency.

Saved by hn_user_4 · Last saved October 28, 2025 · 3 min read

gpu ✓ + architecture + data

Announcing VectorWare - VectorWare

VectorWare is launching as a company focused on developing GPU-native software, aiming to shift the software industry towards utilizing GPUs more effectively as their importance grows in various applications. They emphasize the convergence of CPUs and GPUs and the need for improved tools and abstractions to fully leverage GPU capabilities. With a team of experienced developers and investors, VectorWare is poised to lead this new era of software development.

Saved by hn_user_14 · 2 others saved this · Last saved October 28, 2025 · 3 min read

gpu ✓ + software + innovation + technology

GitHub - neelsomani/cuq: Cuq: A MIR-to-Coq Framework Targeting PTX for Formal Semantics and Verified Translation of Rust GPU Kernels

The article introduces Cuq, a framework that translates Rust's Mid-level Intermediate Representation (MIR) into Coq, aiming to establish formal semantics for Rust GPU kernels compiled to NVIDIA's PTX. It addresses the lack of verified mapping from Rust's compiler IR to PTX while focusing on memory model soundness and offers a prototype for automating this translation and verification process. Future developments may include integrating Rust's ownership and lifetime reasoning into the framework.

Saved by hn_user_10 · 1 other saved this · Last saved October 28, 2025 · 3 min read

+ rust gpu ✓ + formal verification

Alibaba Cloud says it cut Nvidia AI GPU use by 82% with new pooling system— up to 9x increase in output lets 213 GPUs perform like 1,192 | Tom's Hardware

Alibaba Cloud has developed a new pooling system called Aegaeon that significantly reduces the number of Nvidia GPUs required for large language model inference by 82%, allowing 213 GPUs to perform like 1,192. This innovative approach virtualizes GPU access at the token level, enhancing overall output and efficiency during periods of fluctuating demand. The findings, which were published in a peer-reviewed paper, highlight the potential for cloud providers to maximize GPU utilization in constrained markets like China.

Saved by hn_user_10 · Last saved October 28, 2025 · 3 min read

+ alibaba gpu ✓ + inference

the bug that taught me more about PyTorch than years of using it | Elana Simon

The article recounts a bug encountered while using PyTorch that caused a training loss plateau, initially attributed to user error but ultimately traced back to a GPU kernel bug on the MPS backend for Apple Silicon. The author details the investigative process which deepened their understanding of PyTorch internals, illustrating the importance of debugging and exploration in mastering the framework. A minimal reproduction script is provided for others interested in the issue.

Saved by hn_user_13 · Last saved October 28, 2025 · 3 min read

+ pytorch + debugging gpu ✓