3 links
tagged with all of: nvidia + performance
Click any tag below to further narrow down your results
Links
The NVIDIA HGX B200, now available in the Cirrascale AI Innovation Cloud, offers an 8-GPU configuration that significantly enhances AI performance, achieving up to 15X faster inference compared to the previous generation. With advanced features such as the second-generation Transformer Engine and NVLink interconnect, it is designed for demanding AI and HPC workloads, ensuring efficient scalability and lower operational costs.
Perplexity evaluates OpenAI's newly released open-weight models, gpt-oss-20b and gpt-oss-120b, focusing on their implementation on NVIDIA H200 GPUs. The article discusses infrastructure decisions, kernel modifications, and performance optimizations made to efficiently integrate these models into their inference engine, ROSE.
Cerebras Systems has boasted about outperforming Nvidia's Blackwell architecture, claiming superior performance in AI tasks. The company highlights advancements in its Wafer Scale Engine technology that enable extensive parallel processing capabilities, which they believe set them apart in the competitive landscape of AI hardware.