7 min read
|
Saved October 29, 2025
|
Copied!
Do you care about this?
Perplexity evaluates OpenAI's newly released open-weight models, gpt-oss-20b and gpt-oss-120b, focusing on their implementation on NVIDIA H200 GPUs. The article discusses infrastructure decisions, kernel modifications, and performance optimizations made to efficiently integrate these models into their inference engine, ROSE.
If you do, here's more
Click "Generate Summary" to create a detailed 2-4 paragraph summary of this article.
Questions about this article
No questions yet.