NVIDIA's Nemotron-H-8B-Base-8K is a large language model designed for text completion, featuring a hybrid architecture and a context length of 8K. It supports multiple languages and offers customization tools through the NeMo Framework for enhanced performance in research and development. The model is intended for use on NVIDIA GPU-accelerated systems and is part of the Nemotron-H collection, governed by specific licensing terms.
Nvidia has released the Nemotron-Nano-9B-V2, a small language model with 9 billion parameters, optimized for deployment on a single Nvidia A10 GPU. It features a unique toggle for AI reasoning, allowing users to manage internal reasoning and improve performance across various languages and applications.