AIResearchAIResearch
Machine Learning

NVIDIA Releases Open Models Across Five AI Verticals

NVIDIA released open models across five AI verticals alongside 10 trillion training tokens, targeting language, robotics, autonomous vehicles, and biomedical AI.

3 min read
NVIDIA Releases Open Models Across Five AI Verticals

TL;DR

NVIDIA released open models across five AI verticals alongside 10 trillion training tokens, targeting language, robotics, autonomous vehicles, and biomedical AI.

NVIDIA has shipped its largest coordinated open-model release to date, covering five verticals simultaneously. Published on the NVIDIA blog, the announcement details new open models, datasets, and training frameworks spanning language AI, physical AI, autonomous vehicles, robotics, and biomedicine. Alongside the models, NVIDIA contributed an open dataset of 10 trillion language training tokens, 500,000 robotics trajectories, 455,000 protein structures, and 100 terabytes of vehicle sensor data.

Rather than releasing a single flagship model, NVIDIA is shipping an ecosystem where each family targets a specific deployment context. The open data contribution is meant to lower the barrier to domain-specific fine-tuning and downstream community development.

The technical lineup

The Nemotron family, NVIDIA's agentic AI line, gains three new variants: speech, multimodal RAG, and safety. Bosch is already deploying the speech model for in-vehicle driver interaction. Cadence and IBM are running pilots with the multimodal RAG variant to improve search and reasoning over dense technical documentation, while CrowdStrike, Cohesity, and Fortinet have adopted the safety model to strengthen AI trustworthiness in security-sensitive environments.

Beyond Nemotron, the Cosmos platform addresses physical AI systems that must reason about the real world. The new Alpamayo family targets autonomous vehicle development, backed by 100TB of vehicle sensor data; Isaac GR00T continues the robotics push with 500,000 robot trajectories in the open dataset; Clara extends into biomedical AI with 455,000 protein structures available in the open commons.

Industry adoption spans enterprise software, security, and physical hardware. Palantir is integrating Nemotron into its Ontology framework for specialized agentic stacks. ServiceNow is training its Apriel model family on Nemotron-derived datasets for cost-efficient multimodal performance, and CodeRabbit is using the models to scale its AI code review pipeline. Other named adopters include Salesforce, Hitachi, Humanoid, Franka Robotics, and Uber.

Strategic context

What distinguishes this release is not any individual model but the combination of scale and openness. According to the llm-stats.com landscape tracker, open-weight models from DeepSeek, Qwen, and Moonshot AI have been closing the gap with proprietary alternatives across multiple benchmarks. NVIDIA appears to be betting that supplying open infrastructure - data, frameworks, and base models - is more durable than any single performance advantage.

The 10 trillion token dataset is a significant practical contribution. Practitioners working on domain-specific fine-tuning consistently cite data scarcity as a binding constraint, and counts at this scale can meaningfully shift what smaller fine-tuned models can achieve. For teams exploring artificial intelligence in medicine, the 455,000-structure protein dataset alongside Clara addresses a documented shortage. As pricepertoken.com coverage of the open-model market illustrates, smaller open-weight models are increasingly competitive on cost-per-token output - and better training data directly feeds that competitive pressure.

Caveats remain. NVIDIA's announcement lists industry adopters but does not publish benchmark comparisons for the new Nemotron variants against current open-weight alternatives. Teams evaluating these models for production deployment will want third-party evaluations before committing fine-tuning compute. The physical AI datasets are also early stage: 500,000 robot trajectories is a meaningful starting point, but generalization across diverse robot morphologies and task distributions typically demands far greater variety than raw counts suggest.

NVIDIA's release positions it as a full-stack platform vendor at a moment when the artificial intelligence ecosystem is fragmenting into specialized vertical applications. Hardware dominance created the company's current standing; open models and data may be the mechanism for extending relevance as inference shifts toward smaller, domain-specific models running on more diverse hardware configurations.

Whether community fine-tuning on NVIDIA's open contributions produces models that outperform closed alternatives for specific domains remains an open empirical question - and its answer will reveal whether this constitutes a genuine research contribution or a developer-relations play designed to make the GPU supply chain stickier.

Frequently asked questions

What models did NVIDIA release?
NVIDIA released five families: Nemotron for agentic AI (with speech, multimodal RAG, and safety variants), Cosmos for physical AI, Alpamayo for autonomous vehicle development, Isaac GR00T for robotics, and Clara for biomedical AI.

What open data did NVIDIA contribute alongside the models?
The release includes 10 trillion language training tokens, 500,000 robotics trajectories, 455,000 protein structures, and 100 terabytes of vehicle sensor data, all contributed to the open commons per the NVIDIA announcement.

Which companies are already using NVIDIA's open models?
Early adopters include Bosch, Palantir, ServiceNow, CrowdStrike, Cohesity, Fortinet, CodeRabbit, Cadence, IBM, Salesforce, Hitachi, Uber, Humanoid, and Franka Robotics.

How does Alpamayo differ from Cosmos?
Cosmos is NVIDIA's broader physical AI platform; Alpamayo is a specific model family within that ecosystem focused exclusively on autonomous vehicle development, paired with 100TB of dedicated vehicle sensor data.

About the Author

Guilherme A.

Guilherme A.

Former dentist (MD) from Brazil, 41 years old, husband, and AI enthusiast. In 2020, he transitioned from a decade-long career in dentistry to pursue his passion for technology, entrepreneurship, and helping others grow.

Connect on LinkedIn