News Report Software Technology

August 23, 2024

NVIDIA Unveils New Small Language Model With Cutting-Edge Accuracy

by Alisa Davidson

Published: August 23, 2024 at 5:49 am Updated: August 23, 2024 at 5:49 am

by Anastasiia O

Edited and fact-checked: August 23, 2024 at 5:49 am

In Brief

NVIDIA released Mistral-NeMo-Minitron 8B, a streamlined version of the open NeMo 12B model created by Mistral AI.

NVIDIA Unveils New Small Language Model With Cutting-Edge Accuracy

Technology company NVIDIA released Mistral-NeMo-Minitron 8B, a streamlined and condensed version of the open NeMo 12B model created by the AI-focused company Mistral AI. This new model demonstrates high accuracy throughout widely used benchmarks in areas encompassing chatbots, virtual assistants, content generation, coding, and educational tools.

Builders can begin utilizing Mistral-NeMo-Minitron 8B via an NVIDIA NIM microservice that includes a standard application programming interface, or it can be downloaded directly from Hugging Face.

Mistral-NeMo-Minitron 8B, a highly capable model in its category, offers enhanced accuracy and reduced computational requirements while achieving top performance on major benchmarks. It was crafted by width-pruning the Mistral AI NeMo 12B base model and then undergoing a light retraining procedure leveraging distillation.

Among similar models in size, Mistral-NeMo-Minitron 8B succeeds on well-regarded benchmarks for language models. They assess multiple tasks, including language understanding, common sense reasoning, mathematical reasoning, summarization, coding, and the generation of accurate answers.

It is presented as an NVIDIA NIM microservice, adjusted for low latency to offer quicker responses and high throughput to ensure greater computational efficiency in production environments.

Today we released Mistral-NeMo-Minitron 8B, a pruned and distilled version of the open @MistralAI NeMo 12B model, achieving high accuracy across nine popular benchmarks for chatbots, virtual assistants, content generation, coding, and educational tools.
➡️… pic.twitter.com/N8oS9hF0fd
— NVIDIA AI Developer (@NVIDIAAIDev) August 21, 2024

NVIDIA Introduces Nemotron-Mini-4B-Instruct For Efficient Memory Usage

NVIDIA is known for pioneering and advancing graphics processing units (GPUs). Its primary revenue source is the Compute and Networking business segment, which encompasses AI. It creates and produces GPUs for various applications, including gaming, cryptocurrency mining, and professional use, along with chip systems for vehicles, robotics, and other technologies. AI has emerged as an important area of focus for the firm lately.

Recently, it unveiled the Nemotron-Mini-4B-Instruct small language model, which is made to be efficient in memory usage and offer quicker response times on the company’s computers and laptops.

Tags:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Alisa, a dedicated journalist at the MPost, specializes in crypto, AI, investments, and the expansive realm of Web3. With a keen eye for emerging trends and technologies, she delivers comprehensive coverage to inform and engage readers in the ever-evolving landscape of digital finance.

Alisa Davidson

Hot Stories

News Report Technology

Agentic Payments Reach Infrastructure Inflection Point As AI Demand Drives Emergence Of New Transaction Protocols

by Alisa Davidson

July 16, 2026

News Report Technology

VeChain Moves Into The AI Agent Economy With The Launch Of VeWorld.ai’s No-Code Marketplace

by Alisa Davidson

July 16, 2026

Top Lists Technology

Top 10 Infrastructure Providers Behind Modern AI Applications In 2026

by Alisa Davidson

July 15, 2026

Digest News Report Technology

Gate Update: OpenAI Pre-IPO Hits 639% Oversubscription, Polymarket Leads All Channels, BTC Rebounds To $65K

by Alisa Davidson

July 15, 2026

NVIDIA Unveils New Small Language Model With Cutting-Edge Accuracy

NVIDIA Introduces Nemotron-Mini-4B-Instruct For Efficient Memory Usage

Disclaimer

About The Author

Agentic Payments Reach Infrastructure Inflection Point As AI Demand Drives Emergence Of New Transaction Protocols

VeChain Moves Into The AI Agent Economy With The Launch Of VeWorld.ai’s No-Code Marketplace

Top 10 Infrastructure Providers Behind Modern AI Applications In 2026

Gate Update: OpenAI Pre-IPO Hits 639% Oversubscription, Polymarket Leads All Channels, BTC Rebounds To $65K

Agentic Payments Reach Infrastructure Inflection Point As AI Demand Drives Emergence Of New Transaction Protocols

VeChain Moves Into The AI Agent Economy With The Launch Of VeWorld.ai’s No-Code Marketplace

Gate Update: OpenAI Pre-IPO Hits 639% Oversubscription, Polymarket Leads All Channels, BTC Rebounds To $65K

Nokia And NVIDIA Target Telecom’s Capacity Crisis With First Commercial AI-RAN Platform

How Minmax Is Building The Professional AI Trading Terminal Prediction Markets Still Lack In 2026

The Calm Before The Solana Storm: What Charts, Whales, And On-Chain Signals Are Saying Now