NVIDIA Brings its AI Capabilities to the Cloud for Increased Accessibility

by Cindy Tan

Published: February 23, 2023 at 12:00 pm Updated: February 23, 2023 at 12:00 pm

In Brief

NVIDIA is partnering with leading cloud service providers to offer AI as a service.

Customers will be able to access NVIDIA’s AI supercomputer, acceleration libraries software or pretrained generative AI models as a cloud service.

NVIDIA DGX™ AI supercomputer is accessible through the NVIDIA DGX Cloud, which is already offered on Oracle Cloud Infrastructure, with Microsoft Azure, Google Cloud Platform coming soon.

NVIDIA Brings its AI Capabilities to the Cloud for Increased Accessibility

Artificial intelligence computing company NVIDIA announced on Wednesday a new initiative to offer Artificial Intelligence as a service (AIaaS) through partnerships with major cloud service providers.

This new service will provide enterprise customers with access to NVIDIA’s cutting-edge AI platform, which includes an AI supercomputer, acceleration libraries, software, and pretrained generative AI models.

It will also allow customers to engage each layer of NVIDIA AI through their browser. The NVIDIA DGX AI supercomputer will be accessible through the NVIDIA DGX Cloud, which is already offered on Oracle Cloud Infrastructure, with Microsoft Azure, Google Cloud Platform, and others expected to follow soon.

“AI is at an inflection point, setting up for broad adoption reaching into every industry,”
said Jensen Huang, founder and CEO of NVIDIA said in a press release.

“From startups to major enterprises, we are seeing accelerated interest in the versatility and capabilities of generative AI. We are set to help customers take advantage of breakthroughs in generative AI and large language models. Our new AI supercomputer, with H100 and its Transformer Engine and Quantum-2 networking fabric, is in full production,” he added.

Customers using NVIDIA’s AI as a service will have access to two layers of the NVIDIA AI platform. The first is the AI platform software layer, where they can access NVIDIA AI Enterprise to train and deploy large language models or other AI workloads.

The second layer is the AI-model-as-a-service layer, where customers can use NVIDIA’s NeMo and BioNeMo customizable AI models to build proprietary generative AI models and services for their businesses.

In recent years, NVIDIA has become increasingly focused on developing specialized AI chips and services to meet the growing demand for AI applications, especially with generative AI services like ChatGPT opening a new market for AI chips.

One of NVIDIA’s most notable AI chips is the Tensor Processing Unit (TPU), which is designed specifically for deep learning applications. TPUs are capable of performing massive amounts of mathematical operations in parallel, which is essential for training deep neural networks.

Another important AI chip developed by NVIDIA is the Jetson family of embedded systems, which are designed for edge computing applications. Jetson devices are small, low-power computers that can be integrated into robots, drones, and other devices to enable AI-powered capabilities like object detection and recognition, autonomous navigation, and more.

In addition to its hardware offerings, NVIDIA also provides a range of AI services, including the NVIDIA Deep Learning Institute (DLI), which provides training and certification for developers, researchers, and data scientists looking to expand their AI skills.

NVIDIA also offers several cloud-based AI services, such as the NVIDIA GPU Cloud (NGC), which provides access to pre-built deep learning models and software tools for developers.

Tags:

Disclaimer

In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.

About The Author

Cindy is a journalist at Metaverse Post, covering topics related to web3, NFT, metaverse and AI, with a focus on interviews with Web3 industry players. She has spoken to over 30 C-level execs and counting, bringing their valuable insights to readers. Originally from Singapore, Cindy is now based in Tbilisi, Georgia. She holds a Bachelor's degree in Communications & Media Studies from the University of South Australia and has a decade of experience in journalism and writing. Get in touch with her via [email protected] with press pitches, announcements and interview opportunities.

Cindy Tan