NVIDIA AI Foundry Introduces Custom Llama 3.1 Generative AI Models for Enterprises

Ted Hisokawa
Jul 23, 2024 18:50

NVIDIA launches AI Foundry service and NIM microservices, enabling enterprises to create customized Llama 3.1 generative AI fashions for industry-specific functions.

NVIDIA has introduced the launch of its new NVIDIA AI Foundry service together with NVIDIA NIM inference microservices, geared toward revolutionizing generative AI capabilities for enterprises worldwide. The initiative options the Llama 3.1 assortment of brazenly out there fashions, launched to offer companies with superior AI instruments.

Customized AI Options for Enterprises

With the NVIDIA AI Foundry, enterprises and nations can now construct bespoke ‘supermodels’ tailor-made to their particular {industry} wants utilizing Llama 3.1 and NVIDIA’s expertise. These fashions will be educated with proprietary and artificial information generated from Llama 3.1 405B and the NVIDIA Nemotron Reward mannequin.

The AI Foundry is powered by the NVIDIA DGX Cloud AI platform, co-engineered with main public cloud suppliers, providing scalable compute assets to satisfy evolving AI calls for. This service goals to help enterprises and nations in creating sovereign AI methods and customized giant language fashions (LLMs) for domain-specific functions.

Key Trade Adoption

Accenture is the primary to leverage NVIDIA AI Foundry to create customized Llama 3.1 fashions for its purchasers. Firms like Aramco, AT&T, and Uber are among the many early adopters of the brand new Llama NVIDIA NIM microservices, indicating a robust curiosity throughout numerous industries.

“Meta’s brazenly out there Llama 3.1 fashions mark a pivotal second for the adoption of generative AI throughout the world’s enterprises,” mentioned Jensen Huang, founder and CEO of NVIDIA. “Llama 3.1 opens the floodgates for each enterprise and {industry} to construct state-of-the-art generative AI functions. NVIDIA AI Foundry has built-in Llama 3.1 all through and is able to assist enterprises construct and deploy customized Llama supermodels.”

Enhanced AI Capabilities

NVIDIA NIM inference microservices for Llama 3.1 are actually out there for obtain, promising as much as 2.5x increased throughput in comparison with conventional inference strategies. Enterprises may also pair these with new NVIDIA NeMo Retriever NIM microservices to create superior AI retrieval pipelines for digital assistants and human avatars.

Accenture, using its AI Refinery framework, is pioneering the usage of NVIDIA AI Foundry to develop customized Llama 3.1 fashions. “The world’s main enterprises see how generative AI is reworking each {industry} and are wanting to deploy functions powered by customized fashions,” mentioned Julie Candy, chair and CEO of Accenture. “Accenture has been working with NVIDIA NIM inference microservices for our inner AI functions, and now, utilizing NVIDIA AI Foundry, we may also help purchasers rapidly create and deploy customized Llama 3.1 fashions to energy transformative AI functions for their very own enterprise priorities.”

Complete AI Mannequin Companies

NVIDIA AI Foundry provides an end-to-end service that features mannequin curation, artificial information technology, fine-tuning, retrieval, and analysis. Enterprises can use Llama 3.1 fashions and the NVIDIA NeMo platform to create domain-specific fashions, with the choice to generate artificial information to reinforce mannequin accuracy.

NVIDIA and Meta have collaborated to offer a distillation recipe for Llama 3.1, enabling builders to construct smaller, customized fashions appropriate for a variety of infrastructure, from AI workstations to laptops.

Main corporations throughout healthcare, power, monetary providers, retail, transportation, and telecommunications are already integrating NVIDIA NIM microservices for Llama 3.1, educated on over 16,000 NVIDIA H100 Tensor Core GPUs.

Future Prospects

Manufacturing help for Llama 3.1 NIM and NeMo Retriever NIM microservices is obtainable by way of NVIDIA AI Enterprise. Moreover, members of the NVIDIA Developer Program will quickly have free entry to NIM microservices for analysis, growth, and testing.

For extra info, go to the NVIDIA Newsroom.

Picture supply: Shutterstock

What's Hot

AAA Launches Web3 Panel for Crypto Disputes

Senators said to strike idea to toughen Trump’s concession on Clarity Act’s crypto limits

What happens if a prediction market is delisted?

NVIDIA AI Foundry Introduces Custom Llama 3.1 Generative AI Models for Enterprises

AAA Launches Web3 Panel for Crypto Disputes

Senators said to strike idea to toughen Trump’s concession on Clarity Act’s crypto limits

AMD Rethinks Memory Strategy with EPYC 9004, 9005 CPUs

ThunderAgent Boosts Synthetic Data Generation with 2.5x Speedup

AAA Launches Web3 Panel for Crypto Disputes

Senators said to strike idea to toughen Trump’s concession on Clarity Act’s crypto limits

What happens if a prediction market is delisted?

Why transaction counts tell you almost nothing

AMD Rethinks Memory Strategy with EPYC 9004, 9005 CPUs

What's Hot

NVIDIA AI Foundry Introduces Custom Llama 3.1 Generative AI Models for Enterprises

Customized AI Options for Enterprises

Key Trade Adoption

Enhanced AI Capabilities

Complete AI Mannequin Companies

Future Prospects

Related Posts