DigitalCIO
No Result
View All Result
  • Home
  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources
DigitalCIO
  • Home
  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources
No Result
View All Result
Digitalcio
No Result
View All Result
Home Tech News

NVIDIA Corporation – NVIDIA Enters Production With Dynamo, the Broadly Adopted Inference Operating System for AI Factories

DigitalCIO Bureau by DigitalCIO Bureau
March 18, 2026
in Tech News
0
NVIDIA Corporation – NVIDIA Enters Production With Dynamo, the Broadly Adopted Inference Operating System for AI Factories
75
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter

NVIDIA announced NVIDIA Dynamo 1.0, open source software for generative and agentic inference at scale, with widespread global adoption. Together with the NVIDIA Blackwell platform, Dynamo 1.0 enables cloud providers, AI innovators and global enterprises to deliver high-performance AI inference with unmatched scale, efficiency and speed.

As agentic AI systems move into production across industries, scaling inference within a data center has become a complex challenge of resource orchestration, with requests of varying sizes and modalities, as well as performance objectives, arriving in unpredictable bursts.

Just as a computer’s operating system coordinates hardware and applications, Dynamo 1.0 functions as the distributed “operating system” of AI factories, seamlessly orchestrating GPU and memory resources across the cluster to power complex AI workloads. In recent industry benchmarks, Dynamo boosted the inference performance of NVIDIA Blackwell GPUs by up to 7x, lowering token cost and increasing revenue opportunity for millions of GPUs with free, open source software.

“Inference is the engine of intelligence, powering every query, every agent and every application,” said Jensen Huang, founder and CEO of NVIDIA. “With NVIDIA Dynamo, we’ve created the first-ever ‘operating system’ for AI factories. The rapid adoption across our ecosystem shows this next wave of agentic AI is here, and NVIDIA is powering it at global scale.”

Dynamo 1.0 splits inference work across GPUs by adding smarter “traffic control” and the ability to move data between GPUs and lower-cost storage, reducing wasted work and easing memory limits. For agentic AI and long prompts, it can route requests to GPUs that already have the most relevant “short-term memory” from earlier steps, then offload that memory when it is not needed.

NVIDIA Inference Platform Gains Momentum
NVIDIA is accelerating the open source ecosystem by integrating Dynamo and NVIDIA TensorRT-LLM library optimizations into popular frameworks from providers such as LangChain, llm-d, LMCache, SGLang, vLLM and more. Core Dynamo building blocks like KVBM for smarter memory management, NVIDIA NIXL for fast GPU-to-GPU data movement and NVIDIA Grove for simplified scaling are also available as standalone modules. NVIDIA also contributes TensorRT-LLM CUDA kernels to the FlashInfer project so they can be natively integrated into open source frameworks.

The NVIDIA inference platform is supported across the AI ecosystem, including:

  • Cloud Service Providers: Amazon Web Services (AWS), Microsoft Azure, Google Cloud, OCI
  • NVIDIA Cloud Partners: Alibaba Cloud, CoreWeave, Crusoe, DigitalOcean, Gcore, GMI Cloud, Lightning AI, Nebius, Nscale, Together AI, Vultr
  • AI-Native Companies: Cursor, Hebbia, Perplexity
  • Inference Endpoint Providers: Baseten, Deep Infra, Fireworks
  • Global Enterprises: AstraZeneca, BlackRock, ByteDance, Coupang, Instacart, Meituan, PayPal, Pinterest, Shopee, SoftBank Corp.

Chen Goldberg, executive vice president of product and engineering at CoreWeave, said: “As AI moves from experimental pilots to continuous, large-scale production, the underlying infrastructure must be as dynamic as the models it supports. Supporting NVIDIA Dynamo allows us to offer a more seamless, resilient environment for deploying complex AI agents. This foundation provides the durability and high-performance orchestration required to move the industry’s most ambitious agentic workloads into global production.”

Danila Shtan, chief technology officer of Nebius, said: “Delivering reliable AI inference at scale isn’t just about powerful GPUs, it’s about the software that turns that performance into real customer outcomes. We value how NVIDIA’s software stack, from Dynamo to TensorRT-LLM, brings deep optimization, predictable performance and faster time to deployment, helping us offer customers a simpler, higher-performance path to production AI.”

Matt Madrigal, chief technology officer of Pinterest, said: “Delivering an intuitive, multimodal AI experience to hundreds of millions of users requires real-time intelligence at global scale. As a significant adopter in open source, we’re committed to building scalable AI technologies. With NVIDIA Dynamo optimizing our deployment, we’re expanding the seamless and personalized experiences we deliver, powered by high-performance AI infrastructure.”

Vipul Ved Prakash, cofounder and CEO of Together AI, said: “AI natives require inference that can reliably and efficiently scale with their application. NVIDIA Dynamo 1.0, combined with cutting-edge inference research from Together AI, helps us deliver a high-performance stack to offer accelerated, cost-effective inference for large-scale production workloads.”

Tags: Agentic AIAI Factoriesartificial intelligenceDynamoNVIDIA
Share30Tweet19
DigitalCIO Bureau

DigitalCIO Bureau

Recommended For You

IBM and OpenAI Introduce Frontier AI to Cyber Defense to Help Enterprises Match Machine-Speed Threats

by DigitalCIO Bureau
June 23, 2026
0
IBM and OpenAI Introduce Frontier AI to Cyber Defense to Help Enterprises Match Machine-Speed Threats

IBM has announced its participation in the OpenAI Daybreak Cyber Partner Program, integrating advanced frontier AI capabilities into its security operations to help enterprises respond to machine-speed threats....

Read moreDetails

Randstad Digital releases list of top 10 high-demand AI tech jobs overcoming the enterprise integration gap

by DigitalCIO Bureau
June 23, 2026
0
Randstad Digital releases list of top 10 high-demand AI tech jobs overcoming the enterprise integration gap

New Randstad Digital data reveals a structural shift in tech hiring. As enterprises move from AI experimentation to implementation, AI-augmented developer roles have surged 597%, creating a premium...

Read moreDetails

SUSE Appoints Marshal Correia as General Manager for India and South Asia

by DigitalCIO Bureau
June 22, 2026
0
SUSE Appoints Marshal Correia as General Manager for India and South Asia

SUSE has announced the appointment of Marshal Correia as General Manager for India and South Asia. The move underscores SUSE’s commitment to strengthening its presence in one of...

Read moreDetails

Accenture to Bolster Critical Infrastructure Security with End-to-End Cybersecurity Platform Amid Rising AI-Driven Threats and Geopolitical Risks

by DigitalCIO Bureau
June 22, 2026
0
Accenture to Bolster Critical Infrastructure Security with End-to-End Cybersecurity Platform Amid Rising AI-Driven Threats and Geopolitical Risks

Agrees to acquire a majority stake in Dragos, a leading operational technology cybersecurity platform Also agrees to acquire runZero, a top asset intelligence and exposure assessment firm, and...

Read moreDetails

HCLTech unveils AI Innovation Zone showcasing Enterprise Solutions powered by Intel

by DigitalCIO Bureau
June 19, 2026
0
HCLTech unveils AI Innovation Zone showcasing Enterprise Solutions powered by Intel

HCLTech announced the launch of an AI Innovation Zone in Chennai, aimed at helping enterprises innovate and deploy Intel-based AI products alongside HCLTech’s AI solutions, speeding up the...

Read moreDetails
Next Post
Tech Mahindra and Fortinet Partner to Deliver Managed SASE Solutions for Secured Digital Transformation

Tech Mahindra and Fortinet Partner to Deliver Managed SASE Solutions for Secured Digital Transformation

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Related News

Kaspersky uncovers investment scam

Kaspersky uncovers investment scam

January 2, 2024

How Technology Helps in Assisting Motor Vehicle Departments in India

October 22, 2019

FutureCalls Technology Launches Innovative Software Solution iSPARK

March 10, 2020

Browse by Category

  • Acquisition
  • Appointment
  • Archive
  • Artificial Intelligence
  • CIO Interviews
  • Cloud
  • Datacenter
  • Events and Conferences
  • Market Insights
  • News
  • Opinion and Analysis
  • Products
  • Resources
  • Security
  • Storage
  • Tech News
  • Telecom
Digitalcio

Welcome to DigitalCIO, your ultimate source for staying ahead in the ever-evolving world of technology and business.

BROWSE BY TAG

Accenture Acquisition AI Appointment artificial intelligence Artificial Intelligence and Machine Learning AWS Big Data and Analytics Blockchain CISCO Cloud Computing Cloudflare CrowdStrike Cybersecurity Digital Transformation E-books Enterprises Fortinet Gartner Generative AI Google Cloud HCLTech IBM India Infographics Infosys Internet of Things (IoT) Kaspersky NTT DATA NVIDIA Palo Alto Networks Panel Discussion ServiceNow Sophos Strategic Partnership Tata Consultancy Services TCS Tenable Trend Micro Veeam Veeam Software Vertiv Webinars Whitepaper Zscaler

CATEGORIES

  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources
  • Archive

NAVIGATION

  • Home
  • About Us
  • Advertise with Us
  • Contact Us

© 2024 digitalcio.in - All rights reserved.

No Result
View All Result
  • Home
  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources

© 2024 digitalcio.in - All rights reserved.

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?