DigitalCIO
No Result
View All Result
  • Home
  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources
DigitalCIO
  • Home
  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources
No Result
View All Result
Digitalcio
No Result
View All Result
Home Tech News

Red Hat AI now runs on AWS Trainium and Inferentia chips

DigitalCIO Bureau by DigitalCIO Bureau
December 5, 2025
in Tech News
0
Red Hat AI now runs on AWS Trainium and Inferentia chips
75
SHARES
1.2k
VIEWS
Share on FacebookShare on Twitter

Red Hat has announced an expanded collaboration with Amazon Web Services (AWS) to power enterprise-grade generative AI (gen AI) on AWS with Red Hat AI and AWS AI silicon. With this collaboration, Red Hat focuses on empowering IT decision-makers with the flexibility to run high-performance, efficient AI inference at scale, regardless of the underlying hardware.

“By enabling our enterprise-grade Red Hat AI Inference Server, built on the innovative vLLM framework, with AWS AI chips, we’re empowering organizations to deploy and scale AI workloads with enhanced efficiency and flexibility. Building on Red Hat’s open source heritage, this collaboration aims to make generative AI more accessible and cost-effective across hybrid cloud environments,” said Joe Fernandes, vice president and general manager, AI Business Unit, Red Hat.

Red Hat’s collaboration with AWS empowers organizations with a full-stack gen AI strategy by bringing together Red Hat’s comprehensive platform capabilities with AWS cloud infrastructure and AI chipsets, AWS Inferentia2 and AWS Trainium3. Key aspects of the collaboration include:

Upstream community contribution: Red Hat and AWS are collaborating to optimize an AWS AI chip plugin up-streamed to vLLM. As the top commercial contributor to vLLM, Red Hat is committed to enabling vLLM on AWS to help accelerate AI inference and training capabilities for users. vLLM is also the foundation of llm-d, an open source project focused on delivering inference at scale and now available as a commercially supported feature in Red Hat OpenShift AI 3.

Red Hat AI Inference Server on AWS AI chips: Red Hat AI Inference Server, powered by vLLM, will be enabled to run with AWS AI chips, including AWS Inferentia2 and AWS Trainium3, to deliver a common inference layer that can support any gen AI model, helping customers achieve higher performance, lower latency and cost-effectiveness for scaling production AI deployments, delivering up to 30-40% better price performance than current comparable GPU-based Amazon EC2 instances.

Enabling AI on Red Hat OpenShift: Red Hat worked with AWS to develop an AWS Neuron operator for Red Hat OpenShift, Red Hat OpenShift AI and Red Hat OpenShift Service on AWS, a comprehensive and fully managed application platform on AWS, providing customers with a more seamless, supported pathway to run their AI workloads with AWS accelerators.

Ease of access and deployment: By supporting AWS AI chips, Red Hat will offer enhanced and easier access to high-demand, high-capacity accelerators for Red Hat customers on AWS. In addition, Red Hat recently released the amazon.ai Certified Ansible Collection for Red Hat Ansible Automation Platform to enable orchestrating AI services on AWS.

The AWS Neuron community operator is now available in the Red Hat OpenShift OperatorHub for customers using Red Hat OpenShift or Red Hat OpenShift Service on AWS. Red Hat AI Inference Server support for AWS AI chips is expected to be available in developer preview in January 2026.

Colin Brace, vice president, Annapurna Labs, AWS, said, “Our collaboration with Red Hat provides customers with a supported path to deploying generative AI at scale, combining the flexibility of open source with AWS infrastructure and purpose-built AI accelerators to accelerate time-to-value from pilot to production.”

Share30Tweet19
DigitalCIO Bureau

DigitalCIO Bureau

Recommended For You

Sumit Chadha Appointed as Chief Technology Officer at IIFL Home Loans

by DigitalCIO Bureau
May 8, 2026
0
Sumit Chadha Appointed as Chief Technology Officer at IIFL Home Loans

IIFL Home Loans has announced the appointment of Sumit Chadha as its new Chief Technology Officer (CTO), reinforcing the company’s commitment to accelerating its digital transformation and enhancing...

Read moreDetails

Airtel Business Launches Airtel Secure Workforce for Protecting Enterprises with a Hybrid Workforce

by DigitalCIO Bureau
May 8, 2026
0
Airtel Business Launches Airtel Secure Workforce for Protecting Enterprises with a Hybrid Workforce

Airtel Business, the B2B arm of Bharti Airtel has launched Airtel Secure Workforce, a fully-managed and unified Zero Trust Architecture (ZTA) security platform with an end-to-end, compliance-ready security...

Read moreDetails

IBM and Yotta Collaborate to Bring Agentic AI Platform to Enterprises in India

by DigitalCIO Bureau
May 7, 2026
0
IBM and Yotta Collaborate to Bring Agentic AI Platform to Enterprises in India

IBM and Yotta Data Services have announced plans to collaborate on a new sovereign Agentic AI platform aimed at enterprises and government organizations in India. The platform is...

Read moreDetails

ServiceNow and Accenture Announce FDE Program to Scale Agentic AI Across the Enterprise

by DigitalCIO Bureau
May 7, 2026
0
ServiceNow and Accenture Announce FDE Program to Scale Agentic AI Across the Enterprise

ServiceNow and Accenture have launched a Forward Deployed Engineering (FDE) program to help enterprises scale agentic AI from pilot stages to full production. Through the program, ServiceNow’s AI-native...

Read moreDetails

SAP to Acquire Prior Labs, Launching a World-Class Frontier AI Lab in Europe

by DigitalCIO Bureau
May 6, 2026
0
SAP to Acquire Prior Labs, Launching a World-Class Frontier AI Lab in Europe

SAP and Prior Labs announced that they have entered into a definitive agreement for SAP to purchase Prior Labs, accelerating SAP’s success in TFMs that started with SAP-RPT-1,...

Read moreDetails
Next Post
Kiteworks’ New Survey Reveals Critical Need to Shift From Legacy Web Forms

Kiteworks' New Survey Reveals Critical Need to Shift From Legacy Web Forms

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Related News

Schneider And Nvidia Collaborate on AI Data Center Designs

Schneider Electric And NVIDIA Present Reference Design For Liquid-Cooled AI Clusters

December 6, 2024

NETGEAR Exhibits its Range of Wi-Fi Devices for Home and Businesses at CES 2020

January 8, 2020
Qlik Acquires Upsolver for Apache Iceberg Optimisation

Qlik Acquires Upsolver for Apache Iceberg Optimisation

January 17, 2025

Browse by Category

  • Acquisition
  • Appointment
  • Archive
  • Artificial Intelligence
  • CIO Interviews
  • Cloud
  • Datacenter
  • Events and Conferences
  • Market Insights
  • News
  • Opinion and Analysis
  • Products
  • Resources
  • Security
  • Storage
  • Tech News
  • Telecom
Digitalcio

Welcome to DigitalCIO, your ultimate source for staying ahead in the ever-evolving world of technology and business.

BROWSE BY TAG

Accenture Acquisition AI Appointment artificial intelligence Artificial Intelligence and Machine Learning AWS Big Data and Analytics Blockchain CISCO Cloud Computing Cloudflare Commvault CrowdStrike Cybersecurity Digital Transformation E-books Fortinet Gartner Generative AI Google Cloud HCLTech IBM Infographics Infosys Internet of Things (IoT) Kaspersky NTT DATA NVIDIA Palo Alto Networks Panel Discussion Qlik Salesforce ServiceNow Sophos Tata Consultancy Services TCS Tenable Trend Micro Veeam Veeam Software Vertiv Webinars Whitepaper Zscaler

CATEGORIES

  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources
  • Archive

NAVIGATION

  • Home
  • About Us
  • Advertise with Us
  • Contact Us

© 2024 digitalcio.in - All rights reserved.

No Result
View All Result
  • Home
  • Tech News
  • Market Insights
  • CIO Interviews
  • Events and Conferences
  • Opinion and Analysis
  • Resources

© 2024 digitalcio.in - All rights reserved.

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?