Cloudera Unveils AI Inference Service with Embedded NVIDIA NIM Microservices to Accelerate GenAI Dev
Cloudera's AI Inference service boosts LLM performance speeds by 36x using NVIDIA accelerated computing and NVIDIA NIM microservices, providing enhanced performance, robust security, and scalable flexibility for enterprises
Combined capability brings together companies’ differentiators in a single offering: Cloudera’s trusted data as the foundation for trusted AI with NVIDIA accelerated computing and the NVIDIA AI Enterprise software platform to deploy secure and performant AI applications privately on Cloudera
SANTA CLARA, Calif and NEW YORK, Oct. 08, 2024 (GLOBE NEWSWIRE) - Cloudera, the only true hybrid platform for data, analytics, and AI, today launched Cloudera AI Inference powered by NVIDIA NIM microservices, part of the NVIDIA AI Enterprise platform. As one of the industry’s first AI inference services to provide embedded NIM microservice capability, Cloudera AI Inference uniquely streamlines the deployment and management of large-scale AI models, allowing enterprises to harness their data’s true potential to advance GenAI from pilot phases to full production.
Recent data from Deloitte reveals the biggest barriers to GenAI adoption for enterprises are compliance risks and governance concerns, yet adoption of GenAI is progressing at a rapid pace, with over two-thirds of organizations increasing their GenAI budgets in Q3 this year. To mitigate these concerns, businesses must turn to running AI models and applications privately - whether on premises or in public clouds. This shift requires secure and scalable solutions that avoid complex, do-it-yourself approaches.
Cloudera AI Inference protects sensitive data from leaking to non-private, vendor-hosted AI model services by providing secure development and deployment within enterprise control. Powered by NVIDIA technology, the service helps to build trusted data for trusted AI with high-performance speeds, enabling the efficient development of AI-driven chatbots, virtual assistants, and agentic applications impacting both productivity and new business growth.
The launch of Cloudera AI Inference comes on the heels of the company’s collaboration with NVIDIA, reinforcing Cloudera’s commitment to driving enterprise AI innovation at a critical moment, as industries navigate the complexities of digital transformation and AI integration.
Developers can build, customize, and deploy enterprise-grade LLMs with up to 36x faster performance using NVIDIA Tensor Core GPUs and nearly 4x throughput compared with CPUs. The seamless user experience integrates UI and APIs directly with NVIDIA NIM microservice containers, eliminating the need for command-line interfaces (CLI) and separate monitoring systems. The service integration with Cloudera’s AI Model Registry also enhances security and governance by managing access controls for both model endpoints and operations. Users benefit from a unified platform where all models—whether LLM deployments or traditional models—are seamlessly managed under a single service.
Additional key features of Cloudera AI Inference include:
- Advanced AI Capabilities: Utilize NVIDIA NIM microservices to optimize open-source LLMs, including LLama and Mistral, for cutting-edge advancements in natural language processing (NLP), computer vision, and other AI domains.
- Hybrid Cloud & Privacy: Run workloads on prem or in the cloud, with VPC deployments for enhanced security and regulatory compliance.
- Scalability & Monitoring: Rely on auto-scaling, high availability (HA), and real-time performance tracking to detect and correct issues, and deliver efficient resource management.
- Open APIs & CI/CD Integration: Access standards-compliant APIs for model deployment, management, and monitoring for seamless integration with CI/CD pipelines and MLOps workflows.
- Enterprise Security: Enforce model access with Service Accounts, Access Control, Lineage, and Auditing features.
- Risk-Managed Deployment: Conduct A/B testing and canary rollouts for controlled model updates.
“Enterprises are eager to invest in GenAI, but it requires not only scalable data but also secure, compliant, and well-governed data,” said industry analyst, Sanjeev Mohan. “Productionizing AI at scale privately introduces complexity that DIY approaches struggle to address. Cloudera AI Inference bridges this gap by integrating advanced data management with NVIDIA's AI expertise, unlocking data's full potential while safeguarding it. With enterprise-grade security features like service accounts, access control, and audit, organizations can confidently protect their data and run workloads on prem or in the cloud, deploying AI models efficiently with the necessary flexibility and governance.”
“We are excited to collaborate with NVIDIA to bring Cloudera AI Inference to market, providing a single AI/ML platform that supports nearly all models and use cases so enterprises can both create powerful AI apps with our software and then run those performant AI apps in Cloudera as well,” said Dipto Chakravarty, Chief Product Officer at Cloudera. “With the integration of NVIDIA AI, which facilitates smarter decision-making through advanced performance, Cloudera is innovating on behalf of its customers by building trusted AI apps with trusted data at scale.”
“Enterprises today need to seamlessly integrate generative AI with their existing data infrastructure to drive business outcomes,” said Kari Briski, vice president of AI software, models and services at NVIDIA. “By incorporating NVIDIA NIM microservices into Cloudera's AI Inference platform, we're empowering developers to easily create trustworthy generative AI applications while fostering a self-sustaining AI data flywheel.”
These new capabilities will be unveiled at Cloudera's premier AI and data conference, Cloudera EVOLVE NY, taking place Oct. 10. Click here to learn more about how these latest updates deepen Cloudera’s commitment, elevating enterprise data from pilot to production with GenAI.
About Cloudera
Cloudera is the only true hybrid platform for data, analytics, and AI. With 100x more data under management than other cloud-only vendors, Cloudera empowers global enterprises to transform data of all types, on any public or private cloud, into valuable, trusted insights. Our open data lakehouse delivers scalable and secure data management with portable cloud-native analytics, enabling customers to bring GenAI models to their data while maintaining privacy and ensuring responsible, reliable AI deployments. The world's largest brands in financial services, insurance, media, manufacturing, and government rely on Cloudera to use their data to solve what seemed impossible—today and in the future.
To learn more, visit Cloudera.com and follow us on LinkedIn and X. Cloudera and associated marks are trademarks or registered trademarks of Cloudera, Inc. All other company and product names may be trademarks of their respective owners.
Contact
Jess Hohn-Cabana
cloudera@v2comms.com
- 苯酚水溶液防护服|酸碱防护服|硝基苯防护服|化学防护服|锦勇
- 人工智能的第三支柱:数据存储
- WS每一次销售的成功都是一次喜悦的盛宴,在这场盛宴中,WhatsApp工具功不可没
- 产品政策双管齐下 美菱618打响品质之战
- 创维光伏工商业:构建阳光新生态,用标准化助力企业实现“零碳”目标!
- 医健云联结合”大平台+大数据+大服务”,助力慢病管理
- 轻综艺《糊咖也是咖》即将上线 看”艺人”们如何“没活硬整”
- 我很幸运,选择了魔方网表这款低代码开发平台软件
- 北京博爱堂名医馆研修班 2024年3月中医舌诊实用技术基础班(第一期)招生简章
- UN Global Compact launches Guidebook to encourage companies to Empower Women in the Workplace
- 十月上海改装车展来了!商务、越野皮卡、房车、SUV、轿车,各类改装车型一次看够!
- Goodview 数字标牌行业年度市占率第一揭晓
- Meltwater partners with Blackbird.AI to combat narrative attacks created by the spread of misinforma
- 不服来试!小稀无糖藤茶,全网挑战更好喝的健康茶饮!
- 引领消费医疗创新,硅基仿生入选2023年度“灼耀热力榜”
- Bedford Metals Comments on Gold Market Dynamics & Engages Grander Exploration for Margurete Gold
- Instagram快速私信神器,ins精准引流软件,ig群发工具/ins一手协议号
- 跃升30位 波司登再度上榜Brand Finance中国品牌价值500强
- 全国范围专业回收网络设备交换机路由器等
- 【健康产业助力奥运特别报道】 顶尖级复合微物专家任启刚博士
- 升阳光用心电站护绿色屋顶 贴心服务换用户满意
- WS协议号脚本注册工具,WhatsApp批量注册协议号群发利器!
- 《非银跨境支付行业年度专题分析2024》发布 连连国际入选典型案例
- NIKE, Inc. Board of Directors Announces Long-Time Nike Veteran Elliott Hill to Return as President a
- 临商银行金雀山支行为卧床高龄客户上门服务获好评
- SHEIN将通过直播时装秀“SHEIN Live: Front Row”发布2024春夏系列
- ISO9001质量管理体系认证的好处是什么
- 喜来健实力亮相第42届中国国际康复辅助器具产业暨国际福祉机器博览会!
- 羽素38妇女节公益活动,将微微星光汇成星河
- 从容应对三伏挑战,Liebherr利普赫尔生活方式系列臻品演绎储藏艺术
推荐
- 奥运冠军刘翔更新社交账号晒出近照 时隔473天更新动态! 2月20日凌晨2点,奥运冠军刘翔更新社交账号晒 资讯
- 抖音直播“新红人”进攻本地生活领域 不难看出,抖音本地生活正借由直播向本地生活 资讯
- 私域反哺公域一周带火一家店! 三四线城市奶茶品牌茶尖尖两年时间做到GMV 资讯
- 男子“机闹”后航班取消,同机旅客准备集体起诉 1月4日,一男子大闹飞机致航班取消的新闻登上 资讯
- 一个“江浙沪人家的孩子已经不卷学习了”的新闻引发议论纷纷 星标★ 来源:桌子的生活观(ID:zzdshg) 没 资讯
- 看新东方创始人俞敏洪如何回应董宇辉新号分流的? (来源:中国证券报) 东方甄选净利润大幅下滑 资讯
- 海南大学生返校机票贵 有什么好的解决办法吗? 近日,有网友在“人民网领导留言板&rdqu 资讯
- 周星驰新片《少林女足》在台湾省举办海选,吸引了不少素人和足球爱好者前来参加 周星驰新片《少林女足》在台湾省举办海选,吸 资讯
- 国足13次出战亚洲杯首次小组赛0进球 北京时间1月23日消息,2023亚洲杯小组 资讯
- 新增供热能力3200万平方米 新疆最大热电联产项目开工 昨天(26日),新疆最大的热电联产项目—&md 资讯