US-based Deepgram Partners with Fortanix and NVIDIA to Shift On-Premise Voice AI to Confidential Computing Deployments
2026-06-03 17:36
Favorite

en.Wedoany.com Reported - Recently, US-based real-time voice AI infrastructure company Deepgram partnered with data security firm Fortanix to leverage Fortanix Confidential AI and NVIDIA's confidential computing capabilities, offering voice AI deployment solutions that can run in on-premise environments for regulated industries. The solution targets scenarios such as healthcare, finance, government, enterprise customer service, and internal operations, with a focus on protecting voice data and model weights during inference.

This collaboration pushes the competition in voice AI beyond recognition accuracy, low latency, and multilingual capabilities, further into the realm of "whether it can enter highly sensitive business environments." In many industries, voice data inherently contains significant privacy and business secrets, such as doctor-patient conversations, financial transaction records, customer identity information, enterprise meeting content, ticket processing, and internal service requests. While traditional cloud-based voice recognition or voice agent solutions facilitate rapid integration, they often encounter limitations in data residency, compliance audits, model intellectual property protection, and internal security boundaries. Deepgram's adoption of an on-premise deployment path enables enterprises to run voice recognition, text-to-speech, and voice-to-voice capabilities within their own environments; Fortanix, through its Confidential AI technology and trusted execution environments, provides hardware-isolated protection for audio data and model weights during processing, reducing risks of model theft, unauthorized data access, or inference exposure to the underlying infrastructure.

The solution runs on GPUs supporting NVIDIA's confidential computing, ensuring that audio data and AI model weights remain encrypted and isolated even during active processing. Deepgram, Fortanix, and NVIDIA position this combination as a pre-integrated technology stack for high-security environments.

Voice is becoming a new entry point for enterprise systems. In the past, call center recordings, meeting minutes, quality assurance, and voice transcription were primarily post-processing workflows, where enterprises uploaded audio to service providers to obtain text, summaries, or analysis results. As real-time voice agents enter customer service, medical records, field maintenance, IT service desks, and compliance review processes, voice data directly participates in business decisions and automated execution, extending security requirements from "encryption at rest and in transit" to "protection during use." The addition of confidential computing addresses a critical bottleneck in the production deployment of voice AI: models and data also need isolation during GPU execution, preventing platform administrators, underlying operating systems, and infrastructure environments from directly accessing plaintext data or model assets. For regulated enterprises, this capability helps run real-time voice applications within their own security boundaries while maintaining low-latency interactive experiences.

Deepgram itself offers voice-to-text, text-to-speech, and voice-to-voice capabilities, supporting deployment methods such as cloud APIs, self-hosting, and on-premise APIs. The company states that over 200,000 developers and 1,400 organizations currently use its platform, processing over 50,000 years of audio and transcribing over 1 trillion words. By combining with Fortanix and NVIDIA, Deepgram can further extend its voice AI capabilities, originally more focused on developers and platform-type customers, to hospitals, banks, insurance companies, public sectors, enterprise internal service desks, and industry clients with high sovereignty data requirements. Fortanix's role centers on protecting the full lifecycle security of data, AI models, and applications in on-premise, multi-cloud, and high-security environments, particularly by connecting confidential computing, key control, zero-trust execution, and model protection.

Subsequent variables focus on enterprise implementation costs, the availability of GPU confidential computing environments, compliance audit adaptation, and the replicability of voice agents in industry workflows. If the solution can establish stable use cases in regulated industries, voice AI deployment methods will shift from a "cloud-first" approach to a hybrid architecture of "on-premise operation, protected models, and data within boundaries." For voice recognition and conversational AI companies, future competition will simultaneously focus on model performance, real-time capabilities, private delivery, confidential computing integration, and industry compliance capabilities.

This article is compiled by Wedoany. All AI citations must indicate the source as "Wedoany". If there is any infringement or other issues, please notify us promptly, and we will modify or delete it accordingly. Email: news@wedoany.com