Tin tức ngành

Cập nhật những diễn biến mới nhất trong công nghệ giọng nói AI, tổng hợp giọng nói và bối cảnh pháp lý đang thay đổi

Microsoft Unveils MAI-Voice-1: Hyper-Realistic Speech Generation from Just One Minute of Audio
Technology

Microsoft Unveils MAI-Voice-1: Hyper-Realistic Speech Generation from Just One Minute of Audio

Microsoft launches three new foundational AI models including MAI-Voice-1, which delivers hyper-realistic voice synthesis and custom brand voice creation, marking a major leap in enterprise TTS capabilities.

👤 FreeReadText Team📅 April 2, 2026
MicrosoftMAI-Voice-1Enterprise TTSVoice SynthesisFoundry
Đọc thêm
ElevenLabs Reaches $11 Billion Valuation, Eyes IPO as Voice AI Becomes Enterprise Standard
Business

ElevenLabs Reaches $11 Billion Valuation, Eyes IPO as Voice AI Becomes Enterprise Standard

AI voice startup ElevenLabs raises $500 million at an $11 billion valuation, tripling its worth in just over a year while forging major partnerships with IBM and planning a potential IPO.

👤 FreeReadText Team📅 February 4, 2026
ElevenLabsFundingIPOIBM PartnershipEnterprise AI
Đọc thêm
Global AI Voice Regulation Tightens: EU AI Act Deepfake Rules Take Effect as Voice Cloning Crosses 'Indistinguishable Threshold'
Regulation

Global AI Voice Regulation Tightens: EU AI Act Deepfake Rules Take Effect as Voice Cloning Crosses 'Indistinguishable Threshold'

As voice cloning technology reaches human-level quality, regulators worldwide respond with new laws — the EU AI Act's deepfake labeling rules, the US ELVIS Act, and emerging biometric voice data protections reshape the industry landscape.

👤 FreeReadText Team📅 March 15, 2026
EU AI ActELVIS ActVoice CloningDeepfakeBiometric DataCompliance
Đọc thêm
OpenAI Launches Voice Engine to the Public: Real-Time Conversational TTS Now Available to All Developers
Technology

OpenAI Launches Voice Engine to the Public: Real-Time Conversational TTS Now Available to All Developers

After over a year of limited preview, OpenAI opens Voice Engine to all API developers, introducing real-time streaming TTS with emotional awareness and 40+ language support at significantly reduced pricing.

👤 FreeReadText Team📅 March 28, 2026
OpenAIVoice EngineReal-Time TTSAPIConversational AI
Đọc thêm
Google DeepMind Brings Studio-Quality TTS to Smartphones with SoundStorm 2 Edge — No Internet Required
Technology

Google DeepMind Brings Studio-Quality TTS to Smartphones with SoundStorm 2 Edge — No Internet Required

Google DeepMind announces SoundStorm 2 Edge, a compact on-device TTS model that runs entirely on mobile hardware, delivering studio-quality voice synthesis without cloud connectivity and opening new possibilities for offline accessibility.

👤 FreeReadText Team📅 March 20, 2026
Google DeepMindSoundStorm 2On-Device AIMobile TTSAccessibility
Đọc thêm
AI Dubbing Market Surges Past $2 Billion as Hollywood, Streaming Giants, and Game Studios Embrace Automated Localization
Business

AI Dubbing Market Surges Past $2 Billion as Hollywood, Streaming Giants, and Game Studios Embrace Automated Localization

The AI-powered dubbing and localization market crosses the $2 billion mark in Q1 2026, driven by adoption from Netflix, Disney+, and major game publishers seeking to reach global audiences at a fraction of traditional costs.

👤 FreeReadText Team📅 April 5, 2026
AI DubbingLocalizationNetflixGamingVoice ActingStreaming
Đọc thêm
Apple Unveils 'Personal Voice 2.0' in iOS 20: On-Device Voice Cloning Creates Your Digital Twin in 3 Minutes
Technology

Apple Unveils 'Personal Voice 2.0' in iOS 20: On-Device Voice Cloning Creates Your Digital Twin in 3 Minutes

Apple announces Personal Voice 2.0 at its spring event, allowing users to create a highly realistic clone of their own voice in just 3 minutes of recording — all processed entirely on-device with Apple Silicon, positioning it as the privacy-first alternative to cloud-based voice AI.

👤 FreeReadText Team📅 April 8, 2026
AppleiOS 20Personal VoiceOn-Device AIPrivacyVoice Cloning
Đọc thêm
Spotify Rolls Out AI Voice Translation for Podcasts Globally: Your Favorite Hosts Now Speak 40 Languages in Their Own Voice
Business

Spotify Rolls Out AI Voice Translation for Podcasts Globally: Your Favorite Hosts Now Speak 40 Languages in Their Own Voice

Spotify launches its AI-powered podcast translation feature worldwide, using voice cloning technology to automatically dub podcasts into 40 languages while preserving each host's unique voice characteristics — opening 100,000+ shows to global audiences overnight.

👤 FreeReadText Team📅 April 7, 2026
SpotifyPodcast TranslationVoice CloningLocalizationStreaming AudioCreator Economy
Đọc thêm
FDA Clears First AI Voice Assistant for Clinical Use: Voice-Based Patient Screening Enters the Hospital
Technology

FDA Clears First AI Voice Assistant for Clinical Use: Voice-Based Patient Screening Enters the Hospital

The FDA grants its first clearance for an AI voice assistant designed for clinical patient interaction, allowing automated voice-based symptom screening and triage in emergency departments — marking a historic milestone for voice AI in healthcare.

👤 FreeReadText Team📅 April 10, 2026
Healthcare AIFDAVoice AssistantClinical AIPatient ScreeningHippocratic AI
Đọc thêm
Meta Releases Llama-Voice: First Fully Open-Source TTS Model to Match Commercial Giants in 50+ Languages
Technology

Meta Releases Llama-Voice: First Fully Open-Source TTS Model to Match Commercial Giants in 50+ Languages

Meta drops Llama-Voice under an Apache 2.0 license, delivering near state-of-the-art voice synthesis, zero-shot voice cloning from 10 seconds of audio, and 52-language coverage — all runnable on a single consumer GPU.

👤 FreeReadText Team📅 April 12, 2026
MetaLlama-VoiceOpen SourceMultilingual TTSVoice CloningHugging Face
Đọc thêm
NVIDIA Launches Voice Foundry NIM: Blackwell-Optimized Microservices Cut Real-Time TTS Costs by 70%
Technology

NVIDIA Launches Voice Foundry NIM: Blackwell-Optimized Microservices Cut Real-Time TTS Costs by 70%

NVIDIA unveils Voice Foundry, a dedicated suite of NIM inference microservices for TTS and STT optimized for Blackwell GB200 hardware, promising sub-80ms first-token latency and 70% lower per-character costs for enterprise voice applications.

👤 FreeReadText Team📅 April 15, 2026
NVIDIAVoice FoundryNIMBlackwellEnterprise InfrastructureTensorRT
Đọc thêm
Audible Opens AI-Narrated Audiobook Catalog to 400,000 Backlist Titles — Narrators Split on Landmark Royalty Model
Business

Audible Opens AI-Narrated Audiobook Catalog to 400,000 Backlist Titles — Narrators Split on Landmark Royalty Model

Amazon's Audible launches the industry's largest AI-narrated audiobook catalog, adding 400,000 previously unnarrated titles using voice clones of consenting narrators, with a first-of-its-kind per-listen residual model that splits the narration community.

👤 FreeReadText Team📅 April 17, 2026
AudibleAmazonAI NarrationAudiobooksVoice ActingRoyaltiesSAG-AFTRA
Đọc thêm