Tin tức ngành

Cập nhật những diễn biến mới nhất trong công nghệ giọng nói AI, tổng hợp giọng nói và bối cảnh pháp lý đang thay đổi

Microsoft Unveils MAI-Voice-1: Hyper-Realistic Speech Generation from Just One Minute of Audio

Microsoft launches three new foundational AI models including MAI-Voice-1, which delivers hyper-realistic voice synthesis and custom brand voice creation, marking a major leap in enterprise TTS capabilities.

Tin tức ngành

Microsoft Unveils MAI-Voice-1: Hyper-Realistic Speech Generation from Just One Minute of Audio

ElevenLabs Reaches $11 Billion Valuation, Eyes IPO as Voice AI Becomes Enterprise Standard

Global AI Voice Regulation Tightens: EU AI Act Deepfake Rules Take Effect as Voice Cloning Crosses 'Indistinguishable Threshold'

OpenAI Launches Voice Engine to the Public: Real-Time Conversational TTS Now Available to All Developers

Google DeepMind Brings Studio-Quality TTS to Smartphones with SoundStorm 2 Edge — No Internet Required

AI Dubbing Market Surges Past $2 Billion as Hollywood, Streaming Giants, and Game Studios Embrace Automated Localization

Apple Unveils 'Personal Voice 2.0' in iOS 20: On-Device Voice Cloning Creates Your Digital Twin in 3 Minutes

Spotify Rolls Out AI Voice Translation for Podcasts Globally: Your Favorite Hosts Now Speak 40 Languages in Their Own Voice

FDA Clears First AI Voice Assistant for Clinical Use: Voice-Based Patient Screening Enters the Hospital

Meta Releases Llama-Voice: First Fully Open-Source TTS Model to Match Commercial Giants in 50+ Languages

NVIDIA Launches Voice Foundry NIM: Blackwell-Optimized Microservices Cut Real-Time TTS Costs by 70%

Audible Opens AI-Narrated Audiobook Catalog to 400,000 Backlist Titles — Narrators Split on Landmark Royalty Model

Google Launches Gemini 3.1 Flash TTS: 70+ Languages, Multi-Speaker Dialogue, and a Top Spot on the Artificial Analysis Leaderboard

OpenAI Launches GPT-Realtime-2: Voice Models with GPT-5-Class Reasoning, Live Translation, and Streaming Transcription

Microsoft Launches MAI-Voice-2 at Build 2026: Expressive Speech and Zero-Shot Voice Cloning Across 15 Languages

Wispr Hits ~$2 Billion Valuation as AI Voice Dictation Becomes a Workplace Standard

FTC Begins Enforcing the TAKE IT DOWN Act: Platforms Face $53,088-Per-Violation Penalties for AI Deepfakes

Poland Government Takes Stake in ElevenLabs, Launches AI Lab to Build Voice AI from Europe

ElevenLabs Launches Dubbing v2: Emotion-Preserving AI Dubbing Across 90+ Languages

ElevenLabs Partners with UK Government to Bring Voice AI to Public Services, Doubles London Headquarters

Rumik Launches Silk Mulberry 1.5: 'Describe a Voice Into Existence' with Plain-Language Prompts, Matching Commercial TTS Giants at 95% Lower Cost

Michael Caine's AI Voice Narrates 13-Hour 'The Odyssey' Audiobook — 20 AI Characters, Original Score, Built by 4 Producers in 6 Weeks

Five9 Launches Voice AI Agents with ElevenLabs, Deepgram, and OpenAI Under the Hood — Targeting Legacy IVR Replacement

xAI Launches Voice Agent Builder: No-Code Platform Harnesses Grok Voice to Beat GPT and Gemini in Telephony Benchmarks

Bland.ai Raises $50M Series C After 180 Investor Rejections, Now Powers 3.5 Million Voice Calls Per Week

NetEase Youdao Releases Confucius4-TTS: Open-Source 14-Language Voice Cloning from Just 3 Seconds of Audio

NO FAKES Act Unanimously Passes Senate Judiciary Committee, Creating Federal Voice and Likeness Protection

Kotoba Technologies Raises $10 Million to Bring Real-Time Voice AI to East Asian Languages

ViiTorVoice-NAR Goes Open Source: First TTS Model That Edits Single Words Inside Finished Audio

OpenAI Launches GPT-Live: Full-Duplex Voice Model Lets ChatGPT Listen and Speak Simultaneously

Gradium Raises $100M Seed Round Backed by Nvidia to Build Ultra-Low-Latency Voice AI

Tencent Cloud Partners with Inworld AI to Deliver One-Stop Real-Time Voice AI with Sub-130ms Latency Across 100+ Languages