Amazon Nova Sonic

Amazon Nova understanding models deliver advanced intelligence with unmatched price-to-performance.

Amazon Nova Sonic powers real-time, human-like voice conversations with natural tone and responsiveness. Supporting up to 300k tokens and optimized for English (US, UK) and Spanish dialogue, Nova Sonic enables enterprises to build voice-driven applications, contact center AI, and interactive agents that feel seamless to end users. Integrated with Amazon Bedrock features such as Knowledge Bases and Agents, Nova Sonic provides production-ready infrastructure for scalable voice AI deployments.

Why Nova Sonic?

Amazon Nova Sonic is built for natural, real-time conversations that make applications sound and respond like a human.

Use-Cases

Organizations use Nova Sonic to power contact center assistants, build interactive voice copilots, enable multilingual customer engagement, and deliver real-time conversational AI experiences.

What it's best for

Nova Sonic is best for scalable, production-grade voice AI where latency, accuracy, and natural tone are critical.

Amazon Nova Lite - Halo Radius
Image Generated with Amazon Nova

MODEL CARD

Amazon Nova Sonic

Text to Speech + Dialogue

Generates natural, conversational speech from text with multilingual support.

Context

Handles up to 300k tokens, enabling extended voice interactions.

Latency

Optimized for real-time dialogue with minimal delay.

Cost

Balanced for scalable deployments of conversational AI across enterprises.

Your questions answered

Common questions about Amazon Nova Sonic

Nova Sonic is optimized for English (US, UK) and Spanish, with support for real-time conversational use cases.

Unlike text or multimodal models, Nova Sonic specializes in real-time speech and dialogue generation.

Yes. Nova Sonic integrates with Amazon Bedrock Knowledge Bases and Agents, allowing enterprises to build conversational AI that connects to proprietary data.

Typical use cases include contact center automation, voice-driven copilots, interactive agents, and multilingual customer engagement platforms.

Amazon Nova Models

Compare Amazon Nova Models and Capabilities

Amazon Nova Micro

A low-latency, text-only model optimized for cost-efficient workloads. It supports over 200 languages, offers up to 128k token capacity, and can be fine-tuned for custom use cases.

Amazon Nova Lite

A high-speed multimodal model designed to handle text, image, and video inputs with exceptional responsiveness. It supports fine-tuning, processes up to 300k tokens, and works across 200+ languages.

Amazon Nova Pro

Amazon Nova Pro offers the best balance of accuracy, speed, and cost for general-purpose AI tasks. It supports 200+ languages, handles up to 300k tokens, and is fully fine-tunable for specialized needs.

Amazon Nova Premier

Nova Premier is the most advanced model in the Nova family, built for complex reasoning and high-stakes tasks. It supports 1 million tokens, over 200 languages, and is ideal for use as a teacher model in distillation workflows.

Amazon Nova Canvas

A customizable image generation model that turns text into visuals with precision and control. It supports prompts up to 1,024 characters, fine-tuning, and is optimized for English inputs.

Amazon Nova Reel

Generate high-quality video from text or image prompts, enabling fast visual storytelling. It supports up to 512 input characters and is optimized for English-language inputs.

Amazon Nova Sonic

Power real-time, human-like voice conversations with natural responsiveness and tone. It supports up to 300k tokens and is optimized for English (US, UK) and Spanish dialogue.

Amazon Nova Act

An autonomous agent model accessible via SDK, trained to interact with web browsers like a human user. It can click elements, perform searches, and answer questions—enabling real task execution, not just simulation.
Scroll to Top