# AI Voice Agents > AI systems that communicate through natural speech, handling phone calls, voice commands, and spoken conversations in real-time. Category: Applications Source: https://chipp.ai/ai/glossary/ai-voice-agents AI voice agents are AI systems that communicate through natural speech, enabling real-time voice conversations over phone calls, VoIP, and other audio channels. They combine speech recognition, language understanding, AI reasoning, and speech synthesis to have natural-sounding phone conversations. The voice agent pipeline includes: speech-to-text (converting the caller's voice to text), language understanding (processing the text with an LLM), response generation (creating an appropriate response), text-to-speech (converting the response back to natural-sounding speech), and conversation management (handling turn-taking, interruptions, and silence). Key capabilities of modern AI voice agents include: natural-sounding voices (indistinguishable from human speech in many cases), real-time conversation (sub-second response times), multilingual support (handling calls in 100+ languages), tool integration (booking appointments, looking up information during calls), and sentiment detection (recognizing emotional cues and adjusting tone). Common use cases include: receptionist duties (answering calls, routing, taking messages), appointment scheduling (checking availability and booking in real-time), customer support (resolving common issues over the phone), outbound calls (follow-ups, reminders, surveys), and emergency triage (assessing urgency and routing appropriately). Platforms like Chipp enable builders to create voice agents using LiveKit technology, with customizable voices, personalities, and integrations — deployable with a phone number and no coding required. ## Related Terms - [AI Agents](https://chipp.ai/ai/glossary/ai-agents.md): Autonomous AI systems that can perceive their environment, make decisions, and take actions to achieve specific goals. - [Conversational AI](https://chipp.ai/ai/glossary/conversational-ai.md): AI systems designed to engage in natural, contextual dialogue with humans across text and voice channels. - [Natural Language Processing (NLP)](https://chipp.ai/ai/glossary/natural-language-processing.md): The field of AI focused on enabling computers to understand, interpret, and generate human language in useful ways. - [Multimodal AI](https://chipp.ai/ai/glossary/multimodal-ai.md): AI systems that can process and generate multiple types of data — text, images, audio, and video — within a single model. --- This term is part of the [Chipp AI Glossary](https://chipp.ai/ai/glossary), a reference of AI concepts written for builders and businesses. Build AI agents with no code at https://chipp.ai.