HomeToolsMCPHow It WorksStoriesPhilosophyArchitectureStar on GitHub
All Tools
P
AgentsFreemiumOpen Source

PIPECAT

Open-source framework for voice and multimodal conversational AI

BSD-2-Clause

ABOUT

Building real-time voice and multimodal AI agents requires orchestrating speech recognition, language models, text-to-speech, and transport layers with ultra-low latency streaming across WebRTC or WebSocket connections, which is complex to implement from scratch. Pipecat provides composable pipelines, 40+ service integrations, and transport abstraction so developers can wire up STT, LLM, TTS, and image generation services into working voice and multimodal agents without building the streaming infrastructure themselves.

INSTALL
pip install pipecat-ai

INTEGRATION GUIDE

1. Build voice assistants with natural, streaming conversations and interruption handling 2. Create AI companions such as coaches, meeting assistants, and interactive characters 3. Develop customer support and intake bots with guided conversational flows and function calling

TAGS

voice-aimultimodalrealtimeconversational-aistreamingttssttpipelinewebrtc
Pipecat — AI Tool | Agentic AI For Good