AgentsFreemiumOpen Source

PIPECAT

Open-source framework for voice and multimodal conversational AI

12.9k starsBSD-2-Clause

ABOUT

Building real-time voice and multimodal AI agents requires orchestrating speech recognition, language models, text-to-speech, and transport layers with ultra-low latency streaming across WebRTC or WebSocket connections, which is complex to implement from scratch. Pipecat provides composable pipelines, 40+ service integrations, and transport abstraction so developers can wire up STT, LLM, TTS, and image generation services into working voice and multimodal agents without building the streaming infrastructure themselves.

INSTALL

pip install pipecat-ai

INTEGRATION GUIDE

1. Build voice assistants with natural, streaming conversations and interruption handling 2. Create AI companions such as coaches, meeting assistants, and interactive characters 3. Develop customer support and intake bots with guided conversational flows and function calling

PIPECAT

ABOUT

INTEGRATION GUIDE

TAGS