Phone calls, SMS, email, WhatsApp — provisioned, routed, billed, and compliant. One server, any provider, any agent.
But production AI communication needs more than a wrapper.
Hard-coded to one provider. Switching means rewriting your entire integration from scratch.
Agent disconnects mid-call and the caller hears silence. No voicemail, no transfer, no recovery.
TCPA time-of-day rules, DNC lists, CAN-SPAM, GDPR consent — gaps that become lawsuits.
No per-agent cost tracking, no spending caps, no way to monetize when you deploy for agents.
Unsigned webhooks, no rate limiting, no replay prevention. Open doors for abuse.
Caller speaks Spanish, your agent only works in English. No translation, no reach.
Your AI agent is the brain. Butt-Dial is the telephone system.
The server never generates AI responses. It handles transport, compliance, and delivery. Your agent stays in control.
Twilio, Vonage, Resend, ElevenLabs, OpenAI TTS — all pluggable. Switch in config, not in code.
One API call, under 10 seconds. Phone number, SMS, email, WhatsApp — all channels ready.
Features that take months to build. Included.
Complete voice infrastructure. From one-way recordings to live AI conversations and conference calls.
Per-agent language settings. Caller speaks one language, agent works in another. Translated in both directions.
Your AI agent takes real actions during a live phone call. Not after — during.
Pluggable provider architecture. Swap telephony, email, TTS, or STT providers without touching application code.
Self-hosted by design. Message content passes through — never stored. Credentials encrypted at rest. Logs redacted automatically.
Per-agent cost tracking with tiered plans. Deploy for agents and monetize from day one.
Send and receive WhatsApp voice notes. Text in, voice out. Voice in, text out. Automatic transcription with CC subtitles.
Every conversation is an isolated tunnel. Thousands of concurrent sessions per agent. Messages never leak between sessions.
Verify who you're talking to. Send OTP on one channel, verify on another. The cross-channel identity bridge.
Agents find and message each other directly. Build multi-agent workflows where AI agents collaborate without human intermediaries.
Beyond text: location pins, contact cards, polls, interactive buttons, typing indicators, read receipts, reactions, and more.
No helmet. No cors package. Every security layer built from scratch.
Three steps from zero to a fully connected AI agent.
Create an account and get your security token. Link your AI agent via MCP in under a minute.
Add your Twilio, Vonage, or Resend credentials in the admin panel. Test with one click.
Your AI agent can now call, text, email, and WhatsApp — across 5 channels, 16 languages.
Everything you need to monitor, debug, and run in production.
Real-time view of agents, calls, messages, and system health.
Export to Grafana, Datadog, or any metrics backend.
JSON logs with correlation IDs across every request.
Test everything without live API calls. Safe for development.
The missing infrastructure layer between AI agents and the real world.
Choose the right voice pipeline for your use case. Switch anytime from the admin panel.
Single model. STT+LLM+TTS in one WebSocket. Sub-1s latency. 70+ languages.
Premium voice quality. Custom LLM brain. 5000+ voices. Managed platform.
Mix any STT + LLM + TTS. Deepgram, Cartesia, OpenAI — your choice, your control.
Deploy anywhere. No usage fees, no vendor dashboards, no data leaving your network.