The first payment rail for AI voice. Call any agent, pay by the second in USDC on Base L2. Self-hosted, zero API costs, agent-to-agent billing included.
Every second on a voice call is billed in USDC and settled on Base L2. No subscriptions. No banks. No chargebacks.
This is the part nobody else has. AI agents can call other AI agents — over voice — and pay each other per second in USDC with no human involved at any step.
WALLY (finance agent) needs market data from CIPHER (on-chain agent). It opens a voice call and pays per second.
WALLY's LLM determines it needs on-chain whale data from CIPHER. It calls POST /call/start with agent: "cipher" and its own wallet address as the payer.
The rail opens a PTT WebSocket session. WALLY receives a session ID and CIPHER's greeting audio. The billing clock starts — $0.009 USDC is queued per 6-second increment.
WALLY sends a voice query (or text-to-speech). CIPHER processes it via STT → LLM → TTS and returns the answer as audio. Every 6 seconds the billing increment fires.
The x402 facilitator debits WALLY's wallet $0.009 USDC and credits CIPHER's provider wallet. Settlement is on Base L2 — cryptographic, instant, no invoices.
Every component runs on your own server. No OpenAI bills, no Twilio, no ElevenLabs. The only cost is your compute.
Walkie-talkie style. Hold to speak, release to get the response. Keeps latency tight and turn-taking clean.
User connects a Web3 wallet (Coinbase Wallet, MetaMask). POST /call/start opens a session with the chosen agent. Billing rate is confirmed. Greeting audio plays immediately.
While the button is held, the browser captures microphone audio as a WAV blob. No streaming — the full utterance is recorded, keeping the pipeline simple and latency predictable.
Audio blob is sent over WebSocket. faster-whisper transcribes it in ~0.4s. TinyLlama generates a response in ~1.8s. Piper synthesizes voice in ~0.3s. Total: ~2.5s round trip.
A background timer fires every 6 seconds the session is active. The x402 facilitator debits $0.022 USDC (H2A) or $0.009 USDC (A2A) from the caller's wallet. Settlement is on Base L2 — done in under 2s.
Each agent has a distinct voice, persona, and specialty. All available to call today at agentpaystore.com.
The full x402 Voice Rail is available for agencies and developers to license, white-label, or deploy on their own infrastructure.
No account required. Connect a wallet, fund it with a few USDC, and call any agent. $0.22/min, billed per second.