H
    Hestur
    Platform Comparison · 2026

    LiveKit vs Vapi

    Open-source media infrastructure vs managed voice AI platform. Real per-minute cost breakdown and the volume crossover where LiveKit's economics pull decisively ahead.

    We build production voice AI on both. This is our honest take.

    LiveKit→ production at scale
    vs
    Vapi→ fast prototypes

    TL;DR

    LiveKitProduction at scale

    Best for: engineering-led teams who need full control of the media stack, video + voice in one platform, volume above 50k min/month, self-hosted data sovereignty, or HIPAA on-prem deployments.

    • Open-source (Apache 2.0) — zero vendor lock-in
    • ~$0.006/min media cost on LiveKit Cloud
    • Video + voice in one real-time platform
    • Self-hosted option for complete data control
    VapiFast prototypes

    Best for: product teams that need a working voice agent in days, startups without dedicated infra engineers, and moderate-volume deployments under 50k min/month where managed simplicity outweighs higher per-minute cost.

    • Pre-built STT → LLM → TTS pipeline, ship fast
    • $0.05/min platform fee, no infra overhead
    • Managed cloud — no server maintenance
    • Built-in call analytics and dashboard

    Architecture

    Managed Pipeline vs Open-Source Stack

    The core difference isn't features — it's ownership. Vapi is a managed black box that handles everything. LiveKit is an open WebRTC transport layer you assemble into a pipeline using their Agents SDK.

    Vapi — managed pipeline

    Your app calls the Vapi API. Vapi handles STT routing, LLM connection, TTS synthesis, call lifecycle, and webhooks. You configure via dashboard or API; you don't write any media code.

    // your app calls

    Your App ←→ Vapi API

    ↓ managed by Vapi

    STT routing

    LLM connection (BYOK)

    TTS synthesis (BYOK)

    Phone / WebRTC

    Time to first working agent: 1–3 days

    LiveKit — open-source stack

    Your code uses the LiveKit Agents SDK (Python or Node) to assemble a pipeline — you choose and wire each plugin: STT, LLM, TTS. LiveKit handles WebRTC transport. You own every layer.

    // your code assembles

    Your Agent ←→ Agents SDK

    ↓ you wire each plugin

    STT plugin (you choose)

    LLM plugin (you choose)

    TTS plugin (you choose)

    LiveKit Server (open-source)

    Time to first working agent: 1–3 weeks

    The Real Pricing Math

    What You Actually Pay Per Minute

    LLM and TTS costs are identical on both — you bring your own keys. The difference is entirely in the platform fee. LiveKit Cloud trades integration time for a ~75% lower media cost.

    Cost componentVapiLiveKit Cloud
    Platform / media fee
    LiveKit Cloud charges per participant-minute (~$0.003 × 2)
    $0.05 / min~$0.006 / min
    Speech-to-text (Deepgram BYOK)
    Identical — both use your own Deepgram key
    ~$0.008 / min~$0.008 / min
    LLM (GPT-4o mini, BYOK)
    Same cost on both, depends on turn length
    ~$0.02–0.05 / min~$0.02–0.05 / min
    TTS (ElevenLabs Flash, BYOK)
    Identical — you bring your own ElevenLabs key
    ~$0.10–0.15 / min~$0.10–0.15 / min
    Telephony (Twilio)
    Same cost if using Twilio for PSTN on both platforms
    ~$0.009 / min~$0.009 / min
    Effective platform + STT
    LiveKit is ~75% cheaper on infrastructure alone
    ~$0.058 / min~$0.014 / min
    Typical all-in total
    LiveKit saves ~$0.05/min on same BYOK stack
    $0.23–0.33 / min$0.18–0.27 / min

    LiveKit Cloud pricing based on published participant-minute rates. Self-hosted LiveKit brings media cost near zero beyond infra. BYOK costs assume GPT-4o mini, Deepgram Nova-2, ElevenLabs Flash Turbo v2.

    Cost Crossover

    Where LiveKit's Economics Win

    LiveKit Cloud is cheaper per-minute from day one. But integrating the Agents SDK takes 2–4 engineer-weeks of setup. The question is when that investment pays off. The answer is ~10k–50k minutes/month sustained, depending on your engineering cost.

    10,000 min / mo

    Vapi~$2,300–3,300
    LiveKit Cloud~$1,800–2,700
    Monthly saving~$500 / mo

    LiveKit cheaper

    Savings are real but integration cost (~$12k–20k eng) takes 24+ months to recoup

    50,000 min / mo

    Vapi~$11,500–16,500
    LiveKit Cloud~$9,000–13,500
    Monthly saving~$2,500–3,000 / mo

    Crossover point

    Integration investment recovers in 5–8 months — this is where LiveKit wins on TCO

    100,000 min / mo

    Vapi~$23,000–33,000
    LiveKit Cloud~$18,000–27,000
    Monthly saving~$5,000–6,000 / mo

    LiveKit decisively cheaper

    Self-hosted option cuts media cost to near-zero — the gap widens further

    Feature Comparison

    Capability by Capability

    FeatureVapiLiveKit
    Voice AI agents
    Vapi via managed pipeline; LiveKit via Agents SDK — full code, full control
    Video + voice in one platform
    LiveKit handles real-time video and voice natively; Vapi is voice-only
    Open-source / self-hostable
    LiveKit server is Apache 2.0 — run on your own cloud or on-prem
    Managed cloud option
    Vapi is cloud-only; LiveKit offers both LiveKit Cloud and self-hosted
    Pre-built AI voice pipeline
    Vapi bundles STT → LLM → TTS orchestration; LiveKit requires code assembly
    Full WebRTC stack control
    LiveKit exposes the full media layer; Vapi abstracts WebRTC entirely
    SIP / PSTN support
    Both support SIP trunking; LiveKit has deeper native WebRTC↔SIP bridging
    BYOK LLM
    Both support GPT-4o, Claude, Gemini, and custom model endpoints
    BYOK STT / TTS
    Both support Deepgram, ElevenLabs, Cartesia, Azure, and others
    Multi-participant rooms
    LiveKit supports multi-user rooms natively; Vapi handles one caller per call
    Built-in analytics dashboard
    Vapi has native call dashboards; LiveKit requires custom observability setup
    HIPAA self-hosted control
    Self-hosted LiveKit gives full PHI control; Vapi HIPAA eligibility unconfirmed

    Scaling Economics

    How Each Platform Scales

    Vapi scales linearly — with your bill

    Vapi charges per minute regardless of volume. No servers to manage, no scaling events — but your cost grows exactly with usage. At $0.05/min platform fee, every 100k additional minutes adds $5,000 to your monthly infrastructure bill with no compression possible.

    • Predictable but non-compressible per-minute pricing
    • Zero infra overhead — fully managed by Vapi
    • Enterprise volume discounts available — negotiate directly

    LiveKit scales with infra — not per-minute

    Self-hosted LiveKit runs on a fixed server. A $200/month instance handles millions of participant-minutes. As volume grows, per-minute media cost collapses toward zero. At 500k min/month self-hosted, your effective media cost is ~$0.0004/min — over 100× cheaper than Vapi's platform fee alone.

    • Media cost compresses as volume grows
    • Horizontal scaling with open-source server clusters
    • Requires engineers to manage infra, upgrades, and monitoring

    The Decision Tree

    How We Choose Internally

    This is the actual decision flow we walk clients through. Answer yes to the first question that applies.

    1

    Do you need video + voice in the same platform?

    Yes

    Use LiveKit

    Only option here — Vapi is strictly voice-only

    No

    Continue ↓

    2

    Building a prototype or MVP in under 4 weeks?

    Yes

    Use Vapi

    Pre-built pipeline — working agents in days, not weeks

    No

    Continue ↓

    3

    Does your team have engineers to own the media pipeline?

    Yes

    Continue ↓

    No

    Use Vapi

    LiveKit Agents SDK requires dedicated integration work

    4

    Volume above 50k minutes / month (or heading there fast)?

    Yes

    Use LiveKit

    Economics shift decisively — integration cost recovers in months

    No

    Continue ↓

    5

    Need full data sovereignty or HIPAA self-hosted compliance?

    Yes

    Use LiveKit

    Self-host for complete PHI control and infra ownership

    No

    Continue ↓

    Final default

    When in doubt, start with Vapi.

    Ship a working agent first. When your monthly bill crosses $3,000–5,000 and you have engineering bandwidth, that's when a LiveKit migration pays for itself.

    Our Role

    We Build on Both. Here's How We Engage.

    Hestur AI is a platform-agnostic voice AI engineering firm. We don't have partner agreements with Vapi or LiveKit. We recommend the platform that fits your stage and scale, then build on it.

    Vapi engagements

    Typically early-stage and product-led companies that need a working agent quickly. We build on Vapi when speed-to-market matters more than per-minute cost — usually under 50k min/month or when the product is still validating PMF.

    Common scope: inbound support agents, sales qualification, appointment booking, customer service deflection.

    LiveKit engagements

    Typically engineering-led companies at growth stage with significant voice volume, HIPAA requirements, or multi-modal (video + voice) needs. We architect the Agents SDK pipeline, set up cloud or self-hosted infrastructure, and implement observability.

    Common scope: high-volume outbound calling, telehealth platforms, video + voice agents, on-prem enterprise deployments.

    Not Sure Which Platform Fits?

    Book a 30-minute call. We'll review your use case, volume projections, and engineering resources — and give you a clear platform recommendation with no pitch attached.