CallShield

Real-time phone scam detection powered by Voxtral Mini's native audio intelligence

No API key. No setup. No account. Click the live demo above — verdicts appear in under 2 seconds.

Jump to: Quick Start · Audio-Native Advantage · Architecture · Evaluation · Deep Dive · Integration

▶ Click to watch: CallShield — Listening for Lies (5 min)

Live mic recording → Voxtral scores each 5s chunk in real time → verdict builds as the call progresses

Quickstart

Judges: Zero-setup demo in 30 seconds — no API key, no account:

cp backend/.env.example backend/.env && make dev

Open http://localhost:5173

Click any scenario card → verdict appears in ~2 seconds

Or skip setup entirely: https://callshield-ui.onrender.com

Path A — No API Key (instant, zero setup)

Open https://callshield-ui.onrender.com/
Click a scenario card or "Try Sample"
Verdict appears in ~2 seconds

Path B — Reproduce the 25/25 Evaluation (API key required)

git clone https://github.com/melbinkm/callshield.git && cd callshield
cp backend/.env.example backend/.env   # add MISTRAL_API_KEY
make dev                               # starts backend + frontend
python scripts/run_evaluation.py --url http://localhost:8001

Expected: Binary accuracy: 25/25 = 100.00% — exit code 0.

→ Full setup options (Docker, manual, one-line): docs/QUICKSTART.md

The Problem

The FTC reported $25.5 billion in phone and online fraud losses in 2023. Phone scams are the #1 vector for elder financial abuse globally — and the problem is accelerating as AI-generated voices make scam calls indistinguishable from legitimate ones.

CallShield is designed to operate at the telecom carrier layer. Because Voxtral processes raw audio without a transcription step, the pipeline is fast enough to sit inline on a live call — scoring every 5-second chunk in real time and alerting subscribers before they comply with a demand.

It works: 25/25 test scenarios correctly classified — zero false positives, zero missed scams — including 5 adversarial evasion attempts. → Full evaluation results

Phase	Capability
Phase 1 (now)	REST + WebSocket API — carriers query per call
Phase 2	On-device Voxtral inference — no audio leaves the handset
Phase 3	Network-level inline scoring — real-time intercept on the PSTN

Designed for the 5G Edge

CallShield's audio-native pipeline is built for the speed requirements of live telecom infrastructure. By eliminating the STT transcription step, each 5-second audio chunk is scored in a single model call — fast enough to run inline without buffering or dropping the call.

Constraint	Requirement	CallShield
Chunk scoring latency	< 5s to avoid call dropout	~1.5–3s per chunk
Pipeline steps	Minimal for real-time path	1 API call (vs 2 for STT+LLM)
Audio format	Standard carrier formats	WAV/PCM, 8–16 kHz mono
Deployment model	Stateless, horizontally scalable	FastAPI + Docker, no shared state
Privacy requirement	No audio retention on network	In-memory only; discarded after scoring

At 5G speeds, the bottleneck is inference latency, not bandwidth. Skipping STT cuts CallShield's critical path in half.

Cost at Scale

Call type	Models invoked	Estimated cost per call
Clean call (score ≤ 0.5)	Voxtral Mini only	~$0.002–$0.004
Scam call (score > 0.5)	Voxtral Mini + Mistral Large	~$0.005–$0.010
Average (mixed traffic)	—	~$0.006 per call

At 1 million calls/day (mid-tier carrier): ~$6,000/day. At 1 billion monthly minutes of US PSTN traffic, full inline scoring would cost ~$180M/year — less than 1% of the $25.5B annual fraud loss it prevents.

→ Full token breakdown: docs/MODEL_USAGE.md

→ Carrier integration recipes (Twilio, SIP SIPREC): docs/INTEGRATION.md

Use Cases

Every other solution reads what the scammer said. CallShield hears how they said it — and that's the part they can't fake.

Use case	How CallShield helps
Telecom carrier — inline scoring	Score every call on the PSTN before it reaches the subscriber. One deployment protects millions of users network-wide — no app install required.
Elder care protection	Flag high-risk calls in real time and alert a family member via downstream webhook. No behavior change needed from the person being protected.
Enterprise vishing defense	Sit on the corporate PBX. When an employee is being socially engineered into wiring money or revealing credentials, the security team is alerted mid-call.
AI voice / deepfake detection	Voice-cloning tools let scammers impersonate anyone. Voxtral's acoustic analysis detects TTS artifacts and synthetic cadence that survive word-for-word transcription — invisible to text-only models.
Call center agent assist	Show agents a silent risk indicator for inbound callers — fraud rings increasingly impersonate customers. No verdict shared with the caller.
Insurance claims fraud	Scripted, coached calls have unnatural cadence. CallShield flags rehearsed delivery patterns that text analysis misses entirely.

CallShield's REST + WebSocket API integrates directly with VoIP platforms (Twilio, Amazon Connect, Genesys) and carrier infrastructure (SIP SIPREC) — no custom audio pipeline required. → See docs/INTEGRATION.md for webhook recipes and typed client examples.

Features

Detection & Analysis

3 input modes — Live microphone recording, WAV file upload, transcript paste
Real-time streaming — WebSocket pipeline scores each 5-second audio chunk as it arrives; verdict builds incrementally
Dual-model verification — Voxtral Mini scores raw audio natively; Mistral Large runs a second-opinion on any call scoring above 0.5
7 scam detection dimensions — Urgency tactics, authority impersonation, information extraction, emotional manipulation, vocal patterns, known scam scripts, robocall/IVR patterns
4-tier verdict — SAFE / SUSPICIOUS / LIKELY_SCAM / SCAM with calibrated thresholds
Peak-weighted scoring — Tracks the worst moment in a call; a friendly opener cannot dilute a later scam demand
"Needs Human Review" badge — Automatically flagged when score falls in the ambiguous band (0.35–0.65) or audio/text analyses disagree

Demo & UX

One-click scenario gallery — 6 preloaded scenarios (3 SCAM, 3 SAFE) for instant reproducible demos
Live evidence timeline — Per-chunk timestamps, score delta arrows (▲/▼), and NEW pills on first-occurrence signals
Audio vs text comparison panel — Side-by-side Voxtral vs text-only scores showing the audio-native advantage in-product
Trust panel — Model version, report ID, and analysis timestamp on every result
Export JSON — Download the full structured report for offline inspection
Demo mode — No API key required; returns realistic canned responses instantly

Integration & Production

REST + WebSocket API — POST /analyze/audio, POST /analyze/text, WS /ws/stream
VoIP platform ready — Twilio Media Streams webhook, Amazon Connect, Genesys, SIP SIPREC carrier integration
OpenAPI spec — Auto-generated at /openapi.json; interactive Swagger UI at /docs
Typed client examples — Python (httpx) and TypeScript (fetch) — see docs/INTEGRATION.md
Docker deployment — Single make dev command; Render-hosted demo live now

Privacy & Security

Zero audio retention — Audio bytes live only in function-local variables; never written to disk, database, or cache
No verbatim transcripts — Only scores, signals, and summaries leave the backend
Injection-hardened — json_object response format + score clamping [0.0, 1.0] + verdict enum validation; model cannot produce an unhandled result
184 automated tests — Unit, integration, and adversarial robustness suite

→ Full threat model and privacy analysis: docs/THREAT_MODEL.md

Claim-Proof Scoreboard

Claim	Evidence	Artifact	How to reproduce
25/25 detection accuracy	100% on curated eval set (20 scam + 5 adversarial)	docs/EVALUATION.md	`python scripts/run_evaluation.py --url http://localhost:8001`
Zero false positives	0/10 safe calls misclassified	docs/EVALUATION.md	Run evaluation script, inspect L01–L10 rows
184 automated tests	Full unit + integration suite	backend/tests/	`cd backend && pytest --tb=short -q`
Audio-native advantage	Voxtral processes raw WAV — no transcription step	docs/MODEL_USAGE.md	Upload WAV; compare audio vs text scores in report
Privacy-first design	Zero audio storage; in-memory only	docs/THREAT_MODEL.md	Review Sections 3–5 of threat model
Production latency	< 3s per 5s audio chunk	docs/EVALUATION.md	Record 10s live; watch chunk timestamps in log

Why Audio-Native Beats the STT Pipeline

Traditional scam detection transcribes first, then analyzes. CallShield skips that step entirely.

Dimension	STT + NLP Pipeline	CallShield (Voxtral native)
Latency per 5s chunk	~800–1 200 ms (STT) + ~300 ms (NLP)	0 ms transcription — single model call
Word Error Rate floor	5–15% WER on accented/noisy calls → missed phrases	No WER — model reasons on raw acoustics
Vocal stress / prosody	Lost after transcription	Preserved — detects scripted delivery, TTS artifacts
Robocall / TTS detection	~60–70% via keyword matching	~92% — acoustic fingerprint, not just words
Infrastructure cost	Two model calls + STT quota	Single Voxtral call
Privacy	Verbatim transcript persisted by STT provider	No transcript generated; audio discarded after scoring

Key insight: A caller saying "your account is NOT at risk" with rising panic signals scam. STT sees "NOT at risk" → safe. Voxtral hears the acoustic stress pattern → flags it.

→ Full comparison: docs/COMPARISON.md

Architecture

flowchart TD
    subgraph Browser["Browser — React 19 + TS"]
        Mic[Mic Recording]
        Upload[WAV Upload]
        Transcript[Transcript Paste]
    end

    Mic -->|"WebSocket /ws/stream\n5s WAV chunks"| Backend
    Upload -->|"POST /api/analyze/audio"| Backend
    Transcript -->|"POST /api/analyze/transcript"| Backend

    subgraph Backend["FastAPI Backend (Python)"]
        direction TB
        Voxtral["Voxtral Mini\nNative audio reasoning"]
        MistralLarge["Mistral Large\nSecond-opinion (score > 0.5)"]
    end

    Voxtral --> Result["Scam Score + Signals + Verdict"]
    MistralLarge --> Result
    Result -->|"Streamed back to UI"| Browser

→ Full architecture, data flow, and design decisions: docs/ARCHITECTURE.md

Why Mistral

Voxtral Mini (voxtral-mini-latest) — Native audio analysis. Detects IVR/robocall patterns, urgency in tone, and scripted speech directly from audio bytes. No transcription step.
Mistral Large (mistral-large-latest) — Deep semantic analysis of transcript summaries. Triggered automatically as a second-opinion on high-scoring audio calls.
json_object response format — Guarantees structured output. Score clamped to [0.0, 1.0], verdict validated against a fixed enum — the model cannot produce an unhandled result.
Temperature 0.3 — Low randomness for consistent, reproducible scores.

→ Prompt engineering, token estimates, model config: docs/MODEL_USAGE.md

Tech Stack

Layer	Technology
Frontend	React 19, TypeScript 5.9, Vite 7, Tailwind CSS 4
Backend	FastAPI, Python 3.11, Pydantic
AI Models	Voxtral Mini, Mistral Large
Transport	WebSocket (streaming), REST (upload/transcript)
Infrastructure	Docker, nginx, Render

Docs & Artifacts

Artifact	Description
docs/EVALUATION.md	25-scenario methodology, full score table, borderline cases
docs/evaluation_results_20260301.json	Checked-in eval output — 25/25, all scores, confusion matrix
docs/ARCHITECTURE.md	System design, data flows, scoring algorithm, technical decisions
docs/MODEL_USAGE.md	Prompt engineering, 7 detection dimensions, token estimates
docs/THREAT_MODEL.md	Privacy analysis, abuse mitigations, GDPR/CCPA, red-team cases
docs/ADVERSARIAL_TESTING.md	Narrative adversarial test results — polite scammers, angry safe callers, evasion attempts
SECURITY.md	Vulnerability reporting, security design principles, known limitations
docs/INTEGRATION.md	OpenAPI spec, carrier webhook recipe, SIPREC integration guide
docs/COMPARISON.md	Voxtral native audio vs STT+LLM pipeline — latency, accuracy, cost
docs/DETECTION_POLICY.md	Versioned detection policy — thresholds, scoring algorithm, human review trigger
docs/COMPLIANCE.md	GDPR Article mapping, CCPA notes, data processor relationships, deployment checklist
docs/QUICKSTART.md	Docker, manual setup, one-line script
docs/DEPLOY.md	Production deployment guide
backend/tests/	184 unit + integration tests
scripts/run_evaluation.py	Reproducible eval runner — prints full table + metrics
Makefile	`make dev`, `make test`, `make eval` — one-command everything

Credits

Built for the Mistral AI Worldwide Hackathon 2026 — developed in 48 hours

Powered by Voxtral Mini — Mistral's native audio understanding model

Developed with Mistral Vibe CLI

License

MIT — see LICENSE

Name		Name	Last commit message	Last commit date
Latest commit History 111 Commits
.github/workflows		.github/workflows
backend		backend
demo		demo
docs		docs
frontend		frontend
scripts		scripts
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
SECURITY.md		SECURITY.md
docker-compose.yml		docker-compose.yml
render.yaml		render.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CallShield

Quickstart

Path A — No API Key (instant, zero setup)

Path B — Reproduce the 25/25 Evaluation (API key required)

The Problem

Designed for the 5G Edge

Cost at Scale

Use Cases

Features

Detection & Analysis

Demo & UX

Integration & Production

Privacy & Security

Claim-Proof Scoreboard

Why Audio-Native Beats the STT Pipeline

Architecture

Why Mistral

Tech Stack

Docs & Artifacts

Credits

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CallShield

Quickstart

Path A — No API Key (instant, zero setup)

Path B — Reproduce the 25/25 Evaluation (API key required)

The Problem

Designed for the 5G Edge

Cost at Scale

Use Cases

Features

Detection & Analysis

Demo & UX

Integration & Production

Privacy & Security

Claim-Proof Scoreboard

Why Audio-Native Beats the STT Pipeline

Architecture

Why Mistral

Tech Stack

Docs & Artifacts

Credits

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages