Compare · LLM Guard

LLM Guard alternative for image and audio prompt injection

LLM Guard (Protect AI) is the de-facto open-source guardrail library for LLM applications. Its PromptInjection scanner is one of the strongest free options in the category — and it is, by design, a text-only library. If your attack surface includes user-uploaded images or voice input, LLM Guard cannot see those bytes, and the right move is almost always to add a second detector at a different point in the pipeline rather than to replace anything.

TL;DR

Keep LLM Guard on the text path. Put Glyphward in front of any image or audio that lands in your model. The two compose cleanly because they look at different bytes — Glyphward never tries to compete with LLM Guard on text PI, and LLM Guard does not try to read pixels. Free tier on Glyphward (10 scans/day, no card) is enough to benchmark against your own samples before you commit.

Where LLM Guard sits in 2026

LLM Guard ships as a Python package under an MIT-style licence, with a maintained registry of input scanners (Anonymize, BanSubstrings, BanTopics, Code, Language, PromptInjection, Toxicity, Secrets, …) and output scanners (NoRefusal, Sensitive, Bias, …). The community ships new scanner classes, the Hugging Face transformer backing PromptInjection gets re-trained, and the integration cost for a Python LLM app is essentially zero — pip install, instantiate, call scan_prompt().

That is the design surface, and it is excellent. It is also explicitly text-in / text-out. There is no ImagePromptInjection scanner that takes bytes; BanSubstrings works on strings, not pixels. If you want to inspect an image for FigStep- or AgentTypo-style typographic injection before it reaches your VLM, LLM Guard does not have a hook for that — you would have to write one, train it, host the model, maintain a corpus of known payloads, and version it as new attack variants emerge. That is a real engineering programme, not a config change.

Architectural difference

	LLM Guard	Glyphward
Distribution	OSS Python library, self-host	Managed API, drop-in REST + SDK
Primary modality	Text in / text out	Image bytes + audio bytes
PI coverage	Text PromptInjection (transformer-backed)	OCR + CLIP + text-in-image head + waveform anomaly + transcript filter
Image PI	Not in scope	FigStep / AgentTypo / typographic / indirect-via-image
Audio PI	Not in scope	WhisperInject-class + spoken-jailbreak + ultrasonic carriers
Pricing	Free (BYO compute)	Free 10/day · $29/mo Pro · $99/mo Team
Lock-in	None (OSS)	Standard REST, no SDK lock
Operational ownership	You host, monitor, update	We host the detector and corpus

The story is not "LLM Guard is missing something" — it is that LLM Guard solves the text half of a problem whose multimodal half lives at a different layer of the stack. Typographic PI walks past character-level text detectors because the attack vector never becomes text inside your application; the model reads it directly off the rendered pixels. A Python guardrail library that runs on a string cannot, by construction, see what the VLM sees.

The run-both pattern

The deployment most multimodal teams converge on:

Text path — system prompt + user text + retrieved context → LLM Guard.scan_prompt() with PromptInjection, BanTopics, and any custom rule set you maintain. Block or sanitise as your policy requires.
Image path — every uploaded image (avatar, screenshot, document, attachment) → POST glyphward.com/v1/scan before it is forwarded to the VLM. Block on score > threshold, log on score in the grey band, pass on clean.
Audio path — every uploaded clip or live ASR-bound stream → Glyphward audio scanner before STT, ideally co-inspected with the transcript LLM Guard already produces.
Cross-modal join — keep a per-request trace ID so a flagged image + a clean transcript can be reviewed as a single composite incident, not three orphaned signals.

Two independent detectors at two pipeline stages is also the architectural posture we argue for in detail in our cornerstone post on why text scanners miss image PI — no single PI detector should be load-bearing for the whole stack, and adding the modality you do not yet cover gives you the largest marginal lift.

When to pick which

Stay on LLM Guard alone if your application is text-only and you have no plans to accept user-uploaded images or audio. There is no marginal value to layering Glyphward on a pure-text RAG.
Add Glyphward if you are about to ship — or already shipped — image upload or voice features, want a drop-in detector with a curated multimodal-PI corpus, and would rather not run and version a pixel-level model in-house. The free tier covers benchmarking.
Run both as the default for any multimodal app. They are not in the same category once you look at what they read.

What a switch costs

There is no switch — that is the point. Adding Glyphward to an existing LLM Guard deployment is one new HTTP call on the upload path. No Python dependency change, no retraining, no migration of allow-lists or block-lists. If you decide to leave Glyphward later, you remove the call; LLM Guard keeps running. The lock-in surface is intentionally tiny.

Get early access · See full market comparison