Question 1

What types of data can Shield redact before it reaches an LLM?

Accepted Answer

Shield can redact PII (names, emails, SSNs, phone numbers, addresses), PHI (medical record numbers, dates of birth, insurance IDs), secrets (API keys, tokens, passwords), cloud credentials (AWS, GCP, Azure keys, database connection strings), and custom patterns you define. It uses a four-layer detection stack: regex for known formats, entropy analysis for unknown tokens, keyword matching for labeled secrets, and custom user-defined patterns. All redaction happens locally on your machine before any data leaves your device.

Question 2

What happens to the redacted data? Does Shield store or log it?

Accepted Answer

No. Shield is a local desktop application — it runs entirely on your machine. Redacted content is replaced with block characters (████) in the outgoing API request. The original sensitive data never leaves your device, is never stored, logged, or cached by Shield. There is no cloud component, no telemetry, and no configuration that sends data off-device. This is architecturally different from SaaS-based guardrails that necessarily see your plaintext.

Question 3

Can Shield redact custom patterns specific to my business?

Accepted Answer

Yes. Shield's filter pack system includes a fully configurable custom pattern engine. You define regex patterns matching your proprietary data — project codes, internal filenames, customer IDs, proprietary identifiers — and Shield applies them alongside the built-in packs. Patterns are stored locally in your Shield configuration and never shared. Enterprise deployments can push standard patterns across their fleet.

Question 4

Does redaction slow down my LLM API calls?

Accepted Answer

Shield's redaction engine runs at wire speed. For typical prompt sizes (a few thousand tokens), redaction adds 5-15 milliseconds — imperceptible to users. The overhead comes from regex scanning, which is natively fast in the Rust-based engine. Even on large codebases or chat histories, the latency impact is sub-second. Shield runs as a local sidecar, not a network proxy, so there's no additional network hop.

Question 5

How does Shield handle false positives — data mistakenly flagged as sensitive?

Accepted Answer

Each filter pack can be toggled on/off per deployment. Within packs, individual patterns can be disabled or tuned. Shield also supports allowlisting — you can specify strings, patterns, or context rules that override detection. The goal is to catch real secrets without blocking legitimate prompts. Shield's transparency feature shows you exactly what was redacted and why, so you can tune your configuration based on actual usage.

Question 6

How is Shield different from prompt injection guards or content moderation APIs?

Accepted Answer

Prompt injection guards block malicious instructions; Shield redacts sensitive data. They solve different problems. Content moderation APIs (like OpenAI's moderation endpoint) classify toxicity, not data sensitivity — they won't catch an API key or a medical record number. Most importantly, SaaS-based guardrails receive your plaintext — they have to, to scan it. Shield scans locally, so the sensitive data never reaches a third party. For regulated industries, this architectural difference is the compliance boundary.

LLM Data Redaction What Gets Caught and What Doesn't

How Shield's Filter Packs Work

PII / PHI

Secrets & Tokens

Cloud Credentials

Custom / Advanced

Interactive Redaction Sandbox

Match Details — hover or click to see why each was caught

Filter Pack Reference

Frequently Asked Questions

See Shield Redact in Your Own Environment