Question 1

How is the cost estimate calculated?

Accepted Answer

The estimate is based on IBM's annual Cost of a Data Breach Report methodology, adapted for AI prompt leaks. The model multiplies industry-specific per-record costs (healthcare: $250/record, finance: $210, tech: $180, legal: $190, e-commerce: $165) by company size multipliers, data sensitivity weights, and detection-time factors. These numbers include direct costs (fines, legal fees, notification) and indirect costs (customer churn, reputation damage, operational disruption). Real-world AI leaks at Samsung (source code), Google (training data extraction), and Samsung again (ChatGPT ban after employee data leak) confirm the multiplier effect — small initial exposures cascade.

Question 2

Why are API key leaks so expensive?

Accepted Answer

A single leaked API key can cascade into multiple downstream breaches. If an AWS access key appears in an LLM prompt and that prompt is logged by the provider, anyone with log access can pivot into your cloud infrastructure. The 2023 Capgemini/Tenable breach began with a single exposed secret in a code repository. In AI contexts, the damage compounds because LLM providers log prompts for 30-90+ days — the key is exposed across multiple retention windows, not just one transmission.

Question 3

How does PurfectShield prevent these costs?

Accepted Answer

Shield is a local desktop application that runs on your machine. It sits between your apps and LLM providers, redacting sensitive information before it ever leaves your device. API keys are tokenized into opaque placeholders, PII is replaced with type-preserving synthetic data, and source code is filtered by entropy analysis. Because redaction happens before TLS encryption, no sensitive data ever reaches the network, provider logs, or training pipelines. Detection time drops from months to zero — the leak never happens in the first place.

Question 4

What's the difference between a regular data breach and an AI prompt leak?

Accepted Answer

Traditional breaches target stored data — databases, file servers, backups. AI prompt leaks target data in transit, moving through LLM provider infrastructure you don't control. The key differences: (1) you may never know it happened — providers don't notify you when an employee reads your prompt logs, (2) the data spreads across multiple systems instantly (provider logs, sub-processors, training pipelines), and (3) the legal framework is newer and murkier — is a prompt a 'transmission' under data processing agreements? Is the provider a processor or a controller? These ambiguities increase legal costs.

Question 5

Can't I just use provider contracts and DPAs to protect myself?

Accepted Answer

Data Processing Agreements (DPAs) are necessary but insufficient. They govern what the provider promises to do with your data, but they don't prevent accidental exposure. A misconfigured logging pipeline, a rogue employee with admin access, or a provider sub-processor incident bypasses your DPA's protections. Shield provides technical enforcement — redaction at the source — which complements contractual protections. In regulatory investigations, demonstrating technical controls carries more weight than pointing to a signed document.

Question 6

Does Shield work with any LLM provider?

Accepted Answer

Yes. Shield is vendor-agnostic by design. It operates as a local HTTPS proxy — any application or SDK that makes HTTP requests to an LLM API can route through Shield by setting one environment variable. This means Shield works with Anthropic, OpenAI, Google, DeepSeek, local models via Ollama, and any OpenAI-compatible endpoint. You configure it once per machine, not per provider. The filter packs are also provider-agnostic — they parse JSON request bodies regardless of which vendor's API format you're using.

The Real Cost of an AI Data Leak

Leak Cost Estimator

What Leaks in a Single Prompt

API Keys & Tokens

Source Code

Customer PII

Health Records (PHI)

Financial Data

System Prompts & Architecture

The Cost Multiplier Effect

Immediate Response

Legal & Notification

Regulatory & Remediation

Real-World Scenarios

The API Key Cascade

The Customer Data Exposure

The PHI Breach

Stop the leak before it starts

Frequently Asked Questions