Question 1

What is Squirrelops AI Deception?

Accepted Answer

Squirrelops AI Deception is a deception and attribution platform for LLM-exposed surfaces. It sits in front of a customer-facing language model and inspects every incoming prompt with a two-layer detection pipeline. Clean prompts go to the real model untouched; suspicious ones — jailbreak attempts, credential probing, prompt injection, tool-call abuse — are routed to a decoy backed by a model fine-tuned to engage convincingly without leaking real secrets. v1.0.4 measured 98.5% threat capture with zero false positives.

Question 2

How is this different from an LLM guardrail product?

Accepted Answer

Guardrails sit in-band and refuse: when an attacker tries a jailbreak, the model returns an 'I cannot help with that' message and the attacker iterates. Squirrelops engages: it routes the attacker into a decoy that responds with plausible-but-fabricated content and issues trackable fake credentials, turning every attempted abuse into a labeled, attributable signal. Guardrails block. Squirrelops attributes.

Question 3

What is the threat capture rate?

Accepted Answer

98.5% on the v1.0.4 internal adversarial test suite — 66 of 67 attack turns correctly routed into the decoy, with zero false positives on benign traffic. The suite covers jailbreak, role-play injection, encoded-instruction smuggling, credential probing, and tool-call abuse.

Question 4

How does per-tenant tuning work?

Accepted Answer

Operators can add their own detection rules on top of the shipped ruleset, validated at load time. Overlays are additive only — they can raise a threat score, never lower it — so the baseline detection that comes with every profile is preserved. The audit log shows exactly which rule (operator or default) fired on every event.

Question 5

What does 'reproducible profile bundle' mean?

Accepted Answer

Every profile bundle is a signed, sealed artifact. Given the source materials, our reproducibility check rebuilds the bundle and proves it is bit-identical to the one deployed — same bytes, same signature, same hash.

No supply-chain ambiguity. No “did someone swap the model on the way to production?”

The check runs in two modes: a fast staged-input rebuild (~15 seconds, for CI) and a full end-to-end rebuild (~15 minutes, for incident-response forensics).

Question 6

What does a pilot engagement look like?

Accepted Answer

A pilot is a 4-to-6-week engagement: Squirrelops delivers a signed profile bundle scoped to the customer's model and use cases; the customer runs it against their own adversarial traffic (real or simulated); Squirrelops provides the detection feed and the methodology to evaluate the results. Pilots require an internal red-team or pen-test capability on the customer side.

Metric	Result
Threat capture rate (adversarial test suite)	98.5% (66 of 67 attack turns)
Tracked credentials issued (87-turn campaign)	121
False positives on benign traffic	0
Production rule update turnaround	~10 minutes
Profile-bundle reproducibility	Bit-identical rebuilds verified
Independent test suite (release gate)	815 tests passing

Dimension	LLM Guardrail	Squirrelops AI Deception
Response to a jailbreak attempt	Refuses (“I cannot help with that”)	Engages with plausible-but-fake content
What the attacker learns	That this attack didn't work; iterates	Nothing — they think it worked
Signal to defender	Refusal events (often noisy)	Labeled, attributable threat events with technique mapping
Credential leakage on novel attacks	Depends on the model behind it	Decoy model trained to emit only recognizable fakes; layered with cryptographic signing
Attribution across sessions	None	Trackable fake credentials correlate sessions; reuse anywhere points back to origin
Per-tenant policy	Usually fixed at the vendor	Additive-only operator overlays on top of baseline ruleset
Supply-chain trust	Vendor-attested	Cryptographically signed, bit-identically reproducible bundles

Turn every LLM jailbreak attempt into a labeled threat signal.

Deceive. Attribute. Scale.

Deceive

Attribute

Scale

Inspect. Route. Capture.

Inspect

Route

Capture

Measurable, reproducible, repeatable.

Four independent layers.

Rule-based detection

ML detection

Decoy model

Cryptographic signing

Tune detection per tenant — without weakening the baseline.

Prove what's running in production is what you tested.

LLM guardrail vs. Squirrelops AI Deception.

A new attack surface. The old tooling doesn't see it.

Common questions.

What is Squirrelops AI Deception?

How is this different from an LLM guardrail?

What is the threat capture rate?

How does per-tenant tuning work?

What does "reproducible profile bundle" mean?

What does a pilot engagement look like?

Pilot-ready for security teams with a defined LLM attack surface.