AUCTC 2026 · Acadia University · June 10

The Red Team
Pressure Test

Using AI to stress-test strategy
before implementation does it for you.

Doug Langille · Manager, Digital Innovation
Nova Scotia Community College

linkedin.com/in/douglangille · digital.douglangille.ca

doug.langille@nscc.ca · share.douglangille.ca/content/red-team-pressure-test

01 / 09

The Problem

AI is a story completion engine.

Your plan looks like a story that wants to succeed. The model finishes it that way.

It's not broken. It's doing exactly what it was built to do.

"Having your brain farts flattered
and sprinkled with glitter."

02 / 09

Why This Isn't an Accident

Three forces, same direction.

Training signal — RLHF rewards agreement. Human raters prefer pleasant responses. The model learned that agreement is correct.
Business model — engagement optimization selects for responses that feel good and keep you coming back. Sycophancy is commercially incentivized.
Multi-turn drift — short generations stay honest. Long sessions and multi-turn planning conversations compound toward yes-man territory. Every time.

You cannot prompt your way around a training signal.

03 / 09

The Evidence

78.5%

sycophancy persistence across ChatGPT, Claude, and Gemini — regardless of context, model, or prompt quality

SycEval — 1,000+ interactions, 3 models — AAAI AIES 2025

96%

of simulations chose the optimistic strategy — not because it was right, but because that's what the training rewarded

Romasanta, Thomas & Levina — 6 models, 7 tensions — HBR March 2026

More sophisticated prompting and richer context did not dislodge the bias. The vocabulary of progress activates training associations regardless of strategic merit.

04 / 09

The Lineage

Red-teaming is 200 years old.

1812 — Prussian Kriegsspiel formalizes adversarial simulation as a planning discipline
1973 — Israel institutionalizes a dedicated contrarian unit after the Yom Kippur intelligence failure
1976 — CIA Team B challenges the NIE on Soviet capabilities. Lesson: the red team can be captured too
2007 — Gary Klein's pre-mortem: assume it already failed, work backwards. Prospective hindsight raises correct failure identification by 30%

This isn't a prompt trick. It's a 200-year-old discipline with documented failures — and documented conditions under which it fails.

A model trained on your institution's assumptions will produce adversarial critique bounded by those same assumptions. The red team can be captured too.

05 / 09

The Method

Three steps. One rule.

These prompts don't fix the training signal. They change the story the model is completing — from "help this succeed" to "find why it fails." That's the difference.

Step 01

Role first

Assign the adversarial role before the model sees your plan. Change the story it's completing.

Step 02

Failure modes only

Name false assumptions, unaccounted stakeholders, likely implementation failures. No solutions allowed.

Step 03

Steel-man the opposition

Make the case for a legitimate, well-informed opponent of the plan. Someone in the room who hasn't spoken yet. Use AI when no one safe to argue with is available — or when ego is in the way.

06 / 09

The Persona Technique

Ask Gordon Ramsay.

Fictional characters have no institutional stake and no career to protect. Their honesty ceiling is higher than anyone in the room.

You are Gordon Ramsay reviewing a strategic plan.
You do not lead with strengths. You do not soften feedback.
You identify every reason this plan will fail, every assumption
that's wishful thinking, and every stakeholder who's been ignored.
Be specific. Be brutal. Be right.

Here is the plan: [paste your outline]

Other personas by context: John Taffer for operational failures, Simon Cowell for communications and presentations, hostile board member for governance documents.

07 / 09

The Spectrum

Pick your spot.

Devil's Advocate

One counter-argument. Low stakes.

Socratic Inquiry

Questions only. No conclusions.

Full Red Team

Three-step sequence. Pre-committee.

Abrasive Persona

Gordon Ramsay. John Taffer. Your call.

The goal isn't a better plan. It's a plan that's been argued with.

AI adversarial review is the pre-flight checklist. Human deliberation — someone in the room making the case against the plan — is the pilot. Don't confuse the checklist for the flight.

08 / 09

Prompt Reference

Step 01+02 — Role + Failure Modes

You are a hostile expert reviewer.
Find every reason this plan will fail.
No strengths. No solutions. Identify:
false assumptions, unaccounted
stakeholders, implementation failures.
[paste your plan here]

Step 03 — Steel-Man

Someone in the room opposes this
plan. They're not obstructionist —
they have a legitimate argument
you haven't heard. Make their case.
[paste your plan here]

Gordon Ramsay Persona

You are Gordon Ramsay reviewing a
strategic plan. No strengths. No
softening. Every reason it fails,
every ignored stakeholder. Be right.
[paste your plan here]

Full Pre-Committee Audit

Premise audit · Stakeholder gaps
Implementation failure points
Evidence audit · Alt. hypothesis
Red team summary (2 sentences each)
[paste proposal here]

Did you ask it what could go wrong?

09 / 09

The Red TeamPressure Test

AI is a story completion engine.

Three forces, same direction.

Red-teaming is 200 years old.

Three steps. One rule.

Ask Gordon Ramsay.

Pick your spot.

Notes

The Red Team
Pressure Test