Skip to content
All posts

Evidence-grade controls are boring on purpose

June 14, 20265 min readDhruv Jain

If you sit in risk, compliance, legal, IT, or operations, the uncomfortable truth is simple: an AI control that only exists in a meeting is not a control.

It is an opinion with a calendar invite.

The board can approve the policy. Legal can review the clause. IT can bless the tool. None of that matters if, three months later, nobody can find the artifact that proves what happened.

This is where regulated firms get exposed.

They can explain the intention. They cannot prove the execution.

The artifact is the control

I keep coming back to one sentence because it makes the whole problem easier to see.

A control only exists when someone can find the artifact after the meeting is over.

That artifact can be boring. It can be a use-case register, a model approval memo, a data-retention note, a blocked-tool log, a human review record, or a vendor answer attached to the right workflow.

Boring is the point.

Regulated firms do not need AI theatre. They need proof that survives staff turnover, vendor changes, product updates, and the quarterly audit cycle.

A policy tells people what should happen. An artifact shows what did happen.

That is the difference between a governance story and a governance system.

Your committee needs fewer opinions

Most AI governance meetings create more language than proof.

People discuss risk appetite. They debate which tools are acceptable. They agree that human review is needed. They ask for a working group. The meeting ends with a note that says the firm is taking a measured approach.

That note feels responsible.

But later, when a team uses AI on a live workflow, the question becomes more specific.

Who approved this use case? What data went into it? Was the vendor allowed to retain prompts? Did a human review the output? Which department owns the risk? When was the decision last checked?

Those questions cannot be answered by tone.

They need artifacts.

Policy layerEvidence-grade controlHuman review requiredReviewer name, timestamp, and decision noteOnly approved tools allowedCurrent tool register with owner and data classVendor risk assessedAI-specific DDQ answers attached to the workflowNo confidential uploadsLogging or monitoring that shows where data movedBoard oversightBoard pack with open risks and accepted exceptions

The table is intentionally plain. Official guidance is already pointing here. The PCPD framework talks about risk assessment, human oversight, implementation, monitoring, and internal governance. The European Banking Authority's AI Act mapping lands on similar muscles: documentation, logs, risk management, traceability, human oversight, post-market monitoring, and incident reporting.

That is what makes it useful. If a control cannot be explained in a row like this, the team probably does not know where the proof lives.

Build a control that leaves a trail

The mistake is trying to govern AI from the top of the firm only.

You need board language, yes. You need policy language, yes. But the real control has to sit close to the workflow where AI is used.

That is where evidence gets created.

  • Name the workflow: Do not approve "AI" in general. Approve client-note drafting, complaint triage, contract review, or support summarization.

  • Classify the data: Separate public content, internal material, customer data, regulated records, and legal material.

  • Attach the vendor answer: Keep model training, retention, subprocessor, and audit-log answers next to the use case.

  • Record the human decision: Keep who reviewed the output, when they reviewed it, and what they changed.

  • Recheck on a schedule: Model behavior, vendor terms, and staff habits change. The control has to be reviewed after the first approval.

This does not need to be fancy at the start. When I pressure-test this with a team, I ask for one workflow and five artifacts: owner, approval note, vendor answer, data class, and review trail.

A simple register with real owners beats a beautiful policy with no operational proof.

The deeper point is that evidence-grade controls change the conversation. Instead of asking whether the firm is "comfortable with AI," you ask what the firm can prove about one specific workflow.

That question is less glamorous.

It is also much harder to dodge.

The control has to live where work happens

A lot of firms accidentally build AI governance as a committee artifact.

That means the proof lives far away from the work. The board pack is in one folder. The vendor answer is in procurement. The policy is in legal. The team using the tool is in a fourth system. When someone asks what happened, everyone has to reconstruct the story by hand.

That is fragile.

The better design is to put the evidence beside the workflow.

If the claims team uses AI to draft complaint summaries, the approval note, data class, vendor answer, and review record should sit next to that workflow. If the legal team uses AI to compare contracts, the same evidence trail should live with that use case. If a support team uses an internal assistant, the tool owner and the human review rule should be easy to find.

The control becomes less impressive on paper and more useful in practice.

This is also how you reduce fear inside the firm.

People are less anxious when they know where the line is. They are more honest when the approval path is clear. They are more likely to use the safer tool when it fits the way they already work.

Evidence-grade control design is not about making AI feel heavy.

It is about making the safe path easier to prove than the unsafe path is to hide.

Once that happens, governance stops being a quarterly scramble. It becomes a habit the firm can inspect.

There is another benefit that rarely gets discussed.

Good evidence reduces internal politics. When the artifact is clear, the conversation stops depending on who speaks with the most confidence. The record shows what was approved, what was rejected, what changed, and who owns the next review.

That makes the control calmer before the audit arrives.

Sources behind this note

When you're ready

Look at one AI workflow your team already uses this week.

If you cannot find the owner, approval note, vendor answer, data class, and review trail in under fifteen minutes, the control is still mostly a story.

Reply with the workflow you are worried about. I can help you pressure-test what proof should exist before it turns into an audit finding.

Book the 20-minute AI review

Bring the messy version: public AI usage, unclear policy, vendor pressure, or a department asking for approval. Leave with what to inspect first.

20 minutes: first exposure, first owner, next decision
Your data stays yours — NDA on day one
Book AI review

Opens Cal.com to select your slot

01

First department

Where AI usage is already creating risk, leverage, or process drift.

02

Exposure surface

The workflow, data path, or approval gap leadership cannot see yet.

03

Next decision

Audit, workshop, or private pilot scope if the risk is real.

Need context first? Read the proof, case studies or get the weekly brief.

Q2 AI readiness window

Find the shadow-AI risk before it becomes policy debt.

In 20 minutes, we'll identify the department to review first, the AI usage surface you can't see yet, and whether a readiness audit, workshop, or private AI pilot is the right next step.

NDA-ready20-minute executive reviewNo tool pitchFor regulated or data-sensitive teams

Best fit: CTOs, operators, and compliance leads who need a governed first AI use case.

Review output

Your first governed AI use case

Actionable
01

First department to review

Where AI usage is already creating leverage, risk, or hidden process drift.

02

Shadow-AI exposure surface

The workflows, data paths, and approval gaps leadership cannot currently see.

03

Approval-worthy next step

A readiness audit, workshop, or private pilot scoped for governance first.

The urgency is not hype. Once teams normalize ungoverned AI habits, cleanup becomes policy debt, retraining, and slower approvals.