AI Consulting

The problem

This isn't theoretical.

These are not predictions. They're the last 24 months of enterprise AI in regulated industries.

Receipts · 2023–2025 5 incidents · cross-sector

leaks in 20 days

Samsung 2023 · ChatGPT

breach during drug discovery

London pharma · 2025

83%

of pharma firms have no automated AI data controls

Kiteworks · 2025

65%

of top-20 pharmas ban ChatGPT outright

ZoomRx · 2024

+56.4%

YoY rise in AI-related security incidents

Stanford AI Index · 2025

Field note—Samsung's 2023 incident is the one everyone cites, but the London pharma IP breach last year was arguably worse. It just didn't get the press.

Your prompts train their models

When your scientists paste synthesis routes, clinical drafts or batch records into ChatGPT, Gemini or Claude.ai, that text becomes training data unless you explicitly opt out. OpenAI retains prompts for 30 days; others keep them longer. "We don't use your data" on the marketing page is a preference you set — not a default state.

No audit trail regulators accept

21 CFR Part 11 demands signed, reviewed, traceable actions. EU Annex 11 demands validated systems with change control. A chat log dumped into a vendor's opaque database is neither. When an auditor asks "who approved this prompt, and where is it retained?", a consumer LLM can't answer.

Data sovereignty vs. GDPR vs. patient data

Your clinical data must stay in the EU under GDPR. Your IP is governed by the contracts you signed. Consumer AI sends prompts to US servers, often with unclear subprocessor chains. For pharma, that isn't a compliance gap — it's a notifiable breach waiting to happen.

What we help with

AI with a job description.

Nine concrete use cases we've built or are actively building with pharma, biotech, and medtech clients. All of them run inside your tenant.

Private document RAG

Point an AI at your SOPs, batch records, and deviation logs — without those documents ever leaving your tenant. Questions get answered with citations. No training, no retention, no leakage.

SharePoint / Confluence / document-DB connectors
Every answer cites its source document + page
Tenant-local embeddings, zero-retention inference

Deviation & CAPA analysis

AI drafts investigation proposals and CAPA steps from your historical quality data. QA approves, rejects, or edits — humans stay in control. Every AI action logged, signed, and auditable.

Proposal, not autonomy — AI-assisted, QA-approved
Full audit trail on every AI-generated action
Maps cleanly into your existing eQMS workflow

Regulatory intelligence

Continuous monitoring of FDA, EMA, Swedish MPA, and EudraLex publications. The AI surfaces what matters for your pipeline, products, and indication — in plain English, with source links.

FDA / EMA / MPA / EudraLex feeds
Filtered to your pipeline & indication
Weekly digest + real-time alerts on critical changes

Validated AI in GxP production

IQ/OQ/PQ adapted for machine learning. ALCOA+ extended to AI outputs. Model change control, re-validation triggers, explainability docs — everything your quality function needs to actually deploy AI in a GxP process.

IQ/OQ/PQ protocols adapted for ML
ALCOA+ principles applied to AI outputs
Change control for model updates & re-training

R&D & financial analytics

Automated analysis of R&D spend, trial cost-per-site, and programme budgets. The AI surfaces variance, flags anomalies, and drafts management summaries — all running where your financial data already lives, not in a SaaS vendor you have to sign another DPA with.

R&D spend variance & anomaly detection
Trial cost-per-site and programme-level burn rate
Narrative summaries generated on demand

Pharmacovigilance & signal detection

AI triages adverse-event reports and drafts ICSR narratives from source documents. Cross-case signal detection on your AE database. Human review and e-sign before anything goes to regulators — compliant with your existing PV SOPs and GVP Module VI.

ICSR triage and case-narrative drafting
Cross-case signal detection on your AE database
Fits your PV SOP + e-signature flow

SOP & document authoring

AI helps draft, update, and redline SOPs, WIs, and validation protocols against your house style and regulatory expectations. Tracked changes, version control, and full diff history — not a wholesale rewrite.

Structured drafting against your templates
Diff-preserving redlines
QA review baked in

Supplier & third-party risk intel

Continuous monitoring of your critical suppliers, vendors, and subprocessors. Breach disclosures, Schrems-II posture changes, SOC2 lapses, GxP-relevant news — all flagged before your legal or QA team hears about it elsewhere.

Breach + news monitoring by supplier
Schrems-II / GDPR posture tracking
Alerts filtered to your vendor list

Clinical trial operations assistant

Protocol-deviation tracking, site-query drafting, and enrolment signals across your study portfolio. The AI reads your TMF and CTMS and surfaces what needs attention — your CRAs focus on exceptions, not inbox.

TMF / CTMS integration
Protocol-deviation tracking & categorization
Site-query drafting + enrolment signals

Field note—Most pharma clients start with Use Case 01 (document RAG) because the ROI is immediate and the risk is contained. The hard one is Use Case 04 — validated AI in production — because your QA team needs to sign something off that the FDA will recognize. We do them in that order for a reason.

Architecture patterns

Three ways to run AI without leaking.

Same outcomes, different trade-offs on sovereignty, cost, and capability. We pick the pattern that fits your data, your risk posture, and your ops.

Pattern A

Fully isolated

Your hardware

User→ Prompt→ Model→ Data

Everything stays on your hardware. No cloud, no API, no outbound network call. Maximum sovereignty — you provide the GPUs.

Runtime Ollama, vLLM, text-generation-webui
Models Llama 3, Mistral, Qwen (open weights)

Trade-off Your ops team runs it. Capability ceiling below frontier models.

Pattern B

Tenant-isolated cloud

Your cloud tenant

User→ Prompt→ Model→ Data

Frontier-model capability with contractual data boundaries. Inputs stay in your cloud tenant, governed by your existing BAA/DPA.

Runtime Azure OpenAI, AWS Bedrock, Google Vertex AI
Models GPT-4o, Claude 3.5, Gemini family

Trade-off Vendor processes your prompts — within the tenant boundary you already trust.

Recommended Pattern C

Hybrid

Your tenant

User→ Data→ Embed

⇄

Zero-retention

Model

Your documents are embedded and indexed locally. Only the minimal context needed for a query is sent — via a zero-retention API — to a frontier model.

Embeddings on-prem (sentence-transformers, BGE, nomic)
Inference Anthropic / OpenAI / Google zero-retention endpoints

Trade-off Most practical for most pharma. Requires careful prompt-scrubbing & query-audit design.

Field note—We've deployed Pattern C in four pharma engagements. Every time, the objection isn't technical — it's convincing QA that zero-retention means what the vendor says it means. The answer is a signed data processing agreement that says it, followed by audit logs that prove it.

Governance & policy

Seven documents, one framework.

The artifacts your QA, IT, and Legal teams need to let AI into a regulated process. Built from our template, calibrated to your specific data classes, tools, and workflow.

01

Acceptable Use Policy
Policy

What's in scope, what's not. Which tools, data classes, and roles — the thing that stops shadow-AI.
02

Data classification × AI matrix
Matrix

For each data classification (public → regulated), which AI patterns are allowed — cloud, tenant, or on-prem.
03

Model inventory
Inventory

Every model and AI service in use: owner, data scope, validation state, last review. Your GxP system-inventory pattern, applied to AI.
04

Change control for AI systems
Procedure

Documented triggers for re-validation. Which model or prompt changes need QA sign-off — and which don't.
05

Explainability & audit trail
Template

For every AI action: inputs, outputs, approver, model version, retrieval source. Evidence pack, not chat logs.
06

AI-specific incident response
Playbook

Hallucinated outputs, prompt-based data leaks, model drift, vendor breach — distinct playbooks because the failure modes are different.
07

AI governance committee charter
Charter

The cross-functional body (Quality, IT, Legal, Clinical) that approves use cases, reviews incidents, sets policy. Membership and cadence included.

Field note—These seven documents take two to four weeks to draft from our templates. They take two to six months to get signed. The difference is change management, not paperwork.

How we engage

Four phases, no mystery.

From "what's even happening?" to "ship it" to "keep it governed." A predictable rhythm — adapted to your scope, not a slideshow.

Phase 01

Assessment

2 weeks

Discover what AI is (and isn't) already in use, which data classes it touches, and where the governance gaps are.

Shadow-AI inventory (tools, users, data classes)
Risk & regulatory gap map
Scoring against MSET AI readiness framework

Phase 02

Strategy & Policy

2–4 weeks

Turn findings into a defensible strategy and the governance documents that let you actually move forward.

Target architecture & use-case roadmap
All 7 governance documents (see §5)
Executive + QA sign-off package

Phase 03

Implementation

Scope-dependent

Build and deploy. Private RAG, validated ML, BI analytics — whatever the strategy prioritised. Iteratively, with QA in the loop.

Working systems in your tenant
IQ/OQ/PQ and validation evidence
Operational runbooks

Phase 04

Ongoing governance

Quarterly retainer

We stay on as governance stewards — cadence reviews, incident response, new use cases as they emerge.

Quarterly governance review
Model inventory updates & change control
Incident response on retainer

Field note—Phase 03 is where clients usually underestimate the internal effort. We build; your team also has to review, test, and approve. Rule of thumb: budget 30% of our time, again, on your side for the first use case. That number halves by the third.

Frequently asked

Things you'll want to know.

Straight answers to the questions we get most often before an engagement.

Can we just use ChatGPT or Copilot?

For personal productivity on non-regulated data, yes. For anything touching SOPs, batch records, clinical data, IP, or patient information, no. The reason isn't paranoia — it's that consumer-grade AI ships your prompts to vendor infrastructure with retention periods, unclear subprocessor chains, and no audit trail a regulator accepts.

Enterprise-tier equivalents (ChatGPT Enterprise, Claude for Work) improve the data position but still don't give you the GxP-compatible audit trail or tenant isolation regulated workflows require. We build the version that does.

What about Microsoft 365 Copilot specifically?

M365 Copilot is a special case — it runs inside your Microsoft 365 tenant, which solves the data-residency question for content already in M365. Good for: summarising Teams meetings, drafting Outlook replies, working inside Word/Excel on already-classified content.

Not good for: anything you need to prove didn't leak, cite its source, or pass a Part 11 audit. It also reads Graph data broadly — if your Intune hygiene isn't perfect, it can surface documents to users who technically have access but shouldn't. We help audit your Copilot deployment and set the guardrails.

How long does a private RAG take to build?

A first working version — one document source, answering with citations — is typically 4–6 weeks. A production-ready deployment with permissions, audit trail, and QA review workflow is 2–4 months depending on scope.

The long tail is never the model — it's the data pipeline (which SharePoint sites? which SOPs are current? who should see what?) and the evaluation harness (how do you know the answers are right?). We time-box both explicitly.

Which model should we use?

It depends on what the AI is doing, where your data lives, and what your auditor will accept. For on-prem regulated workloads: Llama 3.1/3.3 70B or Mistral Large. For tenant-isolated cloud: Claude 3.5 Sonnet, GPT-4o, or Gemini 2.5. For embeddings: BGE, nomic-embed-text, or OpenAI text-embedding-3-large.

We benchmark 3–5 candidates against your actual use case before committing — and re-benchmark quarterly, because the leaderboard moves.

Field note—We keep a private benchmark of 40+ model-task pairs across pharma use cases. The leaderboard shifts monthly. We re-benchmark quarterly, not because it's trendy, but because last quarter's best model is often this quarter's second-best.

Do we need a GPU cluster?

For Patterns B and C (tenant-isolated cloud, hybrid), no — inference runs on vendor infrastructure. For Pattern A (fully isolated), yes, but smaller than you'd expect.

A single server with 2×H100 or 4×L40S can run a 70B-class model at useful throughput for a few hundred users. We size the hardware to actual concurrency, not a glossy brochure spec.

What does validated AI look like for audit?

A change-controlled model card; an IQ/OQ/PQ package adapted for ML (hardware qualification, functional tests against a frozen eval set, performance tests on representative queries); an ALCOA+ audit trail capturing input, retrieval context, output, approver, and model version for every AI-generated action.

Plus a re-validation trigger policy (when does a new model version or prompt change require re-qualification?) and a use-case-specific explainability document. We build all of these from templates we've already run past auditors.

How much does this cost?

Pricing follows the phases. Phases 01 Assessment and 02 Strategy & Policy are fixed-price engagements — scoped and signed before we start. Phase 03 Implementation is scope-dependent; we quote after phases 01–02 give us real numbers, either fixed or milestone-based. Phase 04 is a quarterly retainer.

No hourly billing games — you know the number before we begin. We quote after a short scoping call.

Field note—A Phase 01 assessment is typically fixed-price in the low five figures. Phase 02 is similar. Phase 03 ranges from five to low six figures depending on scope. We'll quote specifically after a 30-minute scoping call, not before.

Your data.
Your AI.

R&D spend · fiscal YTD

This isn't theoretical.

Your prompts train their models

No audit trail regulators accept

Data sovereignty vs. GDPR vs. patient data

AI with a job description.

Private document RAG

Deviation & CAPA analysis

Regulatory intelligence

Validated AI in GxP production

R&D & financial analytics

Pharmacovigilance & signal detection

SOP & document authoring

Supplier & third-party risk intel

Clinical trial operations assistant

Three ways to run AI without leaking.

Fully isolated

Tenant-isolated cloud

Hybrid

Seven documents, one framework.

Acceptable Use Policy

Data classification × AI matrix

Model inventory

Change control for AI systems

Explainability & audit trail

AI-specific incident response

AI governance committee charter

Four phases, no mystery.

Assessment

Strategy & Policy

Implementation

Ongoing governance

Things you'll want to know.

Less hand-waving. More working AI.

Your data.Your AI.

R&D spend · fiscal YTD

This isn't theoretical.

Your prompts train their models

No audit trail regulators accept

Data sovereignty vs. GDPR vs. patient data

AI with a job description.

Private document RAG

Deviation & CAPA analysis

Regulatory intelligence

Validated AI in GxP production

R&D & financial analytics

Pharmacovigilance & signal detection

SOP & document authoring

Supplier & third-party risk intel

Clinical trial operations assistant

Three ways to run AI without leaking.

Fully isolated

Tenant-isolated cloud

Hybrid

Seven documents, one framework.

Acceptable Use Policy

Data classification × AI matrix

Model inventory

Change control for AI systems

Explainability & audit trail

AI-specific incident response

AI governance committee charter

Four phases, no mystery.

Assessment

Strategy & Policy

Implementation

Ongoing governance

Things you'll want to know.

Less hand-waving. More working AI.

Your data.
Your AI.