NHS Apps and the AI Act

Umma Gohil

NHS Apps and the AI Act

Sep 29, 2025

12 min read

The EU AI Act is now enforceable, and NHS applications using artificial intelligence fall squarely within its scope. While the healthcare sector already operates under strict regulatory frameworks like DCB 0129 and DCB 0160, the AI Act introduces additional requirements that intersect with existing NHS digital standards in complex ways.

This guide examines the practical implementation challenges and provides working code examples for achieving compliance in NHS applications.

Understanding the Regulatory Intersection

NHS applications that incorporate AI for clinical decision support, risk assessment, or patient monitoring are classified as high-risk systems under the AI Act. This classification triggers specific obligations around quality management, documentation, and human oversight that must be integrated with existing NHS clinical governance requirements.

The challenge isn’t implementing either framework independently - it’s designing systems that satisfy both simultaneously while maintaining clinical safety and operational efficiency.

Implementing Compliant AI Decision Logging

Every AI decision in your NHS application needs comprehensive audit trails - it’s fundamental to patient safety and regulatory compliance. When an AI system influences clinical decisions, you need to be able to trace back exactly what factors contributed to that recommendation, particularly if patient outcomes are questioned later.

The React hook below addresses a critical safety requirement: ensuring that every AI decision can be reconstructed and audited. Notice how the humanOversightRequired flag is automatically triggered when confidence drops below 85% - this directly implements the AI Act’s requirement for human oversight of uncertain decisions, which is particularly crucial in healthcare where wrong decisions can have life-threatening consequences:

import { useState, useCallback } from "react";

interface AIDecisionLog {
  timestamp: Date;
  modelVersion: string;
  inputDataHash: string;
  prediction: any;
  confidence: number;
  explanation: string[];
  humanOversightRequired: boolean;
  clinicalContext: string;
  biasMetrics?: BiasAssessment;
}

interface BiasAssessment {
  demographicParity: number;
  equalizedOdds: number;
  calibrationScore: number;
  flaggedForReview: boolean;
}

export const useAIDecisionLogging = () => {
  const [logs, setLogs] = useState<AIDecisionLog[]>([]);

  const logAIDecision = useCallback(
    async (
      prediction: any,
      confidence: number,
      inputData: any,
      clinicalContext: string,
      modelVersion: string
    ) => {
      const inputHash = await hashSensitiveData(inputData);
      const biasMetrics = await assessBias(inputData, prediction);

      const logEntry: AIDecisionLog = {
        timestamp: new Date(),
        modelVersion,
        inputDataHash: inputHash,
        prediction,
        confidence,
        explanation: await generateExplanation(inputData, prediction),
        humanOversightRequired:
          confidence < 0.85 || biasMetrics.flaggedForReview,
        clinicalContext,
        biasMetrics,
      };

      // Dual logging: NHS clinical risk management + AI Act compliance
      await Promise.all([
        logToNHSClinicalAudit(logEntry),
        logToAIActCompliance(logEntry),
      ]);

      setLogs((prev) => [...prev, logEntry]);

      return logEntry;
    },
    []
  );

  return { logAIDecision, logs };
};

The quality gate validation process addresses a critical safety requirement; ensuring that AI systems don’t degrade over time without detection. The 90-day review cycle is based on the recognition that AI systems in healthcare need regular validation to ensure they’re still performing safely as clinical practices, patient populations, and underlying data distributions change. The evidence tracking for each quality gate creates an audit trail that’s essential for both AI Act compliance and NHS clinical governance.

The component treats non-compliance as a blocking condition, systems that fail quality gates cannot be marked as ready for clinical use. This hard stop is crucial for patient safety, preventing AI systems with known issues from continuing to influence clinical decisions.

Building Human Oversight into Clinical Workflows

The AI Act’s human oversight requirement mandates meaningful clinical review where healthcare professionals can understand, challenge, and override AI recommendations. Clinicians need to see not just what the AI recommends, but why it made that recommendation and what clinical factors it considered.

The ClinicalAIReview component below implements several critical safety mechanisms. The automatic escalation for low-confidence predictions (below 70%) ensures that uncertain AI decisions always get additional clinical oversight. The structured explanation section breaks down the AI’s reasoning into clinically relevant categories - primary factors, clinical reasoning, and contraindications - giving clinicians the context they need to make informed decisions about whether to trust or override the AI recommendation:

import React, { useState, useEffect } from "react";
import { Card, CardHeader, CardContent } from "@/components/ui/card";
import { Alert, AlertDescription } from "@/components/ui/alert";

interface AIRecommendation {
  prediction: string;
  confidence: number;
  explanation: {
    primaryFactors: string[];
    clinicalReasoning: string;
    contraindications: string[];
  };
  modelInfo: {
    version: string;
    lastValidated: Date;
  };
}

interface ClinicalReviewProps {
  recommendation: AIRecommendation;
  patientId: string;
  onReviewComplete: (decision: ClinicalDecision) => void;
}

interface ClinicalDecision {
  accepted: boolean;
  override: boolean;
  clinicalNotes: string;
  clinicianId: string;
  reviewTimestamp: Date;
}

export const ClinicalAIReview: React.FC<ClinicalReviewProps> = ({
  recommendation,
  patientId,
  onReviewComplete,
}) => {
  const [clinicalNotes, setClinicalNotes] = useState("");
  const [decision, setDecision] = useState<"accept" | "override" | null>(null);
  const [isSubmitting, setIsSubmitting] = useState(false);

  const requiresEscalation = recommendation.confidence < 0.7;

  const handleSubmitReview = async () => {
    if (!decision || !clinicalNotes.trim()) return;

    setIsSubmitting(true);

    const clinicalDecision: ClinicalDecision = {
      accepted: decision === "accept",
      override: decision === "override",
      clinicalNotes,
      clinicianId: getCurrentClinician().id,
      reviewTimestamp: new Date(),
    };

    // Log for AI Act compliance and NHS clinical governance
    await logClinicalReview({
      patientId,
      aiRecommendation: recommendation,
      clinicalDecision,
      complianceFrameworks: ["AI_ACT_ARTICLE_14", "NHS_DCB_0129"],
    });

    if (requiresEscalation || decision === "override") {
      await escalateForSecondOpinion(
        patientId,
        recommendation,
        clinicalDecision
      );
    }

    onReviewComplete(clinicalDecision);
    setIsSubmitting(false);
  };

  return (
    <Card className="w-full max-w-4xl">
      <CardHeader>
        <h3 className="text-lg font-semibold">AI Clinical Decision Review</h3>
        <p className="text-sm text-gray-600">
          Patient ID: {patientId} | Model: {recommendation.modelInfo.version}
        </p>
      </CardHeader>

      <CardContent className="space-y-4">
        {requiresEscalation && (
          <Alert>
            <AlertDescription>
              Low confidence prediction (
              {(recommendation.confidence * 100).toFixed(1)}%). Second clinical
              opinion will be automatically requested.
            </AlertDescription>
          </Alert>
        )}

        <div className="bg-blue-50 p-4 rounded-lg">
          <h4 className="font-medium text-blue-900">AI Recommendation</h4>
          <p className="text-blue-800 mt-1">{recommendation.prediction}</p>
          <p className="text-sm text-blue-700 mt-2">
            Confidence: {(recommendation.confidence * 100).toFixed(1)}%
          </p>
        </div>

        <div className="space-y-2">
          <h4 className="font-medium">Clinical Reasoning</h4>
          <p className="text-sm text-gray-700">
            {recommendation.explanation.clinicalReasoning}
          </p>

          <div className="grid grid-cols-2 gap-4 mt-3">
            <div>
              <h5 className="font-medium text-sm">Primary Factors</h5>
              <ul className="text-sm text-gray-600 mt-1 space-y-1">
                {recommendation.explanation.primaryFactors.map(
                  (factor, idx) => (
                    <li key={idx}>• {factor}</li>
                  )
                )}
              </ul>
            </div>

            <div>
              <h5 className="font-medium text-sm">Contraindications</h5>
              <ul className="text-sm text-gray-600 mt-1 space-y-1">
                {recommendation.explanation.contraindications.map(
                  (contra, idx) => (
                    <li key={idx}>• {contra}</li>
                  )
                )}
              </ul>
            </div>
          </div>
        </div>

        <div className="space-y-3">
          <label className="block">
            <span className="font-medium">Clinical Assessment</span>
            <textarea
              value={clinicalNotes}
              onChange={(e) => setClinicalNotes(e.target.value)}
              placeholder="Document your clinical reasoning and any additional factors considered..."
              className="mt-1 w-full p-3 border rounded-lg h-32"
              required
            />
          </label>

          <div className="flex gap-3">
            <button
              onClick={() => setDecision("accept")}
              className={`px-4 py-2 rounded-lg border-2 transition-colors ${
                decision === "accept"
                  ? "border-green-500 bg-green-50 text-green-700"
                  : "border-gray-200 hover:border-green-300"
              }`}
            >
              Accept AI Recommendation
            </button>

            <button
              onClick={() => setDecision("override")}
              className={`px-4 py-2 rounded-lg border-2 transition-colors ${
                decision === "override"
                  ? "border-amber-500 bg-amber-50 text-amber-700"
                  : "border-gray-200 hover:border-amber-300"
              }`}
            >
              Override with Clinical Judgment
            </button>
          </div>

          <button
            onClick={handleSubmitReview}
            disabled={!decision || !clinicalNotes.trim() || isSubmitting}
            className="w-full py-2 px-4 bg-blue-600 text-white rounded-lg disabled:opacity-50 disabled:cursor-not-allowed"
          >
            {isSubmitting ? "Submitting Review..." : "Complete Clinical Review"}
          </button>
        </div>
      </CardContent>
    </Card>
  );
};

The dual logging approach in the handleSubmitReview function ensures that both the AI Act compliance system and NHS clinical governance have records of every human override decision. This creates accountability trails, that are essential when investigating adverse events or quality incidents. If a clinician overrides an AI recommendation and the patient outcome is poor, the detailed clinical notes and reasoning captured here become part of the clinical record that informs future practice and system improvements.

Real-Time Bias Monitoring Implementation

Bias in healthcare AI isn’t just a fairness issue - it’s a patient safety issue. Consider an AI system that’s less accurate at detecting cardiac risk in women or people from certain ethnic backgrounds. The AI Act’s bias monitoring requirements exist specifically to prevent these kinds of systematic failures that could lead to missed diagnoses or inappropriate treatments.

The useBiasMonitoring hook below implements continuous bias surveillance with automatic escalation thresholds. The key safety feature here is the automatic escalation when bias metrics exceed defined thresholds - particularly the immediate escalation for high-severity bias issues.

import { useEffect, useState, useCallback } from "react";

interface BiasMetrics {
  demographicParity: number;
  equalizedOdds: number;
  calibration: number;
  overallRiskScore: number;
}

interface BiasAlert {
  severity: "low" | "medium" | "high";
  attribute: string;
  metric: string;
  value: number;
  threshold: number;
  timestamp: Date;
}

export const useBiasMonitoring = (modelId: string) => {
  const [currentMetrics, setCurrentMetrics] = useState<BiasMetrics | null>(
    null
  );
  const [alerts, setAlerts] = useState<BiasAlert[]>([]);
  const [isMonitoring, setIsMonitoring] = useState(false);

  const checkBias = useCallback(
    async (predictions: any[], demographicData: any[]) => {
      const metrics = await calculateBiasMetrics(predictions, demographicData);
      setCurrentMetrics(metrics);

      // Check thresholds defined by AI Act Article 10
      const biasThresholds = {
        demographicParity: 0.1,
        equalizedOdds: 0.1,
        calibration: 0.05,
      };

      const newAlerts: BiasAlert[] = [];

      Object.entries(biasThresholds).forEach(([metric, threshold]) => {
        const value = metrics[metric as keyof BiasMetrics] as number;
        if (value > threshold) {
          const severity =
            value > threshold * 2
              ? "high"
              : value > threshold * 1.5
              ? "medium"
              : "low";

          newAlerts.push({
            severity,
            attribute: "demographic",
            metric,
            value,
            threshold,
            timestamp: new Date(),
          });
        }
      });

      if (newAlerts.length > 0) {
        setAlerts((prev) => [...prev, ...newAlerts]);

        // Auto-escalate high severity bias issues
        const highSeverityAlerts = newAlerts.filter(
          (alert) => alert.severity === "high"
        );
        if (highSeverityAlerts.length > 0) {
          await escalateBiasIncident(modelId, highSeverityAlerts);
        }

        // Log to NHS incident reporting system
        await logToNHSIncidentReporting({
          type: "AI_BIAS_DETECTED",
          modelId,
          alerts: newAlerts,
          complianceFramework: "AI_ACT_ARTICLE_10",
        });
      }

      return metrics;
    },
    [modelId]
  );

  useEffect(() => {
    if (isMonitoring) {
      // Set up continuous monitoring
      const interval = setInterval(async () => {
        const recentPredictions = await fetchRecentPredictions(modelId);
        const recentDemographics = await fetchRecentDemographicData(modelId);

        if (recentPredictions.length > 50) {
          // Minimum sample size
          await checkBias(recentPredictions, recentDemographics);
        }
      }, 300000); // Check every 5 minutes

      return () => clearInterval(interval);
    }
  }, [isMonitoring, modelId, checkBias]);

  return {
    currentMetrics,
    alerts,
    checkBias,
    startMonitoring: () => setIsMonitoring(true),
    stopMonitoring: () => setIsMonitoring(false),
    clearAlerts: () => setAlerts([]),
  };
};

The continuous monitoring setup in the useEffect hook addresses a fundamental safety challenge. AI systems can drift over time, and bias can emerge gradually as patient populations or clinical practices change. The 5-minute monitoring interval with a minimum sample size of 50 predictions ensures that bias detection happens frequently enough to catch problems early, while maintaining statistical validity. The integration with NHS incident reporting systems means that bias issues are treated as clinical safety incidents, not just technical problems.

Quality Management System Integration

The AI Act’s quality management requirements align closely with NHS clinical risk management frameworks like DCB 0129 but they add specific obligations around AI system lifecycle management.

The AIQualityManagement component below demonstrates how safety-critical quality gates work in practice. Each quality gate represents a different aspect of system safety, bias testing ensures the system doesn’t systematically harm certain patient groups, clinical validation ensures the AI recommendations are clinically sound, and explainability verification ensures clinicians can understand and challenge AI decisions when necessary.

The key safety principle here is that all quality gates must pass before a system can be considered compliant for clinical use. The component’s visual indicators serve a crucial safety function - they provide immediate visibility into system readiness for clinical teams and risk managers:

import React, { useState, useEffect } from "react";

interface QualityGate {
  name: string;
  status: "passed" | "failed" | "pending";
  requirement: string;
  lastChecked: Date;
  evidence?: string[];
}

interface AISystemQuality {
  modelId: string;
  version: string;
  qualityGates: QualityGate[];
  overallStatus: "compliant" | "non_compliant" | "under_review";
  nextReviewDate: Date;
}

export const AIQualityManagement: React.FC<{ modelId: string }> = ({
  modelId,
}) => {
  const [qualityStatus, setQualityStatus] = useState<AISystemQuality | null>(
    null
  );
  const [isLoading, setIsLoading] = useState(true);

  useEffect(() => {
    const loadQualityStatus = async () => {
      setIsLoading(true);

      // Quality gates required by AI Act + NHS standards
      const requiredGates: Omit<QualityGate, "status" | "lastChecked">[] = [
        {
          name: "bias_testing",
          requirement: "AI Act Article 10 - Bias detection and mitigation",
        },
        {
          name: "clinical_validation",
          requirement: "NHS DCB 0129 - Clinical risk management",
        },
        {
          name: "explainability_verification",
          requirement: "AI Act Article 13 - Transparency obligations",
        },
        {
          name: "human_oversight_integration",
          requirement: "AI Act Article 14 - Human oversight",
        },
        {
          name: "data_governance_compliance",
          requirement: "GDPR + NHS Data Security Standards",
        },
        {
          name: "performance_monitoring",
          requirement: "AI Act Article 15 - Accuracy and robustness",
        },
      ];

      const qualityGates = await Promise.all(
        requiredGates.map(async (gate) => {
          const status = await checkQualityGate(modelId, gate.name);
          return {
            ...gate,
            status: status.passed ? "passed" : "failed",
            lastChecked: new Date(),
            evidence: status.evidence,
          } as QualityGate;
        })
      );

      const overallStatus = qualityGates.every(
        (gate) => gate.status === "passed"
      )
        ? "compliant"
        : qualityGates.some((gate) => gate.status === "failed")
        ? "non_compliant"
        : "under_review";

      setQualityStatus({
        modelId,
        version: await getModelVersion(modelId),
        qualityGates,
        overallStatus,
        nextReviewDate: new Date(Date.now() + 90 * 24 * 60 * 60 * 1000), // 90 days
      });

      setIsLoading(false);
    };

    loadQualityStatus();
  }, [modelId]);

  if (isLoading) {
    return <div className="p-4">Loading quality management status...</div>;
  }

  if (!qualityStatus) {
    return <div className="p-4">Unable to load quality status</div>;
  }

  const statusColor =
    qualityStatus.overallStatus === "compliant"
      ? "green"
      : qualityStatus.overallStatus === "non_compliant"
      ? "red"
      : "amber";

  return (
    <div className="p-6 bg-white rounded-lg border">
      <div className="flex justify-between items-start mb-6">
        <div>
          <h2 className="text-xl font-semibold">
            AI System Quality Management
          </h2>
          <p className="text-gray-600">
            Model: {modelId} v{qualityStatus.version}
          </p>
        </div>
        <div
          className={`px-3 py-1 rounded-lg text-sm font-medium bg-${statusColor}-100 text-${statusColor}-700`}
        >
          {qualityStatus.overallStatus.replace("_", " ").toUpperCase()}
        </div>
      </div>

      <div className="space-y-4">
        {qualityStatus.qualityGates.map((gate, index) => (
          <div
            key={index}
            className="flex items-start justify-between p-4 border rounded-lg"
          >
            <div className="flex-1">
              <div className="flex items-center gap-2">
                <h3 className="font-medium capitalize">
                  {gate.name.replace("_", " ")}
                </h3>
                <span
                  className={`px-2 py-1 rounded text-xs font-medium ${
                    gate.status === "passed"
                      ? "bg-green-100 text-green-700"
                      : gate.status === "failed"
                      ? "bg-red-100 text-red-700"
                      : "bg-amber-100 text-amber-700"
                  }`}
                >
                  {gate.status}
                </span>
              </div>
              <p className="text-sm text-gray-600 mt-1">{gate.requirement}</p>
              <p className="text-xs text-gray-500 mt-2">
                Last checked: {gate.lastChecked.toLocaleDateString()}
              </p>
            </div>

            {gate.evidence && (
              <div className="ml-4">
                <button className="text-sm text-blue-600 hover:text-blue-700">
                  View Evidence ({gate.evidence.length})
                </button>
              </div>
            )}
          </div>
        ))}
      </div>

      <div className="mt-6 pt-4 border-t">
        <p className="text-sm text-gray-600">
          Next scheduled review:{" "}
          {qualityStatus.nextReviewDate.toLocaleDateString()}
        </p>
        <div className="flex gap-3 mt-3">
          <button className="px-4 py-2 bg-blue-600 text-white rounded-lg text-sm">
            Generate Compliance Report
          </button>
          <button className="px-4 py-2 border border-gray-300 text-gray-700 rounded-lg text-sm">
            Schedule Review
          </button>
        </div>
      </div>
    </div>
  );
};

Implementation Timeline and Practical Steps

The AI Act’s full enforcement began in 2025 but implementation should start immediately for NHS applications using AI. The key phases are:

Immediate (2025): Audit existing AI capabilities and implement logging systems
Q2 2025: Deploy human oversight interfaces and bias monitoring
Q3 2025: Complete quality management system integration
Q4 2025: Full compliance validation and documentation

Integration with NHS Digital Transformation

By 2028, the NHS App will integrate lifestyle, demographic, and genomic data - functionality that will almost certainly involve AI processing. Applications built today need to anticipate these compliance requirements and build them into their architecture from the ground up.

The organisations that succeed will treat AI Act compliance as a fundamental system requirement, not a regulatory afterthought. This means comprehensive logging, transparent decision-making processes, and robust human oversight integrated into everyday clinical workflows.