GDPR and LLM (GDPR-compliant LLM)
GDPR and LLM (GDPR-compliant LLM) — A GDPR-compliant LLM deployment means personal data passed to the model is processed under GDPR principles of lawfulness, minimization, integrity, and data-subject rights — including right to be forgotten, access, and rectification. In practice it requires a DPA with the LLM provider, training opt-out, EU data residency, audit logging, and a DPIA.
What a GDPR-compliant LLM must satisfy
- DPA (Data Processing Agreement) with the LLM provider — OpenAI, Anthropic, Google, AWS Bedrock all offer one.
- Opt-out from training — by default ChatGPT consumer trains on user data; OpenAI API does not. Verify per endpoint.
- Data residency — personal data should not leave the EU without legal basis. Vertex AI europe-west1, Azure OpenAI EU, self-hosted Mistral/Llama, hybrid setups.
- Audit log — who, when, what query, what response. Mandatory under regulators.
- Right to be forgotten — strategies for removing data from the system (re-train, fine-tuning unlearning, RAG-only).
- DPIA (Data Protection Impact Assessment) — formal risk assessment, ready for regulator conversation.
- Data minimization — do you actually need the customer’s name in the prompt, or is an ID enough?
Common pitfalls
- OpenAI API with personal data in the prompt without a DPA — most common 2024–2025 mistake.
- Embedding personal data in a vector DB — embeddings preserve information about personal data.
- Confusing EU AI Act with GDPR — two different regimes; satisfy both.
- No fallback for users invoking right to be forgotten — full retraining is often too expensive.
How fewtokensai helps
6 years of AI deployments in regulated environments (IG Group + FCA, CFTC, JFSA). I run LLM/AI compliance audits, design EU data-residency architectures, write DPIAs, work directly with your Compliance and Legal. GDPR/EU AI Act compliance audit or reach out.