How to Remove PII Before Using AI Tools
A step-by-step guide to GDPR-compliant AI use — with ChatGPT, Claude, Gemini, Copilot and more
Remove PII automatically in seconds → Use our free redaction tool — no signup needed
Every time you paste a document into ChatGPT, Claude, Gemini, or any cloud-based AI tool, its content is transmitted to and processed on that provider's external servers. If that document contains personally identifiable information (PII) — names, email addresses, national IDs, medical data, or financial identifiers — you may be in breach of GDPR Article 28, HIPAA, or other applicable data protection law.
The solution is straightforward: remove the PII before the document reaches the AI tool. This guide explains exactly how to do that — quickly, reliably, and at no cost.
📋 On This Page
Why This Matters for GDPR Compliance
Under GDPR Article 28, AI providers such as OpenAI, Anthropic, and Google are third-party data processors. Transferring personal data to them requires a signed Data Processing Agreement (DPA) and a valid lawful basis for processing. Most organisations using AI tools in their day-to-day workflows have neither.
Beyond the contractual requirements, GDPR Chapter V governs international transfers. US-based AI server infrastructure means personal data is routinely sent outside the EU — potentially triggering Standard Contractual Clauses (SCCs) or a Transfer Impact Assessment (TIA).
Cleaning PII out of the document before it leaves your device eliminates all of these obligations in one step. No PII in the document = no personal data transfer = no GDPR exposure.
Which AI Tools Require PII Removal?
Any AI tool that processes your text on external servers requires PII removal before use. This includes all major AI assistants currently in use by businesses and professionals:
Identify PII in Your Document
Before you clean the document, it helps to know what you're looking for. Common PII types found in business documents include:
High-risk document types include CVs and resumes, HR records, medical notes, invoices, contracts, and email threads. For a complete breakdown, see our What Is PII? →
Open PrivacyPromptAI
Go to the PrivacyPromptAI homepage. No account, signup, or installation is required. The tool runs entirely in your browser — nothing is sent to any server at any point.
You have two input methods:
- Paste text — copy your document content and paste it directly into the text area on the homepage.
- Upload File(s) — use the Clean a Document page to upload a TXT or DOCX file. Pro users can also upload PDF, XLSX, CSV, and other formats.
Run Automatic PII Detection
Click "Clean PII". The detection engine will scan your text using two methods simultaneously:
- Pattern matching — advanced regular expressions detect emails, phone numbers, IBANs, credit card numbers, national IDs, IP addresses, and dates across country-specific formats.
- Named Entity Recognition (NER) — detects personal names and organisation names without relying on fixed patterns, covering 23 European languages.
Each detected item is replaced with a numbered token: [NAME_1], [EMAIL_1], [IBAN_1]. The numbering lets you refer back to specific cleaned items if needed.
Review the Cleaned Output
Automated tools are highly reliable, but a quick human review adds an important final layer of assurance — especially for high-risk documents such as medical records or legal contracts.
When reviewing, look for:
- Context-specific identifiers — project codenames, internal employee codes, or organisation-specific reference numbers that no general pattern matcher would know to flag.
- Indirect identifiers — combinations of data that, while individually innocuous, could identify someone in context (e.g. "the head of department in the Brussels office, appointed in 2019").
- Third-party data — references to clients, customers, or colleagues who are not the primary subject of the document but whose data is still present.
Use the Clean Document with Your AI Tool
Copy the cleaned text and paste it into ChatGPT, Claude, Gemini, Copilot, or any AI assistant. The document is now safe to process.
Because no personal data remains in the text, no personal data transfer occurs — the GDPR obligations under Article 28 do not apply to anonymised or pseudonymised data that cannot be re-linked to a real person.
When your AI output is ready, reverse the process: replace the tokens with the original values from your local reference to produce the final personalised document. You can download a cleaning report (Pro) or simply keep your original document open alongside the AI output.
✅ GDPR AI Compliance Checklist
Use this checklist each time you prepare a document for an AI tool:
- Identified the document type and typical PII it contains
- Pasted or uploaded the document into PrivacyPromptAI
- Run automatic PII detection — all patterns and NER
- Reviewed output for context-specific and indirect identifiers
- Confirmed no names, emails, IDs, or financial data remain
- For medical documents: verified all 18 HIPAA Safe Harbor identifiers are removed
- Pasted clean document into the AI tool
- Stored original (non-cleaned) document securely and separately
For a deeper guide on GDPR and AI tools, see our GDPR Compliance for AI Tools guide →
Try It Now — Free, No Signup
Paste any document and remove all PII in under 10 seconds. 100% local processing. GDPR-compliant. Works in 23 languages.
Clean My Document — Free