How to Remove PII Before Using AI Tools

A step-by-step guide to GDPR-compliant AI use — with ChatGPT, Claude, Gemini, Copilot and more

Remove PII automatically in seconds → Use our free redaction tool — no signup needed

Every time you paste a document into ChatGPT, Claude, Gemini, or any cloud-based AI tool, its content is transmitted to and processed on that provider's external servers. If that document contains personally identifiable information (PII) — names, email addresses, national IDs, medical data, or financial identifiers — you may be in breach of GDPR Article 28, HIPAA, or other applicable data protection law.

The solution is straightforward: remove the PII before the document reaches the AI tool. This guide explains exactly how to do that — quickly, reliably, and at no cost.

📋 On This Page

  1. Why This Matters for GDPR Compliance
  2. Which AI Tools Require PII Removal?
  3. Step 1: Identify PII in Your Document
  4. Step 2: Open PrivacyPromptAI
  5. Step 3: Run Automatic Cleaning
  6. Step 4: Review the Output
  7. Step 5: Use the Clean Document with Your AI Tool
  8. GDPR AI Compliance Checklist
⚖️

Why This Matters for GDPR Compliance

Under GDPR Article 28, AI providers such as OpenAI, Anthropic, and Google are third-party data processors. Transferring personal data to them requires a signed Data Processing Agreement (DPA) and a valid lawful basis for processing. Most organisations using AI tools in their day-to-day workflows have neither.

Beyond the contractual requirements, GDPR Chapter V governs international transfers. US-based AI server infrastructure means personal data is routinely sent outside the EU — potentially triggering Standard Contractual Clauses (SCCs) or a Transfer Impact Assessment (TIA).

GDPR fines for unlawful data transfers can reach €20 million or 4% of global annual turnover. The risk is not theoretical — regulators across the EU have actively enforced these rules against organisations using US-based SaaS tools.

Cleaning PII out of the document before it leaves your device eliminates all of these obligations in one step. No PII in the document = no personal data transfer = no GDPR exposure.

Which AI Tools Require PII Removal?

Any AI tool that processes your text on external servers requires PII removal before use. This includes all major AI assistants currently in use by businesses and professionals:

🤖
ChatGPTOpenAI — US servers
🔮
ClaudeAnthropic — US servers
GeminiGoogle — US/EU servers
🪟
Microsoft CopilotAzure — global servers
🔍
PerplexityUS servers
🌐
Any other LLMCloud-based AI tool
💡 The rule is simple: if text leaves your device to be processed, remove the PII first. Locally-running models (e.g. Ollama, LM Studio) are the only exception — and even then, removing PII is good practice.
1

Identify PII in Your Document

Before you clean the document, it helps to know what you're looking for. Common PII types found in business documents include:

Full names Email addresses Phone numbers Postal addresses National ID numbers Social Security Numbers IBANs / bank details Credit card numbers Dates of birth IP addresses Passport numbers Health data

High-risk document types include CVs and resumes, HR records, medical notes, invoices, contracts, and email threads. For a complete breakdown, see our What Is PII? →

💡 You don't need to identify every instance manually — PrivacyPromptAI will find them automatically in Step 3. But being aware of what's in your document helps you verify the output in Step 4.
2

Open PrivacyPromptAI

Go to the PrivacyPromptAI homepage. No account, signup, or installation is required. The tool runs entirely in your browser — nothing is sent to any server at any point.

You have two input methods:

💡 For email threads: copy the entire thread including headers and signatures. PrivacyPromptAI will redact sender names, email addresses, and phone numbers from signatures automatically.
3

Run Automatic PII Detection

Click "Clean PII". The detection engine will scan your text using two methods simultaneously:

Each detected item is replaced with a numbered token: [NAME_1], [EMAIL_1], [IBAN_1]. The numbering lets you refer back to specific cleaned items if needed.

💡 Using the Settings page? You can toggle individual detection types (e.g. disable credit card detection for documents where those numbers are not PII) and add custom keywords for project-specific terms under the Pro licence.
4

Review the Cleaned Output

Automated tools are highly reliable, but a quick human review adds an important final layer of assurance — especially for high-risk documents such as medical records or legal contracts.

When reviewing, look for:

For healthcare documents: HIPAA's Safe Harbor method requires removal of 18 specific identifiers. Verify the output against the full Safe Harbor list — see our Compliance page →
5

Use the Clean Document with Your AI Tool

Copy the cleaned text and paste it into ChatGPT, Claude, Gemini, Copilot, or any AI assistant. The document is now safe to process.

Because no personal data remains in the text, no personal data transfer occurs — the GDPR obligations under Article 28 do not apply to anonymised or pseudonymised data that cannot be re-linked to a real person.

You're now compliant. The AI receives only the anonymised version. All PII remains on your device inside PrivacyPromptAI's local session — it is never uploaded anywhere.

When your AI output is ready, reverse the process: replace the tokens with the original values from your local reference to produce the final personalised document. You can download a cleaning report (Pro) or simply keep your original document open alongside the AI output.

✅ GDPR AI Compliance Checklist

Use this checklist each time you prepare a document for an AI tool:

For a deeper guide on GDPR and AI tools, see our GDPR Compliance for AI Tools guide →

Try It Now — Free, No Signup

Paste any document and remove all PII in under 10 seconds. 100% local processing. GDPR-compliant. Works in 23 languages.

Clean My Document — Free

Frequently Asked Questions

Yes, if you are subject to GDPR. Pasting a document into ChatGPT transmits its content to OpenAI's servers — a third-party processor under GDPR Article 28. Without a valid DPA and lawful basis, this may constitute an unlawful data transfer. Removing PII first means no personal data is transmitted, eliminating the risk entirely. Once clean, you can store the document in pCloud (Swiss encrypted storage) or sign it with BoldSign — both GDPR-compliant. See our GDPR AI Compliance guide →
Both OpenAI and Anthropic offer enterprise DPAs and have EU data processing terms. However, from a pure GDPR risk perspective, the safest approach with any cloud AI tool is to remove PII before the document leaves your device — regardless of which provider you use. This approach is provider-agnostic and requires no contractual setup. If you also need to transmit the clean document securely, NordVPN encrypts your connection with a verified no-logs policy.
Microsoft 365 Copilot is subject to the same GDPR rules as any other AI processor. Enterprise tenants with a Microsoft DPA in place may have stronger protections, but individual and SME users should still remove PII before using Copilot features on sensitive documents. Use our Clean a Document tool → to redact DOCX files before opening them in Word.
For most tasks — summarising, drafting, translating, classifying — the AI only needs the content, not the identities. Replacing names with [NAME_1] and emails with [EMAIL_1] preserves the structure and meaning of the document. The AI can still summarise a contract, draft a follow-up email, or extract key clauses — without ever needing the personal details.