What each provider does with your data, enterprise agreements, and HIPAA/GDPR considerations.
Before any enterprise can deploy AI at scale, legal, compliance, and security teams need answers to specific questions: Does this provider train models on our data? Where is our data processed? Can we get a signed Data Processing Agreement? Does the provider have SOC 2 certification?
These aren't just box-checking exercises. Confidential customer data, attorney-client privileged communications, protected health information, and financial records are all potentially passing through AI APIs. The answers to these questions determine what you can legally and responsibly use AI for.
This article covers what each major provider actually offers for enterprise deployments — not the consumer product defaults, but the enterprise-grade options.
One of the most important distinctions in AI data privacy is API access versus consumer products.
Consumer products (ChatGPT free, Claude.ai free tier) may use your conversations to improve models, depending on your settings and the provider's current policies. Default settings vary and change over time.
API access — paying for model inference via API key — operates under different, generally stricter terms. Most providers' API terms explicitly state that they do not use API inputs and outputs to train models. Verify this in current documentation, as terms change.
For any enterprise deployment, you should be using API access or a managed enterprise product, not consumer products with business data.
Claude API: Anthropic's API terms state that prompts and completions are not used to train models. Data is retained for a limited period for safety monitoring and abuse detection, then deleted.
Claude.ai Teams and Enterprise: The enterprise tier offers a signed DPA, SOC 2 Type II certification, admin controls, single sign-on (SSO), and audit logging. Data is not used for model training. Anthropic offers a Business Associate Agreement (BAA) for HIPAA-covered use cases under enterprise agreements.
Key gap: Data residency options are limited compared to hyperscale cloud providers. If you have strict requirements for data to remain within a specific geographic region, verify current availability.
API access: OpenAI's API terms state that API data is not used to train models by default. The company retains data for up to 30 days for safety purposes, then deletes it.
ChatGPT Enterprise: Designed for business deployments. Data is not used for training, AES-256 encryption at rest and in transit, SOC 2 Type II, SSO via SAML, admin controls, and usage analytics. ChatGPT Enterprise also supports HIPAA compliance with a BAA.
Azure OpenAI Service: For organizations already in the Microsoft/Azure ecosystem, Azure OpenAI offers the same models with Azure's enterprise compliance infrastructure — including data residency options, private networking, Azure Active Directory integration, and the full Azure compliance portfolio (HIPAA, FedRAMP, GDPR, etc.).
Gemini API (Google AI Studio): Developer-tier access with standard terms. Not recommended for production enterprise use with sensitive data.
Vertex AI: Google's enterprise AI platform. Gemini models accessed via Vertex AI operate under Google Cloud's enterprise terms — data is not used for training, comprehensive compliance certifications (SOC 1/2/3, ISO 27001, HIPAA, FedRAMP), data residency options, VPC Service Controls for network isolation. For regulated industries, Vertex AI is the appropriate access path.
Gemini for Google Workspace: Integrated into Google Workspace (Gmail, Docs, Drive). Enterprise data protections apply: data not used for training, existing Workspace compliance certifications carry over.
Microsoft 365 Copilot operates within Microsoft's existing enterprise compliance framework. Business data is not used to train foundation models. Microsoft's compliance portfolio is extensive — HIPAA BAA, GDPR, FedRAMP, ISO 27001, SOC 1/2 — and Copilot inherits these certifications. Data residency follows your existing Microsoft 365 tenant configuration.
If you process data from EU residents, GDPR requires a lawful basis for processing and appropriate safeguards for transfers outside the EU. Key requirements for AI use:
Using AI with Protected Health Information (PHI) requires a Business Associate Agreement (BAA) with the provider. Both Anthropic (Claude Enterprise) and OpenAI (ChatGPT Enterprise) offer BAAs. Google Cloud (Vertex AI) also supports HIPAA under a BAA. Not all providers or tiers offer BAAs — verify before any healthcare deployment.
Even with a BAA, you remain responsible for ensuring PHI is handled appropriately. Implement the minimum necessary standard: only include PHI in prompts when genuinely required.
A DPA is a contract between you (data controller) and the provider (data processor) that specifies how personal data will be handled. It's a legal requirement under GDPR and best practice regardless. Enterprise tiers from all major providers include DPA templates or signed agreements.
The compliance landscape for enterprise AI is maturing quickly. What required extensive negotiation in 2023 is now a standard enterprise tier feature. The major providers are investing heavily in compliance infrastructure because enterprise contracts are a significant portion of their revenue.
Have a follow-up question about this topic?
Ask AI