Data Privacy & Enterprise Options

What each provider does with your data, enterprise agreements, and HIPAA/GDPR considerations.

Why This Matters for Enterprises

Before any enterprise can deploy AI at scale, legal, compliance, and security teams need answers to specific questions: Does this provider train models on our data? Where is our data processed? Can we get a signed Data Processing Agreement? Does the provider have SOC 2 certification?

These aren't just box-checking exercises. Confidential customer data, attorney-client privileged communications, protected health information, and financial records are all potentially passing through AI APIs. The answers to these questions determine what you can legally and responsibly use AI for.

This article covers what each major provider actually offers for enterprise deployments — not the consumer product defaults, but the enterprise-grade options.

API Access vs Consumer Products

One of the most important distinctions in AI data privacy is API access versus consumer products.

Consumer products (ChatGPT free, Claude.ai free tier) may use your conversations to improve models, depending on your settings and the provider's current policies. Default settings vary and change over time.

API access — paying for model inference via API key — operates under different, generally stricter terms. Most providers' API terms explicitly state that they do not use API inputs and outputs to train models. Verify this in current documentation, as terms change.

For any enterprise deployment, you should be using API access or a managed enterprise product, not consumer products with business data.

Provider-by-Provider Overview

Anthropic

Claude API: Anthropic's API terms state that prompts and completions are not used to train models. Data is retained for a limited period for safety monitoring and abuse detection, then deleted.

Claude.ai Teams and Enterprise: The enterprise tier offers a signed DPA, SOC 2 Type II certification, admin controls, single sign-on (SSO), and audit logging. Data is not used for model training. Anthropic offers a Business Associate Agreement (BAA) for HIPAA-covered use cases under enterprise agreements.

Key gap: Data residency options are limited compared to hyperscale cloud providers. If you have strict requirements for data to remain within a specific geographic region, verify current availability.

OpenAI

API access: OpenAI's API terms state that API data is not used to train models by default. The company retains data for up to 30 days for safety purposes, then deletes it.

ChatGPT Enterprise: Designed for business deployments. Data is not used for training, AES-256 encryption at rest and in transit, SOC 2 Type II, SSO via SAML, admin controls, and usage analytics. ChatGPT Enterprise also supports HIPAA compliance with a BAA.

Azure OpenAI Service: For organizations already in the Microsoft/Azure ecosystem, Azure OpenAI offers the same models with Azure's enterprise compliance infrastructure — including data residency options, private networking, Azure Active Directory integration, and the full Azure compliance portfolio (HIPAA, FedRAMP, GDPR, etc.).

Google

Gemini API (Google AI Studio): Developer-tier access with standard terms. Not recommended for production enterprise use with sensitive data.

Vertex AI: Google's enterprise AI platform. Gemini models accessed via Vertex AI operate under Google Cloud's enterprise terms — data is not used for training, comprehensive compliance certifications (SOC 1/2/3, ISO 27001, HIPAA, FedRAMP), data residency options, VPC Service Controls for network isolation. For regulated industries, Vertex AI is the appropriate access path.

Gemini for Google Workspace: Integrated into Google Workspace (Gmail, Docs, Drive). Enterprise data protections apply: data not used for training, existing Workspace compliance certifications carry over.

Microsoft (Copilot)

Microsoft 365 Copilot operates within Microsoft's existing enterprise compliance framework. Business data is not used to train foundation models. Microsoft's compliance portfolio is extensive — HIPAA BAA, GDPR, FedRAMP, ISO 27001, SOC 1/2 — and Copilot inherits these certifications. Data residency follows your existing Microsoft 365 tenant configuration.

Key Compliance Considerations

GDPR

If you process data from EU residents, GDPR requires a lawful basis for processing and appropriate safeguards for transfers outside the EU. Key requirements for AI use:

A signed Data Processing Agreement (DPA) with your AI provider is required. All major providers offer DPAs for enterprise customers.
Data minimization: only include personal data in prompts when necessary.
Purpose limitation: AI processing must be consistent with the original purpose for which data was collected.
Consider data residency requirements — some EU deployments require data to remain in the EU.

HIPAA

Using AI with Protected Health Information (PHI) requires a Business Associate Agreement (BAA) with the provider. Both Anthropic (Claude Enterprise) and OpenAI (ChatGPT Enterprise) offer BAAs. Google Cloud (Vertex AI) also supports HIPAA under a BAA. Not all providers or tiers offer BAAs — verify before any healthcare deployment.

Even with a BAA, you remain responsible for ensuring PHI is handled appropriately. Implement the minimum necessary standard: only include PHI in prompts when genuinely required.

Data Processing Agreements

A DPA is a contract between you (data controller) and the provider (data processor) that specifies how personal data will be handled. It's a legal requirement under GDPR and best practice regardless. Enterprise tiers from all major providers include DPA templates or signed agreements.

Practical Procurement Guidance

1.Start with your compliance requirements: identify applicable regulations (GDPR, HIPAA, SOC 2, FedRAMP, etc.) and which provider tiers can meet them.

1.Get the DPA signed early: don't wait until procurement finalization. Request the DPA during evaluation.

1.Verify model training terms in writing: don't rely on marketing pages. Read the data processing terms and API ToS, and have legal review them.

1.Understand data retention: how long does the provider retain your data? For what purposes?

1.Consider audit logging: for regulated use cases, you need to be able to demonstrate what data was processed. Enterprise tiers with audit logging make this feasible.

1.Plan for access controls: enterprise tiers allow restricting which users can access AI features, which models they can use, and what data they can process. Configure these controls rather than using open access.

The compliance landscape for enterprise AI is maturing quickly. What required extensive negotiation in 2023 is now a standard enterprise tier feature. The major providers are investing heavily in compliance infrastructure because enterprise contracts are a significant portion of their revenue.

Have a follow-up question about this topic?

Ask AI

← Previous

Choosing a Provider for Your Product

AI Use Cases by Industry