ChatGPT Health: SaaS Procurement Questions to Ask

Use the ChatGPT Health launch to pressure-test AI SaaS vendors on data handling, retention, subprocessors, and contract terms.

The launch of ChatGPT Health is more than a consumer AI headline. For procurement teams, it is a reminder that any AI-enabled document, signing, or workflow platform may now ingest regulated data, mix it with external sources, and produce outputs that influence business decisions. If you buy SaaS for OCR, e-signature, intake automation, claims workflows, HR packets, or records management, the procurement bar has changed: you are no longer just evaluating features, you are evaluating data handling, privacy boundaries, contract terms, and operational controls.

That shift matters because the most valuable AI features are often the ones closest to your most sensitive content. Whether a vendor analyzes medical records, employment forms, tax documents, patient onboarding packets, or legal agreements, the same questions apply: where is data stored, who can access it, what is retained, what is used for model improvement, and what happens when the vendor changes infrastructure or business model? If you need a broader framework for evaluating vendors and workflow fit, pair this guide with our vendor due diligence checklist and our SaaS procurement checklist.

This guide turns the ChatGPT Health launch into a practical procurement checklist for teams evaluating AI-enabled document and signing platforms. You will find a vendor questionnaire, a security and privacy review framework, contract language to request, and a comparison table you can reuse during procurement. For teams comparing OCR, intake, and signing platforms, also review our document scanning tools directory, e-signature platforms comparison, and AI document processing guide.

1. Why ChatGPT Health Changes the Procurement Conversation

AI features now touch regulated data by default

ChatGPT Health shows how quickly a general-purpose AI product can become a sensitive-data product. The BBC reported that OpenAI said ChatGPT Health can analyze medical records and app data to generate more personalized responses, while also saying chats are stored separately and not used to train its models. That is exactly the kind of dual message procurement teams need to interrogate: a vendor may promise privacy separation, but your job is to verify how that separation is implemented and whether it survives every downstream system in the stack. In practice, regulated data is rarely isolated cleanly unless the contract, architecture, and retention settings all enforce that boundary.

For document and signing SaaS, the risk surface is not limited to health records. The same pattern appears with identity documents, insurance forms, signed contracts, payroll files, and case notes. An AI assistant embedded in a scanning or signing tool may summarize, classify, extract, route, or recommend next steps on content that contains protected personal information. That makes your procurement process similar to a clinical or financial security review even when the product markets itself as productivity software.

“Not used for training” is necessary, not sufficient

Vendors often advertise that customer data is not used to train foundation models. That statement is helpful, but it does not answer the more important procurement questions: Is data used for product analytics? Is it stored in logs? Is it exposed to human reviewers? Is it retained in backups? Is it transmitted to subprocessors that have their own retention policies? A strong privacy posture requires operational proof, not a marketing line.

This is where teams often need to compare privacy claims against actual contractual controls. The distinction between model training, service improvement, debugging, abuse monitoring, and support access matters because each process has different legal and technical implications. If you are building a repeatable review process, our security questionnaire template and privacy review guide can help standardize the first pass before legal and security sign off.

Procurement is now a data governance function

Historically, SaaS procurement focused on features, price, and integration. With AI-enabled systems, procurement becomes a governance checkpoint for data minimization, residency, access control, retention, and deletion. The buying team needs enough technical fluency to challenge architecture claims and enough legal fluency to spot contract gaps. That is especially true if the tool ingests regulated records or generates summaries used in business decisions.

In that sense, ChatGPT Health is a useful stress test. If a consumer AI feature can prompt concern around medical records, then enterprise teams should assume any AI-enabled document platform needs to prove its controls with the same rigor. For a more general approach to vetting AI tools, see our AI vendor risk assessment framework and our compliance notes library.

2. The Data Questions Every Vendor Must Answer

What data is collected, derived, and stored?

Start with a precise inventory of data types. A vendor may collect uploaded files, extracted text, metadata, usage telemetry, prompts, embedded comments, audit logs, signatures, timestamps, IP addresses, device identifiers, and workflow events. Each of these can become sensitive when combined, and each may be retained on a different schedule. Ask the vendor to separate “customer content” from “system telemetry” and explain whether derived data, such as OCR output or AI-generated summaries, is treated as customer content.

This distinction matters because many vendors store derived artifacts longer than the original file, especially for indexing or search. If a document platform ingests medical records or government IDs, that extracted text may be just as sensitive as the source file. Procurement teams should insist on a data flow diagram or architecture statement that maps what enters the system, what is transformed, where it is stored, and what is deleted when the customer requests removal.

Where does the data live, and who can reach it?

Ask for the hosting regions, backup locations, support access model, and subprocessor list. Data residency is not only about geography; it is also about which service teams can access data, what jurisdictions the vendor operates in, and whether support access is time-bound and logged. If your organization has regional residency requirements, a vendor should be able to explain whether content stays in-region, whether failover crosses borders, and whether support tooling respects those boundaries.

Teams handling regulated records should also ask whether encryption keys are vendor-managed or customer-managed, and whether privileged access is role-based, approved, and audited. If the platform integrates with your identity stack, verify whether SSO, SCIM, and least-privilege role controls are available. For a related security angle, our zero-trust cloud guide and secure smart office patterns show how access boundaries should work in connected environments.

How long is data retained, and what is deleted?

Retention policy is one of the most under-specified areas in SaaS procurement. Many vendors promise “configurable retention,” but the actual controls may only apply to active content while logs, caches, and backups follow a separate schedule. Ask how retention is configured for source files, extracted text, signatures, workflow metadata, support attachments, and AI conversation history. Then ask how deletion propagates across production, backups, replicas, and analytics stores.

For regulated environments, deletion should be predictable, documented, and verifiable. You want a written retention policy, a deletion SLA, and an explanation of what happens upon contract termination. If the vendor cannot explain how quickly content is deleted, or whether backups are eventually purged on a fixed schedule, treat that as a material risk. For deeper contract and operational planning, see our retention policy checklist and data processing agreement guide.

3. Security Questions for AI-Enabled Document and Signing Platforms

How is regulated data protected in transit and at rest?

Encryption is baseline, not a differentiator, but procurement teams still need specifics. Ask about TLS versions, encryption at rest, key management, key rotation, and whether the vendor uses field-level encryption or tokenization for especially sensitive data. For platforms that process signed contracts, identity documents, or health-related materials, the question is not whether encryption exists; it is whether the implementation aligns with your risk posture and compliance obligations.

Vendors should also explain how they isolate tenant data and whether AI processing happens in the same environment as primary storage. If the architecture uses external inference services, ask whether those services inherit the same encryption and logging controls. Security reviews should verify whether the AI layer introduces new exposure through prompts, embeddings, vector stores, or cross-tenant retrieval paths.

What is the incident response and disclosure process?

Every serious procurement review should ask how the vendor detects, investigates, and notifies customers of incidents involving customer content or model outputs. The relevant issue is not only a breach of storage systems but also unauthorized access through support tooling, model logs, or misconfigured integrations. Ask for the notification timeline, the evidence customers receive, and whether the vendor commits to cooperate with forensic analysis.

It is also worth asking whether the vendor distinguishes between security incidents and privacy incidents. A product can be “secure” in the narrow infrastructure sense and still mishandle data in a privacy-violating way by over-retaining content or over-sharing with subprocessors. If you manage sensitive workloads, our cloud hosting security lessons article is a useful companion for framing incident expectations.

How are AI prompts and outputs monitored?

AI systems create a new class of telemetry: prompts, completions, tool calls, citations, and confidence signals. Ask whether prompts are logged, whether outputs are stored, whether human reviewers can inspect them, and whether those artifacts are redacted before support access. This matters even more if users paste regulated data into free-text fields, because those prompts may not follow the same policies as uploaded files.

Procurement teams should confirm whether monitoring is limited to abuse prevention and service quality or whether it can be repurposed for product training and analytics. If a vendor cannot clearly separate the two, you may be buying a system whose internal data flows are too ambiguous for regulated use. That is a red flag for legal, privacy, and infosec teams alike.

4. The Contract Terms Procurement Should Push Hardest

Data processing agreement terms are not boilerplate

The DPA should define roles, processing purposes, categories of data, security measures, subprocessors, breach notification, and deletion obligations. Do not accept vague language that leaves the vendor free to process customer content for broad “service improvement” purposes if your data includes regulated records. Ask for a narrow processing purpose and make sure it aligns with the product’s actual behavior.

Teams should also confirm whether the vendor will sign an addendum for sensitive data categories, such as health information, financial data, or government identifiers. If the product is used in a regulated workflow, the DPA should match that reality rather than the vendor’s generic terms. If your legal team wants a structured starting point, our DPA template for SaaS and contract terms checklist are useful review aids.

Subprocessors must be visible and controlled

Subprocessors are one of the most common blind spots in procurement. A vendor may promise not to use customer data for training while still routing content through multiple infrastructure, analytics, support, or AI service providers. Ask for a complete and current subprocessor list, including the function each subprocessor performs, its region, and whether it can access customer content or only metadata.

Then ask about subprocessor change notifications. You want advance notice and a clear objection mechanism for material changes, especially if a new subprocessor is introduced for AI inference, observability, or customer support. A strong procurement process treats subprocessor churn as a risk signal, not an administrative footnote.

Retention, deletion, and audit rights should be explicit

Contract language should define how long data is retained after termination, whether export is available before deletion, and how backup deletion is handled. If the vendor supports AI features, make sure the contract states whether prompts and generated outputs follow the same retention schedule as source documents. You should also ask for audit rights or, at minimum, independent assurance reports such as SOC 2, ISO 27001, or a similar control attestation.

For highly sensitive use cases, procurement may need rights to review architecture changes, significant incident summaries, and periodic compliance attestations. If the vendor uses a shared AI platform underneath its application, ask for transparency on how that dependency is governed. This is where our security review checklist and subprocessor tracking guide can help standardize vendor responses.

5. Practical Vendor Questionnaire for AI Procurement

Core questions to ask before the demo

Before you let a vendor walk you through polished workflows, ask the hard questions up front. Request written answers to: What data types do you ingest? Do you store prompts, extracted text, embeddings, or generated summaries? Is any customer content used to train or fine-tune models? How long is each data category retained? Which subprocessors can access content? What are the data residency options? How are admin and support access controlled? What deletion guarantees do you provide on termination?

These questions are deliberately simple, because a vendor with a mature program should answer them quickly and consistently. If the answers are evasive, contradictory, or overloaded with marketing language, that is a signal that the product team has not aligned legal, security, and engineering around sensitive data handling. In procurement, clarity is usually a proxy for maturity.

Questions for AI-specific risk

AI adds questions that older SaaS questionnaires often miss. Ask whether outputs can be hallucinated, whether the system exposes confidence scores, whether users can disable AI features, whether prompts are isolated by tenant, and whether the vendor has controls to prevent prompt injection or retrieval poisoning. If the system reads external documents, verify whether it validates file provenance and blocks untrusted links or embedded instructions.

Also ask how the vendor evaluates output quality for regulated use cases. If a platform is used to summarize a medical file, a contract, or a signed authorization, errors can have operational consequences even if the system is not making final decisions. That is why the best procurement teams treat AI features as assistive tools that still require human review, particularly in regulated workflows.

Questions for integrations and APIs

Integrations can quietly expand your risk surface. Ask whether APIs expose raw content, whether webhooks include sensitive fields, whether logs contain payloads, and whether third-party connectors inherit your retention and access settings. If the product integrates with CRM, ECM, identity, or workflow automation platforms, make sure the access tokens are scoped minimally and can be rotated or revoked easily.

For organizations building their own workflow automation, our API integration guide and workflow automation patterns show how to think about data flow, tokens, and event boundaries. The bigger procurement lesson is simple: every integration is a new processor, and every processor needs review.

6. A Comparison Table You Can Use in Procurement

Use the table below as a working matrix when comparing AI-enabled document and signing vendors. The goal is not to score features in isolation, but to expose the controls that matter when regulated data may be processed by AI systems. If a vendor cannot answer a row confidently, mark it as a risk and escalate to security or legal before moving forward.

Evaluation Area	What to Ask	Preferred Answer	Red Flag	Why It Matters
Model training	Is customer content used to train or fine-tune any model?	No, by default, with contract language confirming it	“We may use data to improve services”	Training use can create long-lived exposure of regulated data
Retention policy	How long are source files, prompts, outputs, logs, and backups retained?	Category-specific retention with deletion SLA	“Retained as needed”	Ambiguity makes compliance and deletion hard to prove
Subprocessors	Which subprocessors can access customer content?	Current list with function, region, and notice period	No public list or vague categories	Hidden processor chains increase privacy and security risk
Data residency	Can data stay in your required region end to end?	Yes, including backups and support access controls	Primary storage only, but backups leave region	Residency gaps can break regulatory commitments
Deletion	What happens when an admin deletes content or terminates service?	Documented deletion from production and scheduled purge from backups	No backup purge commitment	Deletion is a core control for regulated data handling
AI logging	Are prompts, outputs, and retrieval traces stored?	Minimized, redacted, and access-controlled	Stored indefinitely for debugging	AI logs often contain the most sensitive text

7. Due Diligence Process: How to Run the Review Efficiently

Start with data classification and use case scoping

Do not evaluate the vendor until you know what data will actually flow through it. Classify your use case by sensitivity: public, internal, confidential, regulated, or highly restricted. A simple e-signature workflow for sales agreements may justify a lighter review than a platform that ingests patient intake documents or employee medical leave forms. The clearer your use case, the faster security and legal can make a risk-based decision.

This scoping step also prevents unnecessary procurement friction. Vendors often oversell AI capabilities, but the practical question is whether you need the AI feature at all for the specific workflow. If the answer is no, disable it or choose a non-AI configuration where possible. For teams that want a broader process playbook, our procurement workflow playbook and data classification guide are useful reference points.

Use a three-layer review: security, privacy, legal

Security should validate controls, privacy should validate data use and retention, and legal should validate contract language and liability. The three reviews should not happen in silos because AI features often create overlap: a logging decision is both a security and privacy decision, and a subprocessor change is both a legal and operational issue. Establish a single shared questionnaire so vendors do not answer the same question three different ways.

The best teams also define escalation thresholds. For example, if a vendor cannot provide an audit report, if it retains prompts indefinitely, or if it uses customer content for model improvement, the deal may need executive approval or a hard no. That discipline keeps procurement from becoming a subjective debate after the demo is already sold.

Document the decision, not just the purchase

Every procurement decision should produce an internal memo or ticket that explains the rationale, accepted risks, compensating controls, and owner for periodic reassessment. This is especially important for AI-enabled tools because the product may change faster than your annual review cycle. If the vendor updates its model behavior, expands subprocessors, or changes retention defaults, your original approval may no longer be valid.

One practical approach is to schedule a 6- or 12-month reassessment and require the vendor to re-confirm key answers. That makes vendor due diligence an ongoing control instead of a one-time formality. If you are building a catalog of trusted options, consider using our verified vendor listings and vendor profiles as a starting point for shortlisting.

8. Real-World Scenarios Procurement Teams Should Expect

Healthcare workflows and patient-facing AI

In healthcare-adjacent workflows, AI may summarize intake forms, explain coverage documents, or extract details from scanned records. The risk is not only confidentiality but also accuracy, because a misleading summary can affect downstream decisions. If a tool handles medical data, ask whether it supports stricter access controls, audit logging, and role-based segregation for clinicians, administrators, and external support staff.

The BBC’s report on ChatGPT Health makes clear why this matters: data that users consider personal can quickly become operationally useful to the vendor. Procurement teams should insist on separate storage boundaries, explicit non-training terms, and a clear statement that the system is not intended for diagnosis or treatment. If your workflow touches healthcare data, our health data AI risk guide and medical record scanning resources provide more specific evaluation points.

HR, finance, and legal documents

HR and finance teams routinely process forms that include sensitive identifiers, compensation, banking details, and benefits information. AI can accelerate extraction and routing, but it can also expose more data than necessary if the platform indexes entire documents by default. Ask whether the system can redact fields, limit search permissions, and prevent AI summaries from leaking sensitive values into notifications or dashboards.

For legal documents, the issue is privilege and confidentiality. A clause extraction model that stores prompt traces or sends content to a third-party inference engine may create unacceptable exposure. The procurement question should always be: can the vendor prove that the AI feature is compatible with the confidentiality standard of the documents it processes?

Signing workflows with embedded AI

Digital signing platforms increasingly include AI to suggest fields, summarize agreements, or classify contract types. These features can be useful, but they should not reduce review rigor. Ask whether AI suggestions are editable, whether users can disable automated routing, and whether the platform preserves original document integrity after analysis. Signature workflows should also maintain a complete audit trail, including who viewed, modified, approved, and signed each document.

In many cases, the safest model is AI-assisted, human-approved. That means AI can help triage or extract, but legal or operations staff still make the final decision. For comparisons among leading tools, our digital signature tools directory and OCR comparison guide can help you shortlist vendors with the right workflow depth.

9. What Good Procurement Looks Like After the Vendor Answers

Require evidence, not promises

Strong vendors provide evidence: audit reports, architecture diagrams, DPA redlines, subprocessor lists, retention settings, and documented deletion workflows. Weak vendors provide reassurances. Procurement should reward evidence because AI and regulated data are too sensitive for generic trust claims. If a vendor is serious, it should expect these questions and have the documentation ready.

Pro Tip: Treat every AI feature as a data processor until proven otherwise. If the vendor cannot describe exactly how content is stored, segmented, logged, and deleted, assume the risk is higher than the demo suggests.

Build controls into configuration, not just contracts

Contracts matter, but configuration is what actually enforces the promise. Turn off unnecessary AI features, limit admin access, restrict export permissions, and configure retention to the shortest acceptable window. If the vendor offers customer-managed keys, regional storage, or tenant-level feature flags, use them. A well-configured lower-risk deployment is often better than a feature-rich default setup.

Procurement should coordinate with IT and operations so that the approved settings are actually implemented before launch. Too many vendors pass review and then get deployed with default settings that undermine the original risk decision. Good governance means the contract, configuration, and actual workflow all match.

Plan for change management

AI vendors change quickly. Model updates, new subprocessors, expanded features, and altered retention defaults can all shift your risk posture after procurement closes. Build a renewal checkpoint that asks the vendor to re-attest to the original answers and notify you of material changes. If the vendor cannot support that cadence, it may not be mature enough for regulated workflows.

For teams that need a broader monitoring mindset, our vendor change management guide and compliance monitoring playbook are practical complements to the initial review.

10. FAQ: ChatGPT Health and SaaS Procurement

Does “not used for training” mean a vendor is safe for regulated data?

No. It only answers one part of the question. You still need to know whether data is logged, retained, shared with subprocessors, accessed by support, or stored in backups. A safe procurement decision requires controls across the entire data lifecycle.

Should we block all AI features in document platforms?

Not necessarily. The better approach is risk-based. Low-sensitivity use cases may justify AI-assisted extraction or summarization, while highly sensitive workflows may require AI to be disabled or heavily restricted. The key is to align the feature set with the data classification.

What is the most common vendor red flag?

Ambiguous data use language is one of the biggest red flags. If a vendor cannot clearly state whether customer content is used for training, how long it is retained, and which subprocessors can access it, that uncertainty should slow the deal down.

How should we review subprocessors?

Ask for a complete list, the function of each subprocessor, and advance notice for changes. Also verify whether each subprocessor can access content or only metadata. In regulated workflows, a hidden inference or analytics provider can be a material risk.

What contract terms matter most for AI-enabled SaaS?

The most important terms are the DPA, retention/deletion commitments, subprocessor notice, breach notification, data use limitations, and audit or assurance rights. If those terms are vague, the vendor may be too risky for regulated data.

How often should we re-review an approved vendor?

At least annually, and sooner if the vendor changes model behavior, subprocessors, data residency, or retention settings. AI products evolve fast enough that a one-time approval is not enough for long-term governance.

Conclusion: Treat AI Procurement Like Regulated Data Governance

ChatGPT Health is a useful signal for procurement teams: AI products are moving deeper into sensitive workflows, and the vendor questions that once felt optional are now mandatory. If a platform can ingest medical records or other regulated data, then procurement must prove that its controls are narrow, auditable, and enforceable. That means asking about model training, retention, subprocessors, support access, incident handling, and contract terms before the pilot starts, not after the first data upload.

For teams evaluating document scanning, OCR, signing, or workflow automation tools, the goal is not to avoid AI entirely. The goal is to make AI safe enough for the use case by enforcing the right privacy, security, and legal controls. If you want to continue shortlisting vetted tools, compare options in our verified vendor listings, review integration patterns in our API integration guide, and use our security questionnaire template to standardize the next procurement cycle.

Health Data AI Risk Guide - A deeper look at how to evaluate AI tools that touch medical or wellness records.
Retention Policy Checklist - Practical questions for source files, logs, backups, and deletion timelines.
Data Processing Agreement Guide - What to ask for before you sign an AI-enabled SaaS contract.
Subprocessor Tracking Guide - How to monitor third-party processors and react to vendor changes.
Verified Vendor Listings - Browse vetted document and security scanning vendors with profile-level details.

Marcus Ellison

Senior SEO Content Strategist

Senior editor and content strategist. Writing about technology, design, and the future of digital media. Follow along for deep dives into the industry's moving parts.