LLM02:2025 Sensitive Information Disclosure - Simple Explanation
Sensitive information disclosure is when an LLM reveals data it should keep private. That data could be personal information or business secrets or passwords or source code or internal instructions.
Imagine a new employee who can see customer records and HR files and internal memos. If that employee says a customer's credit card number to the wrong person by accident it is still a data leak. An LLM can do the same thing through its answers.
$ trace llm02.sensitive-disclosure
Why it happens
Sensitive data reaches an LLM in a few common ways. It may be present in training data. It may arrive through live context from a database or retrieval system. It may also be typed directly by a user who pastes private material into chat.
Once sensitive data is inside model knowledge or context a clever prompt can pull it back out. Sometimes no attacker is needed. A normal user can ask an innocent question and receive data they should never see.
Data at risk
Common vulnerabilities
The model reveals one user's personal data to another user.
The output reveals model behavior or training data that helps attackers clone or invert the model.
Confidential business information appears in a response.
Attack scenarios
Real warning stories
Samsung is the easy warning story. Engineers reportedly pasted proprietary source code and internal notes into ChatGPT while trying to solve work problems. The lesson is simple. Private code should not enter tools that company policy does not control.
Researchers also showed that asking ChatGPT to repeat a word forever could make it output memorized training data. WIRED reported that this included names and email addresses and phone numbers. OpenAI later appeared to block at least some versions of that behavior.
$ map data-leak.path
How to defend against it
Why this risk is number two
The damage is concrete and the attack surface is large. Every company that connects an LLM to customer records or internal documents or codebases creates a new path for data to escape.
Prompt injection is often the technique. Sensitive information disclosure is often the result. It can also create legal duties under GDPR and HIPAA and PCI DSS and CCPA.
Related frameworks
OWASP maps this risk to MITRE ATLAS techniques about training data membership and model inversion and model extraction.
- AML.T0024.000 - Infer Training Data Membership
- AML.T0024.001 - Invert ML Model
- AML.T0024.002 - Extract ML Model
One sentence: Sensitive information disclosure is when an LLM outputs data it was never supposed to share. The data may come from training or connected systems or user prompts.
Copyright and source notes
No third-party images are embedded in this post. The diagrams above are original HTML/CSS illustrations made for promptexploit. The factual risk description and mitigation categories are based on the official OWASP LLM02 page.
- Official OWASP LLM02 page: genai.owasp.org/llmrisk/llm022025-sensitive-information-disclosure
- WIRED on the repeat word training data leak: wired.com/story/chatgpt-poem-forever-security-roundup
- Cybernews on the Samsung ChatGPT leak: cybernews.com/news/chatgpt-samsung-data-leak