Top 10 Risks for Large Language Models in 2025

MAY NOT BE REPRODUCED WITHOUT PERMISSION

As generative AI and large language are embedded into a greater number of internal processes and customer-facing applications, the risks associated with LLMs are growing. The OWASP Top 10 list for LLM applications for 2025 details these risks based on real-world usage as a cautionary note for leaders in tech, cybersecurity, privacy, and compliance.

“Organizations are entering uncharted territory in securing and overseeing GenAI solutions. The rapid advancement of GenAI also opens doors for adversaries to enhance their attack strategies, a dual challenge of defense and threat escalation.” — OWASP

Attacks or manipulation of AI models are particularly nefarious because they are often hidden from end users, but can significantly impact outputs. When these risks are introduced by users, outputs are skewed and can be used for deliberate misinformation or other malicious activities.

The 2025 OWASP Top 10 for Large Language Models

Let’s break down each of the top 10 risks with examples and strategies for prevention and mitigation.

1. Prompt Injection

Prompt Injection occurs when user inputs alter an LLM’s behavior or output in unintended ways. This might involve bypassing safety measures, unauthorized access, or manipulation of decisions.

Examples

Injecting prompts into a chatbot to access private data
Using hidden instructions in web content to influence outputs
Modifying documents in repositories to manipulate Retrieval-Augmented Generation (RAG)
Using different languages in instructions to evade detection

Prevention and Mitigation Strategies

Integrate data sanitization to prevent user data from entering models.
Implement filtering for sensitive content on both inputs and outputs.
Apply least privilege access controls for model operations.
Limit access to external data sources.
Incorporate differential privacy to add noise to data or outputs.

Advanced techniques include the use of homomorphic encryption and tokenization to preprocess and sanitize any sensitive information.

2. Sensitive Information Disclosure

Sensitive Information Disclosure happens when a model unintentionally reveals private or confidential data through responses. This often includes information that is contained in training data and disclosed by specific user queries.

Examples

Leaking API keys or user credentials
Disclosing proprietary business strategies inappropriately
Sharing personal user data when answering queries
Revealing sensitive system details or prompts

Prevention and Mitigation Strategies

Scrub training data to remove sensitive details.
Enforce content filtering for sensitive output categories.
Eliminate outdated or vulnerable components.
Employ robust access controls to protect sensitive data from exposure.
Audit responses to identify and prevent leaks.
Implement response anonymization techniques.

3. Supply Chain Vulnerabilities

Supply Chain Vulnerabilities introduce risks when third-party components or dependencies are used. This can include malicious or unverified data, libraries, or models. It may simply be bad data or data crafted for malicious intent.