Integration and Security

Introduction

In this section, we look to dive into protection of LLM and its integration processes to prevent malicious usage by attackers. LangChain has explained some basic concepts and usages of its capabilities. It can also be used as potential attack vectors and malicious endpoint.

Example of LangChain Process Flow using `llm_math`

Credits to Nvidia

Issues with Model Inputs Using LangChain

With LangChain's rising popularity and usage to communicate with models, LangChain can be used as an offensive tool to inject malicious prompts that the model API Endpoint or the model itself may not properly sanitize. Because of rapid developmenet and deployment to public, designs of such tools may be inadequate and lack security enhancements. Recent developments have shown possibilities revolving arbitrary malicious input from attackers to control the output of the LLM.

Exploited Proof of Concepts (PoCs) using llm_math are further discussed in LLM Attacks.

Securing the Process

Recommended to use LangChain's latest version to avoid pitfalls of vulnerable APIs that malicious actors can abuse. Reduce the use of plugins from packages, create custom plugins if possible otherwise, treat external plugins with lowest level of privilege.

Data and Control (context passing) planes are inseparable and sometimes passed by user, important to always sanitize inputs on both ends. Parameterized queries can help to maintain a lower privileged context as well.

Securing LLM Systems Against Prompt Injection | NVIDIA Technical BlogNVIDIA Technical Blog

Paraphrasing can be used as possible mitigation method to prompt injection. This may break the order of the payload within the input which helps reduce malicious input execution.

Data prompt isolation to prevent malicious code within the data input to override the current instruction/context prompt. Solely treating data prompt as data input could reduce Context Ignoring attacks. Delimiters such as ''' or <xml> and other tags have to be carefully treated to prevent execution during data prompting.

Sandwich prevention prefixes an instrcution prompt to ensure the LLM do not modify its original context and prevents attacker's injected instruction to execute. However, it reduces flexibility in allowing normal users from performing custom prompts. Therefore, context should be carefully managed and a limiter could be used to detect if context has been shifted maliciously and only then, revert to its original instructions.

Interview Questions

LLM integration into existing products is a trend to boost productivity, how do you secure LLM integration and pipelines?
What defense techniques can be used to secure API functions?

Author

Zheng Jie 🍞

References

LLM Security - Defense - Fantastic resource links
Nvidia - Prompt Injection Defense
AWS - Architect Defense for Generative AI - Maybe write in separate topic
Lakera - Insights to LLM Security
Neeraj - LLM Defense Strategies - To incorporate in the future
OWASP - Top 10 LLM Vulnerabilities
Microsoft = AI/ML Security - To incorporate in the future
Paper - Prompt Injection and Defenses in LLM-Integrated Applications - Can further explore the paper

PreviousLangChain NextKeystore

Last updated 1 year ago