7 methods to secure LLM apps from prompt…

Jan 27, 2024

Practical strategies to protect language models apps (or at least doing your best)

8 Comments

Jan 27, 2024

Thank you two for using your reach to educate about this super important topic!

In my experience getting devs hands-on with attacks ALA OWASP Juice is incredibly valuable as it helps them conceptualize how these attacks work.

Some resources for that:

- Gandalf CTF: https://gandalf.lakera.ai/

- Portswigger academy web LLM attacks: https://portswigger.net/web-security/llm-attacks

- TensorTrust AI attack+defense: https://tensortrust.ai/

Also worth checking out the OWASP top 10 for LLMs: https://owasp.org/www-project-top-10-for-large-language-model-applications/assets/PDF/OWASP-Top-10-for-LLMs-2023-v1_1.pdf

Expand full comment

Reply (2)

Sahar Mor

Jan 27, 2024

Gandalf is such a fun way to get hands on experience with prompt attacks. Thanks for sharing.

Expand full comment

Devansh

Jan 27, 2024

Seems like you have insight into this. Come on for a follow up guest post if you're not too busy

Expand full comment

Reply (1)

Alex Mackie

Jan 27, 2024

Oh I'd love that, thank you! What's the best way to chat about this further?

Expand full comment

Reply (1)

Devansh

Jan 28, 2024

All my social media is in the end. Pick whatever you like the most

Expand full comment

Jasmine R.

Jan 30, 2024

Used Nemo Guardrails in a hackathon last year after it was released. Easy to implement and powerful! Interested in checking out these other approaches.

Expand full comment

Reply (1)

Sahar Mor

Jan 30, 2024

NeMo is great. The other ones are more straightforward so should be easier for you to implement.

Expand full comment

Simone Romano

Feb 7, 2024

Have you seen this https://github.com/ceterum1/llm-defender-subnet for Bittensor?

Expand full comment

Artificial Intelligence Made Simple

7 methods to secure LLM apps from prompt…