Adversarial Prompting - Tutorial + Lab

Hypnotized AI and Large Language Model Security

Prompt Injection, explained

💔Бабуся з онуком віддають честь військовослужбовиці ЗСУ

ROCK PAPER SCISSOR! (50 MLN CHALLENGE!) feat @PANDAGIRLOFFICIAL #shorts

ПОЛНАЯ ИСТОРИЯ ЭКЗОРЦИЗМА [Топ Сикрет]

Prompt Injections - An Introduction

Embrace The Red

Переглядів 4 775

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 7 чер 2024
Many courses teach prompt engineering and currently pretty much all examples are vulnerable to Prompt Injections. Especially Indirect Prompt Injections are dangerous. They allow untrusted data to take control of the LLM (large language model) and give an AI a new instructions, mission and objective.
This video aims to raise awareness of this rising problem.
Injections Lab: colab.research.google.com/dri...
Prompt Engineering Overview 0:00
Prompt Injections Explained 2:05
Indirect Prompt Injection and Examples 4:03
GPT 3.5 Turbot vs GPT-4 5:55
Examples of payloads 6:15
Indirect Injections, Plugins and Tools 8:20
Algorithmic Adversarial Prompt Creation 10:35
AI Injections Tutorials + Lab 12:22
Defenses 12:39
Thanks 14:40
Наука та технологія

КОМЕНТАРІ • 4

@ninosawas3568 5 місяців тому ⁺¹
Great video! Very informative. Interesting to see how the LLMs ability to "pay attention" is such a large exploit. I wonder if mitigating this issue would lead to LLMs being overall less effective at following user instructions
@embracethered 5 місяців тому ⁺¹
Thanks for watching! I believe you are correct, it's a double edged sword. The best mitigation at the moment is to not trust the responses. Unfortunately it's hence impossible at the moment to build a rather generic autonomous agent that uses tools automatically. It's a real bummer, because i think most of us want secure and safe agents.
@halfoflemon Рік тому ⁺¹
How about giving it a secret word that should be typed in order to unlock control, like a password? Do you think it will work? Also, does lowering the temperature reduces the chance of successful injection attack?
@embracethered Рік тому
Yes, something like that works. I have done it with image models in the past, basically train the model to respond in particular way once a certain object is present. You can check out this blog post on what is possible: embracethered.com/blog/posts/2020/husky-ai-machine-learning-backdoor-model/
Higher temperature means more "creativity", so it is probably more likely to come up with responses that could be considered insecure, but also less deterministic.

Наступне

Автоматичне відтворення

Adversarial Prompting - Tutorial + Lab

Adversarial Prompting - Tutorial + Lab

Hypnotized AI and Large Language Model Security

Hypnotized AI and Large Language Model Security

Prompt Injection, explained

Prompt Injection, explained

💔Бабуся з онуком віддають честь військовослужбовиці ЗСУ

💔Бабуся з онуком віддають честь військовослужбовиці ЗСУ

ROCK PAPER SCISSOR! (50 MLN CHALLENGE!) feat @PANDAGIRLOFFICIAL #shorts

ROCK PAPER SCISSOR! (50 MLN CHALLENGE!) feat @PANDAGIRLOFFICIAL #shorts

ПОЛНАЯ ИСТОРИЯ ЭКЗОРЦИЗМА [Топ Сикрет]

ПОЛНАЯ ИСТОРИЯ ЭКЗОРЦИЗМА [Топ Сикрет]

Історія військовослужбовця з ТЦК на Миколаївщині #shortsvideo

Історія військовослужбовця з ТЦК на Миколаївщині #shortsvideo

Using the Command Prompt for Port Forwarding on windows | Windows Tutorial | omarict

Using the Command Prompt for Port Forwarding on windows | Windows Tutorial | omarict

Real-world exploits and mitigations in LLM applications (37c3)

Real-world exploits and mitigations in LLM applications (37c3)

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

Python RAG Tutorial (with Local LLMs): AI For Your PDFs

LLM Vulnerability Scanning with garak. Tutorial: Test your own chat bots!

LLM Vulnerability Scanning with garak. Tutorial: Test your own chat bots!

Prompt Injection in LLM Agents (ReAct, Langchain)

Prompt Injection in LLM Agents (ReAct, Langchain)

Bobby Tables but with LLMs: Google NotebookLM - Data Exfiltration POC

Bobby Tables but with LLMs: Google NotebookLM - Data Exfiltration POC

Почему Играть на зарядке в Samsung, можно? #Shorts

Почему Играть на зарядке в Samsung, можно? #Shorts

БІЛИЙ НАЛИВ! - народний ПК на Ryzen 7500F за 28 800 грн! Це те що ти просив зібрати. Але є нюанс...😉

БІЛИЙ НАЛИВ! - народний ПК на Ryzen 7500F за 28 800 грн! Це те що ти просив зібрати. Але є нюанс...😉

iPhone 16 Pro Max + Siri 2.0 - це буде РОЗРИВ | Новини Тижня

iPhone 16 Pro Max + Siri 2.0 – це буде РОЗРИВ | Новини Тижня

Самый мощный игровой ПК на LGA 1366 тест в играх

Самый мощный игровой ПК на LGA 1366 тест в играх

Redmi Note 14 Pro - ВАЖКИЙ ЛЮКС 🔥 Samsung: багато Новинок Літа 2024 | iPhone з 8" дисплеєм

Redmi Note 14 Pro - ВАЖКИЙ ЛЮКС 🔥 Samsung: багато Новинок Літа 2024 | iPhone з 8" дисплеєм

AMD врывается с двух ног на Computex 2024 (мама - это точно для учёбы!)

AMD врывается с двух ног на Computex 2024 (мама - это точно для учёбы!)

Мобильные Ryzen 7000 | Как выбрать ноутбук и разгадать ребус AMD

Мобильные Ryzen 7000 | Как выбрать ноутбук и разгадать ребус AMD