Masterclass in AI Threat Modeling: Addressing Prompt Injections

RSAC Gandalf Challenge: Insights from the World's Largest Red Team

Navigating 2024: Insights into AI Regulations and Standards for Enterprises

когда не обедаешь в школе // EVA mash

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

Lessons Learned from Crowdsourced LLM Threat Intelligence

Lakera AI

Переглядів 912

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 29 вер 2024
Join Václav Volhejn (Lakera), Sander Schulhoff (Learn Prompting), Marc Fischer (LVE), Sam Toyer (TensorTrust) and Eric Allen (Lakera) as they discuss insights from 4 awesome crowdsourcing projects.
Here's a brief overview of each:
👉 Gandalf: Gandalf’s capture the flag approach has spread across the world and been a part of everything from Harvard’s CS50 course, the Generative Red Team Challenge at DEF CON AI Village or the Hack.Sydney Conference. Play Gandalf here: gandalf.lakera...
👉 LVE Project: Beyond cataloging language model vulnerabilities, the Community Challenges provide an interesting look into convincing models to give misaligned responses, like identifying a person in a photo. Learn more: lve-project.org/
👉 Tensor Trust: As both an attacker and a defender, you can choose which model to use for defending your account. Your defenses can be implemented pre and post-user prompt. As you get better at attacking other players, your account becomes worth more points to compromise. Play here: tensortrust.ai/
👉 HackAPrompt: LearnPrompting adopted a strategy of getting the model to say a specific phrase, rather than trying to extract a secret. This method still aims to circumvent the model’s instructions but takes a slightly different approach. Try it here: learn-promptin...

КОМЕНТАРІ • 1

Наступне

Автоматичне відтворення

Masterclass in AI Threat Modeling: Addressing Prompt Injections

Masterclass in AI Threat Modeling: Addressing Prompt Injections

RSAC Gandalf Challenge: Insights from the World's Largest Red Team

RSAC Gandalf Challenge: Insights from the World's Largest Red Team

Navigating 2024: Insights into AI Regulations and Standards for Enterprises

Navigating 2024: Insights into AI Regulations and Standards for Enterprises

когда не обедаешь в школе // EVA mash

когда не обедаешь в школе // EVA mash

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

РЕШАЮЩИЙ РАЗГОВОР: Золкин и Карпенко нашли ее мужа / "Жди меня" отдыхает!

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

Usyk and Conor McGregor met on AJ vs Dubois fight

Usyk and Conor McGregor met on AJ vs Dubois fight

What is RAG? (Retrieval Augmented Generation)

What is RAG? (Retrieval Augmented Generation)

DecisionCAMP 2024: Sep20 "Innovations making Decision modelling easy" by Camunda

DecisionCAMP 2024: Sep20 "Innovations making Decision modelling easy" by Camunda

How Enterprises Can Secure AI Applications: Lessons from OWASP's Top 10 for LLMs

How Enterprises Can Secure AI Applications: Lessons from OWASP's Top 10 for LLMs

CIO Playbook for Enterprise AI | CXOTalk #810

CIO Playbook for Enterprise AI | CXOTalk #810

Decoding OWASP Large Language Model Security Verification Standard (LLMSVS)

Decoding OWASP Large Language Model Security Verification Standard (LLMSVS)

OpenAI o1 preview, Agentforce, AI in fantasy football, and machine unlearning

OpenAI o1 preview, Agentforce, AI in fantasy football, and machine unlearning

Lakera’s Global GenAI Security Readiness Report Deep Dive

Lakera’s Global GenAI Security Readiness Report Deep Dive

Generative AI in a Nutshell - how to survive and thrive in the age of AI

Generative AI in a Nutshell - how to survive and thrive in the age of AI

What Is an AI Anyway? | Mustafa Suleyman | TED

What Is an AI Anyway? | Mustafa Suleyman | TED

Factory assembly line, water transfer graffiti #hydrographic #craftshorts #printing #diy #shorts

Factory assembly line, water transfer graffiti #hydrographic #craftshorts #printing #diy #shorts

Мой последний стендап концерт можно посмотреть целиком на платформе specialscomedy.com

Мой последний стендап концерт можно посмотреть целиком на платформе specialscomedy.com

Usyk and Conor McGregor met on AJ vs Dubois fight

Usyk and Conor McGregor met on AJ vs Dubois fight

Bike Vs Tricycle Fast Challenge

Bike Vs Tricycle Fast Challenge

ПОЛНОЕ видео на канале. Нажми СРАЖАЮСЬ с ЗЛЫМИ РОДИТЕЛЯМИ в schoolboy ranaway

ПОЛНОЕ видео на канале. Нажми СРАЖАЮСЬ с ЗЛЫМИ РОДИТЕЛЯМИ в schoolboy ranaway

Сікорський звернувся до Небензі | Що радянські солдати робили у Польщі?

Сікорський звернувся до Небензі | Що радянські солдати робили у Польщі?

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

Кирилл Набутов. Над трупом Маслякова надругались, Патрушева прикончили, Терешкова выжила из ума

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Дзюба, Полина, L'One, Даник, Мага, Братишкин, Усачев, Чернец

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Масленников, Дзюба, Полина, L'One, Даник, Мага, Братишкин, Усачев, Чернец