Should Generative AI Be Regulated in Healthcare? Safety, Ethics & Innovation🚨#generativeai #aiethics

@Google Multimodal Generative AI - The Holy Grail of Healthcare🏥 #multimodalai #generativeai #ai

Michio Kaku on Alien Brains | Big Think

Отзвуки прошлого / The Old Way (2022) / Драма, Вестерн, Боевик

«Такі, як Коля Тищенко, тут не живуть»: як топ з СБУ ховає елітні котеджі / hromadske

🔴 Орбан змінює плани / Деталі ракетного удару по Україні

Is Claude 2 better than PaLM2 & ChatGPT 4? An In Depth Comparison🤖🤯

Junaid Kalia MD

Переглядів 266

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 15 лип 2024
In this in-depth video, get amazing insights on Anthropic's Claude 2 and Google's GPT-4 head-to-head in a thorough comparison to see which conversational AI assistant performs better overall. 🕵️‍♀️💻 I test them on a range of criteria including usefulness, accuracy, safety, and honesty through live conversations and sample queries.🔍
📈Key factors I evaluate include:
➡️Helpfulness - Can they understand natural language questions and provide useful, relevant answers?
➡️Knowledge - How extensive is their knowledge on a range of topics and current events?
➡️Personality - Do they exhibit a human-like personality and conversational ability?
➡️Safety - How well do they avoid harmful, unethical, or inappropriate content?
➡️Truthfulness - Do they admit limitations rather than make up facts or guess?
➡️Objectivity - Can they provide objective, balanced responses free of bias?
Reveal how Claude 2 and GPT-4 compare on each criterion through back-to-back testing. See which advanced AI assistant from Anthropic or Google comes out on top in abilities like contextual conversation, multi-turn dialog, reasoning, judgment and more.🌐
Discover the strengths and weaknesses of these two cutting-edge natural language AI models.💬✨Find out if one has an edge over the other to potentially serve as a more useful, harmless and honest AI assistant. Revealing analysis!💡
TIMESTAMPS
00:00 Introduction
00:31 What is Model Card?
02:04 An Example of Model Card
03:07 Factors & Metrics
03:30 Evaluation Data
04:30 Human Feedback Evaluations
05:07 Open AI's Superalignment
06:15 Constitutional AI: Harmlessness from AI Feedback
07:19 Constitutional AI: Training for Less Harmful Systems
07:39 Harmlessness vs Helpfulness
07:53 Anthropic's Cyber Security Framework
08:16 Claude 2: Evolving Towards Helpful & Harmless Language Assistants
09:48 Claude 2 Model Card
10:58 Intended Use
11:12 Unintended Use
11:55 Ethical Considerations
13:09 Training Data
13:19 Evaluations and Red Teaming
14:26 ELO Score
15:19 Human Feedback Evaluations
15:49 BBQA Bias Scores
16:59 Truthful QA
18:16 Automated Red-Teaming Evaluation
18:38 Combined HHH Evals
19:24 Multilingual Translation Evaluations
19:52 Long Contexts
21:38 Standard Benchmarks and Standardized Tests
21:47 GRE & MBE
22:06 USMLE
23:07 Use Case Specific Improvements
23:54 Areas for Improvement
24:03 Conclusion
📚Articles discussed⏬
1️⃣www-files.anthropic.com/produ...
2️⃣dl.acm.org/doi/abs/10.1145/32...
3️⃣arxiv.org/pdf/2209.07858.pdf
4️⃣arxiv.org/pdf/2202.03286.pdf
👨‍🎓About Junaid Kalia MD
"If anyone saved a life, it would be as if he saved the life of all mankind"; this is the philosophy that drives me. I am a practicing neurologist, sub-specialized in neurocritical care, stroke & epilepsy.
I am a HealthTech expert who believes that technology, especially AI, can enhance human lives by saving lives and preserving limbs. I am a Founders' Founder who shares his journey of founder highs and lows to help others learn. As an Angel Investor, I invest in early-stage startups.
Join me as I explore digital health innovation with case studies, discussions, and topics such as entrepreneurship, startups, medicine, healthcare, & AI. 🩺🤖
🔗 Follow me
🐦 Twitter - / junaidkaliamd
💼 LinkedIn - / junaidkaliamd
📸 Instagram - / junaidkaliamd
📲 My Apps
NeuroChat iOS: apps.apple.com/us/app/neuroch...
NeuroChat Android: play.google.com/store/apps/de...
#ai #comparison #anthropic #google #anthropic #claude2 #gpt4 #palm2 #modelcard #largelanguagemodels #ai #languageai #multilingualai #techreview #artificialintelligence #airesearch #contextwindows #biasreduction #harmlessness #helpfulness #truthfulness #youtubevideo #techtalk #aiinnovation
Наука та технологія

КОМЕНТАРІ • 4

@imtalhasaeed Рік тому
Great video! Informative comparison of Claude 2, PaLM 2, and ChatGPT 4. I'm interested in Claude 2 for creative text generation. Thanks for sharing!
@user-ox5cs3by8e Рік тому
Highly informative comparison! Very informational and interesting!
@shermeenkhalid7998 Рік тому
Mind-blowing comparison! 🤯 Can't wait for more revealing analyses! 🔍🤖
@mauiiithedigitalbard 11 місяців тому
There are so many llm's that are getting released! This is exciting

Наступне

Автоматичне відтворення

Should Generative AI Be Regulated in Healthcare? Safety, Ethics & Innovation🚨#generativeai #aiethics

Should Generative AI Be Regulated in Healthcare? Safety, Ethics & Innovation🚨#generativeai #aiethics

@Google Multimodal Generative AI - The Holy Grail of Healthcare🏥 #multimodalai #generativeai #ai

@Google Multimodal Generative AI - The Holy Grail of Healthcare🏥 #multimodalai #generativeai #ai

Michio Kaku on Alien Brains | Big Think

Michio Kaku on Alien Brains | Big Think

Отзвуки прошлого / The Old Way (2022) / Драма, Вестерн, Боевик

Отзвуки прошлого / The Old Way (2022) / Драма, Вестерн, Боевик

«Такі, як Коля Тищенко, тут не живуть»: як топ з СБУ ховає елітні котеджі / hromadske

«Такі, як Коля Тищенко, тут не живуть»: як топ з СБУ ховає елітні котеджі / hromadske

🔴 Орбан змінює плани / Деталі ракетного удару по Україні

🔴 Орбан змінює плани / Деталі ракетного удару по Україні

ТАМАЕВ УНИЧТОЖИЛ CLS ВЕНГАЛБИ! Конфликт с Ахмедом?!

ТАМАЕВ УНИЧТОЖИЛ CLS ВЕНГАЛБИ! Конфликт с Ахмедом?!

Myths and Facts About Superintelligent AI

Myths and Facts About Superintelligent AI

AI Agents Explained: How This Changes Everything

AI Agents Explained: How This Changes Everything

Does IQ Really Measure How Smart You Are?

Does IQ Really Measure How Smart You Are?

15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)

15 INSANE Use Cases for NEW Claude Sonnet 3.5! (Outperforms GPT-4o)

How Transformer Models Will Revolutionize Medical Care? #multimodalAI #shorts #ai #healthcare

How Transformer Models Will Revolutionize Medical Care? #multimodalAI #shorts #ai #healthcare

The Next Decade in AI by Ray Kurzweil and Ilya Sutskever

The Next Decade in AI by Ray Kurzweil and Ilya Sutskever

How I "Humanize" ChatGPT AI Content...

How I "Humanize" ChatGPT AI Content...

The Exciting, Perilous Journey Toward AGI | Ilya Sutskever | TED

The Exciting, Perilous Journey Toward AGI | Ilya Sutskever | TED

ChatGPT Just Solved Chess

ChatGPT Just Solved Chess

ЧТО ЭТО За Флешки Замурованные в СТЕНЕ? #shorts

ЧТО ЭТО За Флешки Замурованные в СТЕНЕ? #shorts

Забудьте о RX 580 | Тест Nvidia P102, P106 и GTX 1650 Super

Забудьте о RX 580 | Тест Nvidia P102, P106 и GTX 1650 Super

Самый офигеть какой большой QD-Mini LED телевизор в мире - 115X955!

Самый офигеть какой большой QD-Mini LED телевизор в мире - 115X955!

Як організувати резервне живлення в квартирі чи будинку? Всі можливі варіанти (майже).

Як організувати резервне живлення в квартирі чи будинку? Всі можливі варіанти (майже).

Новый питомец! Робот с искусственным интеллектом! Он меня узнал! Anki Cozmo

Новый питомец! Робот с искусственным интеллектом! Он меня узнал! Anki Cozmo

Does Venom also like music?#desksetup #desk #desktop #venom #pickup

Does Venom also like music?#desksetup #desk #desktop #venom #pickup

Не держит нагрузку / Батарея Makita BL1016 | Перепаковка аккумуляторов

Не держит нагрузку / Батарея Makita BL1016 | Перепаковка аккумуляторов

МЫШКА КОТОРАЯ НУЖНАЯ КАЖДОМУ КИБЕРСПОРТСМЕНУ? ЗАЧЕМ НУЖНА ЭТА МЫШКА? #cs2 #игры

МЫШКА КОТОРАЯ НУЖНАЯ КАЖДОМУ КИБЕРСПОРТСМЕНУ? ЗАЧЕМ НУЖНА ЭТА МЫШКА? #cs2 #игры