Supercharging RAG with Generative Feedback Loops from Weaviate

My PhD Journey in AI / ML (while doing YouTube on the side)

MAMBA and State Space Models explained | SSM explained

Когда НАМОЧИЛ МАНТУ (смешное видео, юмор, поржать, приколы)

Лучший фокус с калькулятором + обучение! #shorts

There's no quit with this guy... Wheelz is a BEAST 💪

[Own work] On Measuring Faithfulness or Self-consistency of Natural Language Explanations

AI Coffee Break with Letitia

Переглядів 3 276

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 18 жов 2024

КОМЕНТАРІ • 20

@serta5727 2 місяці тому ⁺⁴
Cool 😎 your explanation was very understandable
@MikeAirforce111 2 місяці тому ⁺⁵
Congrats Doctor!! :-) Looking forward for your future work!
@theosalmon 2 місяці тому ⁺⁶
Thank you Dr. Letitia.
@alexkubiesa9073 2 місяці тому ⁺³
This sounds very useful! LLM users tend to assume that just because it writes like a human, that it can introspect and reason about its thought processes, which of course not a given. But it’s great to see progress on measuring this ability (or at least self-consistency) so that newer models can be more ergonomic.
@MaxShawabkeh 2 місяці тому ⁺³
Congrats on the PhD! This is really valuable work! I'm currently trying to squeeze out as much reasoning capabilities as I can out of small LLMs (7-15B) for my company's product, and I'd love a longer video or recorded talk going into details of your findings, any patterns you've found that contribute to improving or reducing self-consistency, or any insights on which existing models or training corpora result in better self consistency and reasoning capabilities. If you have any pointers, I'd appreciate it!
@AICoffeeBreak 2 місяці тому ⁺²
As far as we can see with this paper's experiments, RLHF helps improve self-consistency, but we have not yet any hints for what else had this effect. Maybe size, but for what we *could* test on our infrastructure, we did not measure an effect, but it might be there, we just couldn't test far enough.
@MaxShawabkeh 2 місяці тому
@@AICoffeeBreak Thanks!
@Thomas-gk42 2 місяці тому ⁺⁶
Congratulations to your doctorate🖖
@beatrixcarroll8144 2 місяці тому ⁺⁶
Congrats Dr. Letitia!!!! Wow, YOU ROCK!!!!!!! :-D :-) P.S. We missed you!!
@DerPylz 2 місяці тому ⁺⁵
Thanks for sharing your work! Always great so see what you're up to!
@AICoffeeBreak 2 місяці тому ⁺¹
Much appreciated!
@naromsky 2 місяці тому ⁺⁴
🎉
@fingerstyledojo 2 місяці тому ⁺⁵
Yay, new video!
Thanks for letting me pass yesterday lol
@AICoffeeBreak 2 місяці тому ⁺¹
Wow, you have a channel! It's amazing, just checked it out! 🤩
@nitinss3257 2 місяці тому ⁺⁵
1 minute ago for non members ... good to see ya
@Ben_D. 2 місяці тому ⁺⁴
No ASMR? 😟
@AICoffeeBreak 2 місяці тому ⁺²
It was an entire blooper. Next time for sure. 😅
@anluifb 2 місяці тому ⁺¹
So you came up with a method, didn't have time to explain the method to us, and didn't show us that it works. Great.
If you still have time before Bangkok I would suggest rerecording and focusing on the implementation and interpretation of results rather than the context and wordy descriptions.
@AICoffeeBreak 2 місяці тому ⁺¹
Thanks for your feedback. The method is in the video, just not the tiny details.
1. Interpret with SHAP prediction and explanation. (Mentioned in the video)
2. Measure their alignment (mentioned) after:
- normalisation: to bring the values to the same range (mentioned. Did not mention that shap properties make their value very different between output tokens with different probabilities)
- aggregation: to collect the many values from many outputs. (mentioned. Did not mention we use the mean for this)
For the results I've synthesized what we see with words and the main takeaways. For lengthy tables, please check the paper and its appendix. I don't know what you mean that the video doesn't show that it works. I've also shown an individual example before the takeaways. The problem that there is no ground truth, of course exists for us as well as for previous work. But for the first time in literature, we now *compare* existing works to each other-and to our method to them.
This is why the context is important, namely to make this clear. Because our paper makes the contribution to evaluate and clarify the state of the field, and as a follow-up contribution, we have this new method by solving the shortcomings of existing tests.

Наступне

Автоматичне відтворення

Supercharging RAG with Generative Feedback Loops from Weaviate

Supercharging RAG with Generative Feedback Loops from Weaviate

My PhD Journey in AI / ML (while doing YouTube on the side)

My PhD Journey in AI / ML (while doing YouTube on the side)

MAMBA and State Space Models explained | SSM explained

MAMBA and State Space Models explained | SSM explained

Когда НАМОЧИЛ МАНТУ (смешное видео, юмор, поржать, приколы)

Когда НАМОЧИЛ МАНТУ (смешное видео, юмор, поржать, приколы)

Лучший фокус с калькулятором + обучение! #shorts

Лучший фокус с калькулятором + обучение! #shorts

There's no quit with this guy... Wheelz is a BEAST 💪

There's no quit with this guy... Wheelz is a BEAST 💪

0ЧНАЯ SТАВКА, ПР0БЛЕМН0ГО 0ФИЦЕРА РАZВЕДКИ АЛТАЯ & ХИЩНИКА @VolodymyrZolkin

0ЧНАЯ SТАВКА, ПР0БЛЕМН0ГО 0ФИЦЕРА РАZВЕДКИ АЛТАЯ & ХИЩНИКА @VolodymyrZolkin

77% Of Employees Report AI Has Increased Workloads

77% Of Employees Report AI Has Increased Workloads

The moment we stopped understanding AI [AlexNet]

The moment we stopped understanding AI [AlexNet]

State Space Models (S4, S5, S6/Mamba) Explained

State Space Models (S4, S5, S6/Mamba) Explained

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

Direct Preference Optimization: Your Language Model is Secretly a Reward Model | DPO paper explained

This is the dangerous AI that got Sam Altman fired. Elon Musk, Ilya Sutskever.

This is the dangerous AI that got Sam Altman fired. Elon Musk, Ilya Sutskever.

Transformers explained | The architecture behind LLMs

Transformers explained | The architecture behind LLMs

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution - Paper Explained

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution – Paper Explained

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Harvard Professor Explains Algorithms in 5 Levels of Difficulty | WIRED

Mission: Impossible language models - Paper Explained [ACL 2024 recording]

Mission: Impossible language models – Paper Explained [ACL 2024 recording]

MY HEIGHT vs MrBEAST CREW 🙈📏

MY HEIGHT vs MrBEAST CREW 🙈📏

Когда у вас с подругой чуть разные размерчики 😅🍒 #юмор

Когда у вас с подругой чуть разные размерчики 😅🍒 #юмор

Наггетс Гедагедигедагедао в лабиринте Granny , помоги ему !

Наггетс Гедагедигедагедао в лабиринте Granny , помоги ему !

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Киркоров, Масленников, +100500, Дава, Супер Стас, Ликс, Генсуха, Шадоукек

МАФИЯ в РЕАЛЬНОЙ ЖИЗНИ: Киркоров, Масленников, +100500, Дава, Супер Стас, Ликс, Генсуха, Шадоукек

Всегда так, когда хочу что то приготовить 🥲 #aminkavitaminka #aminak #aminokka #аминкавитаминка

Всегда так, когда хочу что то приготовить 🥲 #aminkavitaminka #aminak #aminokka #аминкавитаминка

Human vs Jet Engine

Human vs Jet Engine

Генерал СБУ Омельченко: Россию ждет полный военный разгром и капитуляция

Генерал СБУ Омельченко: Россию ждет полный военный разгром и капитуляция