Insights from Finetuning LLMs with Low-Rank Adaptation

How to Improve LLMs with RAG (Overview + Python Code)

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Finetuning Open-Source LLMs

Sebastian Raschka

Переглядів 34 980

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 24 гру 2024

КОМЕНТАРІ • 22

@nadranaj 11 місяців тому ⁺²
Thanks
@Dom-zy1qy 7 місяців тому ⁺¹
Very much appreciate this video, fine-tuning seemed like a somewhat amorphous concept to me for sometime, but the diagrams you showed really made it easier to understand how people finetune.
@SebastianRaschka 7 місяців тому
Thanks so much, glad these diagram were helpful and helped clarifying!
@kenchang3456 6 місяців тому ⁺¹
Thanks for sharing, especially about Lit-GPT (I'm always interested in more tutorials as my journey with fine-tuning and LLMs needs all the help it can get). Thanks again.
@SebastianRaschka 6 місяців тому
Glad you are liking LitGPT!
@Mayur7Garg Рік тому ⁺²
One of the approaches I have experimented with, which is both manual labor, time and compute expensive but more reliable, is as follows:
- Use a LLM to query for outputs. Use RAG and prompt engineering to get the best possible results.
- Generate chat logs for each query. The log should include everything - the prompt, the retrieved info if any and the model output. Any special symbol such as to denote the system prompt or anything else should also be left in. This is because LLMs are text generation models with no concept of chat.
- Manually update the model outputs to better reflect the expected output. This is a data creation task.
- Fine tune a copy of the same LLM using PEFT using the updated chat logs.
This can also be done iteratively as long the chat logs are generated initially by a model which hasn't been fine-tuned yet. Like a sort of A/B experiment. Some use cases are served the original model that generates the data for fine-tuning while the other are served the fine-tune model whose outputs are not used for any further fine-tuning.
Expensive but over time, your model would work better for realistic inputs.
@mulderbm 6 місяців тому ⁺¹
I recently listened to your latest videos. And now this one was recommended by perplexity for my specific use-case ;-) coincidence?
@SebastianRaschka 6 місяців тому ⁺¹
Haha, looks like LLMs are coming full circle here :D
@zjffdu 9 місяців тому ⁺¹
Thanks for the video, very helpful for me to understand different kinds of finetunning. BTW, what kind of finetunnig is huggingface belong to?
@SebastianRaschka 9 місяців тому
Glad that it was helpful! HF itself has different tools for finetuning. Similarly, the LitGPT library I help developing supports full finetuning, LoRA, QLoRA, etc.
@captinbo1 Рік тому
Thanks! Great overview
@mysticaltech Рік тому
Awesome, thank you!
@superfreiheit1 15 годин тому
Plase Show us please how to create a dataset for finetuning, not just downloading it.
@SebastianRaschka 11 годин тому
I have some resources for that here: github.com/rasbt/LLMs-from-scratch/tree/main/ch07/05_dataset-generation
@ParthivShah 3 місяці тому
Nice Video.
@muhammadanas7698 Рік тому
Time saw you here on YT! Hope you remember me.!
@prakhargurawa 11 місяців тому
Thank you :)
@lalmuansangachhakchhuak4927 Рік тому
Cool
@PtYt24 5 місяців тому ⁺¹
I really wish people would stop putting their x link and start sharing something like mastadon or threads, as a free user, x is where u go to feel second class citizen.
@SebastianRaschka 5 місяців тому ⁺¹
I hear you. On that note, I do have Threads and Mastodon accounts 😅. Just not using them much, somehow all the AI folks are still on X :(. I think the days of this type of social media are counted ...
@PtYt24 5 місяців тому ⁺¹
@@SebastianRaschka Haha, I get it. I feel in the topic of "All the AI folks are sill ON X" is somewhere the buck starts with you problem, if more people start sharing it will eventually move there I guess.
@mohammadkad Рік тому
Amazing, Thanks

Наступне

Автоматичне відтворення

Insights from Finetuning LLMs with Low-Rank Adaptation

Insights from Finetuning LLMs with Low-Rank Adaptation

How to Improve LLMs with RAG (Overview + Python Code)

How to Improve LLMs with RAG (Overview + Python Code)

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

Stanford Webinar - Large Language Models Get the Hype, but Compound Systems Are the Future of AI

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

«Я жити не хочу»: винесли «з нуля» пораненого побратима #shorts

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

СПОРИМ ТЫ НЕ ЗНАЕШЬ ТРИ СЛОВА НА БУКВУ О? #shortsvideo #юмор #катяклон #comedy #прикол #мамадочка

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Что-что Мурсдей говорит? 💭 #симбочка #симба #мурсдей

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Unexpected way to open the new Audi A6 e-tron Frunk 😮! #shorts

Developing an LLM: Building, Training, Finetuning

Developing an LLM: Building, Training, Finetuning

How to Build LLMs on Your Company’s Data While on a Budget

How to Build LLMs on Your Company’s Data While on a Budget

A Hackers' Guide to Language Models

A Hackers' Guide to Language Models

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

GraphRAG: The Marriage of Knowledge Graphs and RAG: Emil Eifrem

Building LLMs from the Ground Up: A 3-hour Coding Workshop

Building LLMs from the Ground Up: A 3-hour Coding Workshop

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

Visualizing transformers and attention | Talk for TNG Big Tech Day '24

QLoRA-How to Fine-tune an LLM on a Single GPU (w/ Python Code)

QLoRA—How to Fine-tune an LLM on a Single GPU (w/ Python Code)

Stanford CS25: V3 I Retrieval Augmented Language Models

Stanford CS25: V3 I Retrieval Augmented Language Models

Why Does Diffusion Work Better than Auto-Regression?

Why Does Diffusion Work Better than Auto-Regression?

до конца, там самая счастливая табалапка🐾🐾 #тикток #табалапка

до конца, там самая счастливая табалапка🐾🐾 #тикток #табалапка

Wall Rebound Challenge 🙈😱

Wall Rebound Challenge 🙈😱

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

😯 Подарила сыну БМВ, но не ожидала такой реакции на машину! | Новостничок

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

When you lose control of your Waboba Moon Ball. @TheWabobaTeam #wabobapartner

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

Они Скупали ВСЁ Серебро Мира и вот ЧТО Было Дальше! #shorts

ШАЛОСТЬ (смешное видео, приколы, юмор, поржать)

ШАЛОСТЬ (смешное видео, приколы, юмор, поржать)

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

"Бажано відбити посадку без втрат": військовий розповів, як загибель побратимів впливає на психіку

Ветеран війни отримав гроші на житло

Ветеран війни отримав гроші на житло