You gotta tell us how we can fine-tune an open source model on a non-public PDF. Everybody on UA-cam uses hugging face toy datasets for fine-tuning that is basically useless.
Those who introduce fine-tuning on UA-cam they definitely know they are just talking nonsense.Even OpenAI and Claude they don't know how to leverage RLHF to solve the real case.for example even you used RLHF for stock trading you definitely will lose your pants!
I use script to make qna using open ai api all my pdf data into csv format, then convert my csv as jsonl for fine tune then make another version into json to upload it to vector database rag purpose, then fine tune gpt4o mini base model, still not give me good result 😅
There are hundreds of new public datasets released on Hugging Face per week and they all work for finetuning. Also if you want it for your specific usecase, everything requires some sort of custom data - e.g. RAG needs a database.
Nice work Jason. I appreciate the pace. Keep up the good work!
Fantastic job of teaching. Fast paced but great.
Thanks for the video was waiting on this one😛
Hello Jason, could you kindly add the code so that we can follow along.
You gotta tell us how we can fine-tune an open source model on a non-public PDF. Everybody on UA-cam uses hugging face toy datasets for fine-tuning that is basically useless.
Those who introduce fine-tuning on UA-cam they definitely know they are just talking nonsense.Even OpenAI and Claude they don't know how to leverage RLHF to solve the real case.for example even you used RLHF for stock trading you definitely will lose your pants!
Agree. Creating a dataset with sensible data in the cloud makes everything obsolete. But anyhow good explanation!
I use script to make qna using open ai api all my pdf data into csv format, then convert my csv as jsonl for fine tune then make another version into json to upload it to vector database rag purpose, then fine tune gpt4o mini base model, still not give me good result 😅
There are hundreds of new public datasets released on Hugging Face per week and they all work for finetuning. Also if you want it for your specific usecase, everything requires some sort of custom data - e.g. RAG needs a database.
@@yijin241Finetuning works great, if you know what you are doing.
wow
The problem with these misleading videos is that they provide partial information
Why are you shilling every video? I can no longer trust the tools you recommend