Introduction to Supervised and Reinforcement Finetuning - Sachin Dharashivkar

Поділитися
Вставка
  • Опубліковано 20 сер 2023
  • Sachin Dharashivkar will speak about LLM Finetuning and RLHF
    Sachin is a founder who is exploring use cases of AI agents. He enjoys training Reinforcement Learning agents and exploring novel applications of Large Language Models.
    Three steps of training chatGPT style models. How to perform supervised finetuning. Why is Reinforcement Learning from Human Feedback important and How to train Reward and Policy models.
    More at has.gy/rEcp
  • Наука та технологія

КОМЕНТАРІ •