Fine-Tune Llama3 using Synthetic Data

Поділитися
Вставка
  • Опубліковано 5 тра 2024
  • how to fine tune Llama-3 model in Google Colab in this tutorial using synthetically generated data. In this video chris not only shows you how to fine tune the model but also shows you his lessons learned, such as diversity of data, why the system prompt makes a difference, generalization and fine tuning to a particular format.
    You will not only learn how to fine tune a model but also how to generate synthetic data, and learn what works and what doesn't.
    Google Colab
    colab.research.google.com/dri...
    GIthub for Dataset:
    github.com/chrishayuk/chuk-da...
    HuggingFace for Model
    huggingface.co/chrishayuk/lla...
    HuggingFace for DataSet
    huggingface.co/datasets/chris...
  • Наука та технологія

КОМЕНТАРІ • 14

  • @kenchang3456
    @kenchang3456 Місяць тому +1

    Wow, how fortunate am I?! I was looking for an example of fine-tuning to change the behavior of a model to act like a counter clerk at an auto parts store and I think I have found it and it's synthetic too. THANK YOU VERY MUCH!

  • @JonathanDeCollibus
    @JonathanDeCollibus Місяць тому +1

    chris, fantastic video. i've been looking for this exact answer.

    • @chrishayuk
      @chrishayuk  Місяць тому +1

      Super glad this was useful, this vid is a little more raw than normal as my purposely pointing out the errors in the dataset rather than fixing them, but I think it’s useful to understand

  • @suryat8848
    @suryat8848 Місяць тому

    clean, and crisp!
    brilliant video chris :)
    PS: Can you please update the tokenizer part of the code, it's a bit confusing, thanks!

  • @JonathanDeCollibus
    @JonathanDeCollibus Місяць тому +1

    subscribed.

  • @tomekatomek5694
    @tomekatomek5694 9 днів тому +1

    3:00 - Show how to do it on a local machine please

    • @chrishayuk
      @chrishayuk  9 днів тому

      yes, i need to do that video. i've been distracted by building a faster pipeline for the finetune

  • @AT-mx3hn
    @AT-mx3hn Місяць тому +1

    I like to guess accents. What is your accent?! There is a obvious primary Scottish element but there are also strong hints of American and weaker hints of possibly English and/or Australian... did you move around a lot or are you just trying to sound more American so UA-cam can understand you better?!

    • @chrishayuk
      @chrishayuk  Місяць тому +1

      i'm like a fine wine with lots of elements of different accents. i'm a scot that lives in england that used to live in ireland, spent a lot of time in india, us and travels a lot.

    • @AT-mx3hn
      @AT-mx3hn Місяць тому

      Amazing, thanks for taking the time to answer!

  • @Forwardknowlege
    @Forwardknowlege Місяць тому

    can I fine tune Llama-3 by meta as well ? example >>> meta-llama/Meta-Llama-3-8B-Instruct

    • @chrishayuk
      @chrishayuk  Місяць тому

      ummm, that is llama-3, is there something specific you're trying to do?

    • @felipeekeziarosa4270
      @felipeekeziarosa4270 6 днів тому

      @@chrishayuk non-english legislation would be interesting