Supercharge Your Coding Skills: Fine-Tuning CODE LLMs

Поділитися
Вставка
  • Опубліковано 17 лис 2024

КОМЕНТАРІ • 18

  • @rachelshalom3249
    @rachelshalom3249 Рік тому +2

    Can you add a video of finetuning starcoder for auto completion and not instructions like they show in they repo?

  • @CallisterPark
    @CallisterPark Рік тому +7

    @code_your_own_AI - Hi - are you going to finish the colab and show the results to prove it works? And provide a link to it? Thank you!

  • @attilalukacs1981
    @attilalukacs1981 Рік тому

    Amazing video, thank you for sharing all of this knowledge with us! These fine-tuned generative code systems can be the next level of programming,.

  • @yuvaraj600
    @yuvaraj600 Рік тому +1

    Your explanation is awesome!! Thanks for sharing these videos

    • @code4AI
      @code4AI  Рік тому +1

      You are welcome.

  • @davidbarton3361
    @davidbarton3361 Рік тому

    Thanks for the video, really interesting. I am fine tuning code LLMs and this was helpful. Aside; I find it so strange that token prediction is able to do this, since the LLM needs to be able to "plan" that a function will be needed and declare it and then use it in exactly the right way.

  • @ruchaapte5124
    @ruchaapte5124 Рік тому +1

    @code_your_own_AI - Thank you so much for this great video ! I am in the middle of preparing the dataset for fine-tuning and wanted to refer to the dataset you had to prepare and preprocess for your Pytorch code assistant. It would really help me to understand if there is any character limit that I should be aware about while creating the dataset.
    I also wanted to know the format in which 'Instruction' , 'Input', 'Output' should be presented - will it be in JSONL format / or a .txt format / .csv format ? Also for using StarCoder , is there a character / token limit that I need to follow for each example ?

    • @finnegon16
      @finnegon16 Рік тому

      did you get a reply? looking to do the same

  • @agustinalaurenti6159
    @agustinalaurenti6159 Рік тому

    Thank you for the video! Can this be used to fine-tune for other languages such as Lisp or Haskell? Or should I pre-train a new model from scratch?

  • @GesusOfYou
    @GesusOfYou Рік тому

    Wow :D What a cliff hanger at the end^^

  • @tignanshumishra591
    @tignanshumishra591 Рік тому

    Sir I just wanted to know, can I finetune with predefined codes and there promt for better code genration ?? if yes how to proceed with that

  • @RobertJohnson-xg5kh
    @RobertJohnson-xg5kh Рік тому +2

    I don't see any links in the description....

  • @muntahabintealam7370
    @muntahabintealam7370 Рік тому

    can I do code refactoring task with star coder? how can I prepare my dataset for that task?

  • @rohithkyla7595
    @rohithkyla7595 Рік тому

    Nice video! Is the outlier in part 3?

  • @haralc
    @haralc Рік тому

    Will this work on M2 CPU?

    • @code4AI
      @code4AI  Рік тому

      I see more and more solutions for apple silicon on reddit and hacker news, but I have no empirical data on stability or performance.