Windows Fine Tuning Combined Streams

Поділитися
Вставка
  • Опубліковано 2 жов 2024

КОМЕНТАРІ • 6

  • @Qenton
    @Qenton 2 місяці тому +1

    Thanks for this.... although had one problem that I finally found a solution to. Currently you need to speicify pytorch=2.2.0 otherwise you get an error due to new torch libraries not being compatible. ( github.com/unslothai/unsloth/issues/73#issuecomment-1998826713 )

    • @robkerrai
      @robkerrai  Місяць тому

      Thanks for contributing that tip!

  • @tlubben972
    @tlubben972 4 місяці тому +1

    This really helped. I couldn't find a good video that showed me how to do this all weekend, but I would recommend not cutting out scenes because people are looking at the video and following and literally a glace left and right, and you already did another step that I didn't even notice. Had to watch it like 4 times super slow to catch all the steps. Also I dont know linux so skipping the part of making the code/finetune folders had me thinking i missed something again and that had me turned around for awhile. Also when people stop it, the red bar and play, pause, buttons, subtitles etc cover up the bottom of the screen so we cant even see if you do something quick at the bottom because its covered up when we pause it. Thanks again.

    • @robkerrai
      @robkerrai  4 місяці тому +1

      Thanks for the feedback! Sorry you got tripped up in some places, I'll work harder to make sure the flow is clean and doesn't get interrupted in the future.

  • @karthikrajanv
    @karthikrajanv 3 місяці тому

    Hi Rob, thank you very much for the tutorial! I followed every single of you steps. I am stuck in the inference part. I am getting the following KeyError.
    line: outputs = model.generate(input_ids = inputs, max_new_tokens = 64, use_cache = True)
    KeyError: 'Cache only has 0 layers, attempted to access layer with index 0'

    • @robkerrai
      @robkerrai  3 місяці тому

      Hmm...haven't seen that error. You might try going through the process in a Google Colab and see if the script you're using is working.