Get your own custom Phi-3-mini for your use cases

Поділитися
Вставка
  • Опубліковано 7 чер 2024
  • Here is how to get started with training your own version of Phi-3-mini on your own dataset. We will use Unsloth to train our own version on custom dataset.
    #llm #finetuning #phi3
    🦾 Discord: / discord
    ☕ Buy me a Coffee: ko-fi.com/promptengineering
    |🔴 Patreon: / promptengineering
    💼Consulting: calendly.com/engineerprompt/c...
    📧 Business Contact: engineerprompt@gmail.com
    Become Member: tinyurl.com/y5h28s6h
    💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
    Signup for Advanced RAG:
    tally.so/r/3y9bb0
    LINKS:
    Notebook: tinyurl.com/54pk9v52
    Llama-3 fine-tuning: • LLAMA-3 🦙: EASIET WAY ...
    unsloth.ai/
    TIMESTAMPS:
    [00:00] Phi-3-mini with UnSloth
    [01:05] Setting Up Your Environment for Fine Tuning
    [03:56] Understanding and Preparing Your Data
    [07:25] Setting Up the SFTTrainer and Training Process
    [11:54] Inference: Generating Responses with your trained model
    [13:41] Saving and Loading Models for Future Use
    All Interesting Videos:
    Everything LangChain: • LangChain
    Everything LLM: • Large Language Models
    Everything Midjourney: • MidJourney Tutorials
    AI Image Generation: • AI Image Generation Tu...
  • Наука та технологія

КОМЕНТАРІ • 22

  • @jpmottin
    @jpmottin Місяць тому +1

    Simple, straight to the point

  • @metanulski
    @metanulski Місяць тому +14

    Hi. I still do not unterstand how I can create my own datasets. Can you make a video about that? Its in the title "on you own dataset" ;-)

    • @sherpya
      @sherpya Місяць тому +1

      if you don't know you may not need it 😂, a dataset is mainly a series of question / answers pairs

    • @onlineinformation5320
      @onlineinformation5320 Місяць тому

      @@sherpya I have a csv of question answer pairs how should I upload it plzzz answer

    • @marilynlucas5128
      @marilynlucas5128 Місяць тому

      Use DSPy. Don't worry about creating your own datasets. It's a rabbit hole you'll never get out of.

    • @sherpya
      @sherpya Місяць тому

      @@onlineinformation5320 the are some video tutorials, you mainly format them as json lines or whatever, since you neee to read them tipically in a notebook

    • @d4138
      @d4138 Місяць тому

      Why do you suggest DSPy? Do you have experience with it, @marilynlucas5128

  • @hadebeh2588
    @hadebeh2588 Місяць тому +2

    In the code, the EOS_TOKEN is added „…, output) + EOS_TOKEN“, right? Is that all that is necessary or do I have to add the EOS_TOKEN into the dataset? I trained the model using the code you used but with my own data and my result-model never stops talking. 😢

  • @SukeshShetty1
    @SukeshShetty1 Місяць тому

    How do I create a docker image and run it as a service? Can it support concurrent requests?

  • @VerdonTrigance
    @VerdonTrigance Місяць тому +2

    Thank you man for your videos. But my most curious question is how to prepare dataset from my own data? I have a book and wanna talk with book. Obviously RAG cannot fit all the content of the book even with 128k context length. So how to train my model on that book?

    • @himansrivastava
      @himansrivastava Місяць тому

      I have the same doubt. A video on this would be very helpful.

    • @CharlesOkwuagwu
      @CharlesOkwuagwu Місяць тому

      I second that.

    • @marilynlucas5128
      @marilynlucas5128 Місяць тому

      huh!? Rag is your best bet! If you want structured outputs to enable easy and efficient state transitions, use DSPy.

    • @engineerprompt
      @engineerprompt  Місяць тому +3

      Will see what I can do here.

  • @khaledbouzaiene3959
    @khaledbouzaiene3959 5 днів тому

    but if my data inculte like dailogue how can be structured where there is one instruction for each response

  • @huuhuynguyen3025
    @huuhuynguyen3025 29 днів тому

    Is it possible to finetune on my language?

  • @marilynlucas5128
    @marilynlucas5128 Місяць тому +1

    Can HF auto-train be used here? Also, why does every keep insisting on fine-tuning when DSPy is already out? You can obtain structured outputs with DSPy without the need to fine-tune. You haven't focused much on DSPy, I think it's very important you do this. It's clearly the future of AI

    • @engineerprompt
      @engineerprompt  Місяць тому

      You can use auto-train. UnSloth gives you more efficient finetuning (memory). DSPy is on my list. Need to get a better understanding of it and will start creating content on it.

  • @superfliping
    @superfliping Місяць тому

    Whats next, show your skills?
    1. CodeCraft Duel: Super Agent Showdown
    2. Pixel Pioneers: Super Agent AI Clash
    3. Digital Duel: LLM Super Agents Battle
    4. Byte Battle Royale: Dueling LLM Agents
    5. AI Code Clash: Super Agent Showdown
    6. CodeCraft Combat: Super Agent Edition
    7. Digital Duel: Super Agent AI Battle
    8. Pixel Pioneers: LLM Super Agent Showdown
    9. Byte Battle Royale: Super Agent AI Combat
    10. AI Code Clash: Dueling Super Agents Edition

  • @onlineinformation5320
    @onlineinformation5320 Місяць тому +1

    I have a csv of question answer pairs how should I upload it plzzz answer

    • @engineerprompt
      @engineerprompt  Місяць тому

      You can read that csv file and convert it into a dictionary and the same code provided in the notebook.