What is Prompt Tuning?

Поділитися
Вставка
  • Опубліковано 15 чер 2023
  • Explore watsonx → ibm.biz/BdvxRp
    Prompt tuning is an efficient, low-cost way of adapting an AI foundation model to new downstream tasks without retraining the model and updating its weights. In this video, Martin Keen discusses three options for tailoring a pre-trained LLM for specialization, including: fine tuning, prompt engineering, and prompt tuning ... and contemplates a future career as a prompt engineer.
    Get started for free on IBM Cloud → ibm.biz/sign-up-now
    Subscribe to see more videos like this in the future → ibm.biz/subscribe-now
    #ai #watsonx #llm

КОМЕНТАРІ • 52

  • @dominikzmudziak8340
    @dominikzmudziak8340 Місяць тому +12

    Im stunned how Martin is able to write backwards on this board so efficiently

    • @pradachan
      @pradachan Місяць тому +8

      they just mirror the whole recording

  • @Gordin508
    @Gordin508 11 місяців тому +30

    Really like these summarization videos on this channel. While they do not go into depth, I appreciate the overarching concepts being outlined and put into context in a clean way without throwing overly specific stuff in the mix.

    • @johndong4754
      @johndong4754 6 місяців тому

      Which channels would you recommend that go into more depth?

  • @dharamindia563
    @dharamindia563 7 місяців тому

    Excellent broad explanation of complex AI topics. One can then deep dive once a basic understanding is achieved ! Thank you

  • @SCP-GPT
    @SCP-GPT 9 місяців тому +3

    You should make a guide on FlowGPT / Poe that delves into operators, delimiters, markdown, formatting, and syntax. I've been experimenting on these sites for a while, and the things they can do with prompts are mind-blowing.

  • @uniqueavi91
    @uniqueavi91 7 місяців тому

    crisp and informative

  • @maxjesch
    @maxjesch 10 місяців тому +12

    So how do I get to those "soft prompts"? Do you have to use prelabeled examples for that?

  • @XavierPerales-zm4xx
    @XavierPerales-zm4xx 4 місяці тому

    Excellent job explaining key AI terms!

  • @johndevan3505
    @johndevan3505 5 місяців тому

    A lot to unpack here. Great job explaining.
    I have one question about the difference between incontext learning and prompt tuning with hard prompts. Are they synonymous?

  • @datagovernor
    @datagovernor 8 місяців тому +4

    More important question, what type of smart/whiteboard are you using?? I love it!

  • @WeiweiCheng
    @WeiweiCheng 5 місяців тому +4

    Awesome content. Thanks for uploading.
    It's great that the video calls out the differences between soft prompting and hard prompting. While soft prompts offer more opportunities for performance tuning, practitioners often face the following issues:
    - Choosing between hard prompting with a more advanced, but closed, LLM versus soft prompting with an open-sourced LLM that is typically inferior in performance.
    - Soft prompting is model dependent, and hard prompting is less so.

  • @mikegioia9289
    @mikegioia9289 7 місяців тому

    How do you discover the correct soft prompts?

  • @scifithoughts3611
    @scifithoughts3611 4 місяці тому

    Could you explain labeling done in fine tuning and prompt tuning?

  • @Asgardinho
    @Asgardinho 6 місяців тому +4

    how do you get the AI to generate that tunable soft prompt?

  • @apoorvvallabh2976
    @apoorvvallabh2976 7 місяців тому +1

    What data set for supervised learning is used in prompt tuning

  • @RobertoNascimento-kw6gy
    @RobertoNascimento-kw6gy Місяць тому

    Excelente video, bom trabalho

  • @azadehesmaeili4402
    @azadehesmaeili4402 5 місяців тому

    Could you please outline the advantages and disadvantages of fine-tuning versus prompting in the context of large language models?

  • @neail5466
    @neail5466 11 місяців тому +2

    Could you please explain a little detail about the strings of numbers how those are indexed? Are those some sort of abstraction that we fully understand!
    Very informative lecture is this one... Probably everyone should have a little expertise in prompt engineering skill in near future.

    • @Chris-se3nc
      @Chris-se3nc 10 місяців тому

      There are other embedding models that can take strings of concepts and transform them into embedding vectors (string of numbers). You can store those in a number of vector databases.

  • @8eck
    @8eck 9 місяців тому

    This that soft prompt is basically a trainable parameters, which also undergoing backpropagation and its weights are updated? Just like LoRA method, where you attach new trainable parameters to the model and train only those new parameters.

  • @yt-sh
    @yt-sh 11 місяців тому +1

    funny & informative 👏👏👏

  • @user-mn6bb6gi6v
    @user-mn6bb6gi6v 11 місяців тому +5

    Hi, nice talk by the way, but what about some examples of soft turning, i understand is human unreadable, but how exactly you achieve that ? by writing some code ? extra tools ? plugins ? thanks a lot for your reply :)

    • @sheepcraft7555
      @sheepcraft7555 9 місяців тому

      These are learnable parameters added on top the base language models. This is called soft tuning one of the example is prefix tuning. These parameters are learned.

  • @marc-oliviergiguere3290
    @marc-oliviergiguere3290 6 місяців тому +1

    Very concise and information, but tell me, what technology do you use to write backwards so fast? Do you flip the board in post-production?

    • @IBMTechnology
      @IBMTechnology  6 місяців тому

      Yes, see ibm.biz/write-backwards for details

  • @TimProvencio
    @TimProvencio 7 місяців тому

    Does anyone know how they do these videos where it appears that they are writing on the screen. That is so neat!

  • @user-mo7yq9ks1g
    @user-mo7yq9ks1g 6 місяців тому

    What is unfancy design prompt?

  • @mohslimani5716
    @mohslimani5716 10 місяців тому

    Thanks for the explanation, but still how could someone succeed in prompt engineering practically

  • @manojr4598
    @manojr4598 11 місяців тому

    We are trying to create a chatbot using OpenAI API and the response should be limited to the specific topic and it should not respond to the user queries which are not related to the topic. What is the best way to achieve this ? Prompt engineering or prompt tuning ?

  • @itdataandprocessanalysis3202
    @itdataandprocessanalysis3202 11 місяців тому +2

    A joke by ChatGPT:
    Why did the Large Language Model (LLM) turn down a job as a DJ?
    Because it thought "Prompt Tuning" meant it would have to constantly change the music!

  • @arpitqw1
    @arpitqw1 5 місяців тому

    not fully understood except- prompt tuning-prompt engineering- hard tuning-soft tuning. :P

  • @rajucmita
    @rajucmita 6 місяців тому

    As a newbee how come I be pro in propmt engineering

  • @darkashes9953
    @darkashes9953 11 місяців тому

    IBM could go for the plunge and make a Quantum computer with 10 million Quantum computer chips with 1000 Qubits and optical circuits instead of just one chip.

  • @badlaamaurukehu
    @badlaamaurukehu 5 місяців тому +1

    Nomenclature is it's own problem.

  • @rongarza9488
    @rongarza9488 3 місяці тому +1

    I learned Python in 2 months, great language. Then, I learned the SQLs that Python plays well with. Then, it hit me: AI is doing most of this work! So what is there for me and you to do? "My career may be over before it's begun". Yes, indeed UNLESS we can start using Python for regular business processing, like Accounts Receivable/Payable, Inventory Management, Order Processing, etc. In other words, we can't all be doing AI, especially when it, itself, is doing AI, cheaper, faster, and better.

  • @avinashpradhan5030
    @avinashpradhan5030 2 місяці тому

    🙂

  • @kaiskermani3724
    @kaiskermani3724 Місяць тому

    "A string of numbers is worth a thousand words" tf does that even mean?

  • @NK-ju6ns
    @NK-ju6ns 3 місяці тому

    Soft peompring is confusing

  • @samgoodwin89
    @samgoodwin89 6 місяців тому +1

    Is he writing backwards

  • @DK-ox7ze
    @DK-ox7ze 6 місяців тому +4

    This is too abstract. Some concrete examples would have helped.

  • @generichuman_
    @generichuman_ 6 місяців тому +12

    Wow, you managed to make an 8 minute video on prompt tuning without actually talking about what it is or how one would even begin to implement it. All I gleaned from this is that it has something to do with embeddings... Do better IBM...

    • @scifithoughts3611
      @scifithoughts3611 4 місяці тому +3

      I agree it’s a little obscure. I gave this a second watch through because your comment made me realize that I too wasn’t clear.
      Here is what I’ve noted:
      First step:
      Model creation: A model is created by training it from tons of data (very expensive to do)
      Because a model alone doesn’t work consistently at this point (racist, errors, hallucinations, toxic,…) it needs more work to be ready for the public. To make it ready one of the three strategies are used: fine tuning, prompt engineering, or prompt tuning with soft prompts. (All three could be used as well, I’ve read papers about such cases.)
      Fine tuning :
      Give you have a model, now you create examples about the domain the LLM will represent. The examples are labeled to help the model know what’s is going on. This strategy is labor intensive. (Labeling is another whole area to read up on.)
      Prompt engineering:
      Humans design prompts in a human language (explain to the model how to behave). Example: when I tell you a word in English, you respond with the word in French.
      Prompt tuning using soft prompts:
      Soft prompts are created by the AI using fine tuning data. These prompts are encoded (not human readable) into a vector.
      The above is the first six minutes of the video. Next the lecturer show these three applications by adding them to the box picture. This is confusing because it seems like he is applying all three strategies but then concludes that prompt tuning gets the best results. So I guess he is saying use prompt tuning.
      Since AiML is a new field, I think people will be applying many different strategies in order to get their models to work properly. And this is just scratching the surface. Every few months, people will come up with other strategies that improve the situation. 10 years from now a bunch of these strategies will be discarded and there will be other new ones. The field of ML is defining their design patterns. Pattern books will be written as solutions mature. Prompt engineering and prompt tuning are the two patterns he talks about.
      I hope that helps. Thinking this through has certainly helped me so thanks for the prompt. 😊

    • @chavruta2000
      @chavruta2000 3 місяці тому

      yes. this is incredibly generic and communicates very little considering this is supposed to be from a communication theory expert.

    • @RajatKumar-oy9mw
      @RajatKumar-oy9mw 2 місяці тому

      Totally agrees..

  • @bongimusprime7981
    @bongimusprime7981 11 місяців тому +1

    ChatGPT is not an LLM lol