How to Run Google's New FREE AI Model on Your PC - But is it Any Good?

Поділитися
Вставка
  • Опубліковано 22 лют 2024
  • Google has released Gemma, a family of lightweight open LLM models, built from the same research and technology used to create its Gemini models. You can run them in the cloud, but you can also run them on your PC using LMStudio or llama.cpp. In this video I test them out for speed, accuracy, and capabilities.
    ---
    LMStudio video: • Run Your Own ChatGPT-l...
    llama.cpp video: • Run a ChatGPT-like AI ...
    Twitter: / garyexplains
    #garyexplains
  • Наука та технологія

КОМЕНТАРІ • 34

  • @AlwaysCensored-xp1be
    @AlwaysCensored-xp1be 3 місяці тому +2

    Been trying LLMs on my Raspberry Pi5 8GB, not fast but they work. Have not tried these LLMs. Will add them to my collection. Ollama makes it easy to try them.

  • @issiewizzie
    @issiewizzie 3 місяці тому +4

    At some point, we will be able to train this models on our own data I hope

    • @levesquejean-francois3287
      @levesquejean-francois3287 3 місяці тому

      Could it really get significantly better if it has so few parameters compared to Chatgpt?

    • @onlyeyeno
      @onlyeyeno 3 місяці тому +1

      The "point" of training it on Your own data is ((I suspect)) that You want it to be able to answer questions that is "specifically about Your own data" !!! Which a "general LLM" will not be able to do.
      Best regards

  • @levesquejean-francois3287
    @levesquejean-francois3287 3 місяці тому

    Could it really get significantly better owver time if it has so few parameters compared to Chatgpt?

  • @paulbarnett227
    @paulbarnett227 3 місяці тому

    In you previous AI video - I was surprised by The Princess Bride link but the model went on to explain its reasoning which was sound so there you go.
    As for these 'on device' models, they are going to be limited because they are not referring to the cloud for information and context. They could still be useful though in specialist applications.

    • @GaryExplains
      @GaryExplains  3 місяці тому

      Even the cloud LLMs are offline in that they don't refer to the internet for more information. The exception is Bing which Microsoft built to act like a half way between an LLM and a search engine. Try asking ChatGPT about something that happened last week and it can't tell you.

  • @JSroid
    @JSroid 3 місяці тому

    Maybe you covered this is another video, but are there security risks with running models shared on the internet?

    • @GaryExplains
      @GaryExplains  3 місяці тому +2

      None that I am aware of or can think of. The LLM model doesn't have access to your system. The underlying runtime code is based on llama.cpp which is open source.

  • @MW-mn1el
    @MW-mn1el 3 місяці тому +1

    Did try it, Gemma is not good when compare to llama 2, don't understrand simple phase, change subject/question and context. Side by side, Llama2 is clear better, 7B vs 7B.

  • @zoenagy9458
    @zoenagy9458 3 місяці тому

    is it censured? why is it so slow?

  • @alwin2024
    @alwin2024 3 місяці тому

  • @eddrake5290
    @eddrake5290 3 місяці тому

    Test algebra, calculus, and code generation on these different models.

    • @GaryExplains
      @GaryExplains  3 місяці тому +1

      Little point. The big LLMs struggle with maths. These little ones have no chance.

  • @scrollop
    @scrollop 3 місяці тому +3

    I've seen the model in action on youtube and it looks... awful

  • @MW-mn1el
    @MW-mn1el 3 місяці тому

    And it insists Empire State Building is in Tokyo and not NYC. 😂

  • @Ta2dwitetrash
    @Ta2dwitetrash 3 місяці тому

    You know why its free?
    You should find out.

  • @mikldude9376
    @mikldude9376 3 місяці тому

    To be kind , they are still pretty crap aren't they 😄.
    Hopefully they will get better.

  • @sslaia
    @sslaia 3 місяці тому

    You are too hard on Gemma with the spelling. There are a lot of people I know who do the same spelling mistake. I swear.

  • @DingoAteMeBaby
    @DingoAteMeBaby 3 місяці тому +8

    They giving it away because its basically unusable