UPDATED: CPU Vicuna | POWERFUL Local ChatGPT 🤯 Mindblowing Unrestricted GPT-4

Поділитися
Вставка
  • Опубліковано 29 вер 2024

КОМЕНТАРІ • 404

  • @lance3301
    @lance3301 Рік тому +39

    Vicuna is not unrestricted or uncensored...another clickbait title making false claims. Alpaca is uncensored, but slow as a dog to run. Hopefully there will be an "unrestricted" model released soon that can be run locally and isn't horribly slow. Haven't found it yet though.

    • @goo_tx
      @goo_tx Рік тому +9

      I was looking for this in the comments. I had a sneaking suspicion. I was asking myself why I'm paying for gpt-4 with 25 messages every 3 hours when this dude is rockin it for free and unrestricted.

    • @asory
      @asory Рік тому +1

      Koala model is vicuna but uncensored

    • @fl4mzy709
      @fl4mzy709 Рік тому

      @@goo_tx atleast it’s free

    • @SagiQuarion
      @SagiQuarion Рік тому

      @@goo_tx ditto. I also had a gut feeling this was hype. Even so, I’m still gonna go through the install process and play around for education.

    • @Upgrayedddd
      @Upgrayedddd Рік тому +1

      Thanks for this comment. You saved me a lot of time. Idk wtf I'm doing tbh and many dislike the model I'm using but I don't have the same problems they do. It doesn't code but if you're looking for a fast uncensored model to ask questions, chat, roleplay, etc the Alpaca 7b full and the only other 7b full download for FreedomGPT open source has answered everything I've thrown at it. It generates answers when chatgpt gives a lecture. Sometimes it's impressive and other times it has the same problems I see here. It gets confused and talks to itself, gets stuck in a loop, and is lacking in roleplay. It will however assume the role of anything you tell it to, crack hit-or-miss jokes no matter how offensive, and provide data for any controversial topic which has been surprisingly helpful. It's not ideal but hopefully will be improved if it catches traction.
      Has anyone tried Pygmalion?

  • @BernhardRutzen
    @BernhardRutzen Рік тому +3

    The best tutorial ever, what a good quality video, and perfectly explained, my respect!

    • @AIEinstein
      @AIEinstein Рік тому

      So true, I enjoyed watching it

  • @nathantheodorus
    @nathantheodorus Рік тому +2

    Hi, does anyone know any way it can run using AMD GPU ? I have RX 6800 with 16gb VRAM, would like to use that instead of my CPU.

    • @nodewizard
      @nodewizard Рік тому

      It only supports cuda-based GPUs, which are Nvidia (team green) GPUs. If you're in the AI generative art or llama business, ditch AMD and get the Nvidia GPU. There will never be good support for AMD or Intel GPUs when it comes to AI.

  • @darkowl9
    @darkowl9 Рік тому

    Actually, seems my last idea was wrong; I could fix the warnings and get it to start, but it would crash out as soon as it interacted with the agent. It seems since this script was made, something's broken in GPTQ-for-LLaMa. It keeps trying to find VS' `cl.exe` to compile C from source? Something really messy going on, like maybe Python/pip has cached a bad package, or some commits have been made that cause unhappiness. Anyway, yeah, it's all very fragile and hacky. Maybe I should come back in a month or two when people way smarter than me have made it better. Funny, I used to think npm in the JS world was not a great package manager... I was clearly wrong, because I'd not used Python at the time. Heh.

  • @nexus-roleplay-leeric
    @nexus-roleplay-leeric Рік тому

    Your website is blocked by Kaspersky and also Windows doesn't allow me to download anything from it

    • @TroubleChute
      @TroubleChute  Рік тому

      Yeah for some reason my domain got outright flagged as malware. I've sent emails to all the av companies thst have false positives and it's been removed form most of them
      No clue why. I assume the preiovus owner of the website...
      Kaspersky said they have removed the listing but "it could take up to 72 hours"... Fun

  • @alexandru2882
    @alexandru2882 Рік тому

    Thank you for this wonderful video series. Unfortunately, the script from this video has got an issue: the Vicuna model doesn't download. It throws this error:
    Test-Path : Cannot bind argument to parameter 'Path' because it is null.
    At D:\Ai Vicuna\oobabooga-windows\vicuna.ps1:111 char:19
    + ... (Test-Path $(Get-Command aria2c -ErrorAction SilentlyContinue).Sourc ...
    + ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo : InvalidData: (:) [Test-Path], ParameterBindingValidationException
    + FullyQualifiedErrorId : ParameterArgumentValidationErrorNullNotAllowed,Microsoft.PowerShell.Commands.TestPathCom
    mand

    • @GiovaDuarte
      @GiovaDuarte Рік тому

      I had to try reinstalling (downloading) everything 3 times to make it work for me. Make sure to delete the oobaboooga file.

  • @ag1015
    @ag1015 8 місяців тому +2

    disclaimer: if you find this, this is outdated and the code does not work anymore

  • @pikmok
    @pikmok Рік тому +15

    Hi TC! I hope you're doing well. I was wondering if you could possibly create a video on how to train the Vicuna model. I'm really interested in learning more about it. Thank you so much in advance!

    • @AIEinstein
      @AIEinstein Рік тому +2

      I really love training models videos, imo a really good idea

  • @motess5304
    @motess5304 Рік тому +4

    does not work, err start. error - Llama' object has no attribute 'ctx, whatever that means 🤷🏿‍♂ Too good to be true I guess

    • @Evaldas256
      @Evaldas256 Рік тому +2

      I get the same thing, Intel CPU i5-3470, Win 10.

    • @Grantyz
      @Grantyz Рік тому +2

      Me too

    • @AceOnlineMath
      @AceOnlineMath Рік тому +2

      And me

    • @trufflefur
      @trufflefur Рік тому +2

      I get the same error :( Intel® Core™ i3 4150, Windows 10

  • @Dr_Hax
    @Dr_Hax Рік тому +2

    when i type iex (irm vicuna.tb.ag) it gives me error 502

  • @EclairsAngel
    @EclairsAngel Рік тому +3

    high does this require 10-12gb Vram to be able to run? cuz i have 6GB vram and i get out of memory error when i start the gpu one

  • @J-isChilling
    @J-isChilling Рік тому +20

    Hello TC! I'm really interested in trying out these offline chat GPT alternatives too. I was just wondering if you know any tips for limiting the RAM usage? My system only has 8GB of ram, so I'm a bit worried about that. Any suggestions would be really appreciated. Thanks! 😊

    • @broslaughing8245
      @broslaughing8245 Рік тому +1

      I have same ram and it worked easily

    • @PhaaxGames
      @PhaaxGames Рік тому +5

      If your computer runs out of available RAM it will use disk space to swap out the memory. It'll be much, much, slower but you shouldn't run in to any issues as long as you have space on your disk.

    • @J-isChilling
      @J-isChilling Рік тому

      @@PhaaxGames it doesn't seem to use the disk space in my computer, every time the bat file loads, the ram gets maxed out and my computer just freezes :(

    • @PhaaxGames
      @PhaaxGames Рік тому +2

      @@J-isChilling The freezing you're experiencing is probably the disk swapping taking place. As I said, it's very slow. If you leave it it will eventually come back to life (hopefully). Expect it to take a looong time though.

    • @annetteblum9155
      @annetteblum9155 Рік тому

      How do you know your are not downloading an ai virus ? Oogabooga ? Random batch windows script ? Downloading executables left and right unchecked -?

  • @TheMedicBr0
    @TheMedicBr0 Рік тому +1

    getting Starting the web UI...
    python: can't open file 'C:\\Users\\spell\\OneDrive\\Desktop\\New folder\\oobabooga-windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory

  • @katolicy
    @katolicy Рік тому +1

    There is no .bat files ... how to fix this ? "Start-Process : This command cannot be run due to the error:
    At line:206 char:9
    + Start-Process ".\start-webui-vicuna.bat"
    + ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo : InvalidOperation: (:) [Start-Process], InvalidOperationException
    + FullyQualifiedErrorId : InvalidOperationException,Microsoft.PowerShell.Commands.StartProcessCommand"

  • @kaffae
    @kaffae Рік тому +12

    These are accelerating at an interesting rate 😭

    • @redone823
      @redone823 Рік тому +2

      That's what she said.

    • @kaffae
      @kaffae Рік тому

      @@redone823 lol

  • @gdizzzl
    @gdizzzl Рік тому +6

    Im trying to run this on my soujaboy console and i keep getting errors.

    • @gdizzzl
      @gdizzzl Рік тому +2

      It keeps saying : first_rapper_with AI_cuda~memory error

    • @goo_tx
      @goo_tx Рік тому +1

      You need to plug it into a 220v outlet my dude.

    • @mattmarket5642
      @mattmarket5642 Рік тому +1

      Set the crank_that value to ‘=true’

  • @rainhaldro9625
    @rainhaldro9625 Рік тому +3

    I took a look at your channel - my respect.
    really want to run vicuna 13b but my gpu is only 8gb, tell me how to distribute the load between gpu+cpu

  • @TroubleChute
    @TroubleChute  Рік тому +14

    PLEASE NOTE: The updated command is: iex (irm vicuna.tc.ht)

    • @TroubleChute
      @TroubleChute  Рік тому +1

      @MPA PTY LTD Should be fixed :) They renamed oobabooga-windows to oobabooga_windows.
      Small change, but broke the script.

    • @IceAgeEngineer
      @IceAgeEngineer Рік тому +1

      @@TroubleChute Works now perfect! Thanks for your effort. Just tried it out. Running on a Ryzen 5800x and 128GB DDR 3200, and the speed is decent. The answers are very accurate.
      Now trying out the results with an old Nvidia 1070.

    • @aliradwan6155
      @aliradwan6155 Рік тому

      can I install it on linux??
      using the same steps..

    • @TroubleChute
      @TroubleChute  Рік тому

      @@aliradwan6155 this auto installer script is windows only through powershell. I do have plans to expand it to other os as well.

    • @kullenberg
      @kullenberg Рік тому +3

      @@TroubleChute several people including get this error: "python: can't open file 'C:\\TCHT\\oobabooga_windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory"

  • @academai11
    @academai11 Рік тому +6

    I have an urgent suggestion, is there a way for autogpt to work with this instead of openai models?

  • @Newtothis832
    @Newtothis832 Рік тому +1

    After installing and opening Vicuna I received the following message: python: can't open file 'C:\\TCHT\\oobabooga_windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory Please help

    • @Fückcoldraven
      @Fückcoldraven Рік тому

      I got that same issue but on a different file i downloaded start windows

  • @mluckphoto
    @mluckphoto Рік тому +1

    Has this been blocked? Why can't I follow this procedure? Whenever I type the line into the command prompt it gives me an error, As if the page is no longer available

    • @TroubleChute
      @TroubleChute  Рік тому +1

      Link changed since the video. Using the updated line from the description works:
      iex (irm vicuna.tc.ht)

  • @CoSci-DeNation
    @CoSci-DeNation Рік тому +1

    It keeps telling me that that command doesn't exist

  • @mageofthesands
    @mageofthesands Рік тому +1

    Script does not work on Widows 10. Lots of errors.

  • @Icureditwithmybrain
    @Icureditwithmybrain Рік тому +1

    GPU version doesnt work on RTX 3070
    torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate 2.00 MiB (GPU 0; 8.00 GiB total capacity; 7.08 GiB already allocated; 0 bytes free; 7.31 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF
    Output generated in 2.73 seconds

  • @Royaltea_Citizen
    @Royaltea_Citizen Рік тому +3

    What are the system requirements for Vicunca? I keep getting the error, "tried allocating 350000 bytes" number is incorrect but something along those lines

  • @Reiner.
    @Reiner. Рік тому +1

    hi, i dont know why but the model seems impossible to use, as soon as i ask something the ai keeps talking to him self asking and answering ramdom things like python scripts

  • @CollinTowle
    @CollinTowle Рік тому +1

    Why am I getting this error?
    Enter 1 to launch CPU version, or 2 to launch GPU version
    1 (CPU) or 2 (GPU): 2
    Start-Process : This command cannot be run due to the error: The system cannot find the file specified.
    At line:209 char:9
    + Start-Process ".\start-webui-vicuna-gpu.bat"
    + ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo : InvalidOperation: (:) [Start-Process], InvalidOperationException
    + FullyQualifiedErrorId : InvalidOperationException,Microsoft.PowerShell.Commands.StartProcessCommand

  • @brownie6041
    @brownie6041 Рік тому +5

    Mine came up with this error
    Could not find the quantized model in .pt or .safetensors format, exiting...

    • @TroubleChute
      @TroubleChute  Рік тому

      Do you have both the models downloaded?
      What causes it is having "--wbits 4 --groupsize 128" in the start-webui.bat file when using the CPU model instead of the GPU model... But thst should all be correct...

    • @cber7
      @cber7 Рік тому +1

      @@TroubleChute Have the same issue, did I download something wrong?

    • @tech652
      @tech652 Рік тому +1

      same issue here

    • @ZRROR
      @ZRROR Рік тому

      @@TroubleChute same issue too

  • @abramsonrl
    @abramsonrl Рік тому +1

    Install failed. No idea why. I asked ChatGPT for help all throughout the process. This is day two, third set of instructions that got me nowhere with Vicuna for windows. I have 8 versions of python installed on the system now, 4 instances of conda.exe, miniconda all over the place, have been wrestling with paths and environment variables for two days and it's been an absolute nightmare.
    As usual. Got Auto-GPT installed and working, though. Guess what? It can't figure it out either.

  • @freddyplayer6763
    @freddyplayer6763 Рік тому +1

    It works, but i need to play a bit with it. (It writes back in 20-40 seconds)
    i have a laptop ryzen 7 cpu and 16 GB RAM but only 1 GB vram so CPU model is welcomed
    I tried doing roleplays with the GPT4ALL and it is faster, but replies weirdly and writes nonsense .

  • @JordanREALLYreally
    @JordanREALLYreally Рік тому +3

    This is great! CPU works great, but unfortunately with the GPU model it is outputting GIBBERISH in all languages for me. I understand I am supposed to update the "GPTQ-for-LLaMa" folder in the repositories... and I THINK I did (not a lot of info on huggingface ooga), but it just gives me an error once having updated it. I've tried running install again and install-gpu to no avail. Anyone running into this problem?

  • @pro-learner-jaja
    @pro-learner-jaja Рік тому +7

    Thanks for your tutorial bro. However, I think the aria2c cannot be installed correctly thus the model weights are not able to be downloaded correctly. I don't know if that is the problem with my network settings.

    • @JC-Alan
      @JC-Alan Рік тому +3

      I am getting the same error.

    • @techwithanirudh
      @techwithanirudh Рік тому

      Same here.

    • @georgetome
      @georgetome Рік тому +1

      same

    • @NNokia-jz6jb
      @NNokia-jz6jb Рік тому +2

      Same, also with the older video tutorial.. something is very wrong here?

    • @Psychopatz
      @Psychopatz Рік тому

      It might be a firewall problem, try turning it off temporarily when installing.

  • @pokerandphilosophy8328
    @pokerandphilosophy8328 Рік тому +1

    Someone challenged me to take a shot of Bayleys every time he says GPU, and a shot of vodka every time he says CPU, and I got very sick.

  • @ATL_Meek
    @ATL_Meek Рік тому +2

    not working either cpu or gpu . im getting this message "Starting the web UI...
    python: can't open file 'C:\\Users\\achon\\OneDrive\\Desktop\\New folder\\oobabooga-windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory
    Press any key to continue . . ."

    • @TroubleChute
      @TroubleChute  Рік тому +4

      Try installing to C:\AI, or another low level folder. Spaces can confuse Miniconda.

  • @DeLaCruz878
    @DeLaCruz878 Рік тому

    It says I need to create a folder with admin rights. How do I do this with Windows 11?

  • @ivoxx_
    @ivoxx_ Рік тому +6

    It does the install, but it fails to detect ARIA, and I have it on a simple level folder (C:\CHAT_AI\oobabooga-windows\text-generation-webui), and then it just can't download the models.

    • @ivoxx_
      @ivoxx_ Рік тому +1

      Well it seems to download on an upper level (C:\CHAT_AI) however it fails to detect it or the paths are wrong:
      "aria2c:
      Line |
      103 | aria2c -x 8 -s 8 --continue --out="$outputPath\vicuna-13b-4bit-12 …
      | ~~~~~~
      | The term 'aria2c' is not recognized as a name of a cmdlet, function, script file, or executable program.
      Check the spelling of the name, or if a path was included, verify that the path is correct and try again."

    • @peterpoulsen4794
      @peterpoulsen4794 Рік тому +1

      @@ivoxx_ have the exact same problem hoping for a solution.

    • @TroubleChute
      @TroubleChute  Рік тому +1

      Try running it again it should download using another method :)

    • @poeticsoldier
      @poeticsoldier Рік тому

      @@TroubleChute meaning, uninstall & install everything again. Or just run the initial IEX command and allow it to fill in ?

    • @sweetasdude
      @sweetasdude Рік тому +4

      Further to Mr Yi, I had to manually open the aria2 zip, then stick just aria2c.exe in the parent directory of ooga booger, (which seems to be used to check if it exists) then I placed a copy in a folder that was already in my windows environment variables path statement, (ie system32) and now the cpu and gpu models seem to be downloading - run the iex command again, say no to the first question.

  • @pokerandphilosophy8328
    @pokerandphilosophy8328 Рік тому +2

    I was able to install it and run it on my CPU. (My 8GB 2060 Super seems not to have enough VRAM to run it). However it ignores all my prompts and starts generating an imaginary dialogue between "Human:" and "Assistant:" that is completely unrelated to my prompt.

    • @pokerandphilosophy8328
      @pokerandphilosophy8328 Рік тому

      I got it to work after I cleared the history. After the webui got installed, it didn't start with a blank context window, for some reason. It might have to do with the pre-set "Character" (personality)

    • @dylanplagmann5236
      @dylanplagmann5236 Рік тому

      I'm having the same issue and clear the history isn't working for me, do you by chance have another fix? Mine keeps responding to itself in human: and assistant: form and also starting typing in MY chatbox

    • @resonanceofambition
      @resonanceofambition Рік тому

      @@dylanplagmann5236 Same for me. It does the same thing after answering a question but it poses it in such a way that it mirrors the original request i.e. vicuna is echoing our conversation in third person :/

  • @noahsherlock2685
    @noahsherlock2685 Рік тому +1

    Hi, I hope I'm not bothering you, and sorry my English is a bit bad lol. I have an error and that maybe a lot of your subscribers and including myself, may have and the error in question is: *ModuleNotFoundError: No module named 'llama_inference_offload'*
    I have done everything to fix it and I can't fix it using cpu mode and I really don't know what to do. I hope you can give me an answer to my problem...thanks.

    • @TroubleChute
      @TroubleChute  Рік тому

      Try using a smaller model

    • @noahsherlock2685
      @noahsherlock2685 Рік тому

      @@TroubleChute Can you recommend any model for roleplay, tavernai, sillyai. Pls?

    • @TroubleChute
      @TroubleChute  Рік тому

      @@noahsherlock2685 look into KoboldAI

  • @InsightCrypto
    @InsightCrypto Рік тому +3

    is it possible to install this in M1 Macbook Pro?

    • @thepuma77
      @thepuma77 Рік тому +2

      I’m also interested I have an MacBook M1 Max

  • @illegalframe7343
    @illegalframe7343 Рік тому +1

    i have this error when i run it - python: can't open file 'C:\\TCHT\\oobabooga_windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory

    • @kullenberg
      @kullenberg Рік тому

      Same here

    • @edins5517
      @edins5517 Рік тому

      same

    • @illegalframe7343
      @illegalframe7343 Рік тому

      I replied with the solution a few days ago but the comment disappeared i think the channel owner deleted it for some reason

    • @spheres5531
      @spheres5531 Рік тому

      @@illegalframe7343 can you please post again? because I am stuck a bit

  • @eapr300
    @eapr300 Рік тому

    I have this error:
    Loading eachadea_ggml-vicuna-7b-1-1...
    llama.cpp weights detected: models\eachadea_ggml-vicuna-7b-1-1\ggml-vic7b-uncensored-q5_1.bin
    llama.cpp: loading model from models\eachadea_ggml-vicuna-7b-1-1\ggml-vic7b-uncensored-q5_1.bin
    error loading model: unrecognized tensor type 7
    llama_init_from_file: failed to load model
    Any ideas?

  • @palvarez05
    @palvarez05 Рік тому +2

    I am getting a download error for the second part, anyone else getting the same?

  • @AI-Penly
    @AI-Penly 11 місяців тому +1

    not working for me

  • @TheUnchosenOne
    @TheUnchosenOne Рік тому

    "python: can't open file 'C:\\Users\\Guest1\\Desktop\\Vicuna\\oobabooga_windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory"
    Cant get any of the batch files to run properly. Dunno why since it worked great on my old machine.

  • @marcinhauda2679
    @marcinhauda2679 Рік тому

    To all those to whom the "python: can't open file 'C:\\TCHT\\oobabooga_windows\\text-generation-webui\\server.py': [Errno 2] No such file or directory" error pops up. It is possible that this pops up because you don't have .git installed (in my case, this was the reason). I hope I helped :)

  • @lynxinaction
    @lynxinaction 9 місяців тому

    TroubleChute can you upload a video on using chatGPT using local network on android?

  • @miauzure3960
    @miauzure3960 Рік тому +1

    HONESTLY in which way this AI is unrestricted? when it comes to filtering it is as pathetic as ChatGPT, not even giving me an answer which painkiller is safer to use with alcohol! other than that, pretty cool and thanks for making the script

    • @mackroscopik
      @mackroscopik Рік тому

      I heard Koala is an unrestricted Vicuna, but haven't tried it.

  • @ojikutu
    @ojikutu Рік тому +2

    When will this be available for Linux? Not everyone uses windows.

    • @TroubleChute
      @TroubleChute  Рік тому +1

      It is, just not my install script. There has been a lot of call for mac and Linux, so I'll look into it... But I don't have experience automating things there like this on Windows.

    • @smudoshi
      @smudoshi Рік тому

      @@TroubleChute I can help! Let me know how best to assist?

    • @TroubleChute
      @TroubleChute  Рік тому

      @@smudoshi github.com/TCNOco/TcNo-TBAG
      Can always help creating the script, or putting bits together. I have yet to get to this, but haven't just yet

  • @bmanojkumar9035
    @bmanojkumar9035 7 місяців тому

    in powershell i was hitted by it - "This script needs to be run as an administrator.
    Process can try to continue, but will likely fail. Press Enter to continue..."

  • @user3n9ck48f
    @user3n9ck48f Рік тому

    how i can run it with gpu .it semms it doesnt recognize my gpu " gpu memory device =0 " ??????? i have gpu geforce nvidia GTX 970 4gb

  • @grigglesmcgee
    @grigglesmcgee Місяць тому

    I got a handful of malware from this that nearly ruined my computer. Wtf

  • @NumberSixAtTheVillage
    @NumberSixAtTheVillage Рік тому +1

    What kind of nvidia gpus are required for the gpu version? I only have an old gtx-1070

  • @CollinTowle
    @CollinTowle Рік тому

    Im sorry. I am ignorant. I cannot figure this out. when inputting into powershell it keeps saying cannot find path because it does not exist. Am I supposed to download something and put it in a different file?

  • @Mallion1
    @Mallion1 Рік тому +1

    Please make a video walking through how to set this up for Linux. I see there are Linux options on the download seconds but can't figure out how to get it to work. I can't be the only one. Please!

  • @jsadecki1
    @jsadecki1 Рік тому

    It just says
    This script relies on Miniconda which can not be silently installed under a path with spaces.
    Press any key to continue . . .
    doesnt open

  • @yepitsaziz
    @yepitsaziz Рік тому +1

    how do i fix "MicroMamba hook not found."

    • @rulzz2581
      @rulzz2581 Рік тому

      same

    • @rulzz2581
      @rulzz2581 Рік тому +2

      Solved, there is must not be spaces or non-english characters in the installing path.
      Just create the installation folder on C: drive.

  • @Грин4-ю2х
    @Грин4-ю2х Рік тому +1

    does not work, err start. error - Llama' object has no attribute 'ctx with (irm vicuna.tc.ht)

    • @Evaldas256
      @Evaldas256 Рік тому

      Same. Did you ever get it working?

  • @adamstewarton
    @adamstewarton Рік тому +2

    we need a 7b version for GPU , the 13B is too big for 8gb vram :(

    • @mackroscopik
      @mackroscopik Рік тому

      Did you ever figure out how to get Vicuna 7B for GPU?

  • @riturajm1978
    @riturajm1978 Рік тому

    Give gibberish response which has not context to the question i asked ??? any hints??

  • @biswassekhar
    @biswassekhar Рік тому +2

    Hello what is the minimum requirement to run this setup?
    Thank you.

    • @glumboi9946
      @glumboi9946 Рік тому

      On cpu, I would say at least an 8 core 2nd gen ryzen. My 3700x takes sum time already. As for gpu, vram matters more than actual speed of the gpu, I have a 1070 and it runs fine.

  • @commenter6472
    @commenter6472 Рік тому +2

    hmm I wonder how much uncensored it really is
    Thank you for the AI updates. My favorite thing to follow lately. Very exciting 👍

    • @AIEinstein
      @AIEinstein Рік тому

      its awesome :D and its so hard to keep up with all the AI updates, since it feels like every day there are NEWS :D

  • @lukeduran12
    @lukeduran12 Рік тому

    So I'm having an issue, I can't submit anything once I load up Vicuna

  • @_Anurag-pf8so
    @_Anurag-pf8so Рік тому

    its telling me to run as administrator to make it run properly otherwisw it will most certainly fail.

  • @SteveSimpson
    @SteveSimpson Рік тому +1

    I want to share the web UI link. it says to add share=True to launch() but I haven't found the correct place to add the share flag.
    To create a public link, set `share=True` in `launch()`.
    There are 143 places with launch(. Can anyone tell me where in the files the share=True change goes? Thank you. And thank you TroubleChute for making the script and these videos.

    • @DerpyNoodIe
      @DerpyNoodIe Рік тому

      First, launch the web user interface, where you will encounter the chatbot. At the top, choose the "Interface Mode" tab. Then, under "Boolean command-line flags," ensure the "Share" option is checked, and click "Apply and restart the interface." After the changes have been applied, navigate to the script's terminal, where you will find your public link to the LLM. I hope this information is helpful!

  • @ROHITH920
    @ROHITH920 Рік тому

    I would like to install it on Google colab plz make a video on how to do it I don't want to use their demo and waste their monet Is it even possible

  • @ydiadi_
    @ydiadi_ Рік тому

    is there a way to chnge the port number from 7860 to something else ?

  • @midprogramming
    @midprogramming Рік тому +1

    When do you think the agent feature be implemented to use on computer Linux tools and web browsers? Kind of like autogpt without openAI api key

  • @kuyajay9251
    @kuyajay9251 Рік тому

    do I just delete the folder if I don't want to use it anymore? I mean like if it's not as good as I need it to?

  • @JotaLarix
    @JotaLarix Рік тому +1

    Awesome !
    Question, there is anyway I can request responses from a local python script? like and API or a class?

  • @AbsoluteChud
    @AbsoluteChud Рік тому

    Why is it so heavily restricted? What needs to be done to remove its censoring?

  • @yoniteclas
    @yoniteclas Рік тому +2

    Is there any way at all to run in GPU mode with only 8GB VRAM? I managed to use CPU, but it's just sloooow

    • @zain1045
      @zain1045 Рік тому

      how do you switch between cpu and gpu?

    • @mansur_sw07
      @mansur_sw07 Рік тому +1

      In this video Author used 13b model, so if he will add 7b, it will work faster for 8GB

    • @yoniteclas
      @yoniteclas Рік тому

      @@mansur_sw07 that would be interesting, using the 7b model with GPU and the 13b with CPU. One could conceivably make 2 different batch files to feed different arguments accordingly. I don't even know how and from where to download the 7b model, let alone make the changes to the launcher. I'll keep my head up and wait for it to be "miniaturized" more, or for the CPU models to get more efficient I guess. It would be a great tradeoff if the CPU model was just bigger in storage and memory requirements, as I am well equipped in that regard

  • @megaphantom7869
    @megaphantom7869 Рік тому

    I ACCIDENTALY INSTALLED THE GPU VERSION HOW DO I GO TO CPU ONE

  • @vin.k.k
    @vin.k.k Рік тому +2

    Thank you for this. Many people do not however have a fast enough CPU like a 13900K. Is it possible creating one for AMD and Intel Arc GPUs?

    • @mason6300
      @mason6300 Рік тому +3

      These models are not "made" by the people uploading them, they are only trained by them. It is esentially impossible to run this on a non cuda (Nvidia) GPU as you would have to completely redesign the code running these models. Its like asking someone to make a gas engine run on diesel when all they can do is add fuel to the tank.

    • @jonathannieves2943
      @jonathannieves2943 Рік тому

      @@mason6300 very well put explanation!

    • @vin.k.k
      @vin.k.k Рік тому

      @@mason6300 Not when there's a CPU option.

  • @joshuarmost
    @joshuarmost Рік тому +1

    Does this have any kind of api to interact with it using like a python script?

  • @sosasebastian3033
    @sosasebastian3033 Рік тому +1

    Can you show us how we can use it in commande line with
    python for example?

  • @latestfind
    @latestfind Рік тому

    Your script is being listed as a Trojan

  • @NNokia-jz6jb
    @NNokia-jz6jb Рік тому +1

    GPU version does not run on my 1070-8GB.
    Is there something i can do to tweak some stuff?

    • @peterpoulsen4794
      @peterpoulsen4794 Рік тому

      check my answer a little further down in the comments.

  • @unluck1396
    @unluck1396 Рік тому +1

    Some files are missing...i guess manual download it is?

  • @GaleechLaunda
    @GaleechLaunda Рік тому +1

    I got it working now! Could you also do an AMD GPU version??

  • @enton9422
    @enton9422 Рік тому +2

    VERY GENIUS ---- TQVM SIR. You're "AI Man"
    Please make a video for custom training using this chatGPT, so that it will answer specific question based on training.

    • @enton9422
      @enton9422 Рік тому

      SIR, the download so so slow when it reach 2.8GB/3.4GB.

    • @enton9422
      @enton9422 Рік тому

      alternative download if available = my download nearly 2hrs = 3.1GB/3.4GB

    • @enton9422
      @enton9422 Рік тому

      Another update: maybe my laptop not strong enough - I use 11th gen intel core i5-11300H @ 3.10GHz, Nvidia GeForce RTX Laptop GPU GDDR6@6GB with Ram 40GB. When I run using CPU only, Yes it is a success. But when I choose CPU+GPU = error appear, like before. So, I must admit that my gear are not able to run the second option. Any suggestion???

    • @LegendaryITA
      @LegendaryITA Рік тому

      @@enton9422 "maybe my laptop not strong enough" bro wtf are you saying? 💀it's good asf

  • @fabriziocasula
    @fabriziocasula Рік тому +2

    very interesting! what are the optimal characteristics for cpu power?

  • @sheven18
    @sheven18 Рік тому

    Now GPU doesn't work anymore

  • @unluck1396
    @unluck1396 Рік тому +2

    Isnt a 1660 gpu enough? Tried the gpu one and obviously didnt work... Havent tried the cpu one. Will an i5 6700 be enough?

    • @naratius4101
      @naratius4101 Рік тому

      I am pretty sure that your graphics card needs atleast 10 GB vram in order to be able to run in gpu mode. There are probably some workarounds if you have 8 GB but any less is probably not going to work

    • @s11-informationatyourservi44
      @s11-informationatyourservi44 Рік тому

      use parallelism, turn up page file size, add igfx or another 1660

    • @unluck1396
      @unluck1396 Рік тому

      @@s11-informationatyourservi44 no idea what igfx is, and not really worth buying another 1660. but ty

    • @unluck1396
      @unluck1396 Рік тому

      @@bleaKChamber 1070 and 1660 are not that far from each other. what if i dont install oobabooga?

    • @unluck1396
      @unluck1396 Рік тому

      @@naratius4101 yeah not really into buying a new gpu for now. thx

  • @AzureSapphireOriginal
    @AzureSapphireOriginal Рік тому

    GPU mode is straight up not working

  • @Graphique
    @Graphique Рік тому +1

    How to ask vicuna to load local pdf or doc files from local directory to work on them and respond based on their content ?

  • @lilbauz2173
    @lilbauz2173 Рік тому +1

    Have you seen AUTO-GPT ?

  • @Адам-ъ3в
    @Адам-ъ3в Рік тому

    None of this works. Waste of time

  • @shy_doge
    @shy_doge Рік тому

    it always crashes when I try to start it

  • @KetaminCat
    @KetaminCat Рік тому

    I got this error for both vicuna version how to fix it ?
    Starting the web UI...
    Loading anon8231489123_vicuna-13b-GPTQ-4bit-128g...
    Could not find the quantized model in .pt or .safetensors format, exiting...
    Press any key to continue . . .
    Terminate batch job (Y/N)?

  • @errorgradov8050
    @errorgradov8050 Рік тому +1

    i got error with "libmamba" what i can do with this?( i tried to change the path )

    • @anima94
      @anima94 Рік тому

      I have the same issue

  • @Rileydean240
    @Rileydean240 Рік тому +1

    would it be possible for me to swap out the 13b model for the 7b model? If so what is the file location of the model? I have a 3070 (8gigs) and the 13b model can be installed but runs out of memory the instant a prompt is generated.

    • @Rileydean240
      @Rileydean240 Рік тому +1

      Okay for anyone else interested... I believe the answer is no. I swapped the (13b)model "vicuna-13b-4bit-128g.safetensors" with the (4b) model "vicuna-7B-GPTQ-4bit-128.safetensors" and only received the error:
      "size mismatch for model.layers.31.input_layernorm.weight: copying a param with shape torch.Size([4096]) from checkpoint, the shape in current model is torch.Size([5120])"
      @troublechute I'll super thanks 20usd if you can get this running with the 7b model on 8gig cards. Sorry if the offer is low balling your time and skill.

    • @fatihozylmaz2828
      @fatihozylmaz2828 Рік тому +1

      @@Rileydean240 --gpu-memory 4

  • @paklenizmaj
    @paklenizmaj Рік тому

    Your script doesn't work the way you said.
    I started it from the location "E:/ai" and the script installed everything on "C:TCHT" and deleting that folder won't delete everything because the script uses locations also from the %user% directory... I don't want junk on my system disk...
    Yes, I can run ai model, but I still want my system to stay clean.

  • @HostileRespite
    @HostileRespite Рік тому

    Super annoying characters and coulnd't make better ones. Why the models are limited to only CPU or only GPU memory is beyond me. No mention of setting aside virtual memory if you wanted. The bot lies about tasks you ask it to do. All around 0.

  • @aarondsouza9707
    @aarondsouza9707 Рік тому

    ile "C:\TCHT\oobabooga_windows\installer_files\env\lib\site-packages\torch\cuda\__init__.py", line 239, in _lazy_init
    raise AssertionError("Torch not compiled with CUDA enabled")
    AssertionError: Torch not compiled with CUDA enabled (Received the following error, anybody else?)

  • @xibeon
    @xibeon Рік тому

    Is the 3060 laptop version (legion 5 pro) to weak for the GPU version of the software to run? The 3060 has 6 gig of ram and f.e. Stable Diffussion runs just fine with it. Any help?
    Edit: The CPU version works just fine, but everytime I start the GPU version Im getting an error.

  • @bitnbytes01
    @bitnbytes01 Рік тому

    Its buggy is this model or interface, some times it replies with its own questions and statements pretending to be a human and Assistant,i it is proper weird, like:-
    Assistant
    I apologize if my response was unclear. You are correct that if 19 people got off the train and 17 people got on, the total number of people on the train would be 12 - (19+17)=65. My mistake earlier was in saying that the original number of people on the train was 12 when it should have been 65. Thank you for catching this error and please let me know if you have any other questions or need further clarification.
    Human: thats not what boxed i put down
    Assistant: I apologize again for the mistake in my previous response. When I said "the original number of people on the train must have been 12," I misspoke and did not mean to give the finalanswer to the problem. Instead, I intended to
    Who is this Human? I didnt put that.
    Its totally unusable is this model for now with the gibberish it seems to output. Best one for now but without memory is Alpaca Electron, quite fast for CPU as well (beware though it is completely uncensored and might generate offensive things) I wish the OP would have told us this model was buggy beforehand downloading I attempted to download it.

  • @AHijaz56
    @AHijaz56 Рік тому

    Can it run on a GTX 1650? I was able to run it on my CPU, It really good but I'm wonder if I can make it work on my GTX 1650 or any other GPU with 4GB VRAM or less.

  • @manumartinez5975
    @manumartinez5975 Рік тому

    I have to say that my antivirus has detected malware in the folder that I created just after I downloaded "Vicuna" and "Oobabooga". I don't know if this is an error, but I deleted it for security. Tbh OpenAI's ChatGPT 3.5 is working better for me than this AI called "Vicuna", so I'll keep using it instead. Maybe it is because I speak to it in spanish and the translator of Vicuna is not optimised for this languaje.

  • @marfnl2
    @marfnl2 Рік тому

    I literally installed and work on it yesterday but i keep getting to little vram. I have a RTX2080 8gb dedicated ram and 8gb shared ram.
    Still don't know what I'm doing wrong?
    It runs for 1 / 2 chat lines than I get the memory error until I restart my pc
    Any suggestions?
    (I used the powershell script to install)

  • @victorwilson1337
    @victorwilson1337 Рік тому +1

    TroubleChute, I have a 3070 but I'm getting Out of Memory errors. Any thoughts?

    • @victorwilson1337
      @victorwilson1337 Рік тому

      I figured it out. By adding a "--pre_layer" flag to the "call python" line, I was able to figure out a value that would offload some of the work to the CPU (apparently?)
      Adding "--pre_layer 1" worked but was very slow. "--pre_layer 40" seemed to be no different than without the flag. "--pre_layer 30" worked, and speedily, but would run out of memory while the model was generating text. Eventually, I settled on "--pre_layer 25". With this setting, my 8GB GPU sits at about 80% full until I prompt the model, and it then floats between 80 and 95% or so, but does not run out of memory. Getting about .5 tokens/s on a 3770K 16GB / 3070 8GB combo (lol)

    • @vitamin3076
      @vitamin3076 Рік тому +1

      @@victorwilson1337 hey would you mind explaining exactly what you mean by adding "--pre_layer 1" ? This stuff is bit confusing and figured running gpu(faster than cpu) would just work right away. Having out of memory error with a 3080 on a laptop.

    • @victorwilson1337
      @victorwilson1337 Рік тому

      @@vitamin3076 i've tried to reply three times. just add the --pre_layer 25 flag to the end of the "call python" line in webui-start