Creating J.A.R.V.I.S.

Поділитися
Вставка
  • Опубліковано 15 тра 2024
  • A sneak peek of voice-to-voice chat assistant.
    🦾 Discord: / discord
    ☕ Buy me a Coffee: ko-fi.com/promptengineering
    |🔴 Patreon: / promptengineering
    💼Consulting: calendly.com/engineerprompt/c...
    📧 Business Contact: engineerprompt@gmail.com
    Become Member: tinyurl.com/y5h28s6h
    💻 Pre-configured localGPT VM: bit.ly/localGPT (use Code: PromptEngineering for 50% off).
    Signup for Advanced RAG:
    tally.so/r/3y9bb0
    All Interesting Videos:
    Everything LangChain: • LangChain
    Everything LLM: • Large Language Models
    Everything Midjourney: • MidJourney Tutorials
    AI Image Generation: • AI Image Generation Tu...
  • Наука та технологія

КОМЕНТАРІ • 50

  • @MeinDeutschkurs
    @MeinDeutschkurs 23 дні тому +2

    Wooohooo!! Yeah, can‘t wait for it! ⭐️

  • @barackobama4552
    @barackobama4552 22 дні тому +2

    Impressive, thanks!

  • @3choff
    @3choff 22 дні тому

    Very interesting project! Do you use any VAD to detect the end of the request?

  • @comfyuiadrian
    @comfyuiadrian 22 дні тому

    Wahooo..really looking forward to your new project!

  • @Techonsapevole
    @Techonsapevole 22 дні тому +1

    it's fast which TTS and STT did you use ?

  • @RickySupriyadi
    @RickySupriyadi 23 дні тому

    yes please is it going open source?

  • @aa-xn5hc
    @aa-xn5hc 22 дні тому

    Great looking forward

  • @themax2go
    @themax2go 21 день тому

    should edit title to add "using openai"

  • @user-jq1gc8lt7s
    @user-jq1gc8lt7s 23 дні тому

    I LIKE IT GREAT JOB

  • @GroqSummarizer
    @GroqSummarizer 21 день тому

    Nice!

  • @GetzAI
    @GetzAI 23 дні тому

    EXCITED!

  • @Thorin632
    @Thorin632 21 день тому

    Please make beginner friendly tutorial, step by step guide on how to integrate this with localgpt 🙏🙏

  • @brianpereira7757
    @brianpereira7757 23 дні тому +2

    That doesnt sound like Jarvis, I want the real Jarvis voice!!!

    • @engineerprompt
      @engineerprompt  22 дні тому +1

      Good point, I think elevanlabs have that. Will try to integrate that :)

    • @sayantandas7544
      @sayantandas7544 22 дні тому

      ​@@engineerprompt How about you add a little UI also? And maybe add a button to take continuous screenshot with a regular interval as well. In that way, you will be releasing the OpenAI's demo app before OpenAI.

  • @joepropertykey3612
    @joepropertykey3612 22 дні тому

    Right on Bro, RIGHT ON. ......... but we need the voice of Cortana for this, for when we are sitting around in our Mark V Armor and coding...:)

  • @KiyotokaAyanakoji-ss1gn
    @KiyotokaAyanakoji-ss1gn 23 дні тому +2

    What TTS are you using and is it running locally

    • @engineerprompt
      @engineerprompt  23 дні тому +3

      Whisper but via the api. Nothing is running locally in this video. Local version will be coming soon.

    • @KiyotokaAyanakoji-ss1gn
      @KiyotokaAyanakoji-ss1gn 23 дні тому

      @@engineerprompt loved it 👍

    • @Gun_ForFun
      @Gun_ForFun 23 дні тому +1

      @@engineerprompt but Whisper is ASR, not TTS??

    • @snapman218
      @snapman218 22 дні тому

      Gross.

    • @themax2go
      @themax2go 21 день тому

      someone already made a fully local version and works w/ little latency and with voice training. there already exist projects on github for continuous speech using a keyword to trigger recording, and a version with a ptt implementation instead of keyword

  • @borisrusev9474
    @borisrusev9474 22 дні тому

    I don't get it, how's that different from GPT-4o?

    • @engineerprompt
      @engineerprompt  22 дні тому +1

      You are right, very similar in functionality. In fact, this version is using GPT-4o for text generation. But the voice functionality is not available in GPT-4o yet.

  • @RickySupriyadi
    @RickySupriyadi 23 дні тому

    also i request a video about this vs gpt-4o

  • @im-notai
    @im-notai 22 дні тому

    Idk know, why there is a folder on my desktop named Jarvis-v6 since 5 months and surprisingly that's also doing the same job 😮

    • @engineerprompt
      @engineerprompt  22 дні тому

      Would love to see what's in the folder :D I am v0 now

    • @im-notai
      @im-notai 22 дні тому

      @@engineerprompt it's gonna become interesting. I thought I was the one who was able to crack speech while streaming to reduce the latency.

  • @smoofwah3552
    @smoofwah3552 23 дні тому

    Is there a way to speed it up?

    • @engineerprompt
      @engineerprompt  23 дні тому

      Yes, Groq has whisper support now. Going with that but the issue is the rate limit!

    • @alx8439
      @alx8439 20 днів тому

      To use rhasspy3 as a base. It streams audio directly to asr model

  • @danieldjinishiandebriquez1858
    @danieldjinishiandebriquez1858 23 дні тому

    What apis are being used?

    • @engineerprompt
      @engineerprompt  23 дні тому

      currently everything is openai. Just got access to whisper from Groq, will update it and hope will be much faster!

    • @danieldjinishiandebriquez1858
      @danieldjinishiandebriquez1858 23 дні тому

      @@engineerprompt great! Looking forward the tutorial or git repo. Literally yesterday I was searching about Jarvis haha

  • @Soniboy84
    @Soniboy84 22 дні тому

    how it's different than gpt4o voice?

  • @temp911Luke
    @temp911Luke 22 дні тому

    Nice but would be great without that annoying 2-3 sec delay.

    • @engineerprompt
      @engineerprompt  22 дні тому

      I agree, I just got access to Groq Whisper. Will be interesting to see how that works.

    • @fontende
      @fontende 22 дні тому

      ​@@engineerpromptGeorge Hotz on stream called groq a scam...

  • @themax2go
    @themax2go 21 день тому +2

    not local. not the jarvis voice. misleading title. disappointed

    • @javiergimenezmoya86
      @javiergimenezmoya86 21 день тому

      Why do you think that is not local? The only bad thing is that he do not use voice streaming for make it faster (I did it so)