- 71
- 57 488
NewGenAI
India
Приєднався 5 січ 2024
🚀 Welcome to StableAIHub - Your Gateway to AI Innovation! 🤖✨ Dive into the forefront of artificial intelligence and explore the fascinating world of Stable Diffusion with us. Uncover the magic where stability meets creativity, as we unravel the secrets of generating stunning images from text prompts. Whether you're an AI enthusiast, a tech explorer, or a creative mind seeking inspiration, you're in the right place. Join our community, stay updated on the latest breakthroughs, and embark on a journey of discovery in the ever-evolving landscape of AI. Subscribe now and let's shape the future together! 🌐🔍 #StableDiffusion #AIInnovation #TechExploration
The Beginner's Guide to Creating Your Own Talking-Head / Lip sync videos using EchoMimic
Forge
github.com/lllyasviel/stable-diffusion-webui-forge
EchoMimic tutorial
ua-cam.com/video/WtHdvSSQlWo/v-deo.html
Extract frames
ffmpeg -i video.mp4 -vf fps=30 input\%d.png
Combine frames after post-processing
ffmpeg -framerate 30 -i %d.png -vcodec libx264 -crf 1 video.mp4
#AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #lipsync #talkinghead #audio2video
How to create a talking-head video
Easy guide to making talking-head videos
Talking-head video tutorial for beginners
EchoMimic talking-head tutorial
Best software for talking-head videos
How to animate a static image into a talking-head
Post-process talking-head video to improve quality
How to sync audio with a talking-head video
Best lip sync tools for talking-head videos
How to add lip sync to a talking-head video
0:00 Introduction
0:05 Step 1: Generate a head for talking-head
2:34 Step 2: Breathing life into our head with EchoMimic
9:35 Final Step: post-processing
github.com/lllyasviel/stable-diffusion-webui-forge
EchoMimic tutorial
ua-cam.com/video/WtHdvSSQlWo/v-deo.html
Extract frames
ffmpeg -i video.mp4 -vf fps=30 input\%d.png
Combine frames after post-processing
ffmpeg -framerate 30 -i %d.png -vcodec libx264 -crf 1 video.mp4
#AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #lipsync #talkinghead #audio2video
How to create a talking-head video
Easy guide to making talking-head videos
Talking-head video tutorial for beginners
EchoMimic talking-head tutorial
Best software for talking-head videos
How to animate a static image into a talking-head
Post-process talking-head video to improve quality
How to sync audio with a talking-head video
Best lip sync tools for talking-head videos
How to add lip sync to a talking-head video
0:00 Introduction
0:05 Step 1: Generate a head for talking-head
2:34 Step 2: Breathing life into our head with EchoMimic
9:35 Final Step: post-processing
Переглядів: 283
Відео
Ctrl-X: Revolutionizing Text-to-Image Control Without Guidance
Переглядів 113День тому
Ctrl-X github.com/genforce/ctrl-x Installation guide drive.google.com/file/d/1KdxQkjWQaPvgBTS4YGBV3ewMUjL477E2/view?usp=drive_link #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #CtrlX #T2IGeneration #StructureControl #Appearanc...
CtrLoRA Explained: Next-Level Control for Your Text-to-Image Creations!
Переглядів 289День тому
CtrLoRA github.com/xyfJASON/ctrlora Installation guide drive.google.com/file/d/14fwXYLkbEcd1FHjOOPxMunpIkCW9zDTK/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #CtrLoRA #ImageGeneration #EfficientAI #Controllabl...
Meissonic: Lightning-Fast 1B T2I Model for Jaw-Dropping 1024x1024 Images on Consumer GPUs!
Переглядів 27914 днів тому
Meissonic github.com/viiika/Meissonic Installation guide drive.google.com/file/d/1qTiJm_4az_ud4rCKxM6xZFzTLkwDnFx6/view?usp=sharing Gradio WebUI drive.google.com/file/d/1cgFhMKpDicF-lUV8xzRDZMhemXQ49oEd/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthro...
The BEST voice cloning app ever? Clone Any Voice with F5-TTS: The Most Accurate TTS Yet!
Переглядів 2,7 тис.14 днів тому
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching github.com/SWivid/F5-TTS Fix NUMPY package version pip install force-reinstall -v "numpy 1.25.2" Quick installation guide 1. Clone and navigate inside the folder 2. Create virtual environment python -m venv venv 3. Activate virtual environment venv\scripts\activate 4. Install Wheel pip install wheel 5. Install require...
From Low to Pro: Frame Interpolation with REAL-Video-Enhancer on Windows
Переглядів 19214 днів тому
REAL-Video-Enhancer github.com/TNTwise/REAL-Video-Enhancer #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #VideoEnhancer #FrameInterpolation #Upscaling #REALVideoEnhancer #VideoEditing #RIFEESRGAN #AIUpscaling #AiVideoInterpolat...
Think 8GB VRAM Can't Handle Controllable AI Generation? Naaaaaah! Introducing ControlNeXT SVD
Переглядів 1,8 тис.14 днів тому
ControlNeXT github.com/dvlab-research/ControlNeXt/ ControlNeXt-SVD-v2 for Low VRAM systems (atleast 8 GB VRAM ) 8 GB shared github.com/newgenai79/ControlNeXt-SVD-v2 #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #ControlNeXT #AI...
Makeine Magic: Create Reels & Shorts from Just a Text Prompt!
Переглядів 11621 день тому
Makeine github.com/Kither12/Makeine Updated files for Windows drive.google.com/file/d/1hhqBADXnufZzbTfROl92dxv-6fDE9QSK/view?usp=sharing ImageMagick for Windows imagemagick.org/script/download.php#windows #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearc...
Deep Live Cam: Face Swaps for Live camera, Images, Videos, and Multiple Faces!
Переглядів 56721 день тому
Deep-Live-Cam github.com/hacksider/Deep-Live-Cam Fix for transparent window github.com/hacksider/Deep-Live-Cam/issues/668 #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #DeepLiveCam #FaceSwap #RealTimeFaceSwap #ImageToVideo #Liv...
SadTalker: Audio-Driven Single Image Talking Face Animation on Windows
Переглядів 72928 днів тому
SadTalker github.com/OpenTalker/SadTalker In requirements.txt file replace gradio with gradio 3.41.2 before installing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology realistic ai voice generator new trending ai animation video a...
OOTDiffusion: The Future of Virtual Try-ons with AI Fashion
Переглядів 587Місяць тому
Installation guide drive.google.com/file/d/1dHlaYY-P_wGx5W_6phQNo2Jwyw7M96Th/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #OOTDiffusion #VirtualTryOn #AIFashion #FashionTech #OutfitFusion #LatentDiffusion #Vir...
ResShift: Lightning-Fast Super-Resolution & Face Restoration
Переглядів 270Місяць тому
ResShift github.com/zsyOAOA/ResShift Additional files drive.google.com/file/d/1I4j1bwGakMREmg7Q8PYMvIKmCPbvbFrU/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #ResShift #SuperResolution #FaceRestoration #AIUpsca...
Master Voice Cloning with CosyVoice: Multilingual AI for Realistic Speech Generation
Переглядів 674Місяць тому
CosyVoice github.com/FunAudioLLM/CosyVoice Additional files drive.google.com/file/d/13imjTSVqXcu1SWsy2ptR2hyBRh_E6ued/view?usp=sharing #AI #StableDiffusion #TechInnovation #ArtificialIntelligence #DeepLearning #AIExploration #TechEnthusiast #CreativityInAI #StableAIHub #AICommunity #InnovationHub #TechBreakthroughs #AIResearch #futuristictechnology #VoiceCloning #AI #TextToSpeech #CosyVoice #Vo...
Unlock Emotions in Talking-head Videos with EDTalk
Переглядів 9482 місяці тому
Unlock Emotions in Talking-head Videos with EDTalk
AniTalker: Lightning-Fast Talking Head Animations with Unique Facial Motion Encoding
Переглядів 8552 місяці тому
AniTalker: Lightning-Fast Talking Head Animations with Unique Facial Motion Encoding
Ultimate Vocal Remover: Effortless Vocal Extraction with Deep Neural Networks
Переглядів 1952 місяці тому
Ultimate Vocal Remover: Effortless Vocal Extraction with Deep Neural Networks
Make Backgrounds Disappear: Quick and Easy Transparent Background Tool | Powered by InSPyReNet
Переглядів 2223 місяці тому
Make Backgrounds Disappear: Quick and Easy Transparent Background Tool | Powered by InSPyReNet
AICoverGen: Create Song Covers with RVC v2 AI Voices!
Переглядів 4543 місяці тому
AICoverGen: Create Song Covers with RVC v2 AI Voices!
EchoMimic Magic: Audio and Landmarks Bring Portraits to Life! The BEST talking head generation app.
Переглядів 2,2 тис.3 місяці тому
EchoMimic Magic: Audio and Landmarks Bring Portraits to Life! The BEST talking head generation app.
How to Create Perfect Lipsync Videos with LipSick
Переглядів 4533 місяці тому
How to Create Perfect Lipsync Videos with LipSick
FSRT: AI-Powered Next-Gen Face Reenactment Technology
Переглядів 4393 місяці тому
FSRT: AI-Powered Next-Gen Face Reenactment Technology
LivePortrait: Create Hilarious Portrait Animations Effortlessly!
Переглядів 3,5 тис.3 місяці тому
LivePortrait: Create Hilarious Portrait Animations Effortlessly!
MimicMotion: Revolutionizing Human Motion Videos
Переглядів 2,6 тис.3 місяці тому
MimicMotion: Revolutionizing Human Motion Videos
Hallo: Breakthrough in Audio-Driven Portrait Animation
Переглядів 1,6 тис.3 місяці тому
Hallo: Breakthrough in Audio-Driven Portrait Animation
FaceSwapLab for Stable Diffusion: Seamless Face-Swapping in Automatic1111
Переглядів 2,3 тис.3 місяці тому
FaceSwapLab for Stable Diffusion: Seamless Face-Swapping in Automatic1111
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion
Переглядів 1,1 тис.3 місяці тому
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion
Face-Adapter: The Ultimate Tool for Perfect Face Reenactment & Swapping
Переглядів 4743 місяці тому
Face-Adapter: The Ultimate Tool for Perfect Face Reenactment & Swapping
Transforming Images into Lifelike Conversations: V-Express installation & demo on windows 11
Переглядів 7274 місяці тому
Transforming Images into Lifelike Conversations: V-Express installation & demo on windows 11
Stable Diffusion 3 Medium: The Future of AI Art is Here! Installation and quick demo on Windows 11
Переглядів 1,1 тис.4 місяці тому
Stable Diffusion 3 Medium: The Future of AI Art is Here! Installation and quick demo on Windows 11
Boost Image Diversity: Discover CADS for Automatic1111 WebUI
Переглядів 1724 місяці тому
Boost Image Diversity: Discover CADS for Automatic1111 WebUI
Followed every step on your previous installation video but getting ValueError: DownEncoderBlock2D does not exist with your current setup? Do you know what might be the issue? Thanks 🙏
Please post complete command prompt log here github.com/BadToBest/EchoMimic/issues Also check package version against github.com/BadToBest/EchoMimic/issues/185
白白浪费1个小时
Don't worry it keep happening with everyone as installation is complex for AI tools
错误:找不到conformer==0.3.2的匹配分布
Please post the issue here github.com/FunAudioLLM/CosyVoice/issues
I have followed the instructions but running the webui shows below errors Traceback (most recent call last): File "webgui.py", line 15, in <module> from diffusers import AutoencoderKL, DDIMScheduler File "D:\pinokio\bin\miniconda\envs\echomimic\lib\site-packages\diffusers\__init__.py", line 5, in <module> from .utils import ( File "D:\pinokio\bin\miniconda\envs\echomimic\lib\site-packages\diffusers\utils\__init__.py", line 38, in <module> from .dynamic_modules_utils import get_class_from_dynamic_module File "D:\pinokio\bin\miniconda\envs\echomimic\lib\site-packages\diffusers\utils\dynamic_modules_utils.py", line 28, in <module> from huggingface_hub import HfFolder, cached_download, hf_hub_download, model_info ImportError: cannot import name 'cached_download' from 'huggingface_hub' (D:\pinokio\bin\miniconda\envs\echomimic\lib\site-packages\huggingface_hub\__init__.py)
please, create a New - installation video, can't make it work
I checked few days back, it was working. Please follow the video and do not skip any step. ua-cam.com/video/WtHdvSSQlWo/v-deo.html
I checked EchoMimic, the installation is working. Please check video description for link.
Thank you, this is amazing.
new project MimicTalk
new project MimicTalk
I checked. It's for Linux only.
Wow! This looks better than sad talker and Halo. Thank you for this update
Agreed. Very good quality. I wish it supported paste-back for full body like SadTalker or LivePortrait.
This is what I was waiting for
ERROR: Could not find a version that satisfies the requirement torch==2.1.2+cu121 (from versions: none) ERROR: No matching distribution found for torch==2.1.2+cu121
Please check if you have python 3.10 installed.
Love this can this work with a cpu?
Please check here, it may work. You can check with developer by posting in Issues section github.com/hacksider/Deep-Live-Cam
yes it can
I hate not having a good computer, what type of computer do you need? mine doesn't have a beefy GPU , is 10 years old ;'( and lastly, can this run in any Linux distro?
What is the budget? NVIDIA GPU with 16 GB VRAM is good starting. You can also try on Google Collab
Unfortunately, the project is now dead. The models deleted by stability AI broke the tool completely. Any suggestions or alternatives?
Diffsynth studio
Не могли бы вы добавить модель с русским языком? Что установлено не понимает русский язык
You will have to fine tune for which you will need Russian dataset. Please check the github repo for instructions.
What happened to the DreamTalk video? Is there a reason why you are deleting some of your videos?
Which DreamTalk video. I did deleted some old videos as there were updates and I recorded new. I don't remember doing any video for DreamTalk.
Perfect. It worked in 1 go without any errors. 👍
Thank you for posting solution for cached_download. I wasn't able to make it run because of this error.
How to use lower body config in the code?
I don't think it's possible. The model is trained only on upper body AFAIK.
Thank you 👍
Amazing results. Where all do you get to know all these AI tool releases? Thank you for creating these amazing tutorials.
is there a ComfyUI workflow ??
Sorry I don't use ComfyUI.
Hi, I followed all your steps correctly, but I'm running into an issue at the last stage. When I try to run the Gradio file, I get the following error : "Traceback (most recent call last): File "e:\virtual trial room\OOTDiffusion un\gradio_ootd.py", line 14, in <module> from preprocess.openpose.run_openpose import OpenPose ModuleNotFoundError: No module named 'preprocess.openpose'; 'preprocess' is not a package" same for the following: from preprocess.openpose.run_openpose import OpenPose from preprocess.humanparsing.run_parsing import Parsing from ootd.inference_ootd_hd import OOTDiffusionHD from ootd.inference_ootd_dc import OOTDiffusionDC Any idea how to resolve this? Thanks in advance!
I never faced this error and I checked github too no one reported this error. Can you try checking if all models downloaded fine?
Great tutorial, thank you. I had one question, how did you create the girl's lip-synced video at the start
Tutorial coming soon.
The voice does not match the face
Yeah it won't. Just to demonstrate the audio2LipSync capabilities.
Thanks for your support! We will add this tutorial to our official GitHub repository README.
Thank you. I am working on few updates and will create pull requests.
I have posted 3 updates to repo, please review and merge is all OK github.com/viiika/Meissonic/pull/13
Here here we have the full working version. Thank you for sharing.
You're welcome
This is really good app. I tried it after following your tutorial. There are so many updates happening, you may need to create the video again.
Let me check
Thank you
You're welcome
Appreciate all these easy to follow tutorials.
Glad you like them!
Thanks for sharing! Amazing video! The authors updated their repo to fix some issues. Would you mind updating the demos in the video? Thank you very much!
Will try to update. For the time being use command line inference.
@@StableAIHub Thanks for your support! We will add this tutorial to our official GitHub repository README.
Thanks for your support! We will add this tutorial to our official GitHub repository README.
Very good tutorial. Thank you
Glad you liked it.
why is it showing intel gpu?
Multiple GPU's are supported by this tool.
is echomimic software can generates lipssync on video? thanks
It does but I could never get it to work on Windows.
amazing
Thank you! Cheers!
From which tool you generate talking head at starting of your video?
EchoMimic ua-cam.com/video/WtHdvSSQlWo/v-deo.html
@@StableAIHub okay thanks can you please tell me how much time it takes to generate 10 second video on Tesla T4 gpu
Don't know about Tesla T4 but on 8 GB VRAM 11 sec took 48 minutes
@@StableAIHub and how much time it took to edit the whole thing and upload. Is it less than 48minutes?
@@aadityaaryan3063 It take several hours. I will create video for the complete process.
tutorial to train our own language ? tribal non english language
Unfortunately it it very complex to train. You need huge amount to data and several thousand dollars to train. Here is the excerpt >----------------------------------------------------------------- It was trained on an extensive dataset of 95,000 hours, utilizing 8 A100 GPUs over the course of more than a week. >----------------------------------------------------------------- Which means atleast 3000 dollars spent on training
Thank you!
Thank you! Finally something relevant. However, I tried through tensorRT and it didn't load the video card much, but more the processor. in Flowframes implementation through VapourSynth even worked faster it seems. Still, the main thing is that there are actual RIFE models here. 👍👍
Can I install it in i3 11th gen laptop
Which Graphics card do you have and VRAM? Your laptop seems too old, I highly doubt it will work.
Be careful bro, you leaked your ip at 6:42
Thanks for letting me know. It is remote server IP or am I missing something?
It is your public ip bro Be careful
@@CringeGPT-gnt I just checked again, you talking about this IP, right whatismyipaddress.com/ip/52.89.167.149
Works fine. Good thing is it is updated frequently.
Enjoy!!
Awesome, thanks.
u r welcome
I come from where we just download installer and install. These tools never made sense to me. Appreciate these easy to follow guides. Very helpful. Thank you.
I know. I have seen that struggle myself.
Amazing, thank you. One question, I have this error "Cannot load audio from file: `ffprobe` not found. Please install `ffmpeg` in your system to use non-WAV audio file formats and make sure `ffprobe` is in your PATH." how can I install ffmpeg and ffprobe??
Follow this to install FFMpeg ua-cam.com/video/SIwfW3MAp6w/v-deo.html
@@StableAIHub Thank you man, you're the best, I have to restart the gradio and works really well
@@Huguillon Glad to know it worked.
It seems there are some new updates in repo.
He pushed the updates in repo. Also there is WebUI for FP8
Works fine. Thanks for sharing. 👍
Thanks for sharing another great video tutorial! Is there a list of what languages it supports? or it doesn't matter and ANY language should work?
Only English and Chinese languages supported. They are working on other languages. If you want support for any other language, create a topi in Discussion requesting for support. github.com/SWivid/F5-TTS/discussions
English and Chinese
-r requirements_gradio.txt seems to be missing from the link now. But I persevered and installed things separately. I got stuck 'no module named 'datasets'. EDIT i just grinded through pip install [insert missing module] Probably brute forced my way through in absence of the requirements_gradio.txt missing. It's installing the model.safetensors stuff now
hey thanks for the vid, however when i try to downgrade numpy i get the error AttributeError: module 'pkgutil' has no attribute 'ImpImporter'. Did you mean: 'zipimporter'? error: subprocess-exited-with-error any clue why?
Try without downgrading numpy. Also refer this thread. github.com/SWivid/F5-TTS/issues/59