Let`s Build AI: MEGA AUTOCOMPLETER (LLM Powered) | AI Engineer Project

Unlimited AI Agents running locally with Ollama & AnythingLLM

26 Incredible Use Cases for the New GPT-4o

little emma conquering the vault. ❤️#Olympics #Gymnastics #ArtisticGymnastics #Sports

Каха на свидании #непосредственнокаха

ДРУГИЕ - ВСЕ СЕРИИ ПОДРЯД

Create Your "Small" Action Model with GPT-4o

All About AI

Переглядів 6 258

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 12 лип 2024
Create Your "Small" Action Model with GPT-4o
👊 Become a member and get access to GitHub and Code:
/ allaboutai
🤖 Great AI Engineer Course:
scrimba.com/learn/aiengineer?...
🔥 Open GitHub Repos:
github.com/AllAboutAI-YT/easy...
📧 Join the newsletter:
www.allabtai.com/newsletter/
🌐 My website:
www.allabtai.com
I try to create my own "small" action model based on Python and the GPT-4o API. Will it work? Lets find out
00:00 Small Action Model GPT-4o Intro
01:48 GPT-4o Action Model Code
05:54 Testing the Model
Наука та технологія

КОМЕНТАРІ • 26

@ShpanMan Місяць тому ⁺¹⁰
This is actually really impressive. GPT-4o watching you act and understands what is done, then writes code to reproduce it, which can then be run and automated.
Very clever flow, OpenAI should definitely hire you.
@MilkGlue-xg5vj Місяць тому
Anyone can do better than this with a powerful language model, it's not much. It's just that the rabbit is overrated.
@clumsy_en Місяць тому ⁺²
Cool experimental project and idea 👍 The entire process can be scripted further to continuously store the most recent number of screenshots in 2-second intervals to VRAM using PyTensor, and a call can be triggered at any time with keyword through mic input or keys shortcut to send it to gpt-4o to retrieve the "reply last action script" and then automatically execute it to save time doing some mundane tasks👍👍
@georgestander2682 Місяць тому ⁺⁴
Thanks, this is interesting. I was wondering about this as well and had a thought about adding log data of user interactions to give the model more telemetry. So it not just vision but also the actual logs of all the interactions happening in the background.
@ewasteredux Місяць тому ⁺⁵
Are there any local LLM's this might work with?
@PanduPandu-fh5tk Місяць тому ⁺¹
Maybe, LLaVA 13b can
@mikew2883 Місяць тому ⁺³
This is awesome!
@cyc00000 Місяць тому
So good to see you getting onboard the rabid r1. It's seriously going to change lives.Enjoyed the video man.
@TTOnkeys Місяць тому
I can think of so many uses for this. Great work.
@nic-ori Місяць тому ⁺¹
Useful information. Thank you!👍👍👍
@user-yw9us2qo6g Місяць тому ⁺¹
Looks great
@NetHyTech Місяць тому ⁺³
Bro Plz create video for real time vision and response
@lokeshart3340 Місяць тому
Woh woh look whos here bhai kya aap mere ko jante ho ya yaad rkhe ho?
@BThunder30 Місяць тому
Interesting project as always.
@gnosisdg8497 Місяць тому ⁺²
so where is the code for this project! looks fun
@ibrahimaba8966 Місяць тому
Very interesting. I think it could also be useful to provide it with the mouse positions between different frames.
To go further, we could create multiple actions and then implement a RAG that allows the model to choose the correct snapshot and execute it.
Thanks for this video.
@Soft_Touch_ Місяць тому ⁺¹
I've been thinking recall and omni screenshots were ways to create large pratical data sets to train lams. Do you think that is what's happening? You seem to be doing a smaller version of this
@carstenli Місяць тому
Great start. What's the GH url for subscribers?
@futureworldhealing Місяць тому ⁺²
learning how to be data scientist 80% from u bro haha
@avi7278 Місяць тому ⁺²
honestly more legit than scammer Jesse Lyu and RabbitR1 garbage hardware scam after his NFT game scam.
@darthvader4899 Місяць тому
How does it know where to click though? Does
@kalilinux8682 Місяць тому
Humane and Rabbit watching this and raising another round of funding
@lokeshart3340 Місяць тому ⁺¹
Hello sir can u recreate gemini vision fake demo in real life
@JNET_Reloaded Місяць тому ⁺¹
the github is always the same repo btw itl be easyer tomake a new repo for each project and put project link in description
@wurstelei1356 Місяць тому
I think you can link to git sub folders. The repo is pretty messy, but keep in mind, this is free. Thou I am also not able to find code for some projects on that repo.
@spencerfunk6697 Місяць тому ⁺²
So literally open interpreter…

Наступне

Автоматичне відтворення

Let`s Build AI: MEGA AUTOCOMPLETER (LLM Powered) | AI Engineer Project

Let`s Build AI: MEGA AUTOCOMPLETER (LLM Powered) | AI Engineer Project

Unlimited AI Agents running locally with Ollama & AnythingLLM

Unlimited AI Agents running locally with Ollama & AnythingLLM

26 Incredible Use Cases for the New GPT-4o

26 Incredible Use Cases for the New GPT-4o

little emma conquering the vault. ❤️#Olympics #Gymnastics #ArtisticGymnastics #Sports

little emma conquering the vault. ❤️#Olympics #Gymnastics #ArtisticGymnastics #Sports

Каха на свидании #непосредственнокаха

Каха на свидании #непосредственнокаха

ДРУГИЕ - ВСЕ СЕРИИ ПОДРЯД

ДРУГИЕ - ВСЕ СЕРИИ ПОДРЯД

Арестович: Разворот Зеленского. Дело идет к миру. Сбор для военных👇

Арестович: Разворот Зеленского. Дело идет к миру. Сбор для военных👇

GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR!

GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR!

Reverse Engineering for Beginners: How to Perform Static Analysis on any Piece of Software

Reverse Engineering for Beginners: How to Perform Static Analysis on any Piece of Software

Why This USELESS Gadget Is ACTUALLY The Future

Why This USELESS Gadget Is ACTUALLY The Future

Don’t Build AI Products The Way Everyone Else Is Doing It

Don’t Build AI Products The Way Everyone Else Is Doing It

Why Agent Frameworks Will Fail (and what to use instead)

Why Agent Frameworks Will Fail (and what to use instead)

$100b Slaughterbots. Godfather of AI shows how AI will kill us, how to avoid it.

$100b Slaughterbots. Godfather of AI shows how AI will kill us, how to avoid it.

Ollama Open Source AI Code Assistant Tutorial - Codestral 22b | Llama3 + Codeseeker

Ollama Open Source AI Code Assistant Tutorial - Codestral 22b | Llama3 + Codeseeker

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

How to scrape the web for LLM in 2024: Jina AI (Reader API), Mendable (firecrawl) and Scrapegraph-ai

Two GPT-4os interacting and singing

Two GPT-4os interacting and singing

WATERPROOF RATED IP-69🌧️#oppo #oppof27pro#oppoindia

WATERPROOF RATED IP-69🌧️#oppo #oppof27pro#oppoindia

تجربة أغرب توصيلة شحن ضد القطع تماما

تجربة أغرب توصيلة شحن ضد القطع تماما

Тechnics 1500 Настройка, схема, музыка, всё!

Тechnics 1500 Настройка, схема, музыка, всё!

ИГРОВАЯ СБОРКА ПК ЗА 30К ОТ А ДО Я

ИГРОВАЯ СБОРКА ПК ЗА 30К ОТ А ДО Я

40$ or 50$ or Typecase iPad keyboard #ipadkeyboard #ipadcase #typecase #ipad #ipadpro

40$ or 50$ or Typecase iPad keyboard #ipadkeyboard #ipadcase #typecase #ipad #ipadpro

Todos os modelos de smartphone

Todos os modelos de smartphone

Опыт использования Мини ПК от TECNO

Опыт использования Мини ПК от TECNO

Simple maintenance. #leddisplay #ledscreen #ledwall #ledmodule #ledinstallation

Simple maintenance. #leddisplay #ledscreen #ledwall #ledmodule #ledinstallation