Teaching a computer to strafe jump in Quake with reinforcement learning

Matt's Ramblings

Додати в
- Мій плейлист
- Переглянути пізніше
Поділитися

Поділитися

Вставка

Розмір відео:

Показувати елементи керування програвачем

Автоматичне відтворення

Автоповтор

Опубліковано 1 чер 2024
In this video, I use RLLib to train a reinforcement learning agent to strafe jump in the classic video game Quake
Chapters
00:00 Introduction
01:11 What is Reinforcement Learning?
02:42 Quake RL environment
04:41 Results
08:22 Comparison with human WR
09:54 Future direction
Ігри

КОМЕНТАРІ • 60

@Twisted_Logic 4 роки тому ⁺⁶⁴
Using low difficulty defrag maps might let you focus more on the movement than dealing with enemies and figuring out how to press buttons
@wowie_allie7404 4 роки тому ⁺¹³
i am so pleased to have found your channel through this video; please keep it up! this is my favorite content i've absorbed recently, as a long time speedrunner of quake-based games, its really wonderful to get raw insight down to basic physics and code to understand why im doing the actions i'm doing
@ratzinstagib 4 роки тому ⁺⁵
This video is amazing! Your creativity and practicality with the production and visuals is so helpful for viewer comprehension. For example, I had never understood powerbunnying before other than as just a strange engine quirk that made quakeguy go faster, but getting to see the graph displaying the vectors frame by frame totally illuminated the reason this tactic works and allows players to optimize their runs. But of course, the real treat-watching the software complete a quicker 100m demo than crashfort, one of quake/goldsrc/source engine's most well respected and skilled movement players despite some limitations on keypresses and mousespeed. Truly awesome.
@IM1THE2KING3 4 роки тому ⁺⁵
genuinely hope your channel grows
@HueyTheDoctor 3 роки тому ⁺²
Your Quake videos are excellent. Many thanks for the informative content.
@unity6926 3 роки тому ⁺²
Incredible channel, just finding it now. Subbed!
@AlejandroForeroCuervo 4 роки тому ⁺⁹
This was very interesting to watch, thank you for sharing it!
@Krunklehorn 4 роки тому ⁺⁸
Wonderful suggestion about the potential for reinforcement learning to find new techniques! I'm stoked!
@cattie6554 4 роки тому ⁺³
Fantastic video Matt! Would love to see how far you can take this and really push the engines limits.
@imforpeace9511 3 роки тому
The video is oddly enough, very well composed and polished. Subscribed, please provide more content XD
@ElguSpeedruns 4 роки тому ⁺³
Amazing video. Super detailed information!
@happysedits 4 роки тому ⁺³
This is nuts dude, good job, I love it
@dannybwai1136 4 роки тому ⁺⁶
thank you for your work, great job.
@notgate2624 4 роки тому ⁺²
Awesome video. I wish you had shown how fast it could go without the restrictions.
@gerardrey8120 4 роки тому ⁺²
Great job man! I hope to see more
@Samopal.VanoZz 2 роки тому ⁺²
I studied ai at university and played a lot of qw, so i find your work very intresting. I have had been writing a bot, but my enthusiasm just exhausted after 4 months of work. But i hardly thought of making the bot move through ai and just took moves from pro-players' demos ( i had 16gb of them). I wanted to use some ai for more common behaviour such as attack or chase, or retreat..camp etc..
@doot2359 3 роки тому ⁺¹
We want more of these stuff !
@matthewpaquette 4 роки тому ⁺³
This is super neat. I searched “ai bhopping” and found your video. Really enjoyed it. Do you think this is could be repurposed with other games that allow bhopping / strife jumping?
@tyzorg 3 роки тому ⁺¹¹
I LIKE THE PART WHERE HE JUMPS
@sinus4784 3 роки тому ⁺¹
This is sick you should really upload more this stuff is really cool definitely a subscribe from me
@pynnegames 4 роки тому ⁺²
Amazing work
@FreedSeed 4 роки тому
a new matt video?
on easter?!
my year is complete
@danielporto6307 4 роки тому ⁺²
Hello ! Impressive work. I was wondering which algorithm you are using ? Is it ppo2 with baselines and did you tweek the hyperparameters ? Also, which policy and how many timesteps it took ? Which python version and others (gym, tensorflow, baselines/stable-baselines, etc). I would realy like to know more ! :)
@EkajArmstro 4 роки тому ⁺¹
This is awesome!
@GeckoEidechse 4 роки тому ⁺³⁰
Any chance you'll make your code publicly available? ^^
@AndyPynch 4 роки тому ⁺⁵
Yes please
@MattsRamblings 4 роки тому ⁺²⁹
I've now cleaned up my code and put it here: github.com/matthewearl/q1physrl
@goliathjohnson966 2 роки тому
@@MattsRamblings Matt, first of all I'd like to say, I hope you are doing great and have a Merry Christmas and Happy New Year! I have a few questions.
1. Do you think it would be possible to create an AI- bot with reinforcement learning for *Quake Live* Duel?
2. If so, would it be possible to write the code to limit it to only shoot around a certain % of LG and machine gun? Any gun with track aiming. Obviously, it wouldn't be too fair if the AI bot could shoot 100 LG or completely have lock on aim, so it would be interesting to see how it would learn to take fights and learn how to get frags if it could only shoot 30 percent lg or up to 40 or 50.
Personally, I think the lower the track aiming accuracy the AI bot would have would be way more interesting to see because it could teach you combat techniques that could work for ordinary humans since only the top .1 % of players were able to shoot super high 40 to 50% LG during duels.
If you could create an AI bot like this I would love to buy or rent it off of you so I could plug it into my servers and play it to sharpen my game in duel, because right now I'm only around 2100 elo and would like to get to around 2500, 2600, elo if possible and be at a higher level.
@puilp0502 4 роки тому
Interesting project! What kind of visualization library did you use for the diagram in 6:10? It doesn't look like the standard matplotlib.
@QuakeWorldTeamFortress 10 місяців тому ⁺¹
nice video!
@dorian7259 4 роки тому
your voice is very relaxing
@user-ds1pu2vc8s 3 роки тому
this is totally insanity I like it
@xIncrement 4 роки тому ⁺¹¹
This is super interesting! I'd love to see more videos like this, being a developer myself.
Do you have a recommendation for learning this lib?
Also, alternative titel: "Computer teaches Quake novice to strafe jump" lol
@RockZahdok 4 роки тому
Check out gym.openai.com/docs/ , if you are interested in the theory behind it search for "RL Course by David Silver"
@btarg1 Рік тому
I would love to see this ML approach applied to a full run of Quake or Half Life!
@skope2055 Рік тому
awesome
@realwizardry834 4 роки тому
Content like this is fascinating because regular developers created a video game and people are learning how to get the best results out of the rules that the developers have written. Humans working within other humans' expressed boundaries.
@gendalfgray7889 3 роки тому
Is it necessary to use learning and stuff? SJ consist of combination of buttons and mouse. Program one strafe jump, program how to chain them together, wouldnt it be easier?
@sdiv-36b 3 роки тому ⁺¹
Cool
@DruidVorse 4 роки тому ⁺¹
I wonder how this can discover new tech in AM2R
@magyararon6918 3 роки тому
Its crazy that the human wr is the almost the same as RL.
@YaLTeRz 4 роки тому ⁺³
Interesting. You've said it would be interesting to see how a hand-crafted agent would compare; I've got a video of the theoretically perfect completion of 100m here: ua-cam.com/video/rejtyHCf120/v-deo.html No demos on that video unfortunately, but I can ask a friend of mine (@Jukspa) to make them using his version of the tools: github.com/Jukspa/TASQuake/
@goldnutter412 Рік тому
Got someone on Q2DM8 once.. bunnyhopping from bottom stairs, through the gap and into the tunnel under the dropdown with megahealth..
As I'm passing through the gap between boxes, I hear someone spawn to the top right.. kept bouncing into the megahealth tunnel and flicked up, behind right, about.. there..
and back to the bouncing.. guessing he just saw railgun come out my ass as I was going past 🤣🤣🤣🤣🤣🤣🤣🤣🤣
@webgpu Рік тому
Matt!!! Please pleeeeeeeeeease.. There's this immensely addictive web-game i've being playing called "agario". its simple: just keys w and spacebar + mouse move. it's a super simple and at the same time super complex game: you start as a little blob and you need to eat pellets spread around the field in order to grow, and eat smaller blobs to keep growing, while running away from bigger blobs which are always faster than smaller ones, and finally there are Viruses that are fixed blobs that explode you if you touch them, and you can also shoot pellets (using W) on those virus in the direction of other players so they explode in smaller pieces that you can eat. Players have invented multiple complex movements along the past many years. Could you implement an AI agent to play this game?? (there are already many "bots" in this game, but they are all dumb)
@jimjamz. 9 місяців тому
Can you post a link to the track map?
@jimjamz. 9 місяців тому
speeddemosarchive _d0tt _c0mm_ _slash_ quake _slash_ maps _slash_ 100m _d0tt_ zip
@SL4Y3R_ 3 роки тому
The link for downloading this map?
@jimjamz. 9 місяців тому
speeddemosarchive _d0tt_ _c0mm_ _slash_ quake _slash_ maps _slash_ 100m _d0tt_ zip
@nf7283 3 роки тому
have you ever looked into csgo hns
@TechnAndroGeekzz 4 роки тому ⁺¹
may I know why you used reinforcement learning insted of neat..
@MachStarry 3 роки тому ⁺¹
:)
@shubhamraorane6627 4 роки тому ⁺²
All i can remember is phoon from cs 1.6
@gronk2h 4 роки тому
He is doing it sideways!
@vabalokis 4 роки тому
phoon was from cs source , not 1.6
@gronk2h 4 роки тому
@@vabalokis he also played 1.6
@nolan412 3 роки тому
A new kind of fuzzing 🤔
@6oEkpdySuuWu64iV 4 роки тому ⁺¹
I grew up on Quake.
@MusKubium 4 роки тому ⁺¹
holy crap the compression youtube does make it looks like the video had smallpox
@QuinnKallisti Рік тому
Tame your plosives. and employ a de-esser, I feel like I am watching an ASMR channel.....
@turtvaiz 4 роки тому ⁺⁶
pop filter

Наступне

Автоматичне відтворення

Better Quake strafe-jumping with genetic algorithms