Curiosity-Driven Learning of Joint Locomotion and Manipulation Tasks
Вставка
- Опубліковано 1 лис 2023
- Learning complex locomotion and manipulation tasks presents significant challenges, often requiring extensive engineering of, e.g., reward functions or curricula to provide meaningful feedback to the Reinforcement Learning (RL) algorithm. This paper proposes an intrinsically motivated RL approach to reduce task-specific engineering. The desired task is encoded in a single sparse reward, i.e., a reward of “+1” is given if the task is achieved. Intrinsic motivation enables learning by guiding exploration toward the sparse reward signal. Specifically, we adapt the idea of Random Network Distillation (RND) to the robotics domain to learn holistic motion control policies involving simultaneous locomotion and manipulation. We investigate opening doors as an exemplary task for robotic applications. A second task involving package manipulation from a table to a bin highlights the generalization capabilities of the presented approach. Finally, the resulting RL policies are executed in real-world experiments on a wheeled-legged robot in biped mode. We experienced no failure in our experiments, which con- sisted of opening push doors (over 15 times in a row) and manipulating packages (over 5 times in a row).
Paper: openreview.net/pdf?id=QG_ERxt...
Authors: Clemens Schwarke, Victor Klemm, Matthijs van der Boon, Marko Bjelonic, and Marco Hutter
Learn more: www.swiss-mile.com/
Disclaimer: Robot from ANYbotics; customized by ETH Zürich; strictly for research purposes. - Наука та технологія
I love the way it throws the boxes around, it obviously ready for package delivery service! :D
That is what comes out when you train on human data. :D
LOL !
So true...just like my regular delivery guy. Add in 'run back to the delivery truck' and the robot would be all set.
He's like a jump-out boy. He doesn't drive, but just hefts packages when you tell 'im. Then, if he goes bad, he jumps out for the wrong reason and grabs the package and runs.
1:44 it passend the hot potato test!
haha
Bedarf geweckt, Bedarf gedeckt! Awesome job! Bigweld would be proud! :)
Danke 🙏
Letsss gooooo
lets roll!
He moves like he's alive
Embodied AI at its best.
You guys made big steps!
Thank you!
NICE
Thanks!
Bursting through that door like Kramer
As if it’s about life and death 🫡
Been following you guys for years. The most underrated robot platform in the world. And they keep showing that crap from Tesla, arg...
Thank you for this honor!
👍💪✌
thanks!
It is pretty dramatic with packages. Love this machine. Fabulous channel.
Glad you enjoy it!
It's super interesting - I am guessing that the task was specified as "put item in box" and as such, it found a very efficient way to do it.
Had the task been "stack the box neatly", I assume it would find a more conservative way to put the box down
@@kzalesak4 You are probably right. I think we'll get exactly what we ask for so it will be a good idea to make sure we ask for the right thing. Specificity will be king for a little while. Then the machines will get smarter.
Wow this is fantastic!
Thank you 🙏
really cool stuff guys would be cool if you touched on curiosity driven learning and the challenges you had with distractions or as I call it ADHD behaviour
Need some rewards for not scaring the living crap of people on the other side of the door.
The industrial version will try not to destroy the door :)
Could this technology be extended to autonomous vehicles as well? I am imagining that we can also have scenarios where we can have an AV and we supply a curiosity state that relates to navigating some given environment, or managing control in a way that minimises the energy requirements from the battery. Is this a correct understanding? I am asking this since I am working on a presentation project related to locomotion modalities and curiosity-based navigation from your lab's paper
Can be done. What is the presentation about?
@@leggedrobotics we're just discussing locomotion modalities and showcasing the curiosity mechanism, we're just explaining them and describing the potential use cases
btw, are you still using isaacgym? or already transferred to IsaacSim?
Excuse me, what robot simulation environment is used here?
Pretty sure it's Nvidia Isaac Gym
Yes :)
I love animal
We love all robots
"Recently the robot learned how to stand up"
So it begins...
What begins?
@@leggedroboticsI think he was reffering to a scene from The Planet of Apes, with standing up being the sign of it evolving
Attach saws for the front wheels and you've got yourself a terminator!!! Sweeettt!!! Gemini will kill us all.
Robots need to be helpers! 🤘
Holy shoot it's a GoBot
GoBots are coming
ZABORGAR!
We need to watch this movie.
The fact that it goes thru doors freaks me the hell out (esp if it had circular saws attached instead of front wheels - please make this upgrade). Robots aren't supposed to go thru doors! Haven't you played Generation Zero?!
Haha will check it out now
look I understand we're purely showcasing a technology here, but visually speaking, could this thing be any more terrifying?
please don't answer that lol
This is human perception:)
Amazon will order 200,000. This bot knows how to handle his packages!!!
Let’s hope they do!
Put socks on the rear wheels and it will turn into Tom Cruise in Risky Business. 🎉😎
I will check out the movie
The guy lets the robot go through then pulls the robot back; go through, pull back. Eventually the robot figures it out and wounds the guy.
I look forward to seeing how good these robots can get with a basketball.
Basketball… challenge accepted
Good, but expectation is nothing inside the boxes will fall out. The moment the boxes collapses and breaks all you have is an empty box or nothing at all. Equip it with a functional arm and hand.
Maybe a kick stand for the wheels to keep it in place.
Unfortunately, I wouldn't buy this if I had a company due to the amount of unpredictability and the way it handles boxes.
This is great as a robot that delivers drinks and food while remaining on four wheels. It could work in an airport to handle lost luggage or delivery of luggages at an additional price. A carrier service or service for the disabled or seniors.
All these points can be fixed over time but we have to start somewhere.
Wheels make very bad hands.
Depending on the task ;) but in general, you are right.
recently the robot was "Programed how to stand up" 100% guarantee it didnt learn how to do it.
What do you mean?
A task where standing up would be beneficial is sufficient
It looks awesome but for some reason it seems evil to my brain.
It’s a very nice robot and does exactly what you tell it. :)
We're doomed😂
Robot are coming … to help us
You are too quiet I cant hear you.
Haha
Now let's check for dog poop on this door....
robots do not poo :D
@@leggedrobotics But they do drive on dog poo....
too spergyyy
😂
The loser at the channel @TwoMinutePapers stole your video and didn't even mention the source.
You mentioned this on Reddit as well, right? :)
@@leggedroboticsNope. I think a lot of people are fed up with that channel copy-pasting their work.