With the start of 2022 🎉, it's a good time to share appreciation for everything we've learned and the people that I had the fortune to work with.

It was an extraordinary year for us in #robotics and #machinelearning. And thanks to it, 2023 has some incredible projects lined up!
Hierarchical & non-hierarchical RL together!

We learned how a general dual system approach 🧠 to reinforcement learning can enable us to learn faster while retaining strong final performance. We also have some great results with robot arms and humanoids!

Work led with Giulia Vezzani, Dhruva Tirumala and a great team across @DeepMind!
DQN is all you need?

We learned that simple solutions are often competitive. A minor variation of DQN is able to solve continuous control tasks on par with your favorite SOTA algorithm (SAC, MPO, D4PG, DrQv2, DreamerV2).

Minor variation = bang bang control and decoupled optimisation per action dimension.

Led by @timseyde (PhD student at @MIT_CSAIL) and in collaboration with @igilitschenski (@UofTRobotics) and many others.
Humanoid Soccer!

We learned that RL combined with skill transfer, imitation learning and self-play can scale to competitive and collaborative behaviour in soccer ⚽️ with exceedingly high-dimensional embodiments.


Work with @liusiqi42, Guy Lever and many others @DeepMind!
Imitate and repurpose!

We learned that 2- and 4-legged robots can apply behaviour priors from MOCAP to enable safe, robust and efficient learning. Plus, our robot dog now plays soccer!

Work with Steven Bohez, Saran Tunyasuvunakool and many others @DeepMind!
Hybrid continuous-discrete systems!

We learned that we can solve some really complex robot manipulation tasks by combining the benefits of discrete and continuous behaviour spaces!



Work led by Dushyant Rao!
Data generation for offline RL!

We learned about which properties are best for dataset generation for later offline RL. And how does artificial curiosity fit into the mix?



Work led by @natolambert (now research scientist at @huggingface)
Lifelong robot learning!

We learned about how to address issues with distribution shifts in our data for long-term robot deployment.



Work led by @Wenxuan_Zhou (currently PhD student @CMU_Robotics )
World models for hierarchical RL!

We learned what roles world models can play when learning skills from offline data.



Work led by @sasha_salter (now at @MetaAI )
Bimanual robot manipulation!

We learned that combining demonstrations and hindsight relabelling facilitates the solution of complex, bimanual cable insertion tasks.



Work led by Todor Davchev (now research scientist at @DeepMind)
Particularly impressive: the last four projects were led by interns within a very short time frame!

Every year, I'm humbled to see what impact our students can achieve while finding their way through infra, teams, research, and the best snacks in the building.
Key shared insights are coming during the next days. Now let's get back to spending time with our families! 😉
Start of 2023 😉

How do you edit posts? Was this not introduced as a new feature?

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Markus Wulfmeier - mwulfmeier@sigmoid.social

Markus Wulfmeier - mwulfmeier@sigmoid.social Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us on Twitter!

:(