Discover and read the best of Twitter Threads about #reinforcementlearning

Most recents (14)

1/ 🧠🌌 Embarking on a journey to explore AI, consciousness, and the cosmos, we'll dive into research and knowledge shaping our understanding of the universe. Are you ready for the fascinating world of AI, consciousness, and cosmic connections? #AI #Consciousness #Cosmology
2/ 🎇🔬 From Einstein's theory of relativity to quantum mechanics discoveries, our understanding of the universe has evolved significantly. These advancements set the stage for exploring AI and consciousness. #Einstein #QuantumMechanics #Physics
3/ 🧬🤖 AI research has progressed since Turing's days. Today, we're making breakthroughs in deep learning, neural networks, and reinforcement learning, pushing the boundaries of AI and consciousness. #DeepLearning #NeuralNetworks #ReinforcementLearning
Read 11 tweets
Revolutionizing the World: 20 AI & Machine Learning Startups You Need to Know. 🧵...
1/20: In this thread i will try and explore the fascinating world of AI and machine learning startups! Discover some of the most innovative companies pushing the boundaries in this space. #AIStartups #MachineLearning
2/20: First up is @OpenAI, the team behind the groundbreaking GPT series.

With GPT-4, they're developing even more advanced natural language processing capabilities to revolutionize human-computer interactions. #NLP #OpenAI
Read 22 tweets
Thanks ICP for publishing this essay on #Ritamic Decision Policy. Into year-3 of this series, this post summarizes manthan, research and study of the Vedic origins of sustainable decision policy and strategy, with contemporary examples all can relate to.
Post focuses on sustainable policy to make not one, but a series of interconnected decisions over time.

Can ideas of Vedanta be applied here?
short ans: Y.

Content is of interest to those making gov or private policy, designers, startups, #Ganita/stem students, young parents.
The post is divided into 7 sections with links to each at the top. Those who simply want a easy-to-remember idea of Ritamic decision policy can go to the examples- read how Air India altered its DEL-SFO route in harmony with Ritam.…
Read 38 tweets
"#Imitation vs #Innovation: Large #Language and Image Models as Cultural #Technologies"

Today's Seminar by SFI External Prof @AlisonGopnik (@UCBerkeley)

Streaming now:

Follow our 🧵 for live coverage.
"Today you hear people talking about 'AN #AI' or 'THE AI.' Even 15 years ago we would not have heard this; we just heard 'AI.'"
@AlisonGopnik on the history of thought on the #intelligence (or lack thereof) of #simulacra, linked to the convincing foolery of "double-talk artists":
"We should think about these large #AI models as cultural technologies: tools that allow one generation of humans to learn from another & do this repeatedly over a long period of time. What are some examples?"

@AlisonGopnik suggests a continuity between #GPT3 & language itself:
Read 14 tweets
The First-Time Machine Learning Playbook.

(Read this if you want to efficiently learn machine learning, avoid frustration from searching for resources, and build a career in ML.)

#Ship30For30 #ML #MachineLearning
What people think you need:

Most people think you need to be a math and stats expert to learn ML. ML can be math intensive, but it’s not a barrier to entry.

What you need:

• The Ability to code
• An Open and Curious Mind
• A Good starting point

Which leads me to …
Where to Start:

If you can code, then start here:

The @fastdotai Course is a fantastic place to start. Instead of going from the basics towards an application, it starts from an application and breaks it down piece by piece.

Did I mention it’s free?
Read 9 tweets
Why has reinforcement learning not been adapted to the design process?

I believe the RL problem and the design process are pretty similar, and the RL community should embrace the design process as a potential application for RL algorithms. (1/N)
In this thread, I will shortly elaborate on why these two are similar in my eyes.

Design is a crucial step in making things, but it is not easy to find a single definition for it. In the context of architectural design, ... (2/N)
... one might define the design process as a series of steps followed by the designer to iteratively find a solution for a given design scenario.

In his book, Notes on the Synthesis of Form, in 1964, Christopher Alexander wrote:

"The ultimate object of design is form". (3/N)
Read 12 tweets
Gefühlte 4 Jahre habe ich an einer naturwissenschaftlichen Fragestellung aus dem Fach Lebensmitteltechnik gebrütet.
Wissenschaftliche Werkzeuge:
- #Hermeneutik & #Introspektion in die love affair von Wasser, Teig und heißer Luft
- trial-and-error-error-smallererror
Weil: ich war im musischen Profil.
Und unser Physiklehrer meinte, man könne uns wenig beibringen wenn wir nur Kasperletheater im Kopf hätten.
Und da ich auch noch lesefaul bin,
blieb also nur:
#HI #ReinforcementLearning
Also ...
Die lebensmitteltechnische Fragestellung ist:

Sind #Zeitreisen für sächsische #Bäckerbrötchen möglich?

Leg#1: von warm & frisch in der Backstube an einen anderen Zeitort in der Raumzeit und
Leg#2: von da zurück in den Moment wo sie geboren wurden.

Und was soll ich sagen:
Read 8 tweets
The first-ever #NeurIPS2021 workshop on the Political Economy of Reinforcement Learning Systems starts in just under 12hrs! Come join me, @sociotiose @FrankPasquale @mireillemoret @salome_viljoen_ @natashajaques @jakusg @FinaleDoshi @mlittmancs @math_rachel @jonathanstray ...
...@ivanadusparic @in4dmatics @NCPtarmigan and others for a discussion of how #ReinforcementLearning re-shapes societal institutions and disrupts power, money, and political forces. ...
... P.S. contact myself or @sociotiose if you are interested in attending!
Read 4 tweets
Daily Bookmarks to GAVNet 06/09/2021…
China’s Hot Summer Is Latest Test of Its Carbon-Neutrality Drive…

#china #ClimateChange #CarbonNeutrality #consequences
Quantum Computing and Reinforcement Learning Are Joining Forces to Make Faster AI…

#QuantumComputing #ReinforcementLearning #ArtificialIntelligence
Read 10 tweets
Here are my experiences learning about RL over the past 3 months! ♥️ Hopefully, this will be the most beginner-friendly guide out there.


@DeepMind @OpenAI Image
I've tried to give you all of the tips and tricks I could think of both for more productive learning in general and stuff specific to the RL field.
The structure of the blog:
1) Intro
2) RL 101 (getting you exposed to the terminology)
3) Cool things about RL (awesome RL apps!)
4) RL is not just roses
5) Getting started with RL
6) Going deeper - reading papers
7) Implementing an RL project from scratch
8) Related subfields
Read 4 tweets
Me hace muchísima ilusión presentaros el curso de verano que organizamos @andortizg y un servidor para @UNIAuniversidad:

Introducción práctica a la Inteligencia Artificial y el Deep Learning.

Procedo a vender la moto :)
Inteligencia Artificial… Redes Neuronales…


Dos campos permean las ciencias en el siglo XXI: la #estadística y la #computación. Combinados, dan lugar al Aprendizaje Automático, el Machine Learning. O como hacer que las máquinas "aprendan"
Como cuento en los monólogos, las máquinas más bien nos quitan "trabajo".

El Machine Learning es herramienta fundamental para automatizar procesos, y hoy en día no se puede entender sin las redes neuronales (la base del Deep Learning #DL).
Read 20 tweets
My friend @orian_sharoni and I are watching David Silver’s UCL course on #ReinforcementLearning
I’ll be posting notes, thoughts and randomness as we watch the lectures.
You can find the slides and video lectures here if you want to dive deeper:
Read 12 tweets
How many random seeds are needed to compare #DeepRL algorithms?

Our new tutorial to address this key issue of #reproducibility in #reinforcementlearning




#machinelearning #neuralnetworks
Algo1 and Algo2 are two famous #DeepRL algorithms, here tested
on the Half-Cheetah #opengym benchmark.

Many papers in the litterature compare using 4-5 random seeds,
like on this graph which suggests that Algo1 is best.

Is this really the case? Image
However, more robust statistical tests show there are no differences.

For a very good reason: Algo1 and Algo2 are both the same @OpenAI baseline
implementation of DDPG, same parameters!

This is what is called a "Type I error" in statistics.
Read 11 tweets
My #bostondynamics #spotmini is almost ready for it's machine learning boot camp , on the left…
spent part of the day baking textures + aggregating meshes + setting up the rigid bodies & colliders in Unity so this thing will be able to walk the walk
Read 39 tweets

Related hashtags

Did Thread Reader help you today?

Support us! We are indie developers!

This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3.00/month or $30.00/year) and get exclusive features!

Become Premium

Too expensive? Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal Become our Patreon

Thank you for your support!