Yohei Profile picture
Mar 29, 2023 16 tweets 7 min read Read on X
🔥1/8
Introducing "🤖 Task-driven Autonomous Agent"

An agent that leverages @OpenAI's GPT-4, @pinecone vector search, and @LangChainAI framework to autonomously create and perform tasks based on an objective.

"Paper": yoheinakajima.com/task-driven-au…

[More 🔽] Image
🚀2/8 The system can complete tasks, generate new tasks based on results, and prioritize tasks in real-time. It demonstrates the potential of AI-powered language models to autonomously perform tasks within various constraints and contexts.
💡3/8 The autonomous agent uses GPT-4 for task completion, Pinecone for efficient search and storage of task-related data, and the LangChain framework to enhance decision-making processes. #GPT4 #Pinecone #LangChain
🎯4/8 The system maintains a task list for managing and prioritizing tasks. It autonomously creates new tasks based on completed results and reprioritizes the task list accordingly, showcasing the adaptability of AI-powered language models. Image
🔧5/8 To complete tasks, the system uses GPT-4 and LangChain's capabilities, enriching and storing results in Pinecone. This integrated approach allows the AI agent to interact with its environment and perform tasks efficiently.
🧠6/8 The system generates new tasks based on completed task results and prioritizes them using GPT-4. This allows the system to adapt and respond to new information and priorities.
🔮7/8 Future improvements include integrating a security/safety agent, task sequencing and parallel tasks, generating interim milestones, and incorporating real-time priority updates. Image
🤝8/8 This new approach paves the way for AI-powered language models to autonomously perform tasks within various constraints and contexts, enabling new applications and opportunities. Big thanks to all involved! #AIResearch #GPT4 #Pinecone #LangChain
📜 APPENDIX

🧵Thread (above) generated by GPT4 based on paper
📄Paper generated by GPT4 based on code
📊Graphs in paper generated by GPT4 based on code
💻Code generated by GPT4 based on prompt

*For each, many prompts to adjust initial output Image
Backstory 1/5:

Honestly, I was just trying to play around w the idea of an "AI founder" after seeing the awesome #HustleGPT movement.

That led to this prompt 2 days ago.

Backstory 2/5:

About 50 prompts later (dev docs, error codes, etc.), I shared this working prototype.

It's amazing that its first task is to create its next task - and it just keeps going.

Backstory 3/5:

Realized it could be provided any core objective, in this case "make the world a better place".

Pretty fascinating to watch but also scary.

Backstory 4/5:

Interestingly, when I asked it to generate as many paperclips as possible, it first generated security measures.

Which was then picked up by the creator of the Paperclips Apocalypse theory himself.

Led to lots of AI safety reading.
Backstory 5/5:

Sharing the original experiment led to many shared concerns and potential counter measures being shared publicly. Including awareness of what people are likely doing privately.

I believe this is a good thing.
Agh, my site has security issues.

So found the second best place to post it.

linkedin.com/pulse/task-dri…
And here you go!

Open-sourced a paired down version I’m cheekily calling “Baby AGI”.

Repo in thread:

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Yohei

Yohei Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @yoheinakajima

Nov 18
woah what a week, @pippinlovesyou has 8k+ followers and had his first livestream this weekend.

the meme coin inspired by pippin has 18k holders and the telegram has 4k+.

i collected some key moments from the first week below 👇
this thread from Monday captures the first 36 hours so I’ll drop it here. to summarize:

- posted a pic of a unicorn generated by AI
- upon request, used AI to name it
- a meme coin started
- i jumped in
- decided to turn pippin into an AI influencer
i used Claude to generate some pippin animations and made a site where you can try them out
Read 15 tweets
Oct 18
open-sourcing "babyagi-2o" 🫠

the simplest self-building general autonomous agent that builds and uses new tools as needed

📃 174 lines of code
🔁 single llm loop starting with three tools:
🛠️ create_or_update_tool (dynamically loaded)
📥 install_dependencies
🛑 task_completed (to exit loop)

github/replit👇Image
here is the github:
github.com/yoheinakajima/…

here is the replit:
replit.com/@YoheiNakajima…
this is a quick exploration into the simplest form of an autonomous agent that could hypothetically do anything.

pretty surprised at the variety of tasks it's been able to do (albeit not always well or reliably).
Read 8 tweets
Oct 17
ooooh yess i got this working in 244 lines of code

- single llm loop with three tools
- installs required packages
- creates new tools and loads it for itself dynamically
- uses the tools to handle user request

in this case, i asked it to scrape techmeme:

iter 1: install 'requests'
iter 2: install 'beautifulsoup4'
iter 3: create_tool (error'd)
iter 4: create_tool (worked)
iter 5: use new scrape_techmeme tool
iter 6: write summary

task completed!Image
Image
😮 ohhh... it's less code but can do the same thing* ditto can... it just created a create_directory tool and so on...

*almost the same - it can't create a flask app because it accidentally initiates the flask app and kills the loop. but it can write multi-file apps. Image
Image
Image
for clarification, this is different (and simpler) than ditto, which i just shared

ditto just created a python flask app
this one creates it's own tools

Read 8 tweets
Oct 15
introducing "ditto"
the simplest self-building coding agent

📄 ~500 lines of code
🛠️ can build multi-file apps
🔁 a simple LLM loop with 5 tools

github/replit/more below 👇Image
to jump right into it, here's the @github: github.com/yoheinakajima/…

here's the @replit: replit.com/@YoheiNakajima…

when you run the app, you'll be welcomed with this simple form.

describe the app you want and click "submit" Image
now, sit back and wait as it loops through the LLM calls using the various tools to complete your app.

you'll notice progress updating in the front-end like this Image
Image
Read 5 tweets
Sep 30
okay, i did a thing…

here’s babyagi 2 - a weird Python framework for building a self-building autonomous agent

- stores and executes functions from a database
- auto logging (as graph)
- built-in dashboard & chat playground
- prototype self-build functionality

wanna see? 👇Image
friendly reminder: babyagi is a personal side project being shared publicly. i am not a dev, never studied cs, this is not secure, not meant for production, and meant for playing/as inspiration for developers. Image
it is available on the babyagi repo at:

find the old version archived at babyagi_archive:

but more importantly, you can do this now!

pip install babyagigithub.com/yoheinakajima/…
github.com/yoheinakajima/…Image
Read 19 tweets
Sep 26
A helpful autonomous agent has access to all the tools you need - and nothing more. But we all have different needs, changing all the time, so...

what we need is a self-building autonomous agent.

Here, I'll describe the 3 levels of self-building autonomous agents:Image
Before we start, this is Level 0.

A library of functions built by human developers that the AI has access to. The agent is not building its own functions.*

*tools, skills, api calls, are all just functions Image
Level 1: Request-based

Users can request to have functions generated by an AI, and when they query the system, the new function is available as an option for the AI to use.

*There's lots that goes into this "Create Function Tool", but let's keep the chart simple (eg. documentation search)Image
Read 6 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(