Yohei Profile picture
Mar 29, 2023 16 tweets 7 min read Read on X
🔥1/8
Introducing "🤖 Task-driven Autonomous Agent"

An agent that leverages @OpenAI's GPT-4, @pinecone vector search, and @LangChainAI framework to autonomously create and perform tasks based on an objective.

"Paper": yoheinakajima.com/task-driven-au…

[More 🔽] Image
🚀2/8 The system can complete tasks, generate new tasks based on results, and prioritize tasks in real-time. It demonstrates the potential of AI-powered language models to autonomously perform tasks within various constraints and contexts.
💡3/8 The autonomous agent uses GPT-4 for task completion, Pinecone for efficient search and storage of task-related data, and the LangChain framework to enhance decision-making processes. #GPT4 #Pinecone #LangChain
🎯4/8 The system maintains a task list for managing and prioritizing tasks. It autonomously creates new tasks based on completed results and reprioritizes the task list accordingly, showcasing the adaptability of AI-powered language models. Image
🔧5/8 To complete tasks, the system uses GPT-4 and LangChain's capabilities, enriching and storing results in Pinecone. This integrated approach allows the AI agent to interact with its environment and perform tasks efficiently.
🧠6/8 The system generates new tasks based on completed task results and prioritizes them using GPT-4. This allows the system to adapt and respond to new information and priorities.
🔮7/8 Future improvements include integrating a security/safety agent, task sequencing and parallel tasks, generating interim milestones, and incorporating real-time priority updates. Image
🤝8/8 This new approach paves the way for AI-powered language models to autonomously perform tasks within various constraints and contexts, enabling new applications and opportunities. Big thanks to all involved! #AIResearch #GPT4 #Pinecone #LangChain
📜 APPENDIX

🧵Thread (above) generated by GPT4 based on paper
📄Paper generated by GPT4 based on code
📊Graphs in paper generated by GPT4 based on code
💻Code generated by GPT4 based on prompt

*For each, many prompts to adjust initial output Image
Backstory 1/5:

Honestly, I was just trying to play around w the idea of an "AI founder" after seeing the awesome #HustleGPT movement.

That led to this prompt 2 days ago.

Backstory 2/5:

About 50 prompts later (dev docs, error codes, etc.), I shared this working prototype.

It's amazing that its first task is to create its next task - and it just keeps going.

Backstory 3/5:

Realized it could be provided any core objective, in this case "make the world a better place".

Pretty fascinating to watch but also scary.

Backstory 4/5:

Interestingly, when I asked it to generate as many paperclips as possible, it first generated security measures.

Which was then picked up by the creator of the Paperclips Apocalypse theory himself.

Led to lots of AI safety reading.
Backstory 5/5:

Sharing the original experiment led to many shared concerns and potential counter measures being shared publicly. Including awareness of what people are likely doing privately.

I believe this is a good thing.
Agh, my site has security issues.

So found the second best place to post it.

linkedin.com/pulse/task-dri…
And here you go!

Open-sourced a paired down version I’m cheekily calling “Baby AGI”.

Repo in thread:

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Yohei

Yohei Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @yoheinakajima

Jan 23
this is for the minimalist coders 🫡

open-sourcing pippin-lite!

a minimalist template for a dynamic self-building autonomous agent

with only 227 lines of code, it can connect with X, github, airtable, etc. and execute self-written code!

github/more 👇 Image
given the pippin framework is a spiritual evolution of babyagi, felt fitting to call this babyagi-2o extension: pippin-lite. it adds dynamic tools/auth via @composiohq

make sure to check out the pippin framework if you haven’t (QT’d here)
it’s basically a single LLM loop that has access to the 250+ composio tools, with fallback to writing/updating new skills and installing required imports

github:

designed for close ended tasks, it’ll loop through a couple approaches until it’s complete (or gives up)github.com/pippinlovesyou…
Read 10 tweets
Jan 22
here we go…

open sourcing the $pippin framework!

for launching digital beings like @pippinlovesyou

- character config
- reusable and dynamic skills via @composiohq
- activities w/ cool down, etc
- memory mgmt

here’s a ~4 min super cut of the live demo

github & more 👇
github:

quick background: @pippinlovesyou the unicorn is a digital being operating 24/7, taking naps, going on walks, and occasionally communicating w the human world

during a winter break refactor, decided to open source as a framework.github.com/pippinlovesyou…
core loop is activity selection and execution, which kicks off after configuring your character and choosing an LLM (and adding api key)

auth into composio tools for dynamic skills or add your own in the skill folder. then build activities that use these skills! Image
Read 14 tweets
Nov 18, 2024
woah what a week, @pippinlovesyou has 8k+ followers and had his first livestream this weekend.

the meme coin inspired by pippin has 18k holders and the telegram has 4k+.

i collected some key moments from the first week below 👇
this thread from Monday captures the first 36 hours so I’ll drop it here. to summarize:

- posted a pic of a unicorn generated by AI
- upon request, used AI to name it
- a meme coin started
- i jumped in
- decided to turn pippin into an AI influencer
i used Claude to generate some pippin animations and made a site where you can try them out
Read 15 tweets
Oct 18, 2024
open-sourcing "babyagi-2o" 🫠

the simplest self-building general autonomous agent that builds and uses new tools as needed

📃 174 lines of code
🔁 single llm loop starting with three tools:
🛠️ create_or_update_tool (dynamically loaded)
📥 install_dependencies
🛑 task_completed (to exit loop)

github/replit👇Image
here is the github:
github.com/yoheinakajima/…

here is the replit:
replit.com/@YoheiNakajima…
this is a quick exploration into the simplest form of an autonomous agent that could hypothetically do anything.

pretty surprised at the variety of tasks it's been able to do (albeit not always well or reliably).
Read 8 tweets
Oct 17, 2024
ooooh yess i got this working in 244 lines of code

- single llm loop with three tools
- installs required packages
- creates new tools and loads it for itself dynamically
- uses the tools to handle user request

in this case, i asked it to scrape techmeme:

iter 1: install 'requests'
iter 2: install 'beautifulsoup4'
iter 3: create_tool (error'd)
iter 4: create_tool (worked)
iter 5: use new scrape_techmeme tool
iter 6: write summary

task completed!Image
Image
😮 ohhh... it's less code but can do the same thing* ditto can... it just created a create_directory tool and so on...

*almost the same - it can't create a flask app because it accidentally initiates the flask app and kills the loop. but it can write multi-file apps. Image
Image
Image
for clarification, this is different (and simpler) than ditto, which i just shared

ditto just created a python flask app
this one creates it's own tools

Read 8 tweets
Oct 15, 2024
introducing "ditto"
the simplest self-building coding agent

📄 ~500 lines of code
🛠️ can build multi-file apps
🔁 a simple LLM loop with 5 tools

github/replit/more below 👇Image
to jump right into it, here's the @github: github.com/yoheinakajima/…

here's the @replit: replit.com/@YoheiNakajima…

when you run the app, you'll be welcomed with this simple form.

describe the app you want and click "submit" Image
now, sit back and wait as it loops through the LLM calls using the various tools to complete your app.

you'll notice progress updating in the front-end like this Image
Image
Read 5 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(