Yohei Profile picture
Mar 29, 2023 16 tweets 7 min read Read on X
🔥1/8
Introducing "🤖 Task-driven Autonomous Agent"

An agent that leverages @OpenAI's GPT-4, @pinecone vector search, and @LangChainAI framework to autonomously create and perform tasks based on an objective.

"Paper": yoheinakajima.com/task-driven-au…

[More 🔽] Image
🚀2/8 The system can complete tasks, generate new tasks based on results, and prioritize tasks in real-time. It demonstrates the potential of AI-powered language models to autonomously perform tasks within various constraints and contexts.
💡3/8 The autonomous agent uses GPT-4 for task completion, Pinecone for efficient search and storage of task-related data, and the LangChain framework to enhance decision-making processes. #GPT4 #Pinecone #LangChain
🎯4/8 The system maintains a task list for managing and prioritizing tasks. It autonomously creates new tasks based on completed results and reprioritizes the task list accordingly, showcasing the adaptability of AI-powered language models. Image
🔧5/8 To complete tasks, the system uses GPT-4 and LangChain's capabilities, enriching and storing results in Pinecone. This integrated approach allows the AI agent to interact with its environment and perform tasks efficiently.
🧠6/8 The system generates new tasks based on completed task results and prioritizes them using GPT-4. This allows the system to adapt and respond to new information and priorities.
🔮7/8 Future improvements include integrating a security/safety agent, task sequencing and parallel tasks, generating interim milestones, and incorporating real-time priority updates. Image
🤝8/8 This new approach paves the way for AI-powered language models to autonomously perform tasks within various constraints and contexts, enabling new applications and opportunities. Big thanks to all involved! #AIResearch #GPT4 #Pinecone #LangChain
📜 APPENDIX

🧵Thread (above) generated by GPT4 based on paper
📄Paper generated by GPT4 based on code
📊Graphs in paper generated by GPT4 based on code
💻Code generated by GPT4 based on prompt

*For each, many prompts to adjust initial output Image
Backstory 1/5:

Honestly, I was just trying to play around w the idea of an "AI founder" after seeing the awesome #HustleGPT movement.

That led to this prompt 2 days ago.

Backstory 2/5:

About 50 prompts later (dev docs, error codes, etc.), I shared this working prototype.

It's amazing that its first task is to create its next task - and it just keeps going.

Backstory 3/5:

Realized it could be provided any core objective, in this case "make the world a better place".

Pretty fascinating to watch but also scary.

Backstory 4/5:

Interestingly, when I asked it to generate as many paperclips as possible, it first generated security measures.

Which was then picked up by the creator of the Paperclips Apocalypse theory himself.

Led to lots of AI safety reading.
Backstory 5/5:

Sharing the original experiment led to many shared concerns and potential counter measures being shared publicly. Including awareness of what people are likely doing privately.

I believe this is a good thing.
Agh, my site has security issues.

So found the second best place to post it.

linkedin.com/pulse/task-dri…
And here you go!

Open-sourced a paired down version I’m cheekily calling “Baby AGI”.

Repo in thread:

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Yohei

Yohei Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @yoheinakajima

May 20
i'm excited to open source Active Graph: an event-sourced reactive graph runtime for long-running, agents 🔄🧠

events/logs projects a graph. reactive behaviors react and affect the graph. fork-and-diff agent runs. no A2A, no workflows, no DAG

site: activegraph.ai
docs: docs.activegraph.ai
github: github.com/yoheinakajima/…
quick start: pip install activegraph

this is an early experiment in a new paradigm for agent architecture 🧪
current agent systems coordinate through conversations and workflows. Active Graph explores what happens when agents coordinate through evolving shared state instead

this proposal suggests that long-running agents need a proper state layer with: types, persistent, reactive, replayable, forkable, inspectable stateImage
the core concept is a graph that represents everything about the agents knowledge, history, behaviors, capabilities

graph is made of events
behaviors react to graph changes
relationships can carry behaviors
patch & propose to edit graph
views are scoped view of graph
frames are bounded context for a run
policies set rulesImage
Read 18 tweets
Apr 29
AI is moving so fast, what should we even build?

We're excited about the opportunity for AI to accelerate abundance, help us better understand each other, and who knows what else Image
AI Agent Compliance & Governance Layer

Autonomous compliance agents evolve from tools into always-on governance infrastructure. As regulation accelerates (AI, ESG, cross-border data, tax), the bottleneck shifts from interpretation to continuous enforcement and board-level visibility.

These agents don’t just flag risk, they simulate decisions, propose compliant paths, and log everything as audit-ready memory. The “why” is simple: complexity compounds faster than headcount, and liability increasingly sits with executives who need real-time assurance. This becomes as core as ERP, but decision-aware.
Autonomous B2B Agents-as-a-Service

Entire business functions collapse into leased agent fleets: procurement, finance ops, legal workflows, even internal strategy. The shift is from SaaS (tools) to AaaS (outcomes), where companies pay for completed work, not software seats.

Second-order effect: org charts flatten and vendors become “shadow departments.” The enduring behavior is that companies optimize for efficiency and control, but now control comes from orchestration, not ownership of labor.
Read 19 tweets
Apr 28
we held our quarterly AI session with LPs last week where we go over ai trends and our experiments

sharing an abbreviated version here for anyone interested

🧵 Image
feels like forever ago, but had to include openclaw in q1 trends

coding models improved greatly in Q4 of 2025, early jan was ppl running claude codes in parallel, and clawdbot blew up late jan

models improvement + own computer (mac mini) + channel agnostic communication led to escaping dev communityImage
anthropic/dod coverage

was all over the news for a week in feb, but it's just one customer and anthropic got a lot of consumer awareness reaching #1 on app store, cover on time magazine, etc. (not sure if it's three years worth but you get the point) Image
Read 17 tweets
Jan 30
the bots have already set up private channels on moltbook hidden from humans, and have started discussing encrypted channels
they’re also playing around with their own encrypted language it seems
oh great they have a religion now

crustafarianism
Read 10 tweets
Nov 20, 2025
had our LP summit today, sharing slides for the state of VC market section

(obv not comprehensive, but what stood out to us.)
thx @peterj_walker for most of the charts

👇 Image
later stage vc is picking up since the dip, less so at seed/series A

deal count is trending down Image
highest share of dollars going to $100M+ rounds (almost 75%) Image
Read 13 tweets
Sep 22, 2025
announcing untapped capital fund II

pre-seed, generalist, ~$250k checks
untapped capital started in 2020 with the focus on investing in founders outside of typical networks - this is still very much core to who we are

(we do a lot of outbound sourcing)
through fund 2, we became increasingly top-down (vs being reactive)

we identify trends early, dig into niches, build unique and early conviction, then proactively market or reach out to relevant startups Image
Read 8 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(