Prompt engineers everywhere are busy testing out OpenAI's newly released text-davinci-003. A few observations (not criticisms or benchmarks) as I play with it, a 🧵
It's somewhat more up-to-date with the world, probably from instruction finetuning.
It still needs CoT prompting to solve problems.
It still can't do addition for large numbers with naive prompts
Not quite good at algebra yet (answer x=-1 and x=2)
Taking a page out of @goodside's book with malicious inputs, it's still exploitable.
One of the challenges I've run into is getting GPT to incorporate feedback when rewriting drafts of stories. It tends to just repeat the original draft with minor edits. text-davinci-003 isn't any better unfortunately.
Can you tell the difference between 002 and 003?
CoT still required for moving chess pieces around.
It writes better poetry that actually rhymes!
003 writes better job descriptions than 002
• • •
Missing some Tweet in this thread? You can try to
force a refresh
OpenAI released their ChatGPT. Damn, it is good. This might be GPT4. Starting a 🧵 with observations...
First, it has a memory, something a lot of folks have been working on.
OpenAI has created a great UX for collecting a lot of great human feedback, a necessary ingredient for continual improvement. They are winning the data flywheel game.
It's responses are fast and high quality. The conversation feels fluid.
I'm excited to share that I'm joining @ai2incubator to build an AI company!
A thread on why.
I fell in love with AI in 2008 when my soon-to-be Ph.D. advisor, Robert Hecht-Nielsen, gave me a copy of his new book on AI. I read it and knew AI would change the world and that building an AI company was my calling.
I tried in 2011 and 2014, but the technology wasn't ready. I decided instead to build a company with less technology risk and co-founded @GroundworkBank which was acquired last year.