Tweet

@LangChainAI

@brunotorious

@brunotorious

More from @hwchase17

Harrison Chase

@hwchase17

Dec 26, 2022

@LangChainAI

⚠️Evaluating language models can be tricky⚠️

One new idea of how to do this is to have language models evaluate themselves

@LangChainAI has now added some examples for doing so. Mostly focused on question/answering, but expanding soon (hopefully with community help!)

Thread 🧵

If you are using language models for text generation, you've probably realized that traditional metrics like accuracy and f1 score don't work that well

The common way to evaluate text generation tasks is manually (with human experts), which is slow and not scalable

@EthanJPerez

New research, like the below by @EthanJPerez, @karinanguyen_ et all at @AnthropicAI, propose ways to generate evaluations with LMs themselves

The capabilities in @LangChainAI are not nearly as detailed as those examples, but are a step in that direction

https://twitter.com/AnthropicAI/status/1604883576218341376?s=20&t=iUyEKzCXVTHyepvVCYacHA

Read 12 tweets

Harrison Chase

@hwchase17

Dec 8, 2022

✨Data Augmented Generation✨

langchain.readthedocs.io/en/latest/expl…

🌻Summarization over a specific text
❓Question Answering over private data
🖨️WebGPT

All these use cases involve chains combining LLMs with external sources of data.

🧵Here's a break down of some methods for how to do so:

Some background:

⭐️LLMs are great at generating text generically, but they do that based on the data they were trained on

It is very common to want to generate text based on other sources of data besides what the model was trained on.

@LangChainAI

Common use cases include:

🌻Summarization
❓Question Answering
ℹ️Question Answering with Sources

As of @LangChainAI version 0.0.32, there are now wrappers for all of these use cases, with multiple different underlying types of chains.

Notebook links below 👇

Read 13 tweets

Harrison Chase

@hwchase17

Dec 6, 2022

@OfirPress

A 🧵on results of a little investigation I did over the past week

❓How does text-davinci-003 do on agent-like tasks

TLDR: Displays superior understanding and ability to take multi-step actions towards original goal

h/t to @OfirPress and @momusbah for help/feedback with this

@blennon_

✨text-davinci-003 came out a week ago

Lots of people (like @blennon_ below) provided some great analysis of how it compared to `text-davinci-002`

This is my contribution to that, focused on agent-like tasks

https://twitter.com/blennon_/status/1597374826305318912?s=20&t=6jiirRJBE5ecsevnXvLGQA

@OfirPress

What are agent-like tasks?

I focus mainly on exploring situations where the LLM is used as agent that has access to some tools, and needs to answer a question using the tools it has available. For example:

@OfirPress Self Ask with Seach
@ShunyuYao12 ReAct with Wikipedia

Read 21 tweets

Harrison Chase

@hwchase17

Dec 5, 2022

@subby_tech

Some really cool stuff added by @subby_tech (👏👏👏) to @LangChainAI over the weekend:

⚕️APIChain

A general framework for interacting with an API in natural language

🧵See below for a more in depth explanation + examples

At a high level, the flow is:

1⃣ Format a prompt with API docs + a question
2⃣ Have an LLM generate API query to run to get an answer
3⃣ Run said API query
4⃣ Have LLM interpret API response and answer original question in natural language

Note that the LLM is doing in context learning (via the API docs) to figure out how to call the API

For popular APIs, the LLM may(?) be able to generate the correct API call without that context... but this methodology allows it to work on smaller, newer, or private APIs

Read 10 tweets

Harrison Chase

@hwchase17

Dec 5, 2022

@danielgross

With lots of "unofficial" ChatGPT APIs popping up (most based on @danielgross's code), there's been a lot of asks to hook this into LangChain.

Here's how to do so:

@taranjeetio

1. Step up an unofficial ChatGPT API

Here's one example from @taranjeetio: github.com/taranjeet/chat…

Here's another example from @kylejohnmorris: github.com/kylejmorris/ch…

@LangChainAI

2. Write a CustomLLM @LangChainAI wrapper

Here's an example wrapper for @taranjeetio's API implementation: gist.github.com/hwchase17/af22…

Although since most implementations are based on @danielgross's implementation, this wrapper should work for most

Read 4 tweets

Harrison Chase

@hwchase17

Dec 1, 2022

@LangChainAI

A new chain we introduced in @LangChainAI (with a lot of help from @johnjnay 👏👏):

❓Question Answering with Sources❓

github.com/hwchase17/lang…

This takes in a question and a list of documents, and uses those documents to answer that question, citing its sources

There are several cool things about this chain

1⃣ It's a general chain that can be applied to lots of problems

2⃣ It runs over each document individually, and then combines the answers, meaning it can work on longer documents

3⃣ Citing sources is an extremely important UX!

@johnjnay

One potential use case (from @johnjnay at @CodeXStanford )

LLM understanding of legal reasoning / legal language.

Given that legal documents are long and complicated, they are developing approaches for LLMs to recursively analyze them in sequential chains of LLM interactions.

Read 7 tweets

Share this page!

Harrison Chase

People who liked this thread also liked...

Try unrolling a thread yourself!

More from @hwchase17

Harrison Chase

Harrison Chase

Harrison Chase

Harrison Chase

Harrison Chase

Harrison Chase

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!