Post

This Thread may be Removed Anytime!

Twitter may remove this content at anytime! Save it as PDF for later use!

More from @jxnlco

jason

@jxnlco

Dec 31, 2023

if you know python, you don't need any LLMops.

trust , i've training models and deployed production applications that serve at >350M requests a day.

just need `pip install` and some good naming conventions

1. jinja - prompting frameworks
2. numpy - vector search
3. sqlite - evals, one row per exp
4. boto3 - data management, s3 and some folder structure
???
5. google sheets ;) - experiment tracking w/ a link to the artifacts saved in S3/GCS.

Disagree?

I've been training models in @PyTorch , and deploying them via @FastAPI since the library came out!

we did large image classification tasks where the folder structure reflected class labels and had a config.json in each directory.

our early a/b tests exported to google sheets and we served similar item recommendations via numpy brute force 3M skus with 40 dimension per vector (umap over resnet and matrix factorization machines)

newsroom.stitchfix.com/blog/your-shop…

@PyTorch @FastAPI I have nothing to sell you and sometimes i tweet about applied machine learning with @arnaudai (ex google brain) so give us a follow!

Read 4 tweets

jason

@jxnlco

Oct 12, 2023

For those who couldn't attend my 'Keynote' at @aiDotEngineer

> Pydantic is all you need

Making LLMs, software 3.0, backwards compatible with software 1.0

other links in the thread!youtube.com/live/veShHxQYP…

Presentation Slides:

tome.app/fivesixseven/p…

Repo: github.com/jxnl/instructor

Read 7 tweets

jason

@jxnlco

Jul 18, 2023

0/ Any real AI engineer knows that streaming REALLY improves the UX.

Today, I'm landing a change that defines a reliable way to stream out multiple @pydantic objects from @OpenAI .

Take a look, by the end, you'll know how to do streaming extraction and why it matters.

1/ Streaming is critical when building applications where the UI is generated by the AI.

Notice in the screenshot that the first item was returned in 560ms but the last one in almost 2000ms! a 4x difference in time to first content

How do we do this?

2/ By using `pip install openai_function_call` we can do the following:

1) Use MultiTask to dynamically make a schema based on a `cls` that we defined
2) Set stream=True to unlock the latency win
3) `from_streaming_response` allows us to parse the completion into generator[cls].

Read 7 tweets

jason

@jxnlco

Jun 29, 2023

Why prompt engineer @openai with strings?

Don't even make it string, or a dag, make it a pipeline.

Single level of abstraction:

Tool and Prompt and Context and Technique? its the same thing, it a description of what I want.

The code is the prompt. None of this shit
"{}{{}} {}}".format{"{}{}"

PR in the next tweet.

@OpenAI github.com/jxnl/openai_fu…

with this, we can import templates as just additional messages.

you can colocate 'prompt' with your data structures
you can colocate 'prompt' with your computation

In a way that readable and shows you everything in a few lines of code

Read 6 tweets

jason

@jxnlco

Jun 22, 2023

If you've followed me from the last @LangChainAI webinar I wanted to share the repo that contains the code examples. Contributions of other ideas / evals / or examples are totally welcome. If you want to help you check the issues!

🧵github.com/jxnl/openai_fu…

I'm not trying to make a framework!

Its my exploration on abstractions and design patterns, something that can be leveraged (and has been) in both @LangChainAI & @llama_index

It's also a public notebook of sorts to capture some ideas I have in my mind.

Consider it... light research with design patterns, ergonomcs, and abstractions.

Read 5 tweets

jason

@jxnlco

Jun 21, 2023

1/ Defending from SQL injections w/ @OpenAI

I tried to add another layer of protection for sql writing systems by not just returning the string, but the template, and parameters separately.

So lets use @pydantic again to build a small example of going above SQL strings!

2/ So again, lets begin with the schema:

1) we define literals and identifiers as enum
2) we define what a parameter is so we can template
3) lastly we ask for the string template to use

as a nice to have we try to mark it safe which could be used to warn, or hook into UI.

Its a pretty simple change when you configure by code, in fact, its the same kind of change I'd ask my intern to do if they arn't doing it already.

I'd literally just show them this blog post: realpython.com/prevent-python…

Read 7 tweets

Share this page!

Enter URL or ID to Unroll

jason

Try unrolling a thread yourself!

More from @jxnlco

jason

jason

jason

jason

jason

jason

Did Thread Reader help you today?

Don't want to be a Premium member but still want to support us?

Send Email!