Charly Wargnier Profile picture
Jul 26, 2021 10 tweets 7 min read Read on X
🥁 New @streamlit app! 🎈

WTFaq leverages the power of @huggingface Transformers & @Google T5 to generate quality question & answer pairs from URLs!

Select your best Q&As on the fly & export them to CSV!

🎲App bit.ly/3zzS4us
📬Post bit.ly/3kUz59O

#SEO

↓🧵 Image
First things first, pick a URL!

The app will crawl the URL's content & retrieve it back in the app.

Server side only for now. I'm planning to add the ability to crawl client-side rendered URLs soon, stay tuned! 🤗

2/10 Image
Once a URL is crawled, you can check its scraped content in the toggleable section (see below👇)

As we're still in beta and want to monitor memory spikes closely, built Q&A pairs are based on the first thousand scraped characters only.

We will increase that limit soon!😉

3/10 Image
Wait a few seconds for the results to be populated.

Results will be displayed in a matrix of 5 to 20 question & answers pairs.

You can select your favourite Q&A pairs on the fly simply by ticking them, as shown on the video below 👇

4/10
Final step!

Your selected Q&A pairs will be displayed in the bottom table.

If you wish to download them, click that download button - Voilà!

5/10 Image
Many cool content & #SEO use cases! 🔥

✓ Use it for content generation purposes
✓ Map-out Q&A pairs with your product, service or brand
✓ Research any topic and get Q&A pairs from that seed topic
✓ Differentiate your pages!

6/10
🧰 The stack is 100% #Python! 🐍🔥

✓ Web framework: @streamlit! 🎈
✓ Scraping tasks: Requests
@Google T5 via @huggingface Pipelines - huggingface.co/transformers/m…
✓ Not to forget @thiago's mighty Component for coloured labels! github.com/tvst/st-annota… 🙌

7/10 Image
🛠️ Still To-Do’s:

✓ Optimise code to increase speed ⚡
✓ Increase RAM capacity to mitigate bumps and allow for more content to be analysed
✓ Provide additional Q&A algorithms

Kudos to @huggingFaces and @Streamit DevOps for their support so far! 🙌

8/10
📂 About open-sourcing the code:

That code currently lies in a private repo. I should be able to make it public soon for you to re-use it in your own apps and creations!

Keep your eyes peeled! 🙌

9/10
WTFaq is still in beta, head-off to my Gitter page for bug reports, Qs or suggestions:
▶️ gitter.im/DataChaz/what-…

This app is free! You can buy me a ☕ to support my work if it's useful to you!
▶️ buymeacoffee.com/cwar05

🎲 Check my other apps! charlywargnier.com/my-public-web-…

10/10 Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Charly Wargnier

Charly Wargnier Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @DataChaz

Dec 18
Google's MCP Toolbox for Databases is now open source 🔥

A single backend for connections, security, and schema-aware SQL tools, built for AI agents.

Compatible with Python, JS, Go, LangChain, and more.

Repo in 🧵 ↓ Image
1/

Features include:

→ Declarative tool definitions (<10 LOC)
→ Secure by default with integrated authentication
→ Auth, pooling & fast queries out of the box
→ Native observability (OpenTelemetry)
→ Postgres, MySQL, Cloud SQL & more supported Image
Read 6 tweets
Dec 18
Google is quietly changing how devs work.

These 5 AI agents automate GitHub (via Gemini CLI), BigQuery, Looker, Database migrations and more, from data ingestion to PR reviews 🔥

Here’s what each agent actually does 🧵 ↓
1/ GitHub Agent (via Gemini CLI)

Bring Gemini to your terminal, and your GitHub workflow!

→ Auto triages issues & reviews PRs with contextual AI
→ Handles tasks via natural language
→ Terminal-first, open source, and script-friendly

Link:
github.com/google-gemini/…Image
2/ BigQuery Data Agent

Build & manage data pipelines with AI, using plain language!

→ Ingests, transforms & validates data from Cloud Storage
→ Generates SQLx, enforces quality, and updates pipelines
→ Scales across use cases via CLI, UI, or API

cloud.google.com/blog/products/…
Read 7 tweets
Dec 17
Your own AI coding platform in one click.

ICYMI @Cloudflare opensourced VibeSDK, letting anyone spin up their own vibe coding environment 🤯

→ LLM code gen + fixes
→ Safe runs in CF Sandboxes
→ Infinite scale via `Workers for Platforms`
→ GitHub export

Demo + links in 🧵↓ Image
1/ Try it for yourself here:
build.cloudflare.devImage
2/ Check Cloudflare's announcement:
blog.cloudflare.com/deploy-your-ow…
Read 4 tweets
Nov 21
Wild.

Postman's AI Agent Builder lets you turn any API (from over 100,000!) into an MCP server in seconds, no code required 🤯

Your custom MCP server, ready to use in Cursor, Windsurf, Claude Desktop, Docker, plus a lot more! 🧵↓ Image
1/

First, start here →

You’ve got literally 100,000+ APIs to check out.

1. mix and match any endpoints you want
2. download your custom zip file
3. that’s it! postman.com/explore/mcp-ge…
Mind = blown.

That zip file has EVERYTHING:
↳ a readme with setup instructions
↳ your selected endpoints
↳ all the files to run your MCP server locally, on Cursor, Windsurf… even Docker!

You also get an .env file with your prefilled variables → just add your API keys! 🔥 Image
Read 9 tweets
Nov 17
MIT and Oxford just released their $2,500 agentic AI curriculum on GitHub at no cost.

15,000 people already paid for it.

Now it's on GitHub!

It covers patterns, orchestration, memory, coordination, and deployment.
A strong roadmap to production ready systems.

Repo in 🧵 ↓ Image
10 chapters:

Part 1. What agents are and how they differ from plain generative AI.
Part 2. The four agent types and when to use each.
Part 3. How tools work and how to build them.
Part 4. RAG vs agentic RAG and key patterns.
Part 5. What MCP is and why it matters.
Part 6. How agents plan with reasoning models.
Part 7. Memory systems and architecture choices.
Part 8. Multi agent coordination and scaling.
Part 9. Real world production case studies.
Part 10. Industry trends and what is coming next.
Here's the repo:
github.com/aishwaryanr/aw…Image
Read 5 tweets
Nov 13
If you’re still sending raw JSON into your LLMs, you’re burning tokens, latency, and budget!

Try TOON (Token-Oriented Object Notation).

Clear like YAML, compact like CSV:

• 30–60% fewer tokens
• Up to 50% lower costs
• Shines for tabular data.

Free and Open source 🧵↓ Image
💡 Benchmark tip.

Check out @curiouslychase’s ace Format Tokenization Playground:


It lets you compare token counts for various formats:
- CSV
- JSON
- YAML
- and TOON

... all with your own sample data 🔥 curiouslychase.com/playground/for…Image
Read 4 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(