Charly Wargnier Profile picture
Jul 26, 2021 10 tweets 7 min read Read on X
🥁 New @streamlit app! 🎈

WTFaq leverages the power of @huggingface Transformers & @Google T5 to generate quality question & answer pairs from URLs!

Select your best Q&As on the fly & export them to CSV!

🎲App bit.ly/3zzS4us
📬Post bit.ly/3kUz59O

#SEO

↓🧵 Image
First things first, pick a URL!

The app will crawl the URL's content & retrieve it back in the app.

Server side only for now. I'm planning to add the ability to crawl client-side rendered URLs soon, stay tuned! 🤗

2/10 Image
Once a URL is crawled, you can check its scraped content in the toggleable section (see below👇)

As we're still in beta and want to monitor memory spikes closely, built Q&A pairs are based on the first thousand scraped characters only.

We will increase that limit soon!😉

3/10 Image
Wait a few seconds for the results to be populated.

Results will be displayed in a matrix of 5 to 20 question & answers pairs.

You can select your favourite Q&A pairs on the fly simply by ticking them, as shown on the video below 👇

4/10
Final step!

Your selected Q&A pairs will be displayed in the bottom table.

If you wish to download them, click that download button - Voilà!

5/10 Image
Many cool content & #SEO use cases! 🔥

✓ Use it for content generation purposes
✓ Map-out Q&A pairs with your product, service or brand
✓ Research any topic and get Q&A pairs from that seed topic
✓ Differentiate your pages!

6/10
🧰 The stack is 100% #Python! 🐍🔥

✓ Web framework: @streamlit! 🎈
✓ Scraping tasks: Requests
@Google T5 via @huggingface Pipelines - huggingface.co/transformers/m…
✓ Not to forget @thiago's mighty Component for coloured labels! github.com/tvst/st-annota… 🙌

7/10 Image
🛠️ Still To-Do’s:

✓ Optimise code to increase speed ⚡
✓ Increase RAM capacity to mitigate bumps and allow for more content to be analysed
✓ Provide additional Q&A algorithms

Kudos to @huggingFaces and @Streamit DevOps for their support so far! 🙌

8/10
📂 About open-sourcing the code:

That code currently lies in a private repo. I should be able to make it public soon for you to re-use it in your own apps and creations!

Keep your eyes peeled! 🙌

9/10
WTFaq is still in beta, head-off to my Gitter page for bug reports, Qs or suggestions:
▶️ gitter.im/DataChaz/what-…

This app is free! You can buy me a ☕ to support my work if it's useful to you!
▶️ buymeacoffee.com/cwar05

🎲 Check my other apps! charlywargnier.com/my-public-web-…

10/10 Image

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Charly Wargnier

Charly Wargnier Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

More from @DataChaz

Mar 28
My new favorite way to visualize any codebase! 🤯


link to repo 🧵↓ gitdiagram.com
I can’t believe @ahmedkhaleel04 built this in just 7 days in @cursor_ai... it's already over 100k users and almost 3K ⭐️ on @Github!

github.com/ahmedkhaleel20…
I love seeing my beloved @streamlit featured on the homepage! 🤗🎈 Image
Read 4 tweets
Feb 19
Europe is back in the AI race.

Last week, London-based @Convergence_ai_ turned heads by crushing @OpenAI’s Operator.

Today, they’re back with something even bigger: Proxy 1.0 - one of the most powerful web-based AI agents out there!



Let’s dive in! 🧵↓ proxy.convergence.ai
Quick reminder that funding isn’t everything.

@OpenAI raised $18 billion.

@Convergence_ai_? Just $12 million! That’s 0.067% of OpenAI’s budget.

And yet, Proxy is faster, smarter, fully autonomous.



2/tech.eu/2024/09/27/con…
I got to be part of the beta, where we pushed Proxy through 100K+ real-world tasks.

The main complaints we had:

❌ Slow load times
❌ Doesn’t fully complete open-ended tasks
❌ Failures when faced with website roadblocks

Proxy 1.0 tackles all of these issues head-on!

3/
Read 12 tweets
Feb 11
OpenAI is getting Deepseek’d again.

@Convergence_ai_, a tiny London startup just built one of the most capable AI agents for the web.

Proxy is outperforming Operator on every benchmark handpicked by @OpenAI.



Let’s dive in! 🧵↓ proxy.convergence.aiImage
Let’s start with this, proof that funding isn’t everything.

@OpenAI raised $18 billion.

@Convergence_ai_? Just $12 million! That’s 0.067% of OpenAI’s budget.



And yet, Proxy is faster, smarter, fully autonomous.

Keep scrolling for comparison videos! ↓tech.eu/2024/09/27/con…
Task 1: Find chicken recipes.

❌ Operator gives up after 4 minutes.
✔️ Proxy completes it twice before Operator finishes.
✔️ Not only that, Proxy also delivers complete results.

3/
Read 11 tweets
Sep 5, 2023
I'm SO excited about our @Streamlit #LLM Hackathon kicking off today! 🎈🚀

Build an innovative LLM app, and get ace prizes from our amazing partners:

- @LangChainAI
- @AssemblyAI
- @Weaviate_io
- @Llama_Index
- @ClarifAI

🔗 Join:

More below 🧵↓ streamlit.io/community/llm-…

💡 Rules:

Build an innovative #LLM-based @Streamlit app that incorporates at least one of the following LLM technologies:

- @LangChainAI
- @AssemblyAI
- @Weaviate_io
- @Llama_Index
- @ClarifAI

- There are 5 “Most Innovative Use” prize categories, one for each partner listed above.
- In each category, there will be two lucky app winners.
You can submit your app alone, or as a team of two.
- Winners will be announced by October 5.
Image
🧑‍⚖️ Judging:

- Inventive: Your app offers unique features not present in other @Streamlit apps.
- Error-Free: No errors detected during app testing.
- Public GitHub Repo:
Your app’s source code is public—using secrets management to protect your API keys and credentials.
- Hosted on Community Cloud: Your app must be publicly accessible on Streamlit's Community Cloud.
- Tools Used: Incorporates at least one partner such as LangChain, LlamaIndex, Weaviate, AssemblyAI, or Clarifai.
- LLM Pain Points: Bonus for addressing LLM challenges like transparency, trust, accuracy, privacy, cost-efficiency, or ethics.
Image
Read 11 tweets
Jul 25, 2023
Want to try out @Meta's latest open-sourced #AI chatbot, #Llama2?

Here are three (FREE!) ways to do it! 🧵↓ Image
@perplexity_ai 2 - via @huggingface! 🤗

Read 6 tweets
Jul 20, 2023
Amazing new #SpeechRecognition release by @AssemblyAI! 🔥

Their new #Conformer2 model, trained on a vast 1.1M hours of audio data, is here, marking a 170% increase in training compared to its predecessor!

Let's dive in!

🧵↓

1/6
.@AssemblyAI's versatile new algo handles a huge range of speech recognition tasks:

✔️ Transcription
✔️ Summarization
✔️ Topic detection
✔️ Content moderation
✔️ Sentiment analysis
✔️ Entity detection & more!

A real Swiss army knife for efficient audio & text processing!

2/6 Image
Built on a scaled-up version of Google Brain's Conformer architecture, #Conformer2 takes cues from the scaling laws discovered by @GoogleDeepMind! 🧠

This larger training data set aims for greater accuracy in transcriptions, even for challenging audio data!

3/6 Image
Read 7 tweets

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(