Shubham Saboo Profile picture
Daily tips and tutorials on LLMs, RAG and AI Agents | Author of books on GPT-3 & Neural Search in Production | DM open for collaboration

May 9, 2024, 6 tweets

Document OCR in 90+ languages using python (100% free and without internet):

For document OCR, we'll use opensource toolkit Surya.

Python toolkit that does:
• OCR in 90+ languages
• Line-level text detection in any language
• Layout analysis (table, image detection)
• Reading order detection
• Outperforms Tesseract, on par with Google Cloud vision

1. Install the Python library

Run the following command from your terminal. You'll need python 3.9+ and PyTorch.

2. Try out the OCR (text detection) boilerplate code

3. Try out the built-in interactive demo with Streamlit.

Install Streamlit and run the following command in your terminal: 'surya_gui'

🌟 Support the opensource project:

8000+ readers receive my AI newsletter everyday and you are not one of them. Join @_unwind_ai to stay on top of the latest AI developments and tools: github.com/VikParuchuri/s…
unwindai.substack.com

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling