Andy Marlow Profile picture
Jun 19 5 tweets 2 min read Read on X
Spent more time cleaning PDF output than building the actual AI workflow.

So I tested @nutrientdocs PDF-to-Markdown CLI.

Here's what happened ↓ Image
If you're building with RAG, AI agents, or knowledge bases, you've probably hit the same problem:

PDFs are full of messy layouts, broken tables, and formatting issues.

Before feeding documents into an LLM, you usually spend time cleaning everything manually.
I tried @nutrientdocs PDF-to-Markdown CLI to see how well it handles the conversion process.

The tool takes a PDF and converts it into structured Markdown that's much easier for:

• RAG pipelines
• LLM ingestion
• Documentation systems
• AI workflows

No complicated setup required.
What stood out was the reduction in cleanup work.

Instead of spending time fixing formatting issues after extraction, the Markdown output was already organized enough to drop into my workflow with minimal editing.

That's especially useful when processing large document collections.
If you work with PDFs and AI tools, this is worth testing yourself.

Check out the open-source repo:


Curious to see how it performs on different document types and real-world datasets. 🚀github.com/PSPDFKit/pdf-t…

• • •

Missing some Tweet in this thread? You can try to force a refresh
 

Keep Current with Andy Marlow

Andy Marlow Profile picture

Stay in touch and get notified when new unrolls are available from this author!

Read all threads

This Thread may be Removed Anytime!

PDF

Twitter may remove this content at anytime! Save it as PDF for later use!

Try unrolling a thread yourself!

how to unroll video
  1. Follow @ThreadReaderApp to mention us!

  2. From a Twitter thread mention us with a keyword "unroll"
@threadreaderapp unroll

Practice here first or read more on our help page!

Did Thread Reader help you today?

Support us! We are indie developers!


This site is made by just two indie developers on a laptop doing marketing, support and development! Read more about the story.

Become a Premium Member ($3/month or $30/year) and get exclusive features!

Become Premium

Don't want to be a Premium member but still want to support us?

Make a small donation by buying us coffee ($5) or help with server cost ($10)

Donate via Paypal

Or Donate anonymously using crypto!

Ethereum

0xfe58350B80634f60Fa6Dc149a72b4DFbc17D341E copy

Bitcoin

3ATGMxNzCUFzxpMCHL5sWSt4DVtS8UqXpi copy

Thank you for your support!

Follow Us!

:(