How to get URL link on X (Twitter) App
Many papers, including deepseek-math and nvidia nemotron, have concluded that math pretraining is critical for general LLM reasoning: 
PDFs have a font map that tells you what actual character is connected to each rendered character, so you can copy/paste. Unfortunately, these maps can lie, so the character you copy is not what you see. If you're unlucky, it's total gibberish.



Get with `pip install --pre -U surya-ocr`. Use with marker: `pip install --pre -U marker-pdf`, then pass the `format_lines` marker option.

Find marker at 


Find it at .


Find it here - github.com/VikParuchuri/t…



Find Surya here - .

I used a modified version of efficientvit from MIT - - which was then adapted by @wightmanr . I made some small modifications, including adding a segmentation head. Thanks for much for the architecture/code!github.com/mit-han-lab/ef…
Surya was trained on a diverse set of documents, including scientific papers. It works with every language that I've tried.
Nougat is an amazing model, but is slow and hallucination-prone (1.5% of pages in arXiv, 5%+ outside) due to autoregressive decoding.