@mikko Profile picture
Researcher and a best-selling author. Keynote talks at RSA, Black Hat & DEF CON. TED Speaker. Chief Research Officer at Sensofusion.

Sep 14, 2020, 6 tweets

«We’ve extracted text data from tens of thousands of PDF documents. In the process, we have seen how every single assumption we had about how PDF files are structured was proven incorrect»

The PDF file format specification has 971 pages.

iso.org/standard/63534…

The ISO specification for PDF costs $220. Does anybody have a link to a pirated copy?

"Research spanning 20 years proves PDFs are problematic for online reading. Yet they’re still prevalent and users continue to get lost in them. They’re unpleasant to read and navigate and remain unfit for digital-content display." nngroup.com/articles/pdf-u…

Who knew? You can embed sounds and music into PDFs.

Does anybody have an example of a PDF file with an embedded video?

Share this Scrolly Tale with your friends.

A Scrolly Tale is a new way to read Twitter threads with a more visually immersive experience.
Discover more beautiful Scrolly Tales like this.

Keep scrolling