What I Read: Classifying pdfs