PDF is a terrible format, but it is the standard in academia, despite some perfunctory efforts to make like the rest of the world and get with ebook formats. That would be nice; but in the academic world, few academic communiques are kindle-compatible, and generally if you convert PDFs to ebooks the equations and graphs etc turn into 💩, so this is only a solution for people who survive without equations or tables or graphs, which does not resemble my job description.
Now, how will I read all those PDFs and annotate them without losing track and going crazy? Paper is the best reading experience, but it's just too damn heavy. Bonus points if I can sync my annotations to my citation management software. More bonus points if I can also synchronise to a convenient e-reader so I don't have to have my distracting laptop to read every sodding thing. Bonus points if the solution involves not putting all my notes in some obscure opaque commercial database with no guarantee of existing next week.
Zotero can sync annotations and store PDFs with citation metadata conveniently. See citation management for the details. It knows how to capture and store journal article metadata really well. (open source)
Calibre isn't a general sync solution, but it does manage ebooks well, especially ones that are real books and have ISBNs etc. And it does synchronise with various ebook readers and convert to their local dialect of whatever. (open source)
If I only read books or I only read papers and I had time, I could probably hack one of these into being a general purpose document annotation-and-metadata-and-ebook-reader-and-desktop-synchronisation system. As it is, I swap awkwardly between two systems depending on, basically, whether the PDF I am reading is short (Zotero) or long (Calibre).
e.g. iPad, kindle fire, Onyx boox. A device for reading things.
Good reading apps
- koreader is open-source and fancy, for Andoird
- marginnote seems well-considered and sexy, including flashcard integration but the bloody thing only runs on Macs and iOs, so us e-Ink people are out in the cold.
How can I get my incoming articles on it?
I use Zotero + syncthing for my journal papers, and Calibre for my textbooks.
This works, if not seamlessly, then at least smoothly.
How can I get my notes off it and into something useful?
Fucked if I know. kindle has a hack called clipbook
My particular e-reader.
Max2 isn't well documented, but
there is a bulletin board <http://bbs.onyx-international.com/c/MAX2-SERIES/q-a-max2>.
Apparently Amazon code B078WLP9L8 is
Onyx max2 compatible <http://bbs.onyx-international.com/t/wacom-pen-compatibility/118>.
firmware updates cannot be directly linked to, because the Onyx International website is a shit show, but
you can click through <http://www.onyx-international.com/>_.
Paper reading and discovery
In recent years, a highly interesting pattern has emerged: Computer scientists release new research findings on arXiv and just days later, developers release an open-source implementation on GitHub. This pattern is immensely powerful.[…]
GitXiv is a space to share links to open computer science projects. Countless Github and arXiv links are floating around the web. It's hard to keep track of these gems. GitXiv attempts to solve this problem by offering a collaboratively curated feed of projects. Each project is conveniently presented as arXiv + Github + Links + Discussion. Members can submit their findings and let the community rank and discuss it. A regular newsletter makes it easy to stay up-to-date on recent advancements. It's free and open.
In terms of things that I will actually use, this source-code requirement idea is good.
Arxiv Sanity Preserver
Built by @karpathy to accelerate research. Serving last 26179 papers from cs.[CV|CL|LG|AI|NE]/stat.ML
includes twitter-hype sorting, TF-IDF clustering, and other such basic but important baby steps towards web2.0 style information consumption.
Keep track of arXiv papers and the tweet mini-commentaries that your friends are discussing on Twitter.
Because somehow some researchers have time for twitter and the opinions of such multitasking prodigies are probably worthy of note. I, however, will never contribute to such discourses. Anyway, great hack, good luck.
Extract references and metadata from a given PDF
Detect pdf, url, arxiv and doi references
Fast, parallel download of all referenced PDFs
Output as text or JSON (using the
Extract the PDF text (using the
Use as command-line tool or Python package
Works with local and online pdfs
A Chrome extension that enhances arXiv papers. Get direct links to references, BibTeX extraction and comments on all arXiv papers.
Baldur Bjarnason, Neither Paper Nor Digital Does Active Reading Well:
Catching up on usability research throughout the years makes you want to smash your laptop agains the wall in anger. And trying to fill out forms online makes you scream ‘it doesn’t have to be this way!’ at the top of your lungs.[…]
Software developer inattention to research makes sense when you think of it as a pop culture that—in the 33% of the time where its projects don’t flame out—occasionally has a productive side effect.
The same applies to reading software. When you read up on research and papers on skills development, memory formation, and active reading, frustration with existing tools inevitably follows.
At least with paper, we can teach people to hack their tools—extend the printed book with post-its, commonplace books, bookmarks, and inline annotation. Doing the same in digital is incredibly hard without programming skills (see the low success rate above) or expensive tools, even when the closed silos allow it.[…]
The cognitive effort to actively and intelligently read a text in depth is, if not equal to, then on the same order of magnitude as the effort to write about a complex subject.
But we only have full-featured tools to help us with writing. Ulysses, Tinderbox, Scrivener, etc. all make managing and writing a complex writing task much easier. Even code-oriented text editing workflows […] with their steeper learning curves are a major improvement over paper-based writing workflows.
We can also use paper-based writing tactics in tandem with the digital ones, to the point of going back and forth between the two. You can’t do the same easily with reading.
Which leads us to the current situation: our ability to handle complex writing tasks is increasing while our default reading toolset is stagnating at best.
He ends up giving an extensive advertisement for liquidtext, an ipad app with actual UI development.
Select text to annotate. Add tags and post publicly or save privately.
Reply to or share any annotation. Link to notes or whole pages.
Annotate together in groups. Collaborate privately with others.
Search your notes. Explore all public annotations and profiles.
The have documented a recommended workflow.