Thursday 1st March, 2012
4:50pm to 5:30pm
Come learn how the Mendeley team built the largest crowdsourced database of research literature, scaled to handle 120M uploaded documents, and how they’re using technologies such as Hadoop, Apache Mahout and Thrift to generate daily statistics and recommendations on over 7 TB of academic research data. Jan Reichelt, Mendeley co-founder, will talk about the lessons learned in building the service and how this is shaking up the stodgy old field of academic publishing.
In addition to the technical story, Jan will also show how Mendeley’s real-time data on content usage provides never-before-seen insight into how academics collect, read, share, and annotate academic research. Why should you care about academic publishing? It’s a fascinating story… while you’re using Github and Google+ to share information, the best that all the world’s big brains can come up with is swapping PDFs!
Academic publishing is facing many of the same stressors as other kinds of publishing as their content moves online, but since academic publishing has typically derived revenue from institutional purchases as opposed to individual ones and ad sales don’t contribute as much to revenues, the business models have diverged to where academic publishing has had until now very little end-user focus. Academic content is also read more intensively, curated more carefully by end users, and managed with specialized tools, which gives us a unique opportunity to look at content usage at a level of detail not possible in any other industry and distill some insights that are relevant across all of publishing.
Sign in to add slides, notes or videos to this session