Sessions at MongoBoston 2011 about Case Studies and Hadoop on Monday 3rd October

Your current filters are…

Clear
  • Text Analysis Using MongoDB

    by Aaron Cordova

    In this talk we present our experiences building a text analysis system on top of MongoDB. The Interllective, is building research tools to discover... conceptually related documents in technical collections (patents, journal articles, etc.). We use sharded MongoDB to scale out our document collections. We discuss experiences performing text clustering, with collection design and indexing strategies for using Hadoop MapReduce over MongoDB. We also discuss building a real-time distributed relevance scoring engine on top of MongoDB. And since our data is too large to fit into memory, we address tradeoffs to consider when scaling using a disk based approach.

    At 4:15pm to 5:15pm, Monday 3rd October