Get Lanyrd on your mobile (iPhone, Android and more) - check it out here

Text Analysis Using MongoDB

A session at MongoBoston 2011

In this talk we present our experiences building a text analysis system on top of MongoDB. The Interllective, is building research tools to discover... conceptually related documents in technical collections (patents, journal articles, etc.). We use sharded MongoDB to scale out our document collections. We discuss experiences performing text clustering, with collection design and indexing strategies for using Hadoop MapReduce over MongoDB. We also discuss building a real-time distributed relevance scoring engine on top of MongoDB. And since our data is too large to fit into memory, we address tradeoffs to consider when scaling using a disk based approach.

About the speaker

This person is speaking at this event.
Aaron Cordova

Software Engineer, The Interllective, Inc

Sign in to add slides, notes or videos to this session

Tell your friends!

When

Time 4:15pm5:15pm EST

Date Mon 3rd October 2011

Short URL

lanyrd.com/skbmp

Official event site

www.10gen.com/…ston-2011

View the schedule

Topics

See something wrong?

Report an issue with this session