Your current filters are…
Apache Lucene is used for large variety of usecases from Mobile Phonesto massive scale applications like Twitter. Within the last decadeLucene has grown to the most widely used OpenSource Search Library andstill growing. This session introduces Apache Lucene and itseco-system from a technical point of view. I will introduce the mainimprovements in the upcoming release, the main building blocks used tobuild flexible & large scale search components and explain how theLucene Community maintains its high-quality, stability andperformance.
Keywods: Java, Lucene, Enterprise Search, Framework, Testing, Information Retrieval
Target Audience: This talk should be of interest to engineersinterested in open-source search technology, frameworks and testing.
by Isabel Drost
Storing ever growing amounts of data is getting cheaper each year. Building great applications requires data driven application design: Starting with clear guidelines on which data to gather, building new features based on usage data up to discovering deficiencies based on implicit user feedback.
Several projects at the ASF make it possible to jump start your data driven applications with existing projects for data analysis, search, storage and data mining. This talk shows how to combine the various tools, helps with understanding what pigs, elephants and mahouts can do for you.
Keywords: data mining, machine learning, data analysis, hadoop, mahout, lucene
Target Audience: should be somewhat knowledgeable in hadoop, can be beginner in data mining/analytics
At Twitter we serve more than 1.5 billion queries per day from Lucene indexes, while appending more than 200 million tweets per day in realtime. Additionally we recently launched image, video and relevance search on the same engine.
This talk will outline Twitter's search architecture and explain the modifications we made to Lucene to support realtime search.
Keywords: lucene, realtime search, twitter
Target Audience: advanced, technical