•  

Collaborative Filtering using MapReduce

A session at Strata 2012

Wednesday 29th February, 2012

1:30pm to 2:10pm (PST)

Collborative filtering is a method of making predictions about a user’s interests based on the preferences of many other users. It’s used to make recommendations on many Internet sites, including LinkedIn. For instance, there’s a “Viewers of this profile also viewed” module on a user’s profile that shows other covisited pages. This “wisdom of the crowd” recommendation platform, built atop Hadoop, exists across many entities on LinkedIn, including jobs, companies, etc., and is a significant driver of engagement.

During this talk, I will build a complete, scalable item-to-item collaborative filtering MapReduce flow in front of the audience. We’ll then get into some performance optimizations, model improvements, and practical considerations: a few simple tweaks can result in an order of magnitude performance improvement and a substantial increase in clickthroughs from the naive approach. This simple covisitation method gets us more than 80% of the way to the more sophisticated algorithms we have tried.

This is a practical talk that is accessible to all.

About the speaker

This person is speaking at this event.
Sam Shah

LinkedIn

Next session in Ballroom CD

2:20pm Hadoop + JavaScript: what we learned by Asad Khan

Sign in to add slides, notes or videos to this session

Strata 2012

United States United States, Santa Clara

28th February to 1st March 2012

Tell your friends!

When

Time 1:30pm2:10pm PST

Date Wed 29th February 2012

Where

Ballroom CD, Santa Clara Convention Center

Short URL

lanyrd.com/smmww

View the schedule

Share

Topics

See something wrong?

Report an issue with this session