Lessons learned with Spark at the US Patent & Trademark Office

A session at Spark Summit 2015

Monday 15th June, 2015

2:00pm to 2:30pm

This case study concerns moving large amounts of patent data from Cassandra to Solr. How we approached the problem, the introduction of Spark as a solution, and how to optimize the Spark job. I will cover:

  • Understanding the parts of a Spark Job. Which components run where and common issues
  • Adding metrics to show where pain points are in your code
  • Comparing various methods in the API to achieve more performant code
  • How we saved time and made a repeatable process with Spark

About the speaker

This person is speaking at this event.
Christopher Bradford

Search & Big Data Architect at OpenSource Connections bio from LinkedIn

Sign in to add slides, notes or videos to this session

Tell your friends!


Time 2:00pm2:30pm PST

Date Mon 15th June 2015

Short URL


Official event site


View the schedule


See something wrong?

Report an issue with this session