Saturday 14th November, 2015
11:30am to 12:15pm
HathiTrust is currently building a metadata registry for the comprehensive corpus of United States federal government documents produced since 1789. This work includes the identification and matching of duplicate or related records as well as the identification of gaps in metadata coverage. This presentation will focus on the challenges that come with determining whether or not two MARC records are for duplicate or related items when the data is of varying quality, there is often no common identifier, and the contributed data may be minimal. Speakers will also address how the detection mechanism has evolved over time.
Government Documents Registry Analyst at University of Michigan bio from LinkedIn
Library Systems programmer, U of Michigan Libraries, and connoisseur of off-brand Dr. Pepper knockoffs. bio from Twitter
Sign in to add slides, notes or videos to this session