“Unique”, “Descriptive”, and other damned lies: The Challenges of Identifying Related Records

A session at LITA Forum

Saturday 14th November, 2015

11:30am to 12:15pm (CST)

HathiTrust is currently building a metadata registry for the comprehensive corpus of United States federal government documents produced since 1789. This work includes the identification and matching of duplicate or related records as well as the identification of gaps in metadata coverage. This presentation will focus on the challenges that come with determining whether or not two MARC records are for duplicate or related items when the data is of varying quality, there is often no common identifier, and the contributed data may be minimal. Speakers will also address how the detection mechanism has evolved over time.

About the speakers

This person is speaking at this event.
Valerie Glenn

Government Documents Registry Analyst at University of Michigan bio from LinkedIn

This person is speaking at this event.
Bill Dueber

Library Systems programmer, U of Michigan Libraries, and connoisseur of off-brand Dr. Pepper knockoffs. bio from Twitter

Next session in Nicollet D3

1:15pm API Authority Control: Leveraging Programmatic Access to Legacy Metadata by Kate Flynn and Andy Weidner

4 attendees

  • Bill Dueber
  • Rebecca Ganzel
  • Phil Feilmeyer
  • Valerie Glenn

Sign in to add slides, notes or videos to this session

Sign in to track this session

Tell your friends!


Time 11:30am12:15pm CST

Date Sat 14th November 2015


Nicollet D3, Hyatt Regency Minneapolis

Short URL


Official event site


View the schedule


See something wrong?

Report an issue with this session