Midwinter 2014: Large-Scale Metadata Aggregation Strategies and Workflows

ALCTS Metadata Interest Group at ALA Midwinter 2014
Date: Sunday, January 26, 2014
Time: 8:30am to 10am
Location: Pennsylvania Convention Center, 102A

Add this meeting to your schedule: http://alamw14.ala.org/node/12762

We have two exciting programs that will discuss strategies and workflows for and challenges associated with large-scale metadata aggregation.

“The Other Side of Linked Data: Managing Metadata Aggregation,” presented by Diane Hillman.

Most of the current activity in the library LOD world has been on publishing library data out of current silos. But part of the point of linked data for libraries is that it opens up data built by others for use within libraries, and has the potential for greater integration of library data within the larger data world. The sticking point for most librarians is that data building and distribution outside the familiar world of MARC seems like a black box, the key held by others.¬†¬†Traditionally, libraries have relied on specialized system vendors to build the functionality they needed to manage their data. But the discussions I’ve heard too often result in librarians wanting vendors to tell them what they’re planning, and vendors asking librarians what they need and want. In the context of this stalemate, it behooves both library system vendors and librarians to explore the issues around management of more fine-grained metadata so that an informed dialogue around requirements can begin. As part of this dialogue, there are a number of questions about goals that could be addressed:

* Will expression in MARC (and/or RDA and/or BibFrame) be part of the requirements?
* How does non-library data fit in (dbpedia, nytimes, amazon, onix)?
* How does schema.org and RDFa fit into the picture?
* Will some data be indexed and not displayed, and vice-versa?
* Who will decide what pieces of available data will be valued and what pieces required?
* Will there need to be an aggregation workflow in addition to a cataloging workflow, or are they best integrated?

To assist in discussion about what happens after those basic decisions, Diane will discuss a framework for managing aggregation of atomic level (fine grained) metadata. Drawing on experience aggregating metadata for the National Science Digital Library, she will describe specific tasks, workflow, data improvement strategies and other issues.

 

“Harvesting and Normalization at the Digital Public Library of America: Lessons from a Diverse Aggregation,” presented by Kristy Berry Dixon (Digital Library of Georgia), Sandra McIntyre (Mountain West Digital Library) and Amy Rudersdorf (Digital Public Library of America).

The Digital Public Library of America currently works with more than 21 digital collections hubs to crosswalk, enrich, and normalize their metadata to align with the DPLA Metadata Application Profile (dp.la/info/map). Metadata is shared in a variety of formats, standards, and readiness and is ingested and made available through the DPLA JSON-LD API (dp.la/info/developers/codex/). In developing the DPLA data model, DPLA staff worked closely with metadata designers from the Europeana Digital Library and from leading U.S. institutions, and has refined the model since launch in April 2013 in response to the experience of working with diverse hubs.

This talk will introduce and outline the challenges of aggregating disparate metadata flavors from the perspective of both DPLA staff and representative hubs. We will review next steps and emerging frontiers as well, including improvements to normalization at the hub level and wider adoption of controlled vocabularies and formats for geospatial metadata and usage rights statements. Finally, we will share plans for implementing Linked Data throughout the aggregated national network and discuss how that will expand opportunities for DPLA and its partners.

We hope to see you in Philadelphia!

On behalf of the Metadata IG,
Ivey Glendon, Program Co-Chair
Santi Thompson, Program Co-Chair

About Jason Kovari

Jason Kovari is Head of Metadata Services and Web Archivist at Cornell University. Prior to this role, Kovari worked as Metadata Librarian for Humanities and Special Collections at Cornell University and Special Collections Digital Initiatives Librarian for the University of Mississippi (updated June 2014).
This entry was posted in ALA Midwinter 2014, Conferences. Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>