Metadata Blog

Write a post for the Metadata Interest Group Blog!

Posted on May 29, 2020 by Anne Washington

The Metadata Interest Group wants to hear from you! The ALCTS Metadata Interest Group is soliciting submissions to our blog. Ideas for posts include:

Transitions you have made during the COVID-19 crisis
Work that metadata workers can do remotely
Support that you need as a metadata worker
Reflections on ALA cancellation
Reflections on CORE

We would also love for you to share metadata news, projects, or metadata-related job postings.

This invitation is open-ended. Submissions are welcome at any time.

Please submit your post to our blog via https://www.alcts.ala.org/metadatablog/share-your-metadata-news-or-project/

Please e-mail if you have any questions!

Posted in Metadata Blog | Leave a comment

CFP – Special Issue of KULA: Knowledge Creation, Dissemination, and Preservation Studies: “The Metadata Issue: Metadata as Knowledge”

Posted on December 9, 2020 by Samantha MacFarlane

Guest edited by Stacy Allison-Cassin (Associate Librarian, Department of Student Learning and Academic Success, Scott Library, York University) and Dean Seeman, (Head, Metadata, University of Victoria Libraries), this special issue of KULA: Knowledge Creation, Dissemination, and Preservation Studies explores metadata as knowledge.

Metadata plays a powerful role in our lives. It describes, tracks, and annotates all manner of things including resources, websites, products, communication, and even people. Metadata governs the circulation of information and has the power to name, broadcast, normalize, oppress, and exclude. Although rarely an object of notice or scrutiny by its users, metadata nevertheless acts as knowledge. In its role as comment or surrogate it carries substantial weight, depth, and power.

Metadata can also function separately from the thing it describes and become knowledge in its own right. On a macro scale, aggregations of metadata about resources – catalogues, bibliographies, and datasets – can function as bodies of knowledge. On a micro scale, technologies such as linked open data break metadata down into discrete statements that can be extracted from their original purpose and re-used elsewhere in a variety of unexpected ways in order for new connections to be made and new knowledge created. Open knowledge projects such as Wikidata offer a platform for metadata to be converted to, or function as, knowledge. These technologies and platforms allow for a flow of information between metadata and community-created knowledge in which each enhances the other’s environments with additional knowledge and, less desirably, mirrors their biases.

This special issue of KULA takes up the critical relationship between metadata and knowledge: how metadata acts as knowledge in its descriptive role; metadata as knowledge abstracted and transformed from its original purpose; and the role of open knowledge projects to facilitate this transformation.

We encourage submissions on a wide range of work, projects, and ideas in relation to metadata and knowledge including, but not limited to, the following:

• How metadata functions as knowledge and/or creates meaning
• The use of linked open data to facilitate the interaction between metadata and bodies of knowledge
• Critiques of metadata and what is “knowable”
• The role of metadata and open knowledge in addressing, or not addressing, issues of under- and misrepresentation of traditionally marginalized groups and knowledge
• The role of metadata and open knowledge projects in addressing human rights issues and inequality
• The creation of tools and technologies that allow metadata and open knowledge platform data to interact and flow into one another
• Open knowledge projects that re-purpose metadata created elsewhere
• Cultural heritage organization (libraries, archives, galleries, and museums) and academic projects that contribute to or leverage open knowledge platforms such as Wikidata
• Reports of practical and technical elements of the contribution and reception of open and community-contributed knowledge in cultural heritage organization and academic project metadata

We are seeking contributions in diverse formats: short- to medium-length scholarly articles; project or technical reports; and creative (visual and/or audio) representations of projects or ongoing work. Please submit abstracts of 300-500 words through the journal’s website at https://kula.journals.publicknowledgeproject.org by January 31, 2021. Based on these abstracts, we will then invite authors to submit full pieces for editorial consideration and, if applicable, peer review.

KULA is an open-access journal requiring no author publication charges (APCs). Authors retain full copyright to their works, which will be published under a Creative Commons license: https://kula.journals.publicknowledgeproject.org/index.php/kula/about/submissions.

Posted in User Submitted | Tagged CFPs, Publications | Leave a comment

Working from Home

Posted on December 9, 2020 by Shu Wan

My name is Shu Wan. As an MLIS student, I am currently matriculating in the School of Library and Information Science at the University of Iowa (UI). Meanwhile, I also serve as a student cataloger in the UI Main Library’s Department of Cataloging. As well as most metadata and cataloging librarians in the United States, I underwent a significant transition from onsite work to remote cataloging. The process was uneasy and challenging for me. In this blog post, I will deliver a brief reflection on my transition and the lesson I took from it.

Prior to this transition in mid-March in my institution, my primary job duty in the Department of Cataloging was to assist my supervisor Cathrine with processing a great number of books authored and published by individuals and institutions in South Asian countries. My workflow consisted of checking the bibliographic information of the physical copies of those books and then creating their cataloging information electronically. However, the outbreak of the COVID-19 pandemic suddenly interrupted the routine and thrust me to move to the cloud-based platform Ex Libris Esploro for cataloging.

However, the transition to cataloging collections remotely was not an easy task for me. Initially, I felt difficulty in using the platform. It was because I hadn’t had any experience in using the software before mid-March. Thanks to my supervisors, Catherine, Brenda, and Wendy’s patience and kindness, I was encouraged to take practice in cataloging on the new platform. Eventually, I became adept at cataloging remotely in an efficient manner. In fact, the first week of the transition to work online was the hardest time of my life. I felt upset about adapting to the new system. Without any savvy in acquiring the skill of cataloging remotely, I was stuck in frustration and disappointment of my awkwardness in learning how to catalog on the Ex Libris Esploro. Thanks to my supervisors’ consistent encouragement, I became skilled in cataloging remotely despite the unpreparedness and unreadiness in the first few days.

Reviewing the transition from the traditional onsite work to cataloging online forced by the pandemic, I may encourage my peers to take a look at its bright side. As an international student of Chinese origin, my home language and culture enable me to view this transition from a different perspective. As shown in President Kennedy’s speech in 1960, “In the Chinese language, the word “crisis” is composed of two characters, one representing danger and the other, opportunity.” In other words, within the process of resolving the crisis appropriately, we could transform it from danger to an opportunity. This comprehension also works out for our transition during the pandemic. In the face of the disastrous consequences of the pandemic and its ramification across the world, I had to work at home and adjust to a new normal of working offsite and communicating with my supervisors and colleagues virtually. However, this crisis may provide an opportunity for improving our ability to work at home. One day when we return to the office and work onsite again, “work from home” may become an integral part of the workflow of cataloging and metadata librarians. Hence, in spite of the frustration we may encounter when beginning to work remotely at the moment, we still shall embrace the change. This may be one of the most significant lessons I have taken during the transition in the past few months.

Posted in User Submitted | Tagged cataloging, Remote work | Leave a comment

Safer-at-home metadata work

Posted on July 15, 2020 by Xiying Mi

COVID-19 is like nothing that ever happened to the world before. Just a few weeks after it outburst in the country, we were told to start work from home. No one expected to work from home for so long under such strict rules.

Right before the library was closed, the metadata librarians came together for a couple of hours to brain storm what work to do at home and what we need to support those jobs. There are three librarians in our unit, one metadata librarian, one chief cataloger and one digital initiative metadata librarian. We almost immediately came up with a list of things that can be brought home, such as batch loading MARC records, cataloging digital collections, creating metadata records for digital collections, ect. And of course we can always spend time on our research agenda items. Then we started to think for our staff members. We have two staff members working on maintaining the e-books database to support Textbook Affordability Project. It is very easy for them to take the work home. On copy catalog team, there is one staff member usually mainly working on print materials. He will need new tasks to work on in this situation. He is very familiar MARC records and OCLC Connexion. We quickly decided he can work on correcting our holding information in our local ILS. The project will get our catalog records clean and ready to be migrated into the new ILS system, yes, we are moving to a new ILS system soon. On our preservation team, one member is busy with the same project as well. She is very well covered for a substantial amount of time. The other member, though, needs new tasks and needs to be trained. One of the digital collections projects required move data from a spreadsheet to standardized metadata worksheet. We thought this could be a good task for him since this task doesn’t require high computer skills nor high metadata/cataloging skills. It’s basically a data crosswalking project that the team member can feel comfortable to take over. Thus far, we have established a good starting point for all of our team member in the metadata/catalog unit. As for our collections, 99% of the collection is e-resource now and we have only tiny bit of print materials coming in as printed materials. In an emergency situation, the onsite essential staff members will help to mail those materials to us and they did it for us one time.

So far, we have covered all of our team members with tasks. Then we started to think what support they will need. The university has provided the official communication tool, Microsoft Teams, on top of the email system. This tool will be used to support online meetings, quick chats and team work collaborations. The university also provided the cloud-based file storage and share space through Box. For some of our team members, they need to install OCLC Connexion, library ILS (Aleph), MarcEdit, etc on their individual machines. With all software installed, we all felt confident that we can work from home and still be productive. Before we left, we updated our phone tree to make sure that we will for sure be in touch.

Transitioning in work-from-home mode did take us a little bit time. We are used to the office working environment so much that working remotely feels strange. But we tried to be flexible with ourselves, gave everyone some time to settle in and adjusted working styles to fit the situation. After about two weeks, we all started to feel more comfortable with the new working style.

Once more and more voice saying the new normalcy after COVID-19 will be forever different than before. I am wondering if the workflow in metadata/cataloging will be different afterwards and more and more tasks can be remotely done. Will it impact how we hire and organize our team in the future? Will it impact the wanted skillsets for metadata/cataloging team in the future?

Establishing and transitioning to the new normalcy will be our next big question.

Posted in User Submitted | Tagged Metadata tasks, Remote work | Leave a comment

Transitioning roles

Posted on July 1, 2020 by Tomeka Jackson

My current position at Kennesaw State University Library is Catalog and Metadata Assistant in the Technical Services Unit, in this role I perform copy cataloging and metadata techniques for print materials as well as the physical processing of books. Recently due to COVID-19, I obtained another position, that of selecting ebooks through the GOBI fund account. I have never worked with ebooks nor selected any. Since our print book orders were put on hold, we had to convert entirely to ebooks and online classes like other universities. When the Intrieum Director of Collection Development got in touch with my director about selecting the rest of KSU’s year-end ebook selections, I was excited yet nervous about it.

The Interim Director provided my co-worker, and I serval training sessions on creating GOBI accounts and navigating through the website. Also, we had selectors meetings on Microsoft Teams with the rest of the Collection Development Department staff. After all the trainings and meetings, I felt a bit more confident about selecting ebooks. We were given the choice of choosing ebooks for any undergraduate college. I chose the nursing program personally for my mother, who is a nurse. Learning to select ebooks for the first time is like riding a bike in a sense? Slowly the training wheels come off the more comfortable you become. We looked at the upcoming course schedule for our selected colleges, used choice reviews online database to find the best reviews on related titles, as well as the GOBI spotlight features. Through this experience, I had the opportunity of working closely with the Acquisitions unit in my department as well as the Collection Development Department.

What problems did I run into? Checking for duplicate titles in our library’s catalog. I did not know how to properly check the catalog. Usually, I work on the back end of the catalog but not the front user end. This opportunity allowed me to navigate our catalog efficiently and use previous subject headings as keywords for searching titles. The other issue I encountered was selecting the appropriate fund account name and selector acknowledgment status. After the second time, I began to get the hang of the complexities of selecting ebooks. I was grateful to the other co-worker who works in the Acquisitions unit of my department for updating me on how much money was left to spend with the GOBI fund account.

In conclusion, ebook selecting is a new journey for me as my primary role centers on working with print materials. However, the journey was exciting, and I learned a lot more about what other units do and their roles with ebooks. We selected all ebooks for the upcoming semester. After the project, a shadow program will be established for anyone who wants to learn how to select ebooks with GOBI. I am excited as fall 2020 will be my last semester in the MLIS program outside of work. I hope everyone gains the opportunity to cross-train with other departments, it will give you a greater appreciation for your co-workers and those in different units.

Posted in User Submitted | Tagged ebook selecting, ebooks | Leave a comment

Shelf Reading the Catalog

Posted on June 17, 2020 by Graeme Williams

If shelf reading is a good way to locate misshelved books, what about shelf reading the online catalog? It has a couple of benefits: you see what the patrons see, and you can do it from home without any special access.

Here are two ways I look for mistakes in the online catalog – one similar to shelf-reading and one a little different. {I’ll give details — in parentheses — for some of the steps for BiblioCommons catalogs because that’s what I’m familiar with.}

The first step is just to do a search of the catalog. You can do a search which returns the whole catalog {for BiblioCommons, I use “OnOrder:FALSE”}, or you can look at a slice of the catalog — such as a particular format.

Once you have the search results, sort them by title and then start reading! If you have more than one person doing this “shelf reading”, you can look at the URL of the search results page to see if it includes a page number. If so, you can edit the URL so that different people can start at different places in the search results.

What you’re looking for — other than errors which just jump out at you — are titles which you hold in different formats. Because you’ve sorted the results by title, these items will appear one after the other, making it easy to check that the author’s name is spelled the same each time (a difference that might indicate duplicate Name Authority Records) and that there are no typos in the title.

The other check you can do is a lot less labor-intensive — you can get the catalog to tell you what is in the catalog.

First, do a search which returns the whole catalog. Then, on the search results page, go to the search facets — sometimes called filters, sometimes on the left-hand side and sometimes on the right. The great thing about these search facets is that the facets will normally be pre-populated with values from the catalog. You may need to open up the facet to see all the values. {In a BiblioCommons catalog, you may also need to click on “See More” to see the whole list of values.}

Look at facets like publication date or language. What you’re looking for is values that don’t make sense. When you find an incorrect value, or just a suspicious one, select it so that the catalog displays the item(s) that match the suspicious value. Just this morning I found a biography of Elon Musk which was published in 1632 — I’m pretty sure that was an error.

There’s one extra thing you might be able to do depending on which OPAC you have. When you do an author search, some OPACs show you a list of matching authors, and then allow you to page backwards and forwards through the entire author list. If your OPAC provides this “feature”, scanning this list is another way to look for authors with duplicate entries (n.b., not counting “See” entries).

If you follow these steps, you will have checked the two most important item attributes (author and title) as well as some of the other attributes that patrons use to select items.

Posted in User Submitted | Tagged Remote work | Leave a comment

Work that metadata workers can do remotely

Posted on June 3, 2020 by Bela Gupta

Work that metadata workers can do remotely

Metadata workers can work remotely in a cloud based Integrated Library System (ILS). This can be done in MARC, Dublin Core, etc. Remote work entails its own set of challenges because physical materials may not be present. However, bibliographic records can be edited in the catalog even in the absence of physical resources.

As you embark on working remotely keep in mind that you have a laptop issued by your institution that can be connected to the VPN so that you can access shared files.

Some of the work that can be done remotely is:

Physical and electronic theses and dissertations. The physical ones can be edited if they already have a bibliographic record in the ILS. If they lack classification numbers and subject headings those can be added. Faceted subject headings (FAST) can be added. This can be done on the basis of the summary or abstract that may already be in the record.
Similarly if there is already a bibliographic record and representation for ETDs (Electronic Theses and Dissertations) then subject headings and classification numbers can be added to the records.
MARC fields can also be added to monograph records that have an initial bibliographic records. For example, if local note, TAG 590 for gift note is present then TAG 797 for Local Added entry – corporate name can be added. If the series is a TAG 440 then it can be edited to TAG 490 and TAG 830. Similarly subject headings can be provided and the classification number checked for its uniqueness in the catalog.

MARC fields can be checked in https://www.loc.gov/marc/bibliographic/

AACR2 records do not need conversion to RDA. However, records in AACR2 as well as RDA can be checked for errors in the 100, 240, 245, 260/264, 300, 5XX, etc.

Perform cataloging maintenance tasks:

Delete duplicate items of a continuing resource/serial or monograph
If physical items are attached to an electronic bibliographic record then replace the electronic bibliographic record with a physical one so that it matches the items held by the library.
If there are electronic portfolios attached to the physical bibliographic record then overlay the physical bib. with an electronic one so that the electronic portfolios align to the electronic the bib. record.

Cataloging maintenance on a set of records.

Create an analytics report to identify a set of erroneous records that may have physical holdings and items attached to an electronic bibliographic records. Create an Excel spreadsheet and work through the set to overlay the electronic bibliographic record with a physical one to match the items.
Review each of the bibliographic records to check that there are no electronic portfolios. If there are electronic portfolios then add an electronic bibliographic record and delete the physical bib.

Activation of electronic books that the library may purchase or acquire freely.

Activate the electronic portfolios by providing the permalink. Attach the electronic portfolios to their electronic collection. Test access for each activated electronic portfolio.

Duplication checking of electronic portfolios in an electronic collection.

Delete duplicates after checking and verification. There may be thousands of duplicates in one collection and this can be a clean-up project that can be done remotely.
Organize standalone/orphan electronic portfolios by attaching them to their correct electronic collection. This can be done as a set or individually.

Other work that can be done remotely is:

Work on statistics, create reports and review them.
Update or create LibGuides.
Plan cataloging weeding projects, clean-up projects and others via zoom with colleagues.
Catalog digital objects that have their representations loaded into the ILS.
Attend professional training webinars and read articles and books related to your work.

Posted in User Submitted | Tagged Metadata tasks, Remote work | Leave a comment

Metadata Interest Group Meeting

Posted on May 29, 2020 by Rachel Tillay

Metadata Interest Group Meeting

Tuesday, June 9, 1:00 p.m. – 2:00 p.m. CT | Sign up to Attend

Description:

The ALCTS Metadata Interest Group facilitates active conversation among librarians and information professionals about projects, ideas, and practical use cases related to library metadata.

While the interest group usually plans a program of presentations, given the unprecedented situation facing us all this year, the program will follow a more open, town hall format. The aim is to encourage discussion about any work or ideas related to metadata, with a special emphasis on topics/questions related to the impact of the current COVID-19 crisis on metadata work. This open format is an opportunity to encourage community participation, provide a platform to express concerns and ask questions, and offer solutions to support each other.

Discussion may include:

How does telecommuting affect your workflow on metadata?
Has your library been working on a reopening plan? How does that affect your metadata work?
Have any new projects come out of the need to work from home?
What have you learned from this situation that you will be able to leverage in future work?
Other metadata creation, management, evaluation, and maintenance topics

Presenters:

The IG members will moderate the discussion: Rachel Turner, Mingyan Li (Programming co-chairs), Darnelle Melvin, Anne Washington, Charlie Tillay, and Jacky Hart

Interest Group Chairs:

Darnelle Melvin
Anne Washington

Posted in ALA Annual 2020 | Tagged collaboration, COVID-19, metadata, Metadata tasks, Remote work, tools | Leave a comment

ALCTS Metadata Interest Group Midwinter presentation Bringing Everyone to the Table: Collaborative Ontology Development

Posted on April 2, 2020 by Anne Washington

The ALCTS Metadata Interest Group met during the ALA Midwinter Meeting in Philadelphia on Sunday, January 26th, 8:30 -10:00 a.m., at the Pennsylvania Convention Center, Room 113-A.

Melanie Wacker, Metadata Coordinator at the Columbia University Libraries, presented on metadata collaboration. Included in this post is presentation abstract, slides, and author bio.

Continue reading →

Posted in ALA Midwinter 2020 | Tagged ArtFrame, BIBFRAME, collaboration, metadata | Leave a comment

ALA Annual 2019 Presentation and Q&A

Posted on July 26, 2019 by Rachel Tillay

Hello ALCTS Metadata Interest Group blog! Rick Fitzgerald and Grace Thomas here – we are librarians at the Library of Congress and recently gave a talk about describing web archives at the ALA Annual ALCTS Metadata Interest Group Meeting!

We first want to thank Anna Neatrour and the ALCTS Metadata Interest Group board for having us and, second, to everyone who took time out of their busy conference schedule to attend our talk. Web archives are becoming increasingly prevalent in libraries, archives, and scholarly research, so we are excited about the interest in our work. Anna and Tillay invited us to share our slides, for anyone who would like to review or missed the session.

Additionally, we wanted to address some questions at the end for anyone who wasn’t able to attend. Please forgive us for paraphrasing the questions and also for paraphrasing our own answers!

Q&A:

Q: Quality review for web archives is challenging because of the scale, how does your program approach it?

A: We can’t look at everything, so we do as much as we can. This is an issue throughout the web archiving community and there have been efforts to explore automated quality review, including a workshop held by the International Internet Preservation Consortium (IIPC) this year dedicated solely to brainstorming quality review solutions. If those tactics advance into some kind of software, we would love to implement it, but for now we look at reports from the crawler and click through as much as we can.

Q: You mentioned datasets, and there is an issue among the community of retaining provenance information and scope notes, how does your program handle this?

A: This is a community-wide issue, also, and we have varying levels of provenance information. First, our curatorial data in Digiboard is the record of selection for a particular URL (in it, the selecting librarian must assign the URL to a collection and provide a justification). Second, once a URL is approved to go to crawl, our team assigns scopes telling the crawler what it is allowed to crawl as part of this URL (for example: social media or CDNs which might host embedded content found through the main URL) and what it is restricted from crawling (we don’t want to crawl all of social media, perhaps just a particular related profile or page). Third, we have the crawl logs from the crawler which has very rich metadata showing the path of how a certain page came to be captured: response codes from the server at the time of crawl, the MIME type of the crawled resource, capture timestamp, and size of the resource, for example. Since we do not have a legal mandate to crawl, our (very) complicated permissions process makes releasing the crawl logs publicly impossible right now. However, with time, perhaps more of this provenance data can become part of publicly released datasets. For now, check out the ones we have publicly available here.

Q: Is Digiboard open-source?

A: Unfortunately, no. Digiboard is a home-grown tool that works specifically for our scale (tens of thousands of URLs crawled at varying frequencies), complicated permissions process, complicated selection process (with over 200+ potential selecting librarians), organizational structure, and our current method of quality review. If you wish to begin web archiving, there are subscription-based services which take care of all behind-the-scenes technical work (maintaining and running the crawler, indexing the content, maintaining the indexes, maintaining a version of the Wayback Machine and specific accesspoints for collections, etc). Many national and regional libraries, archives, and university libraries throughout the world successfully use these kinds of services to perform web archiving!

Q: How will the sidecar records relate to the minimal records?

A: The sidecar MODS XML files will sit on the same server as the minimal MODS XML files (separate files). During the ETL (Extract Transform Load) process to convert the information from MODS XML into the Library’s Solr index for loc.gov, the two files will be merged into the pages you see on https://www.loc.gov/websites/ based on identical ID numbers.

For more information about the backlog we released last year, please see the Library of Congress Signal blog post: More Web Archives, Less Process, written by Grace. Also, if you are interested in getting updates on our work as we write about them or any other digital library news from the Library of Congress, bookmark The Signal!

For any other questions, please do not hesitate to send us an email, you can find our addresses at the end of the slide deck. Thank you again for giving us a platform to share our work and best of luck with future interest group activities!

Posted in ALA Annual 2019 | Tagged crawling, mods, tools, web archives, web scraping | Leave a comment

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Metadata Interest Group Meeting

Share this:

Share this:

Q&A:

Share this:

Categories

Archives

Meta