MetadataTools
From oaibp
Main Page >> Tools and Strategies for Using and Enhancing / Extending the OAI Protocol
Contents |
[edit] Metadata Management & Added Value Tools
(see also Best Practices for Shareable Metadata)
[edit] Re-Formatting Metadata
- Metadata Migrator Tool
- Presentation: Web-based program for migrating local files (formatted in .csv, .tab, or .dbf) into simple Dublin Core XML-formatted files. Available for free from Emory University MetaScholar Initiative. Register to receive login and password.
- Features (version 1.0): Allows harvesting using [[[ResumptionTokens]] resumptionTokens]. Does not yet support multiple metadata formats, deleted records, or automatic updating. Does not yet allow set harvesting.
- OAITransform
- Presentation: Open-source transformation tool written in perl, a component of the DLXS product.
- Features: Converts harvested simple Dublin Core metadata into the DLXS BibClass format. Normalizes some fields, checks for UTF-8 and XML character encoding problems, and excludes metadata that does not contain a DC identifier.
[edit] Normalizing & Enhancing Metadata Values
The effort to develop efficient tools for automatically normalizing and enhancing metadata is ongoing. The 2006 Metadata Enhancement & OAI Workshop (MEOW) at Emory University provided information on a number of the following tools, many of which are still in development.
- CDL Date Normalizer
- Presentation: Created by the California Digital Library to help manage metadata for the American West project.
- Features: Searches DC elements (<date>, then <title> and <description>) for date-like strings. Normalizes strings into YYYY only (MM & DD easy to add later).
- Topic Model Browser (Clustering / Classification of Metadata)
- Presentation: Pilot test for clustering and classifying science, engineering and math metadata. Being further developed as a prototype for the OAI DLF Portal.
- Features: Uses University of California Irvine's Topic Mapper, a tool for clustering minimal amounts of text such as bibliographies, newspaper articles and metadata.
- UMichigan's DC Type Normalization Table
- Presentation: Allows the DC Type field to be mapped to a set of normalized values. These values are then offered as search limiters in OAIster and other UMichigan OAI portal interfaces (MODS Portal, Aquifer Portal, OAI DLF Portal).
- Features: Text table acted upon by perl code to insert one of five values into the record, corresponding to the DC Type value.
[edit] Added Value Tools
- ThumbGrabber
- Grabs already-harvested thumbnails from UIUC's servers for inclusion in search results.
[edit] XSL Stylesheets
- UMichigan has some generic stylesheets it uses to map metadata formats from one to another.
