Database Library Metadata
General Information
- Description
-
This data set describes the list databases and e-resources curated and made available to library patrons by library subscriptions.
- Purpose
-
This data set contains the core metadata used in our discovery platform to support searching and browsing for e-resources.
- Quick Facts
-
The data is comprised of approximately 1,300 records. Each record is stored in an individual file. The combined size of all records is approximately 7.2MB.
Data Classifications
- Campus
-
- Public: This data describes library collections and is publicly available.
- Library
-
- Descriptive This data set forms the core descriptive metadata that drives discovery of e-resources.
Data Contacts
- Data Owner/Trustee
- Lee Konrad, Associate University Librarian - Digital Strategy lee.konrad@wisc.edu
- Data Steward
- Coordinated Discovery Team forward-lib@lists.wisc.edu
- Data Custodian
- Aimee Glassel aimee.glassel@wisc.edu
- Data Custodian
- Steve Meyer, Data Strategist stephen.meyer@wisc.edu
- Data Manager
- Aimee Glassel aimee.glassel@wisc.edu
- Data Consumer
- Library Patrons
- Internal Data Client
- Shared Development Group
- Data Subject Matter Expert
- Aimee Glassel aimee.glassel@wisc.edu
Risk Assessment
Score | Risk Type | Details | Evaluation Date |
---|---|---|---|
2 | Library Impact | If the representative record on the shared drive were lost, we could recover the data from tape backups. There may be some temporary disruption to how fast the data could be recovered if the latest backup was out of sync with the latest records. That would require a small amount of time to recreate some records or recent changes. | February 28, 2019 |
3 | Data | This data is backed up, but if all copies were lost, we would need to engage in a time consuming process of recreating it. | February 28, 2019 |
1 | Institutional Knowledge | This data is used operationally to support library discovery process but does not document the history of the university. | February 28, 2019 |
Technical Details
- Specifications
-
The data files are serialized as XML. They use an OAI-PMH schema to wrap the entire record. Within this container, the primary metadata is expressed within a MARC/XML record. There is also a custom metadata section containing information about a local subject classification, which is a critical part of the data.
It should be noted that the MARC/XML is not fully valid as it contains both control and data fields with invalid alphabetical MARC fields.
- Correctness
-
The data must be valid XML to be parsed correctly. The MARC data must be parseable using a standard MARC record library, though this requires stripping out the invalid MARC/XML fields noted in the technical specification. The subject categories must contain values from an approved list of subjects.
Basic validation steps are performed on data records when they are indexed for discovery.
- Representative Record
-
The authoritative instance of the data is stored in a shared network drive location managed by the Library Technology Group (LTG). It is managed by two LTG staff who make changes to the data (create, update, archive and index).
- Dependencies
-
The current data set originates from an export from MetaLib, which was the prior system used for both management and patron use of the data.
This data is required to support the indexing process that run our patron discovery user interface.
Access & Use
- Delivery Modalities
-
The data is delivered to users via the Databases "bucket" within our discovery platform. This web-based user interface provides access to searching and browsing the list of databases purchased and/or subscribed to by the Libraries.
- Lifecycle
-
New records typically originate through the e-resource ordering process within the NERO application. Within a NERO request, selectors and bibliographers provide the first version of the data required to create an XML record. This data is captured within the NERO database and transcribed into new XML files by LTG staff.
As selectors and catalogers request changes, LTG staff update the records and the data is indexed periodically on a sporadic schedule.
When an e-resource is no longer needed within the public interface the XML file is archived by relocating it to a special folder/directory within the shared drive location.
- Disposition
-
The XML files are stored on a network storage drive. This drive/filesystem is subject to backup.
- Relevant Processes
-
The data is based on a prescribed template. Changes are made in support of the user experience within the discovery interface. These changes should be vetted by the appropriate web team, the Coordinated Discovery Team.
- Constraints
-
This data is publicly available and open. There are no regulations that govern its use or retention.