Login

App Data & Preservation Inventories

Search indexes

General Information

Description

Search indexes are derivative metadata structured for and stored in Apache Solr index stores.

Purpose

Provides search functionality.

Quick Facts

A distinct Solr index is created for the Catalog, Databases, and UWDC categories in the Coordinated Discovery Platform and for each citation database implemented in Blacklight.

These are transitory application data sets. The size of the dataset vary from 1000s of records for citation database to more 12 million records for the catalog dataset.

Data Classifications

Campus
  • Internal: Used for internal discovery operations. Elements of the content are public.
Library
  • Operational This is transitory operational data.

Data Contacts

Data Custodian
Library Platforms and Applications Group
Data Owner/Trustee
Library Platforms and Applications Group
Data Consumer
Indirectly, patrons who use library discovery platforms.

Risk Assessment

Score Risk Type Details Evaluation Date
1 Library Impact These are transitory derivative datasets; there is no risk of permanent loss of data. March 1, 2019
1 Data These are transitory derivative datasets; there is no risk of permanent loss of data. March 1, 2019
1 Institutional Knowledge These are transitory derivative datasets; they have no historic value. March 1, 2019

Technical Details

Specifications

NA

Correctness

Mappings between source data records in MARC, MODS, or local schemas and the semantics of Solr record schema must preserve the semantic intent of source data relative to the discovery and display requirements of the consuming application.

Representative Record

NA These are derivative records. There is no authoritative instance of this data.

Dependencies

Source MARC, MODS, and local schema records.

Access & Use

Delivery Modalities

Via Solr APIs to consuming applications: Coordinated Discovery Platform and Blacklight instances.

Lifecycle

The indexes are periodically rebuilt and incrementally updated based on changes detected in the source data. There is no preservation or archiving.

Disposition

Stored on LCB machines hosting Solr, their associated storage, and backups.

Relevant Processes

The data are derived through ETL workflows from source data sets.

Constraints

No applicable constraints,.