Skip to contents

Creates a provenance-aware recordset_df from observational filesystem snapshots and contextual reconstruction workflows.

The function preserves observed filesystem resources while adding:

  • contextual Record Set assertions (human-defined grouping of related archived digital resources);

  • dataset-level provenance metadata (information about how, when, and from which observations the dataset was created);

  • preservation-oriented semantic context (structured contextual information supporting archival, audit, and long-term reconstruction workflows).

Unlike snapshot_to_reconstruction_context(), which is optimized for analytical and forensic workflows, this function creates a stable contextual preservation object suitable for:

  • contextual digital preservation;

  • audit reconstruction;

  • heritage and archival workflows;

  • provenance-aware digital collections;

  • human-in-the-loop semantic enrichment.

Usage

snapshot_to_recordset_df(
  snapshot_files,
  roots,
  record_set_id,
  record_set_title = NULL,
  person = utils::person("Jane", "Doe"),
  exclude_patterns = c("\\\\.Rcheck")
)

Arguments

snapshot_files

Character vector of .rds snapshot files.

roots

Character vector of contextual root paths used for observational selection.

record_set_id

Character scalar giving the asserted identifier of the resulting Record Set.

record_set_title

Optional human-readable title.

person

A utils::person() object describing the creator of the semantic Record Set assertion.

exclude_patterns

Character vector of exclusion patterns passed to subset_snapshot().

Value

A semantically enriched recordset_df object inheriting from dataset_df.

Details

The function intentionally reuses snapshot_to_reconstruction_context() to preserve:

  • identical observational reconstruction logic;

  • stable contextual identifiers;

  • reproducible reconstruction workflows.

The resulting object keeps observational rows intact while adding a lightweight semantic preservation layer based on:

  • contextual Record Set assertions;

  • provenance metadata;

  • RiC-aligned contextual semantics.