User Tools

Site Tools


data_quality_dimension:uniqueness

Uniqueness

Definition(s)

  1. Uniqueness of data records is the degree to which data records occur only once in a data file.
  2. Uniqueness of the mapping of an object in a data file is the degree to which objects (in the real world) occur as one data record in a data file.
  3. Uniqueness of primary keys is the degree to which primary keys occur only once in a data file.

Note

In general, double data records should be deduplicated.

Relation(s)

Uniqueness is a characteristic of data records
Uniqueness is a characteristic of the mapping of objects in a data file
Uniqueness is a characteristic of primary keys

Example(s)

  1. This article appears twice in a data file under the same article number. This is easy to detect by checking on double article numbers.
  2. This article appears twice in a data file under a different article number. This can only be discovered by comparison of other data elements than the article number.
  3. Two different articles appear under the same article number in a data file. This can only be discovered by checking double article numbers and then compare other data elements than article number before deduplicating.

Reference(s)

data_quality_dimension/uniqueness.txt · Last modified: 2023/10/21 19:11 by peter