User Tools

Site Tools


data_quality_dimension:uniqueness

This is an old revision of the document!


Uniqueness

Definition(s)

1) Uniqueness of data records is the degree to which data records occur only once in a data file.
2) Uniqueness of the appearance of an object in a data file is the degree to which objects (in the real world) occur only once as a data record in a data file.
3) Uniqueness of primary keys is the degree to which primary keys occur only once in a data file.

Note(s)

In general, duplicates should be deduplicated.

Relation(s)

Uniqueness can be a characteristic of data records, the appearance of objects in a data file or primary keys.

Example(s)

1) This article appears twice in a data file under the same article number. This is the easiest to detect.
2) This article appears twice in a data file under a different article number. This can only be discovered by comparison of other fields than the article number.
3) Two different articles appear under the same article number in a data file. This can only be discovered by checking double article numbers and then compare other fields than article number.

Reference(s)

data_quality_dimension/uniqueness.1677009669.txt.gz · Last modified: 2024/03/08 13:33 (external edit)