Data Management
“The beginning of wisdom is the definition of terms.”
“The beginning of wisdom is the definition of terms.”
This is an old revision of the document!
1) Uniqueness of data records is the degree to which data records occur only once in a data file.
2) Uniqueness of the appearance of an object in a data file is the degree to which objects (in the real world) occur only once as a data record in a data file.
3) Uniqueness of primary keys is the degree to which primary keys occur only once in a data file.
In general, duplicates should be deduplicated.
Uniqueness can be a characteristic of data records, the appearance of objects in a data file or primary keys.
1) This article appears twice in a data file under the same article number. This is the easiest to detect.
2) This article appears twice in a data file under a different article number. This can only be discovered by comparison of other fields than the article number.
3) Two different articles appear under the same article number in a data file. This can only be discovered by checking double article numbers and then compare other fields than article number.
DAMA NL (2020). Dimensions of Data Quality (DDQ). Research paper. https://www.dama-nl.org/wp-content/uploads/2020/09/DDQ-Dimensions-of-Data-Quality-Research-Paper-version-1.2-d.d.-3-Sept-2020.pdf