User Tools

Site Tools


data_quality_dimension:uniqueness

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
data_quality_dimension:uniqueness [2023/02/21 20:01]
peter
data_quality_dimension:uniqueness [2024/03/08 13:33] (current)
Line 2: Line 2:
  
 ===Definition(s)=== ===Definition(s)===
-1) Uniqueness of data records is the degree to which [[data_concept:data record|data records]] occur only once in a [[data_concept:data file|data file]].\\  +  - Uniqueness of data records is the degree to which [[data_concept:data record|data records]] occur only once in a [[data_concept:data file|data file]].\\  
-2) Uniqueness of the appearance of an object in a data file is the degree to which [[data_concept:object|objects]] (in the real world) occur only once as [[data_concept:data record|data record]] in a [[data_concept:data file|data file]].\\  +  Uniqueness of the mapping of an object in a data file is the degree to which [[data_concept:object|objects]] (in the real world) occur as one [[data_concept:data record|data record]] in a [[data_concept:data file|data file]].\\  
-3) Uniqueness of primary keys is the degree to which [[data_concept:primary key|primary keys]] occur only once in a [[data_concept:data file|data file]].+  Uniqueness of primary keys is the degree to which [[data_concept:primary key|primary keys]] occur only once in a [[data_concept:data file|data file]].
  
-===Note(s)=== +===Note=== 
-In general, duplicates should be deduplicated.+In general, double [[data_concept:data record|data records]] should be deduplicated.
  
 ===Relation(s)=== ===Relation(s)===
-Uniqueness can be a [[data_quality_dimension:characteristic|characteristic]] of [[data_concept:data record|data records]]the appearance of [[data_concept:object|objects]] in a [[data_concept:data file|data file]] or [[data_concept:primary key|primary keys]].+|Uniqueness| is a [[general_term:characteristic|characteristic]] of |[[data_concept:data record|data records]]
 +|Uniqueness| is a [[general_term:characteristic|characteristic]] of the mapping of [[data_concept:object|objects]] in a |[[data_concept:data file|data file]]
 +|Uniqueness| is a [[general_term:characteristic|characteristic]] of |[[data_concept:primary key|primary keys]]|
  
 ===Example(s)=== ===Example(s)===
-1) This article appears twice in a [[data_concept:data file|data file]] under the same article number. This is the easiest to detect.\\  +  - This article appears twice in a [[data_concept:data file|data file]] under the same article number. This is easy to detect by checking on double article numbers.\\  
-2) This article appears twice in a [[data_concept:data file|data file]] under a different article number. This can only be discovered by comparison of other fields than the article number.\\  +  This article appears twice in a [[data_concept:data file|data file]] under a different article number. This can only be discovered by comparison of other [[data_concept:data element|data elements]] than the article number.\\  
- 3) Two different articles appear  under the same article number in a [[data_concept:data file|data file]]. This can only be discovered by checking double article numbers and then compare other fields than article number.+  -  Two different articles appear  under the same article number in a [[data_concept:data file|data file]]. This can only be discovered by checking double article numbers and then compare other  [[data_concept:data element|data elements]] than article number before deduplicating.
  
 ===Reference(s)=== ===Reference(s)===
 DAMA NL (2020). Dimensions of Data Quality (DDQ). Research paper. https://www.dama-nl.org/wp-content/uploads/2020/09/DDQ-Dimensions-of-Data-Quality-Research-Paper-version-1.2-d.d.-3-Sept-2020.pdf DAMA NL (2020). Dimensions of Data Quality (DDQ). Research paper. https://www.dama-nl.org/wp-content/uploads/2020/09/DDQ-Dimensions-of-Data-Quality-Research-Paper-version-1.2-d.d.-3-Sept-2020.pdf
 +
 +{{tag>All DataQualityDimension}}
data_quality_dimension/uniqueness.1677009669.txt.gz · Last modified: 2024/03/08 13:33 (external edit)