This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
data_quality_management_system:data_cleansing [2023/03/01 16:39] peter |
data_quality_management_system:data_cleansing [2024/03/08 13:33] (current) |
||
---|---|---|---|
Line 1: | Line 1: | ||
===== Data cleansing ===== | ===== Data cleansing ===== | ||
- | |||
- | === Introduction === | ||
- | This factsheet describes knowledge about data cleansing in a nutshell. Data cleansing is highlighted from different angles in a structured way. | ||
=== Definition === | === Definition === | ||
- | Data cleansing is the process of detecting and correcting [[data_quality_management_system/data_issue|data issues]] to improve the quality of data to an acceptable level. | + | Data cleansing is the [[general_term/ |
===Notes=== | ===Notes=== | ||
- | What an acceptable data quality level is for an organization should be defined | + | An organization should define |
Dimensions of data quality that can be improved by data cleansing are: | Dimensions of data quality that can be improved by data cleansing are: | ||
Line 18: | Line 15: | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
- | * Currency of data values | + | * [[data_quality_dimension/ |
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
Line 27: | Line 24: | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
- | === Synonyms | + | === Synonyms === |
* Data Cleaning | * Data Cleaning | ||
* Data Remediation | * Data Remediation | ||
Line 34: | Line 31: | ||
=== Purpose === | === Purpose === | ||
- | To detect and correct data issues and inconsistencies. | + | To detect and correct |
=== Life cycle === | === Life cycle === | ||
Line 45: | Line 42: | ||
=== Methods === | === Methods === | ||
- | The next methods of correcting data issues can be distinguished: | + | The following |
^ Method | ^ Method | ||
| Abbreviation expansion | | Abbreviation expansion | ||
Line 91: | Line 88: | ||
* Documentation is the key to good data quality | * Documentation is the key to good data quality | ||
- | === Characteristics | + | === Characteristics ==== |
^ Characteristic ^ Requirements ^ | ^ Characteristic ^ Requirements ^ | ||
| Effectiveness of data cleansing | Data Cleansing improve the data quality and meets the norm of the data quality dimension(s). | | Effectiveness of data cleansing | Data Cleansing improve the data quality and meets the norm of the data quality dimension(s). | ||
| Cost-effectiveness of data cleansing | Data cleansing must lead to a positive business case, i.e. the benefits must be bigger than the costs. | | Cost-effectiveness of data cleansing | Data cleansing must lead to a positive business case, i.e. the benefits must be bigger than the costs. | ||
- | === Relationships | + | === Relations |
- | * **Data Cleansing** resolves **Data Issues**. | + | |Data cleansing| is child of |[[general_term/ |
- | * **Data Quality Monitoring** can be follow-up by **Data Cleansing**. | + | |Data cleansing| is an element of a |[[data_quality_general: |
- | * Purpose of **Data Cleansing** | + | |Data cleansing| resolves |[[data_quality_management_system: |
- | * A **Data Cleansing** Process can be performed using one of the different **Data Cleansing Methods**. | + | |Data cleansing|is the successor of|[[data_quality_management_system/ |
- | * **Critical | + | |Data cleansing|uses|data cleansing methods| |
+ | |Data cleansing|wil | ||
+ | |Data cleansing|improves|[[data_quality_general/ | ||
+ | |Data cleansing|needs|[[data_quality_management_system/ | ||
{{: | {{: | ||
Line 164: | Line 165: | ||
What is data cleansing? Guide to data cleansing tools, services and strategy. (2020, August 13). Talend Real-Time Open Source Data Integration Software. https:// | What is data cleansing? Guide to data cleansing tools, services and strategy. (2020, August 13). Talend Real-Time Open Source Data Integration Software. https:// | ||
+ | |||
+ | {{tag> | ||
+ |