This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
data_quality_management_system:data_cleansing [2023/06/11 21:08] peter |
data_quality_management_system:data_cleansing [2024/05/26 19:33] (current) peter |
||
---|---|---|---|
Line 2: | Line 2: | ||
=== Definition === | === Definition === | ||
- | Data cleansing is the process of detecting and correcting [[data_quality_management_system/data_issue|data issues]] to improve the quality of data to an acceptable level. | + | Data cleansing is the [[general_term/ |
===Notes=== | ===Notes=== | ||
Line 13: | Line 13: | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
- | * [[data_quality_dimension/ | + | * [[data_quality_management_system: |
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
* [[data_quality_dimension/ | * [[data_quality_dimension/ | ||
Line 31: | Line 31: | ||
=== Purpose === | === Purpose === | ||
- | To detect and correct [[data_quality_management_system/data_issue|data issues]] and inconsistencies. | + | To detect and correct [[data_quality_management_system: |
=== Life cycle === | === Life cycle === | ||
Line 42: | Line 42: | ||
=== Methods === | === Methods === | ||
- | The next methods of correcting [[data_quality_management_system/data_issue|data issues]] can be distinguished: | + | The following |
^ Method | ^ Method | ||
| Abbreviation expansion | | Abbreviation expansion | ||
Line 65: | Line 65: | ||
| Type conversion | | Type conversion | ||
| Edit rules | Edit Rules, a new class of data quality rules, are rules that tells how to fix errors, i.e. which attributes are wrong and what values they should take. | | | Edit rules | Edit Rules, a new class of data quality rules, are rules that tells how to fix errors, i.e. which attributes are wrong and what values they should take. | | ||
- | | Data lifecycle management | + | | Data lifecycle management |
Note 4: Data issue prevention is far superior to data issue detection and cleansing, as it is cheaper and more efficient to prevent issues than to try and find them and correct them later. | Note 4: Data issue prevention is far superior to data issue detection and cleansing, as it is cheaper and more efficient to prevent issues than to try and find them and correct them later. | ||
Line 94: | Line 94: | ||
=== Relations === | === Relations === | ||
+ | |Data cleansing| is child of |[[general_term/ | ||
|Data cleansing| is an element of a |[[data_quality_general: | |Data cleansing| is an element of a |[[data_quality_general: | ||
- | |Data cleansing| resolves |[[data_quality_management_system/data_issue|data issues]]| | + | |Data cleansing| resolves |[[data_quality_management_system: |
- | |Data cleansing|is the successor of|[[data_quality_management_system/ | + | |Data cleansing|is the successor of|[[data_quality_management_system/ |
- | |Data cleansing|can be performed by using|data cleansing methods| | + | |Data cleansing|uses|data cleansing methods| |
- | |Data cleansing|wil be applied firstly to|[[data_quality_management_system/ | + | |Data cleansing|wil be applied firstly to|[[data_quality_management_system/ |
- | |Data cleansing|improves|[[data_quality_general/ | + | |Data cleansing|improves|[[data_quality_general/ |
+ | |Data cleansing|needs|[[data_quality_management_system/ | ||
{{: | {{: | ||
Line 163: | Line 165: | ||
What is data cleansing? Guide to data cleansing tools, services and strategy. (2020, August 13). Talend Real-Time Open Source Data Integration Software. https:// | What is data cleansing? Guide to data cleansing tools, services and strategy. (2020, August 13). Talend Real-Time Open Source Data Integration Software. https:// | ||
+ | |||
+ | {{tag> | ||
+ |