- Completeness
- definition: Ensures all required data is present and no essential parts are missing
- checks: missing values, null, % of populated fields
- importance: Missing data can lead to inaccurate analyses and insights
- Consistency
- definition: Ensures data values are consistent across datasets and do not contradict each other
- checks: cross-field validation, comparing data from different sources or period
- importance: Inconsistent data can cause confusion and result in incorrect conclusions
- Accuracy
- definition: Ensures data values are correct, reliable and represents what it is supposed to
- checks: Comparing with trusted sources, validation against known standards or rules
- importance: Inaccurate data can lead to false insights and poor decision-making
- Integrity
- definition: Ensures data maintains its correctness and consistency over its lifecycle and across systems
- checks: Referential integrity, relationship validation
- importance: Ensures relationships between data elements are preserved, and data remains trustworthy over time