
Metadata is the information describing the data, rather than the data itself. In this blog, I will describe validity testing, breakdown the concept of accuracy testing, and review the testing frameworks available. In our case, asking if a data set is OK is equal to asking “Is it valid and accurate?”.

Integrity: Can different data sets be joined correctly to reflect a larger picture? Are relations well defined and implemented?.Validity (aka Conformity): Is the information in a specific format, type, or size? Does it follow business rules/best practices?.

Timeliness: Is your information available when you need it?.Consistency: Does information stored in one place match relevant data stored elsewhere?.Completeness: Does it fulfill your expectations of what’s comprehensive?.Accuracy: How well does a piece of information reflect reality?.Traditionally data quality is split into 6 dimensions:
