Numeric Distribution
Numeric Distribution Validations detect changes in the numeric distribution of values, including outliers, variance, skew and more
Average
Average Validations gauge performance in transitional databases and search engines, offering valuable insights into overall effectiveness.
Example
Minimum
Minimum Validations ensure consistency across transitional databases and search engines, enhancing data quality and retrieval accuracy.
Example
Maximum
Maximum Validations gauge the highest values within datasets, helping identify outliers and understand data distribution’s upper limits for quality assessment.
Example
Sum
Sum Validations measure the total of all values within a dataset, indicating the overall size of a particular dataset to help understand data quality.
Example
Variance
Variance in data quality measures the degree of variability or dispersion in a dataset, indicating how spread out the data points are from the mean.
Example
Standard Deviation
Standard deviation Validations measure the amount of variation or dispersion of a set of values from the mean, indicating how spread out the data points are from the mean.
Example
Percentile Validations
20th Percentile
The 20th Percentile Validation checks the value below which 20% of the data points fall, offering insight into the lower end of the data distribution.
Example
40th Percentile
The 40th Percentile Validation identifies the value below which 40% of the data points fall, providing insight into the data distribution’s lower-middle range.
Example
60th Percentile
The 60th Percentile Validation checks the value below which 60% of the data points fall, helping understand the distribution of values around the middle of the dataset. Example
80th Percentile
The 80th Percentile Validation examines the value below which 80% of the data points fall, offering insights into the upper-middle range of the dataset.
Example
90th Percentile
The 90th Percentile Validation identifies the value below which 90% of the data points fall, focusing on the upper range of the dataset.
Example
Zero value validations are used to identify and validate fields that contain zero values, which are important for datasets where zero values might have specific implications, such as indicating missing or invalid data, or representing real-world conditions.
Zero Count
COUNT_ZERO
is used to count the number of rows where the specified field contains a zero value. It can be useful for detecting cases where zero might represent missing data or special conditions.
Example:
Zero Percentage
PERCENT_ZERO
is used to calculate the percentage of rows where the specified field contains a zero value. This helps assess the proportion of zero values in a column, allowing the user to enforce percentage-based thresholds for data quality.
Example:
The Numeric Negative Value Validations detect negative values in numeric fields within a dataset and ensure that they do not exceed or fall below a specified threshold.
Negative Count
This validation counts the number of negative values present in a given numeric field.
Example
Negative Percentage
This validation calculates the percentage of negative values in a numeric field, relative to the total number of records.
Example