Quantitative Variable Distributions: Multi Site, Exploratory, Cross-Sectional Analysis


Created

Last Modified

Click on the thumbnail above to preview images.

Domain

Category

Parameters

Publisher

PEDSnet

Abstract

This check provides raw data and visualizations to aid a user in evaluating whether the distribution of quantitative variables aligns with clinical expectations. It can summarize the distribution of a quantitative variable (like lab result values) or patient counts (like number of patients with an outpatient visit).

Probe

Clinical Assessment

Access Package

# install.packages("devtools") devtools::install_github('ssdqa/https://github.com/ssdqa/quantvariabledistribution')

Visualization Output

This check outputs boxplots displaying the distribution of each value type at each site. The diamond reflects the mean value of the distribution. The user can control whether outliers are displayed or not, and they can set a minimum frequency with which a value should occur to be included in the plot. In the example plot, outliers are displayed and a minimum frequency of 5 occurrences is used.

Raw Output

This check produces a raw data output containing 10 columns:

Column Data Type Definition
site character the name of the site being targeted OR “combined” if multiple sites were provided
value_col numeric the quantitative value of the variable of interest
value_freq numeric the frequency with which the quantitative value occurs
value_type character the type of value being measured
mean_val numeric the mean of value_col
median_val numeric the median of value_col
sd_val numeric the standard deviation of value_col
q1_val numeric the first quantile of value_col
q3_val numeric the third quantile of value_col
output_function character a string indicating the type of visualization that should be generated by qvd_output

Funder(s)

This research was made possible through the generous support of Patient-Centered Outcomes Research Institute. The statements presented in this work are solely the responsibility of the author(s) and do not necessarily represent the views of PCORI, its Board of Governors, or its Methodology Committee.

Provenance

Description

Clinical Subjects Headings

Related Data Quality Result

Quantitative Variable Distribution Study Results Part II: SSDQA Comparison
Created:2025-09Affiliation:PEDSnet Data Coordinating Center
The results of a Quantitative Variable Distribution check using the Multi-Site, Exploratory, Cross-Sectional parameters. This check investigates the distribution of lab results, stratified by lab type & result unit, to ensure results are plausible.
Quantitative Variable Distribution Study Results Part IV: SSDQA Comparison
Created:2026-02-05Affiliation:PEDSnet Data Coordinating Center
The results of a Quantitative Variable Distribution check using the Multi-Site, Exploratory, Cross-Sectional parameters. This check investigates the value distributions of drug metadata fields (days_supply, refills, quantity) relating to hydroxyurea drug exposures.
Quantitative Variable Distribtion Study Results II: PRESERVE
Created:2025-04-08Affiliation:PEDSnet Data Coordinating Center
The results of a Quantitative Variable Distribution check using the Multi-Site, Exploratory, Cross-Sectional parameters. This check evaluates blood pressure, eGFR, and urine protein distributions at each institution.

Related Person

Related Code

Study-Specific Quality, Utility, and Breadth Assessment
Created:2025-11Affiliation:PEDSnet Data Coordinating Center
This suite of R packages allows one to investigate multiple facets of data quality and customize analyses based on your study-specific needs. Each module allows up to 8 different analyses in either the OMOP or PCORnet CDM, all aimed at taking a different view of the data while still addressing the same data quality probe.

##### [View pkgdown summary here.](https://ssdqa.github.io/squba/)

Related Data Quality Check

Related Publications

Creative Commons license

Except where otherwised noted, this item's license is described as a CC-BY Attribution 4.0 License.

Cite this Data Quality Check

PEDSnet Data Coordinating Center., Wieand, K., & Dickinson, K. (2025, July). Quantitative Variable Distributions: Multi Site, Exploratory, Cross-Sectional Analysis. [D Q Check]. PEDSpace Knowledge Bank. https://doi.org/10.24373/pdsp-472