Quantitative Variable Distributions: Multi Site, Exploratory, Cross-Sectional Analysis
Created
Last Modified
Files
Domain
Category
Parameters
Publisher
Abstract
This check provides raw data and visualizations to aid a user in evaluating whether the distribution of quantitative variables aligns with clinical expectations. It can summarize the distribution of a quantitative variable (like lab result values) or patient counts (like number of patients with an outpatient visit).
Data Requirements
Probe
Clinical Assessment
Access Package
# install.packages("devtools")
devtools::install_github('ssdqa/https://github.com/ssdqa/quantvariabledistribution')Visualization Output
This check outputs boxplots displaying the distribution of each value type at each site. The diamond reflects the mean value of the distribution. The user can control whether outliers are displayed or not, and they can set a minimum frequency with which a value should occur to be included in the plot. In the example plot, outliers are displayed and a minimum frequency of 5 occurrences is used.
Raw Output
This check produces a raw data output containing 10 columns:
| Column | Data Type | Definition |
|---|---|---|
site |
character | the name of the site being targeted OR “combined” if multiple sites were provided |
value_col |
numeric | the quantitative value of the variable of interest |
value_freq |
numeric | the frequency with which the quantitative value occurs |
value_type |
character | the type of value being measured |
mean_val |
numeric | the mean of value_col |
median_val |
numeric | the median of value_col |
sd_val |
numeric | the standard deviation of value_col |
q1_val |
numeric | the first quantile of value_col |
q3_val |
numeric | the third quantile of value_col |
output_function |
character | a string indicating the type of visualization that should be generated by qvd_output |

