Expected Variables Present: Multi-Site, Exploratory, Longitudinal Analysis


Created

Last Modified

Click on the thumbnail above to preview images.

Domain

Category

Parameters

Publisher

PEDSnet

Abstract

This check provides analyses to examine multiple sites. It generates an exploratory analysis summary across time of the proportion of patients who have a given variable to evaluate whether expected concepts are present in a dataset of interest across time.

Probe

Clinical Assessment

Access Package

# install.packages("devtools") devtools::install_github('ssdqa/https://github.com/ssdqa/conceptsetdistribution')

Visualization Output

This check outputs a line graph that displays the proportion of patients belonging to a given variable & site within each site across a user provided time span. Hovering over a line will display the exact proportion for that site, variable, and time point. Lines can be isolated by clicking on the legend.

Raw Output

The raw data output of this check produces ten columns of data:

Column Data Type Definition
site character the name of the site being targeted OR “combined” if multiple sites were provided
time_start date the start of the time period being examined
time_increment character the length of each time period
total_pt_ct numeric the total number of patients from the cohort in the domain table
total_row_ct numeric the total number of rows associated with patients from the cohort in the domain table
variable_pt_ct numeric the number of patients with evidence of the variable
variable_row_ct numeric the number of rows with evidence of the variable
prop_pt_variable numeric the proportion of patients with evidence of the variable
prop_row_variable numeric the proportion of rows with evidence of the variable
variable character the name of the variable

Funder(s)

This research was made possible through the generous support of Patient-Centered Outcomes Research Institute. The statements presented in this work are solely the responsibility of the author(s) and do not necessarily represent the views of PCORI, its Board of Governors, or its Methodology Committee.

Provenance

Description

Clinical Subjects Headings

Related Data Quality Result

Expected Variables Present Study Results Part IV: SSDQA Comparison
Created:2026-02-05Affiliation:PEDSnet Data Coordinating Center
The results of an Expected Variables Present check using the Multi-Site, Exploratory, Longitudinal parameters. This check investigates temporal trends in the population of drug metadata fields related to hydroxyurea drug exposure.

Related Person

Related Code

Study-Specific Quality, Utility, and Breadth Assessment
Created:2025-11Affiliation:PEDSnet Data Coordinating Center
This suite of R packages allows one to investigate multiple facets of data quality and customize analyses based on your study-specific needs. Each module allows up to 8 different analyses in either the OMOP or PCORnet CDM, all aimed at taking a different view of the data while still addressing the same data quality probe.

##### [View pkgdown summary here.](https://ssdqa.github.io/squba/)

Related Data Quality Check

Related Publications

Creative Commons license

Except where otherwised noted, this item's license is described as a CC-BY Attribution 4.0 License.

Cite this Data Quality Check

PEDSnet Data Coordinating Center. (2024, June). Expected Variables Present: Multi-Site, Exploratory, Longitudinal Analysis. [D Q Check]. PEDSpace Knowledge Bank. https://doi.org/10.24373/pdsp-468