Expected Variables Present Study Results III: PAQS Query 3


Created

Loading...
Thumbnail Image

Last Modified

Findings


Tags



Domain

Publisher

PEDSnet

Abstract

The results of an Expected Variables Present check using the Single Site, Anomaly Detection, Longitudinal parameters. This check evaluates the annual distributions of key variables related to diabetes: stroke, second-line antidiabetics, ketoacidosis, an Hba1c > 8%, elevated blood pressure, and CKD.

Funder(s)

Provenance

This data quality check was run on a data extracted from the PAQS Query 3 Project. These results were not used for the query, but rather run on the query data after analysis was conducted as part of an SSQDA demonstration project.

Description

NOTE This is a single site analysis, but results for all sites are included in the output attached to this record for ease of review & reuse. General observations are recorded below, and details about specific institutions can be discovered in the raw output.

  • Most anomalies are generally observed in the time periods before 2015 and after 2020
  • The pre-2015 anomalies are likely explainable by small patient populations in that time period and less stable data as EHRs were in the process of being onboarded in the broader medical community.
  • The later anomalies are observed less frequently. Some can be considered an artifact of the drop in utilization due to the pandemic and a slower return to baseline after the fact. Others are anomalies above expectation, which could be due to an influx of new data in recent years or a change in coding practices.
  • Some institutions are more likely to have anomalies in the time series if there is a large range between the earliest and latest data point. For example, if there is very low capture pre-2015, this can cause the better capture in recent years to appear as anomalous.

Response to Findings

Study team did not conduct further investigation or data intervention in response to these results because this was a demonstration project.

Clinical Subjects Headings

Development Code

Vocabulary

Related Data Quality Results

Expected Variables Present Study Results I: PAQS Query 3
Created:2025-05-30Affiliation:PEDSnet Data Coordinating Center
The results of an Expected Variables Present check using the Multi-Site, Exploratory, Cross-Sectional parameters. This check probes the presence of key variables related to diabetes: stroke, second-line antidiabetics, ketoacidosis, an Hba1c > 8%, elevated blood pressure, and CKD.
Expected Variables Present Study Results II: PAQS Query 3
Created:2025-05-30Affiliation:PEDSnet Data Coordinating Center
The results of an Expected Variables Present check using the Multi-Site, Anomaly Detection, Longitudinal parameters. This check evaluates the annual distributions of key variables related to diabetes: stroke, second-line antidiabetics, ketoacidosis, an Hba1c > 8%, elevated blood pressure, and CKD.

Data Source

Institutions

Related Data Quality Check

Expected Variables Present: Single Site, Anomaly Detection, Longitudinal Analysis
Created:2024-06-05Affiliation:PEDSnet Data Coordinating Center
This check provides raw data and visualizations to aid a user in evaluating whether expected concepts are present in a dataset of interest. It summarizes the proportion of patients with co-occurring variables. This check promotes the identification of anomalous data for a single site data across time (years).

Related Study

Related Publications

Creative Commons license

Except where otherwised noted, this item's license is described as a CC-BY Attribution 4.0 License.

Cite this Data Quality Result

Wieand, K. (2025, May). Expected Variables Present Study Results III: PAQS Query 3. [D Q Result]. PEDSpace Knowledge Bank. https://doi.org/10.24373/pdsp-679