Study-Specific Quality, Utility, and Breadth Assessment


SQUBA

Date Last Modified

Tags




Publisher

PEDSnet

About

This suite of R packages allows one to investigate multiple facets of data quality and customize analyses based on your study-specific needs. Each module allows up to 8 different analyses in either the OMOP or PCORnet CDM, all aimed at taking a different view of the data while still addressing the same data quality probe.

View pkgdown summary here.

Access Package

devtools::install_github('ssdqa/squba')

Provenance

Developer Affiliation(s)

Description

This is a “verse” package that will download all currently available modules and the support function package. Each module is encapsulated in its own package, so they are also available to download in isolation if you would rather not install the full suite.

Browse individual checks in PEDSpace, or, browse available modules in Github:
  • Cohort Fitness
    • Patient Facts: Assesses the availability of patient clinical data per year of follow-up as a factor of visit type
    • Patient Event Sequencing: Evaluates the plausibility of the temporal relationship between two clinical events
    • Patient Record Consistency: Checks for consistency within a patient’s clinical record to ensure the information is confirmatory and complete
  • Variable Testing
  • Concept-Set Testing
  • Dataset Fitness
  • Cohort Identification
    • Cohort Attrition: Examine each step of a study’s attrition criteria to identify potential irregularities in cohort construction
    • Sensitivity to Selection Criteria: Compare demographics, utilization patterns, and clinical fact makeup of a base cohort definition to alternate cohort definitions

Clinical Subject Headings

Related Code

Related Concept Sets

Related Studies

Semantic Data Quality Standards for Multi-Center Clinical Research Studies and Networks
Affiliation:PEDSnet Data Coordinating Center
Study aimed to define specific guidelines for data quality testing to allow users to assess fitness of data for a given use; to design analytic components to address data quality issues using a stakeholder-responsiveness process; to develop freely-available assessment and reporting tools for these guidelines.

Related Phenotypes

Related Resources

Related Data Quality Check

Source and Concept Vocabularies: Single Site, Exploratory, Cross-Sectional Analysis
Created:2024-06-05Affiliation:PEDSnet Data Coordinating Center
This check provides exploratory analyses at the level of a single site. It generates a single snapshot of a high-level summary of how the source system mappings may impact the data representation. This check may only be executed if both the source code and the represented code are provided.
Patient Records Consistency: Single Site, Anomaly Detection, Longitudinal Analysis
Created:2024-12-17Affiliation:PEDSnet Data Coordinating Center
This check provides analyses to identify anomalous data across time at the level of a single site. The Patient Record Consistency module, part of the larger SSDQA ecosystem, tests the consistency of clinical data representation within a patient's record. The goal is to ensure that the patient's information is confirmatory and complete, such that two events that are expected to co-exist do both occur within the same patient (i.e. a leukemia diagnosis and chemotherapy).
Patient Records Consistency: Multi Site, Exploratory, Longitudinal Analysis
Created:2024-12-17Affiliation:PEDSnet Data Coordinating Center
This check provides exploratory analyses across time to examine multiple sites. The Patient Record Consistency module, part of the larger SSDQA ecosystem, tests the consistency of clinical data representation within a patient's record. The goal is to ensure that the patient's information is confirmatory and complete, such that two events that are expected to co-exist do both occur within the same patient (i.e. a leukemia diagnosis and chemotherapy).
Patient Records Consistency: Multi Site, Anomaly Detection, Longitudinal Analysis
Created:2024-12-17Affiliation:PEDSnet Data Coordinating Center
This check provides analyses to identifiy anomalous data across time among multiple sites. The Patient Record Consistency module, part of the larger SSDQA ecosystem, tests the consistency of clinical data representation within a patient's record. The goal is to ensure that the patient's information is confirmatory and complete, such that two events that are expected to co-exist do both occur within the same patient (i.e. a leukemia diagnosis and chemotherapy).
Patient Records Consistency: Single Site, Anomaly Detection, Cross-Sectional Analysis
Created:2024-12-17Affiliation:PEDSnet Data Coordinating Center
This check provides analyses to identify anomalous data at the level of a single site. The Patient Record Consistency module, part of the larger SSDQA ecosystem, tests the consistency of clinical data representation within a patient's record. The goal is to ensure that the patient's information is confirmatory and complete, such that two events that are expected to co-exist do both occur within the same patient (i.e. a leukemia diagnosis and chemotherapy).

Creative Commons license

Except where otherwised noted, this item's license is described as a CC-BY Attribution 4.0 license.

This license applies to the metadata and documentation of this digital package. The source code and software components may be governed by different licensing terms. Please refer to the GitHub repository link for specific software licensing information.

Cite this Code

Razzaghi, H., Wieand, K., Dickinson, K., & Bailey, C. (2025, October). Study-Specific Quality, Utility, and Breadth Assessment. [Code]. PEDSpace Knowledge Bank. https://hdl.handle.net/20.500.14642/1232