Expected Variables Present: Multi-Site, Anomaly Detection, Longitudinal Analysis
| dc.contributor | Patient-Centered Outcomes Research Institute |
| dc.contributor.author | PEDSnet Data Coordinating Center |
| dc.contributor.other | PEDSnet Data Coordinating Center |
| dc.date.accessioned | 2024-09-09T17:20:49Z |
| dc.date.created | 2024-06-05 |
| dc.description.abstract | This check provides raw data and visualizations to aid a user in evaluating whether expected concepts are present in a dataset of interest. It summarizes the proportion of patients with co-occurring variables. This check promotes the identification of anomalous data to compare among sites. |
| dc.identifier.uri | https://hdl.handle.net/20.500.14642/780 |
| dc.identifier.uri | https://doi.org/10.24373/pdsp-465 |
| dc.publisher | PEDSnet |
| dc.relation.uri | https://github.com/ssdqa/expectedvariablespresent |
| dc.rights | a CC-BY Attribution 4.0 License. |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0 |
| dc.subject | Multi-Site Analysis |
| dc.subject | Data Anomaly Method |
| dc.subject | Longitudinal Analysis |
| dc.subject | Person-Level Analysis |
| dc.title | Expected Variables Present: Multi-Site, Anomaly Detection, Longitudinal Analysis |
| dspace.entity.type | DQCheck |
| local.code.package | # install.packages("devtools") devtools::install_github('ssdqa/https://github.com/ssdqa/conceptsetdistribution') |
| local.description.raw | The raw data output of this check produces nine columns of data: <br> | Column | Data Type | Definition | |----------------------------------------|-----------|----------------------------------------------------------------------------------------------| |`site` | character | the name of the site being targeted | |`time_start` | date | the start of the time period being examined | |`variable` | character | the name of the variable | |`prop_pt_variable` / `prop_row_variable` | numeric | the proportion of patients or rows (based on user selection) with evidence of the variable | |`mean_allsiteprop` | numeric | the average patient/row proportion across sites | |`median` | numeric | the median patient/row proportion across sites | |`date_numeric` | numeric | the numeric equivalent of time_start | |`site_loess` | numeric | the patient/row proportion with Loess regression applied | |`dist_eucl_mean` | numeric | the Euclidean distance of site_loess from mean_allsiteprop | {.dqcheck-table} |
| local.description.viz | This check outputs three visualizations to display the Euclidean distance between two time series: the smoothed (Loess) proportion of a user-selected variable for a given site, and the average proportion of all sites. Two line graphs (one smoother, one raw) represent the proportion of the variable at each site over time. Sites are differentiated by color, and a thick red line represente the All Site Average. A circular bar graph displays the Euclidean distance from the all-site mean where the color represents the average Loess proportion over time. |
| local.dqcheck.category | Consistency |
| local.dqcheck.clinicalprobe | Confirmatory Clinical Data |
| local.dqcheck.clinicalprobe | Clinical Follow-Up |
| local.dqcheck.clinicalprobe | Clinical Complexity |
| local.dqcheck.clinicalprobe | Clinical Consistency |
| local.dqcheck.measurement | Euclidean Distance |
| local.dqcheck.probe | Data Representation Errors |
| local.dqcheck.probe | Misclassification Detection |
| local.dqcheck.probe | Temporality Consistency Check |
| local.dqcheck.probe | External Benchmarking |
| local.dqcheck.probe | Missing Required Data |
| local.dqcheck.requirement | cohort |
| local.dqcheck.requirement | omop_or_pcornet |
| local.dqcheck.requirement | evp_variable_file |
| local.dqcheck.requirement | multi_or_single_site |
| local.dqcheck.requirement | anomaly_or_exploratory |
| local.dqcheck.requirement | output_level |
| local.dqcheck.requirement | age_groups |
| local.dqcheck.requirement | p_value |
| local.dqcheck.requirement | time |
| local.dqcheck.requirement | time_span |
| local.dqcheck.requirement | time_period |
| local.dqcheck.type | Variable Testing |
| local.dqcheck.viz | Line Graph |
| local.dqcheck.viz | Radial Bar Graph |
| relation.isCodeOfDQCheck | 929c8dfc-2c8b-4e62-8e1d-0fa06c542832 |
| relation.isCodeOfDQCheck.latestForDiscovery | 929c8dfc-2c8b-4e62-8e1d-0fa06c542832 |
| relation.isDQResultOfDQCheck | d872868e-af25-4fdc-9f60-717320d82f3b |
| relation.isDQResultOfDQCheck | 7b0495f2-e8b7-481d-8416-8d0d6e29f404 |
| relation.isDQResultOfDQCheck.latestForDiscovery | d872868e-af25-4fdc-9f60-717320d82f3b |
