Expected Variables Present: Single Site, Anomaly Detection, Longitudinal Analysis


dc.contributorPatient-Centered Outcomes Research Institute
dc.contributor.authorPEDSnet Data Coordinating Center
dc.contributor.otherPEDSnet Data Coordinating Center
dc.date.accessioned2024-09-09T17:20:49Z
dc.date.created2024-06-05
dc.description.abstractThis check provides raw data and visualizations to aid a user in evaluating whether expected concepts are present in a dataset of interest. It summarizes the proportion of patients with co-occurring variables. This check promotes the identification of anomalous data for a single site data across time (years).
dc.identifier.urihttps://hdl.handle.net/20.500.14642/776
dc.identifier.urihttps://doi.org/10.24373/pdsp-467
dc.publisherPEDSnet
dc.relation.urihttps://github.com/ssdqa/expectedvariablespresent
dc.rightsa CC-BY Attribution 4.0 License.
dc.rights.urihttp://creativecommons.org/licenses/by/4.0
dc.subjectSingle Site Analysis
dc.subjectData Anomaly Method
dc.subjectLongitudinal Analysis
dc.subjectPerson-Level Analysis
dc.titleExpected Variables Present: Single Site, Anomaly Detection, Longitudinal Analysis
dspace.entity.typeDQCheck
local.code.package# install.packages("devtools") devtools::install_github('ssdqa/https://github.com/ssdqa/conceptsetdistribution')
local.description.rawThe raw data output of this check produces ten columns of data for analysis over annual time intervals: <br> | Column | Data Type | Definition | |-------------------|-----------|-----------------------------------------------------------------------------------------| |`site` | character | the name of the site being targeted OR "combined" if multiple sites were provided | |`time_start` | date | the start of the time period being examined | |`time_increment` | character | the length of each time period | |`total_pt_ct` | numeric | the total number of patients from the cohort in the domain table | |`total_row_ct` | numeric | the total number of rows associated with patients from the cohort in the domain table | |`variable_pt_ct` | numeric | the number of patients with evidence of the variable | |`variable_row_ct` | numeric | the number of rows with evidence of the variable | |`prop_pt_variable` | numeric | the proportion of patients with evidence of the variable | |`prop_row_variable` | numeric | the proportion of rows with evidence of the variable | |`variable` | character | the name of the variable | {.dqcheck-table} The raw data output of this check produces eleven columns of data for analysis in monthly or weekly time intervals: <br> | Column | Data Type | Definition | |-------------------|-----------|--------------------------------------------------------------------------------------------------------------------| |`observed` | numeric | the original proportion of patients/rows | |`season` | numeric | the seasonal component of the time series | |`trend` | numeric | the trend component of the time series | |`remainder` | numeric | the residual component after "season" and "trend" are removed from "observed" - target of anomaly detection | |`seasadj` | numeric | the adjusted seasonal component | |`anomaly` | character | a flag to indicate whether the proportion is an anomaly | |`anomaly_direction` | numeric | the direction of the anomaly (upper or lower) | |`anomaly_score` | numeric | the distance between the anomaly and the centerline | |`recomposed_l1` | numeric | the lower level bound of the processed time series used to identify lower outliers | |`recomposed_l2` | numeric | the upper level bound of the processed time series used to identify upper outliers | |`observed_clean` | numeric | the original proportion after the season and trend components have been removed and anomalies have been detected | {.dqcheck-table}
local.description.vizThis check output varies based on the time increment input by the user. For yearly time increments, the check outputs a control chart displaying the number of pair mappings across time. The user is limited to one `concept_id` or CDM code per graph A tooltip provides each point's exact coordinates upon hover. Anomalous visits are distiguished by an orange point while non-anomalous visits are blue points. For smaller time increments (by month or smaller) the check outputs two graphs to visualize anomalies while ignoring seasonality. The first is a time series line graph with anomalies indicated by red dots. The second graph is a four-facet time series line graph that demonstrates the decomposition of the anomalies to clarify how eash anomaly was identified. For each output, a tooltip provides each point's exact coordinates upon hover. Both graphs represent data for one user-specified specialty at a time.
local.dqcheck.categoryConsistency
local.dqcheck.clinicalprobeConfirmatory Clinical Data
local.dqcheck.clinicalprobeClinical Follow-Up
local.dqcheck.clinicalprobeClinical Complexity
local.dqcheck.clinicalprobeClinical Consistency
local.dqcheck.measurementSeasonal-Trend Decomposition Using LOESS
local.dqcheck.measurementTime Series Anomalies
local.dqcheck.probeData Representation Errors
local.dqcheck.probeMisclassification Detection
local.dqcheck.probeTemporality Consistency Check
local.dqcheck.probeMissing Required Data
local.dqcheck.requirementcohort
local.dqcheck.requirementomop_or_pcornet
local.dqcheck.requirementevp_variable_file
local.dqcheck.requirementmulti_or_single_site
local.dqcheck.requirementanomaly_or_exploratory
local.dqcheck.requirementoutput_level
local.dqcheck.requirementage_groups
local.dqcheck.requirementp_value
local.dqcheck.requirementtime
local.dqcheck.requirementtime_span
local.dqcheck.requirementtime_period
local.dqcheck.typeVariable Testing
local.dqcheck.vizControl Chart
relation.isCodeOfDQCheck929c8dfc-2c8b-4e62-8e1d-0fa06c542832
relation.isCodeOfDQCheck.latestForDiscovery929c8dfc-2c8b-4e62-8e1d-0fa06c542832
relation.isDQResultOfDQCheck304ff83c-5cb2-4e9a-8f50-d312f4d6e8c7
relation.isDQResultOfDQCheck.latestForDiscovery304ff83c-5cb2-4e9a-8f50-d312f4d6e8c7

Files

Original bundle

Now showing 1 - 3 of 3
Loading...
Thumbnail Image
Name:
evp_ss_anom_la_month1.png
Size:
240.07 KB
Format:
Portable Network Graphics
Loading...
Thumbnail Image
Name:
evp_ss_anom_la_month2.png
Size:
274.12 KB
Format:
Portable Network Graphics
Loading...
Thumbnail Image
Name:
evp_ss_anom_la_year.png
Size:
87.32 KB
Format:
Portable Network Graphics