Patient Records Consistency: Single Site, Anomaly Detection, Cross-Sectional Analysis
Created
Last Modified
Files
Domain
Category
Parameters
Publisher
Abstract
This check provides analyses to identify anomalous data at the level of a single site. The Patient Record Consistency module, part of the larger SSDQA ecosystem, tests the consistency of clinical data representation within a patient’s record. The goal is to ensure that the patient’s information is confirmatory and complete, such that two events that are expected to co-exist do both occur within the same patient (i.e. a leukemia diagnosis and chemotherapy).
Data Requirements
Probe
Clinical Assessment
Access Package
# install.packages("devtools")
devtools::install_github('ssdqa/patientrecordconsistency')Visualization Output
This check outputs a bar graph displaying the Jaccard similarity index for categories of F/U time.
The length of F/U “bins” are provided by the user via a vector of integers for breaks. Each bin includes the starting integer and up to but NOT including the ending integer. For example, [0,1) includes patients with F/U between 0 and 0.99 years.
Raw Output
This check produces a raw data output containing 11 columns of data:
| Column | Data Type | Definition |
|---|---|---|
site |
character | the name of the site being targeted |
fu_bin |
character | the categorical bin of follow up time defined by the user in the fu_breaks parameter |
concept1 |
character | the name of the first event being compared in the similarity index |
concept2 |
character | the name of the second event being compared in the similarity index |
cocount |
numeric | the number of patients with evidence of both events |
concept1_ct |
numeric | the number of patients with evidence of event 1 |
concept2_ct |
numeric | the number of patients with evidence of event 2 |
concept_count_union |
numeric | the number of patients with evidence of either event |
jaccard_index |
numeric | the jaccard similarity index (cocount / concept_count_union) |
concept1_prop |
numeric | the proportion of patients with evidence of event 1 (cocount / concept1_ct) |
concept2_prop |
numeric | the proportion of patients with evidence of event 2 (cocount / concept2_ct) |

