Source and Concept Vocabularies: Single Site, Exploratory, Cross-Sectional Analysis
| dc.contributor | Patient-Centered Outcomes Research Institute |
| dc.contributor.author | PEDSnet Data Coordinating Center |
| dc.contributor.other | PEDSnet Data Coordinating Center |
| dc.date.accessioned | 2024-09-09T17:26:07Z |
| dc.date.created | 2024-06-05 |
| dc.description.abstract | This check provides exploratory analyses at the level of a single site. It generates a single snapshot of a high-level summary of how the source system mappings may impact the data representation. This check may only be executed if both the source code and the represented code are provided. |
| dc.identifier.uri | https://hdl.handle.net/20.500.14642/781 |
| dc.identifier.uri | https://doi.org/10.24373/pdsp-449 |
| dc.publisher | PEDSnet |
| dc.relation.uri | https://github.com/ssdqa/sourceconceptvocabularies/tree/main |
| dc.rights | a CC-BY Attribution 4.0 License. |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0 |
| dc.subject | Single Site Analysis |
| dc.subject | Exploratory Analysis |
| dc.subject | Cross-Sectional Analysis |
| dc.subject | Event-Level Analysis |
| dc.title | Source and Concept Vocabularies: Single Site, Exploratory, Cross-Sectional Analysis |
| dspace.entity.type | DQCheck |
| local.code.package | # install.packages("devtools") devtools::install_github('ssdqa/sourceconceptvocabularies') |
| local.description.raw | This check produces a raw data output containing nine columns of data: <br> | Column | Data Type | Definition | |-------------------|---------------------|------------------------------------------------------------------------------------------------------| |`site` | character | the name of the site being targeted OR "combined" if multiple sites were provided | |`domain` | character | the domain associated with the provided concept set | |`concept_id` | numeric / character | the primary concept, native to the CDM and mapped from the source | |`source_concept_id` | numeric / character | the source concept, from the source system and mapped to the CDM | |`ct` | numeric | the number of times the `concept_id` / `source_concept_id` pair occurs in the data | |`denom_concept_ct` | numeric | the number of times the `concept_id` appears in the data | |`denom_source_ct` | numeric | the number of times the `source_concept_id` appears in the data | |`concept_prop` | numeric | the proportion of `concept_id` appearences made up by the `concept_id` / `source_concept_id` pair | |`source_prop` | numeric | the proportion of `source_concept_id` appearances made up by the `concept_id` / `source_concept_id` pair | {.dqcheck-table} |
| local.description.viz | Heatmaps of the top mappings (`source_concept_id`) for the top CDM codes (`concept_id`). The gradient is blue to red where red corresponds to the highest proportion of concept mapping (concept_prop) and blue corresponds to the lowest. Concept names can be identified by hovering over the heatmaps. A separate reference table, lists all `concept_id`, their corresponding concept_name and their respective total counts. |
| local.dqcheck.category | Information Representation |
| local.dqcheck.clinicalprobe | Expected Clinical Event Representation |
| local.dqcheck.clinicalprobe | Clinical Data Distributions |
| local.dqcheck.measurement | Frequency Distribution |
| local.dqcheck.probe | Data Representation Errors |
| local.dqcheck.probe | Misclassification Detection |
| local.dqcheck.probe | Anomalous Values from Internal Distributions |
| local.dqcheck.requirement | cohort |
| local.dqcheck.requirement | concept_set |
| local.dqcheck.requirement | omop_or_pcornet |
| local.dqcheck.requirement | domain_tbl |
| local.dqcheck.requirement | code_type |
| local.dqcheck.requirement | code_domain |
| local.dqcheck.requirement | multi_or_single_site |
| local.dqcheck.requirement | anomaly_or_exploratory |
| local.dqcheck.requirement | p_value |
| local.dqcheck.requirement | age_groups |
| local.dqcheck.requirement | time |
| local.dqcheck.requirement | time_span |
| local.dqcheck.requirement | time_period |
| local.dqcheck.type | Concept Set Testing |
| local.dqcheck.viz | Heatmap |
| relation.isCodeOfDQCheck | 929c8dfc-2c8b-4e62-8e1d-0fa06c542832 |
| relation.isCodeOfDQCheck.latestForDiscovery | 929c8dfc-2c8b-4e62-8e1d-0fa06c542832 |
| relation.isDQResultOfDQCheck | ac372812-8a10-4aad-895a-74bfb4ca1858 |
| relation.isDQResultOfDQCheck.latestForDiscovery | ac372812-8a10-4aad-895a-74bfb4ca1858 |
