Data Resources
Data Resources
VUMC Electronic Health Record Data
VICTR’s EHR data resources provide access to rich, longitudinal clinical data to support a wide range of research use cases—from exploratory cohort discovery to custom data extraction. Whether working with identified or de-identified records, researchers can leverage decades of patient data for impactful insights.
The Research Derivative (RD) is Vanderbilt’s fully identified and robust repository of clinical data drawn from the electronic medical record (Epic, StarPanel, the Enterprise Data Warehouse, HEO, and other sources).
Learn more about the Research Derivative (RD) here.
The Synthetic Derivative (SD) is a rich, multi-source repository of data collected from VUMC’s clinical records and de-identified for use in research.
Learn more about the Synthetic Derivative (SD) here.
Genomics
VICTR’s genomic resources link DNA and omics data to comprehensive clinical records, enabling cutting-edge research in precision medicine, disease susceptibility, and therapeutic response. With one of the largest biobanks of its kind, VUMC is a leader in EHR-linked genomic discovery.
The Alliance for Genomic Discovery (AGD) provides whole genome sequencing data from 250,000 BioVU samples, linked to de-identified clinical data.
Learn more about Alliance for Genomic Discovery (AGD) here.
Imaging
VICTR’s imaging repository offers access to a large archive of clinical imaging studies, with support from experienced DICOM specialists. Imaging data can be linked to clinical records, making it a powerful resource for studies that combine radiologic and phenotypic data.
Biorepositories
VICTR’s biorepositories house diverse biospecimens—from microbial isolates to archival tissues—paired with detailed clinical data. They support mechanistic, diagnostic, and translational studies by enabling researchers to connect biospecimens with patient-level insights.
Clinical Trial Feasibility
Tools like TriNetX help researchers evaluate protocol feasibility and identify potential cohorts using real-world data. These platforms accelerate trial design by offering self-service access to timely, de-identified clinical information across local and national networks.
Network Data Sources
VUMC participates in national research networks that provide harmonized datasets for large-scale population health and comparative effectiveness studies. These sources offer access to EHR, genomic, and survey data from millions of diverse participants across the country.
The All of Us Research Program’s repository is a one-of-a-kind dataset stored on the Researcher Workbench, a secure, cloud-based platform, where registered researchers can access data from surveys, genomic analyses, electronic health records, physical measurements, and wearables to study the full range of factors that influence health and disease.
Learn more about the All of Us program here.
VUMC maintains a limited dataset containing clinical data to support research as a part of the Science, Technology and Research partnership (STAR), Clinical Research Network (CRN), funded by Patient Centered Outcomes Research Institute (PCORI), which is part of the National Patient-Centered Clinical Research Network, PCORnet®.
Learn more about PCORnet here.
The National Clinical Cohort Collaborative (N3C) COVID Enclave is a nationwide repository of EHR-derived data focused on COVID-19 research, systematically collecting information from multiple health institutions.
Learn more about N3C here.