The NIH and other organizations make the results of supported research available to the public enabling data reuse, increasing transparency, and the facilitation of reproducibility of research results.
IRB approval is required for some requests and while open-access data can be browsed online or downloaded without prior permission, controlled-access data can only be obtained after the requestor has been authorized.
Specific information about the common sources can be found below.
The database of Genotypes and Phenotypes (dbGaP)
- The PI will:
- submit an on-line request to The National Center for Biotechnology Information (NCBI).
- email required documents to Sponsored Program Services (SPS).
- NCBI will notify the SPS Signing Official (SO) of the request.
- SPS will secure NCBI application and Data Use Certification Agreement (DUC) signature(s).
- The SO will approve the request in the NCBI system.
NIDDK Specimen and Data Repository
National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK) Central Repository
The NIDDK Central Repository enables scientists to test new hypotheses without the need to collect any new data or biospecimens and provides the opportunity to pool data across several studies to increase the power of statistical analyses. In addition, most NIDDK-funded studies are collecting genetic biospecimens and carrying out high-throughput genotyping making it possible for other scientists to use Central Repository resources to match genotypes to phenotypes and to perform informative genetic analyses. | Data Access is Controlled:
Summary level data is open. Credentialed user must apply for access to individual level data. |
National Institute of Mental Health Data Archive (NDA)
National Institute of Mental Health Data Archive (NDA)
The National Institute of Mental Health Data Archive (NDA) makes available human subjects data collected from hundreds of research projects across many scientific domains. NDA provides infrastructure for sharing research data, tools, methods, and analyses enabling collaborative science and discovery. De-identified human subjects’ data, harmonized to a common standard, are available to qualified researchers. Summary data are available to all. | Data Access is Mixed. |
NDA access portal |
International Bio-repositories (UK BioBank)
International Bio-repositories (UK BioBank) is a uniquely powerful biomedical database that can be accessed globally by approved researchers to explore de-identified data from half a million UK Biobank participants to enable new discoveries to improve public health.
The UK Biobank Data Showcase provides a summary of all the information gathered by UK Biobank on our 500,000 participants and is available to explore. The showcase contains background information on how these data were collected and notes about future collections.
Before applying to access UK Biobank data, or if you are already accessing data, please keep up to date by checking the notes and additional resources provided with categories and data-fields for useful information. The user guide can be found here:
National Institutes of Mental Health NRGR (NIMH Repository and Genomics Resource)
The National Institute of Mental Health Repository and Genomics Resource allows users to search phenotypic & genetic data collections by disorder or study, including stem-cell data.
Principal Investigators (PIs) can request access for 3 years, however, once the access request expires, the Principal Investigator (PI) may not maintain any raw data files. To maintain access to distributions, the PI must submit a renewal request to renew their access.
It can take at least four weeks for NIMH to review your application. For questions regarding the application process, please contact the NIMH Genomics Resources Support Team at mailto:NIMH.genomics.resources@mail.nih.gov.