Skip to content

About

Genotype Imputation

Genotype imputation is a key component of genetic association studies, where it increases power, facilitates meta-analysis, and aids interpretation of signals. This imputation server allows registered users to perform genotype imputation for their own samples. Typically, this allows tens of millions of genetic variants to be genotyped with very high-fidelity in samples that have been genotyped with an affordable genotyping array. Imputation works by comparing short stretches of an individual genome to stretches of previously characterized genomes. Since all humans are related, this process can typically and accurately reconstruct much of the missing data in each genome.

The TOPMed Imputation Panel and Server

The TOPMed Imputation Reference panel is a diverse reference panel including information from 133,597 deeply sequenced human genomes. The panel is available to the community through a collaboration between TOPMed Study Investigators, the National Heart Lung and Blood Institute and the University of Michigan Imputation Server team.

The TOPMed Study

The Trans-Omics for Precision Medicine (TOPMed) program seeks to elucidate the genetic architecture and disease biology of heart, lung, blood, and sleep disorders, with the ultimate goal of improving diagnosis, treatment, and prevention. The initial phases of the program included whole genome sequencing of individuals with rich phenotypic data and diverse backgrounds. This reference panel is built on a subset of 133,597 TOPMed samples for whom high quality whole genome sequence data was available and for where contributing studies approved this use of the data, taking into account both the limited risks associated with including a sample in an imputation reference panel that is only available through this imputation server and also other study and sample specific use restrictions. Version r3 of the panel, which is the current version available to the scientific community at large, includes 133,597 reference samples and 445,600,184 genetic variants distributed across the 22 autosomes and the X chromosome.

For a more detailed description of the study, see the TOPMed pre-print currently on BioXriv Taliun, D. et al. (2019) Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Biorxiv, doi:10.1101/563866.

The Michigan Imputation Server

The Michigan Imputation Server is a joint of effort of the statistical genetics teams at the University of Michigan in the U.S.A. and at the University of Innsbruck in Austria. The Michigan Imputation Server technology provides a web-based service for imputation that facilitates access to imputation reference panels and greatly improves user experience and productivity. The server maintains a private list of reference genomes, which are not directly accessible by panel users. Users can request imputation of their own samples and securely download the results. For information on imputation server technology, please see:

The National Heart Lung and Blood Institute

The National Heart Lung and Blood Institute is part of the National Institutes of Health and has provided key support to the development of the studies that collectively constitute the TOPMed Program. The Institute has also sponsored the sequencing and analysis of whole genome sequence and other data within the TOPMed program. NHLBI is sponsoring this imputation server and providing underlying compute resources through its BioDataCatalyst program.

Citation

If you use the Imputation Server in your work, please cite:

Acknowledgements

Trans-Omics in Precision Medicine (TOPMed) program imputation panel (version TOPMed-r3) supported by the National Heart, Lung and Blood Institute (NHLBI); see topmed.nhlbi.nih.gov. TOPMed study investigators contributed data to the reference panel, which can be accessed through the TOPMed Imputation Server. The panel was constructed and implemented by the TOPMed Informatics Research Center at the University of Michigan (3R01HL-117626-02S1; contract HHSN268201800002I). The TOPMed Data Coordinating Center (3R01HL-120393-02S1; contract HHSN268201800001I) provided additional data management, sample identity checks, and overall program coordination and support. We gratefully acknowledge the studies and participants who provided biological samples and data for TOPMed.