UYSD: a novel data repository accessible via public website for worldwide population frequencies of Y-SNP haplogroups.

Ralf A, Zandstra D, van Wersch B, Köksal Z, Larmuseau MHD, Rosa A, Jobling MA, D'Amato ME, Courts C, Gysi M, Haas C, Flores R, Neis M, Wetton JH, Kiesler K, Ameur A, Azonbakin S, Bôžiková A, Choma A, De Ungria MC, Corradini B, Cruz C, Dunkelmann B, Ferri G, Fleckhaus J, Fragou D, Gaens N, Gonçalves R, Havaš Auguštin D, Helm K, Hölzl-Müller P, Kaliszan M, Kasu M, Kovatsi L, Lesaoana M, Mizuno N, Neuhuber F, Nováčková J, Ňuňuková A, Pamjav H, Parson W, Ramankulov Y, Rangel Villalobos H, Rębała K, Rootsi S, Salvador J, Šarac J, Steffen CR, Stenzl V, Török T, Villems R, Watahiki H, Zhabagin M, Schneider PM, Kayser M

Eur. J. Hum. Genet. 33 (7) 904-912 [2025-07-00; online 2025-05-08]

For decades, there has been scientific interest in the variation and geographic distribution of paternal lineages associated with the human Y chromosome. However, the relevant data have been dispersed across numerous publications, making it difficult to consolidate. Additionally, understanding the relationships between different variants, and the tools used to analyze them, have evolved over time, further complicating efforts to harmonize this information. The Universal Y-SNP Database (UYSD) marks a substantial advancement by providing a comprehensive and accessible platform for Y-SNP and haplogroup data from populations around the world. UYSD harmonizes diverse datasets into a unified repository, facilitating the exploration of global Y-chromosomal variation. The platform handles data generated with both high- and low-throughput technology and is compatible with the automated analysis software tool, Yleaf v3. Key functionalities include the ability to: i) visualize haplogroup distributions on an interactive world map, ii) estimate haplogroup frequencies in geographic regions with sparse data through interpolation, and iii) display detailed phylogenetic trees of Y-chromosomal haplogroups. Currently, UYSD encompasses data from over 6,600 males across 27 populations. This dataset largely aligns with known global Y-haplogroup patterns, but also reveals unexplored finer-scale geographic variations. While the present dataset is largely European-centered, UYSD is designed for ongoing expansion by the scientific community, aiming to include more global data and higher-resolution population sequencing data. The platform thus offers valuable insights into human genetic diversity and migration patterns, serving several fields of research such as: human population genetics, genetic anthropology, ancient DNA analysis and forensic genetics.

NGI Uppsala (Uppsala Genome Center) [Collaborative]

National Genomics Infrastructure [Collaborative]

PubMed 40341774

DOI 10.1038/s41431-025-01854-5

Crossref 10.1038/s41431-025-01854-5

pmc: PMC12229683
pii: 10.1038/s41431-025-01854-5


Publications 9.5.1