HSE Researchers Teach Neural Network to Distinguish Origins from Genetically Similar Populations
Researchers from the AI and Digital Science Institute, HSE Faculty of Computer Science, have proposed a new approach based on advanced machine learning techniques to determine a person’s genetic origin with high accuracy. This method uses graph neural networks, which make it possible to distinguish even very closely related populations.
Over the past 10–15 years, genetic analysis has become increasingly popular not only as a tool for medical diagnostics, but also as a means of ancestry research. DNA testing allows people to learn more about their ethnic background, identify the places where their ancestors lived, and determine the number of Neanderthal mutations in a person’s genome.
This has become possible thanks to the development of modern technologies—such as genotyping, data storage and processing systems, and machine learning—and the significant reduction in their cost. However, current testing methods are unable to differentiate between genetically similar populations that have lived in adjacent regions for extended periods.
Researchers from the AI and Digital Science Institute have developed a method for distinguishing between individuals from closely related populations. At the heart of this technology are graph neural networks, which do not rely on DNA sequences but instead use graphs to represent genetic links between individuals with shared genome segments. These shared segments indicate the degree of kinship between people, revealing how many generations back their common ancestors lived. The more overlaps there are, the closer their ancestral connection is. In the model, each person is represented by a vertex in the graph, and the strength of the connection between them is indicated by the edges in the graph.
The method was tested on data from various regions. The results were particularly insightful for the population of the East European Plain, as a large dataset had already been compiled there. The graph neural network was able to accurately determine the population affiliation of individuals from genetically similar ethnic groups.
Aleksei Shmelev
‘Existing methods of genetic analysis address a different task: they identify affiliation with large, isolated groups, such as determining whether someone has French, German, or English ancestry. Our method enables the analysis of closely related populations, which is particularly relevant for Russia, a country with a diverse ethnic background,’ said Aleksei Shmelev, one of the study's authors and Research Assistant at the HSE International Laboratory of Statistical and Computational Genomics, AI and Digital Science Institute.
In their future work, the researchers aim to train the neural network to predict the proportion of different populations within a genome.
They have named their development AncestryGNN, which stands for 'Neural Network-Based Prediction of Population Affiliation via Shared Genome Segments.’
Vladimir Shchur
As noted by Vladimir Shchur, Head of the International Laboratory of Statistical and Computational Genomics at the AI and Digital Science Institute, HSE University, the proposed method holds great potential for more accurate understanding of human history and can be applied in genealogy and anthropology research.
This research was supported by a grant from the Government of the Russian Federation as part of the federal program ‘Artificial Intelligence.’
See also:
HSE Economists Reveal the Secret to Strong Families
Researchers from the HSE Faculty of Economic Sciences have examined the key factors behind lasting marriages. The findings show that having children is the primary factor contributing to marital stability, while for couples without children, a greater income gap between spouses is associated with a stronger union. This is the conclusion reported in Applied Econometrics.
Fifteen Minutes on Foot: How Post-Soviet Cities Manage Access to Essential Services
Researchers from HSE University and the Institute of Geography of the Russian Academy of Sciences analysed three major Russian cities to assess their alignment with the '15-minute city' concept—an urban design that ensures residents can easily access essential services and facilities within walking distance. Naberezhnye Chelny, where most residents live in Soviet-era microdistricts, demonstrated the highest levels of accessibility. In Krasnodar, fewer than half of residents can easily reach essential facilities on foot, and in Saratov, just over a third can. The article has been published in Regional Research of Russia.
HSE Researchers Find Counter-Strike Skins Outperform Bitcoin and Gold as Alternative Investments
Virtual knives, custom-painted machine guns, and gloves are common collectible items in videogames. A new study by scientists from HSE University suggests that digital skins from the popular video game Counter-Strike: Global Offensive (CS:GO) rank among the most profitable types of alternative investments, with average annual returns exceeding 40%. The study has been published in the Social Science Research Network (SSRN), a free-access online repository.
HSE Neurolinguists Reveal What Makes Apps Effective for Aphasia Rehabilitation
Scientists at the HSE Centre for Language and Brain have identified key factors that increase the effectiveness of mobile and computer-based applications for aphasia rehabilitation. These key factors include automated feedback, a variety of tasks within the application, extended treatment duration, and ongoing interaction between the user and the clinician. The article has been published in NeuroRehabilitation.
'Our Goal Is Not to Determine Which Version Is Correct but to Explore the Variability'
The International Linguistic Convergence Laboratory at the HSE Faculty of Humanities studies the processes of convergence among languages spoken in regions with mixed, multiethnic populations. Research conducted by linguists at HSE University contributes to understanding the history of language development and explores how languages are perceived and used in multilingual environments. George Moroz, head of the laboratory, shares more details in an interview with the HSE News Service.
Slim vs Fat: Overweight Russians Earn Less
Overweight Russians tend to earn significantly less than their slimmer counterparts, with a 10% increase in body mass index (BMI) associated with a 9% decrease in wages. These are the findings made by Anastasiia Deeva, lecturer at the HSE Faculty of Economic Sciences and intern researcher in Laboratory of Economic Research in Public Sector. The article has been published in Voprosy Statistiki.
Scientists Reveal Cognitive Mechanisms Involved in Bipolar Disorder
An international team of researchers including scientists from HSE University has experimentally demonstrated that individuals with bipolar disorder tend to perceive the world as more volatile than it actually is, which often leads them to make irrational decisions. The scientists suggest that their findings could lead to the development of more accurate methods for diagnosing and treating bipolar disorder in the future. The article has been published in Translational Psychiatry.
Scientists Develop AI Tool for Designing Novel Materials
An international team of scientists, including researchers from HSE University, has developed a new generative model called the Wyckoff Transformer (WyFormer) for creating symmetrical crystal structures. The neural network will make it possible to design materials with specified properties for use in semiconductors, solar panels, medical devices, and other high-tech applications. The scientists will present their work at ICML, a leading international conference on machine learning, on July 15 in Vancouver. A preprint of the paper is available on arxiv.org, with the code and data released under an open-source license.
HSE Linguists Study How Bilinguals Use Phrases with Numerals in Russian
Researchers at HSE University analysed over 4,000 examples of Russian spoken by bilinguals for whom Russian is a second language, collected from seven regions of Russia. They found that most non-standard numeral constructions are influenced not only by the speakers’ native languages but also by how frequently these expressions occur in everyday speech. For example, common phrases like 'two hours' or 'five kilometres’ almost always match the standard literary form, while less familiar expressions—especially those involving the numerals two to four or collective forms like dvoe and troe (used for referring to people)—often differ from the norm. The study has been published in Journal of Bilingualism.
Overcoming Baby Duck Syndrome: How Repeated Use Improves Acceptance of Interface Updates
Users often prefer older versions of interfaces due to a cognitive bias known as the baby duck syndrome, where their first experience with an interface becomes the benchmark against which all future updates are judged. However, an experiment conducted by researchers from HSE University produced an encouraging result: simply re-exposing users to the updated interface reduced the bias and improved their overall perception of the new version. The study has been published in Cognitive Processing.