AI to Enable Accurate Modelling of Data Storage System Performance
Researchers at the HSE Faculty of Computer Science have developed a new approach to modelling data storage systems based on generative machine learning models. This approach makes it possible to accurately predict the key performance characteristics of such systems under various conditions. Results have been published in the IEEE Access journal.
Data storage systems play an important role in today’s digital world, as they are responsible for the safety and prompt availability of vast amounts of information. These systems consist of many components, including controllers, HDD and SSD disks, as well as cache memory, which work together to ensure fast and efficient operation. To achieve optimal performance, it is essential to accurately predict how these systems will function in different scenarios, such as when the load on the system changes.
Researchers at the HSE Faculty of Computer Science developed a new approach to modelling data storage system performance, which relies on generative machine learning models. The authors proposed a method that provides high-precision predictions of the key performance characteristics of the systems: the number of input/output operations per second (IOPS) and latency.
The modelling includes two stages. First, the scientists collect data by measuring the system’s performance under various loads and configurations. This data is then fed to two special generative models: the CatBoost regression model and the normalizing flow model. CatBoost works well with tabular data and can accurately predict average values and performance deviations. The normalizing flow model produces a complete distribution of possible outcomes, taking into account data uncertainties and variability.
Mikhail Hushchyn
‘One of the main advantages of our method is that it does not require detailed knowledge of the internal structure of the system components. This is often impossible due to the manufacturers’ trade secrets. Instead, our generative models are trained directly on real-world data. For instance, in our study, we trained a model using 300,000 measurements. This makes our approach versatile and applicable to any type of data storage system,’ says study author Mikhail Hushchyn, a senior research fellow at the HSE Faculty of Computer Science.
The researchers tested the accuracy of the proposed approach using Little's law, a fundamental principle of queuing theory. According to test results, these predictions are highly consistent with real observations: prediction errors range from just 4–10% for IOPS and 3–16% for latency, while the correlation with the observed values reaches 0.99.
Aziz Temirkhanov
‘Our proposed approach opens up broad prospects for optimising and planning the operation of data centres. It makes it possible to predict the behaviour of the system amid load changes, identify potential performance issues, and optimise power consumption. Furthermore, expensive physical experiments are no longer required for accurate modelling,’ stated Aziz Temirkhanov, a junior research fellow at the Laboratory of Methods for Big Data Analysis.
The experimental code and measurements of the storage system performance are publicly available.
The research was carried out within the Mirror Laboratories project of HSE University on improving the efficiency of data centres and data storage systems using artificial intelligence methods.
See also:
HSE Neurolinguists Reveal What Makes Apps Effective for Aphasia Rehabilitation
Scientists at the HSE Centre for Language and Brain have identified key factors that increase the effectiveness of mobile and computer-based applications for aphasia rehabilitation. These key factors include automated feedback, a variety of tasks within the application, extended treatment duration, and ongoing interaction between the user and the clinician. The article has been published in NeuroRehabilitation.
'Our Goal Is Not to Determine Which Version Is Correct but to Explore the Variability'
The International Linguistic Convergence Laboratory at the HSE Faculty of Humanities studies the processes of convergence among languages spoken in regions with mixed, multiethnic populations. Research conducted by linguists at HSE University contributes to understanding the history of language development and explores how languages are perceived and used in multilingual environments. George Moroz, head of the laboratory, shares more details in an interview with the HSE News Service.
Slim vs Fat: Overweight Russians Earn Less
Overweight Russians tend to earn significantly less than their slimmer counterparts, with a 10% increase in body mass index (BMI) associated with a 9% decrease in wages. These are the findings made by Anastasiia Deeva, lecturer at the HSE Faculty of Economic Sciences and intern researcher in Laboratory of Economic Research in Public Sector. The article has been published in Voprosy Statistiki.
Scientists Reveal Cognitive Mechanisms Involved in Bipolar Disorder
An international team of researchers including scientists from HSE University has experimentally demonstrated that individuals with bipolar disorder tend to perceive the world as more volatile than it actually is, which often leads them to make irrational decisions. The scientists suggest that their findings could lead to the development of more accurate methods for diagnosing and treating bipolar disorder in the future. The article has been published in Translational Psychiatry.
Scientists Develop AI Tool for Designing Novel Materials
An international team of scientists, including researchers from HSE University, has developed a new generative model called the Wyckoff Transformer (WyFormer) for creating symmetrical crystal structures. The neural network will make it possible to design materials with specified properties for use in semiconductors, solar panels, medical devices, and other high-tech applications. The scientists will present their work at ICML, a leading international conference on machine learning, on July 15 in Vancouver. A preprint of the paper is available on arxiv.org, with the code and data released under an open-source license.
‘Economic Growth Without the AI Factor Is No Longer Possible’
The International Summer Institute on AI in Education has opened in Shanghai. The event is organised by the HSE Institute of Education in partnership with East China Normal University (ECNU). More than 50 participants and key speakers from over ten countries across Asia, Europe, North and South America have gathered to discuss the use of AI technologies in education and beyond.
HSE Linguists Study How Bilinguals Use Phrases with Numerals in Russian
Researchers at HSE University analysed over 4,000 examples of Russian spoken by bilinguals for whom Russian is a second language, collected from seven regions of Russia. They found that most non-standard numeral constructions are influenced not only by the speakers’ native languages but also by how frequently these expressions occur in everyday speech. For example, common phrases like 'two hours' or 'five kilometres’ almost always match the standard literary form, while less familiar expressions—especially those involving the numerals two to four or collective forms like dvoe and troe (used for referring to people)—often differ from the norm. The study has been published in Journal of Bilingualism.
Overcoming Baby Duck Syndrome: How Repeated Use Improves Acceptance of Interface Updates
Users often prefer older versions of interfaces due to a cognitive bias known as the baby duck syndrome, where their first experience with an interface becomes the benchmark against which all future updates are judged. However, an experiment conducted by researchers from HSE University produced an encouraging result: simply re-exposing users to the updated interface reduced the bias and improved their overall perception of the new version. The study has been published in Cognitive Processing.
Mathematicians from HSE Campus in Nizhny Novgorod Prove Existence of Robust Chaos in Complex Systems
Researchers from the International Laboratory of Dynamical Systems and Applications at the HSE Campus in Nizhny Novgorod have developed a theory that enables a mathematical proof of robust chaotic dynamics in networks of interacting elements. This research opens up new possibilities for exploring complex dynamical processes in neuroscience, biology, medicine, chemistry, optics, and other fields. The study findings have been accepted for publication in Physical Review Letters, a leading international journal. The findings are available on arXiv.org.
Mathematicians from HSE University–Nizhny Novgorod Solve 57-Year-Old Problem
In 1968, American mathematician Paul Chernoff proposed a theorem that allows for the approximate calculation of operator semigroups, complex but useful mathematical constructions that describe how the states of multiparticle systems change over time. The method is based on a sequence of approximations—steps which make the result increasingly accurate. But until now it was unclear how quickly these steps lead to the result and what exactly influences this speed. This problem has been fully solved for the first time by mathematicians Oleg Galkin and Ivan Remizov from the Nizhny Novgorod campus of HSE University. Their work paves the way for more reliable calculations in various fields of science. The results were published in the Israel Journal of Mathematics (Q1).