This shows the impact of the mobility of stations on the performance obtained. Credit: UPF

An article published in the advanced online edition of the journal Computer Communications shows that machine learning can improve the operation and performance of the future Wi-Fi networks of the 5G/6G ecosystem. The research was conducted by Marc Carrascosa and Boris Bellalta, researchers with the Wireless Networking Research Group at the UPF Department of Information and Communication Technologies (DTIC).

The authors focused their study on how to improve the association of Wi-Fi network users consisting of multiple access points in order to be able to serve a large number of users. This type of Wi-Fi network is common in business and academic environments or in public spaces in cities (streets, parks, libraries, etc.).

"In this study, we looked at how stations (PCs, tablets, mobile phones, etc.) may themselves decide dynamically which of the different access points available in their coverage area is offering the best service for their needs using Reinforcement Learning techniques," Carrascosa and Bellalta explain.

Each station takes decisions dynamically

In their proposal, each station is independent and takes decisions dynamically based on the quality of service offered by the Wi-Fi network over time, i.e., the station autonomously learns how the Wi-Fi network is behaving, identifying the impact of its own actions (choosing one or another ) on the benefits received.

"For this learning, as a basis we use an algorithm called ε-greedy, which alternates between choosing access points at random to obtain information (exploring), and choosing the best access points used based on this accumulated information (exploiting)," the authors suggest.

"Thus, the more information, the better decisions we take, considering, however, that there is a compromise between the time a station can devote to learning and the time it disposes of to use what it has learned successfully," they add.

A comparison of the performance obtained using Machine Learning and Load Balancing. Credit: UPF

A new algorithm that shortens station learning time

To solve the limitations of the ε-greedy algorithm, including long learning time, the authors propose a that they call ε-sticky, which includes the concept of emotional attachment. It works so that once the has found an access point that provides the service requested, even if it ceases to do so later, it does not immediately discard it to look for another new one again in the hope that in the future it might give the same good service.

With this new proposal, service disruptions to users and network instability are reduced, which also benefits stations that have not yet found an access point that offers the required . "Despite not being the goal of our work, the extrapolation to humans' social behavior is quite direct, as is the interpretation in this field of the results we present," Carrascosa and Bellalta comment.

"In the article, we study the impact of this change and how it allows us to get better results for the problem of Wi-Fi association. The ultimate goal is to show the effectiveness of techniques to solve problems in Wi-Fi networks that are not easily solved by preconfigured mechanisms. With our results, we also show that not all stations need to make use of these techniques, since if only a few stations implement the new algorithm, the entire benefits," the authors uphold.

More information: Marc Carrascosa et al, Multi-armed bandits for decentralized AP selection in enterprise WLANs, Computer Communications (2020). DOI: 10.1016/j.comcom.2020.05.023

Provided by Universitat Pompeu Fabra - Barcelona