loading page

UB3: Fixed Budget Best Beam Identification in mmWave Massive MISO via Pure Exploration Unimodal Bandits
  • Debamita Ghosh,
  • Manjesh K. Hanawal,
  • Nikola Zlatanov
Debamita Ghosh
IITB-Monash Research Academy, IIT Bombay

Corresponding Author:[email protected]

Author Profile
Manjesh K. Hanawal
MLioNS Lab, IEOR, IIT Bombay
Author Profile
Nikola Zlatanov
Innopolis University
Author Profile


One of the core problems in millimeter wave (mmWave) massive multiple-input-single-output (MISO) communication systems, which significantly affects the data rate, is the misalignment of the beam direction of the transmitter towards the receiver. In this paper, we investigate strategies that identify the best beam within a fixed duration of time. To this end, we develop an algorithm, named Unimodal Bandit for Best Beam (UB3), that exploits the unimodal structure of the mean received signal strength as a function of the available beams and identifies the best beam within a fixed time duration using pure exploration strategies. We derive an upper bound on the probability of misidentifying the best beam, and we prove that the upper bound is of the order O (log 2 K exp {−αnA}), where K is the number of beams, A is a problem-dependent constant, and αn is the number of pilots used in the channel estimation phase. In contrast, when the unimodal structure is not exploited, the error probability is of order O (log 2 K exp {−αnA/(K log K)}). Thus, by exploiting the unimodal structure, we achieve a much better error probability, which depends only logarithmically on K. We demonstrate that UB3 outperforms the state-of-the-art algorithms through extensive simulations.
02 Feb 2024Submitted to TechRxiv
02 Feb 2024Published in TechRxiv