We study the problem of predicting the next query to be recommended
in interactive data exploratory analysis to guide users to correct content.
Current query prediction approaches are based on sequence-to-sequence
learning, exploiting past interaction data. However, due to the resourcehungry
training process, such approaches fail to adapt to immediate
user feedback. Immediate feedback is essential and considered as a signal
of the user’s intent. We contribute with a novel query prediction
ensemble mechanism, which adapts to immediate feedback relying on
Multi-Armed Bandits (MAB) framework. Our mechanism, an extension
to the popular Exp3 algorithm, augments Transformer-based language
models for query predictions by combining predictions from experts,
thus dynamically building a candidate set during exploration. Immediate
feedback is leveraged to choose the appropriate prediction in
a probabilistic fashion. We provide comprehensive large-scale experimental
and comparative assessment using a popular online literature
discovery service, which showcases that our mechanism (i) improves
the per-round regret substantially against state-of-the-art Transformerbased
models and (ii) shows the superiority of causal language modelling
over masked language modelling for query recommendations

First published: 2 July 2024