In recent years, environmental sustainability has emerged as a critical issue for society, prompting governments, local authorities, and private organizations to promote initiatives that encourage more sustainable lifestyles. One such initiative is the AWorld application, which leverages gamification to educate, raise awareness, and inspire positive behavioral changes. The app engages users through personalized challenges, or missions, that require interaction with its features and reward participants with in-game points. In this article, we present a hierarchical Multi-Armed Bandit (MAB) framework, MABTree, designed to replace AWorld’s current random mission assignment with a targeted and adaptive approach guided by a custom reward function. Using real-world interaction data from AWorld, we evaluated MABTree in an offline setting across multiple policies and configurations. Results show that MABTree improves cumulative reward, user coverage, and mission diversity compared to classical MAB baselines, with particularly strong gains in balancing exploration of new features and exploitation of known user preferences. While these findings indicate the potential of MABTree to enhance personalization and engagement in sustainability-focused applications, further online testing is needed to validate its real-world impact on long-term user behavior change.

A Multi-Armed Bandit Framework for Personalized Sustainability Goal Assignment

Bucchiarone, Antonio
;
2026-01-01

Abstract

In recent years, environmental sustainability has emerged as a critical issue for society, prompting governments, local authorities, and private organizations to promote initiatives that encourage more sustainable lifestyles. One such initiative is the AWorld application, which leverages gamification to educate, raise awareness, and inspire positive behavioral changes. The app engages users through personalized challenges, or missions, that require interaction with its features and reward participants with in-game points. In this article, we present a hierarchical Multi-Armed Bandit (MAB) framework, MABTree, designed to replace AWorld’s current random mission assignment with a targeted and adaptive approach guided by a custom reward function. Using real-world interaction data from AWorld, we evaluated MABTree in an offline setting across multiple policies and configurations. Results show that MABTree improves cumulative reward, user coverage, and mission diversity compared to classical MAB baselines, with particularly strong gains in balancing exploration of new features and exploitation of known user preferences. While these findings indicate the potential of MABTree to enhance personalization and engagement in sustainability-focused applications, further online testing is needed to validate its real-world impact on long-term user behavior change.
File in questo prodotto:
Non ci sono file associati a questo prodotto.
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11697/280280
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact