A core area of my research is sequential decision making under uncertainty with resource constraints.
More precisely, I am interested in multi-armed bandit (MAB) models where pulling an arm (i.e., making a decision) requires a cost and the total spending is limited by a finite budget.
To tackle this problem, I have introduced a new model, called the budget-limited MAB, and have also proposed a number of arm pulling algorithms for which I have provided both theoretical and empirical performance analyses.
For more details, see:
More recently with Anshuka Rangi from UCSD we have investigated the adversarial version of this problem (also known as the adversarial bandits with 1-dim knapsack problem).
I am also interested in applying this bandit model (or its variances) to other domains of AI, such as
Other work within this topic includes:
My other core research area is algorithmic game theory. I work on large coalition formation games from both game theoretical and decision making perspective. In more detail, I look at systems where the number of participants is very large (typically thousands or more). Within these systems, calculating different solution concepts (e.g., the core, nucleolus, Shapley-value, etc.) are very hard. Given this, my goal is to identify approximation techniques that can efficiently provide high quality results. To do so, with some of my colleagues, we have introduced a novel, vector-based, representation model of the participating agents, with which we can calculate the abovementioned concepts in a significantly more efficient way. For more details, see our IJCAI 2013 paper (collaboration with Tri-Dung Nguyen from School of Maths, Southampton).
Staying within game theory, I also study different games with resource allocation from both aspects of classical and behaviourial game theory. In particular, I am interested in calculating different equilibria and price of anarchy. A preliminary version of work has been published at SAGT 2011. Within the repeated games setting, I aim to identify players' favourite resource allocation strategies when they repeatedly play such games against different opponents (e.g., Repeated Colonel Blotto, repeated Dollar Auctions).
More recently, I have been working on security games, where the Stackelberg model and its variants is used to model the game between defenders and attackers. In this model the defender is allowed to make the first move, and the attacker will reveal their best response in the second phase. Within this setting, I investigate the following research questions:
Other work on security games with structures:
Other work on game theory:
In the last few years I have been using principled AI techniques to tackle a number of societal and environmental challenges. These include:
I also have 2 projects with my colleagues in Vietnam. One is about building low-cost sensor systems for air pollution monitoring in Saigon (joint work with Hien vo from VGU and Huy-Dzung Han from HUST), and the other one is about building stand-alone intelligent devices for tuberculosis testing (with Cuong Pham from PTIT).
Another application domain of my research is crowdsourcing. In particular, I am interested in investigating the performance of different crowdsourcing systems from a theoretical perspective, aiming to provide rigorous performance guarantees for task allocation algorithms.
Some of our results are listed below:
I am heavily involved in the research work on home energy management.
In particular, we aim to improve the energy consumption profile of home owners, in order to reduce the CO2 emission of the domestic energy sector.
To do so, as the first step, we mainly focussed on the accurate learning and prediction of homeowners' habit, such as appliance usage and heating preferences.
Some currently published results are (all with Henry Cuong Truong):
More recently we look at combining ML techniques with user preference elicitation to further improve predictive home energy management. However, the preference elicitation part has to be done without annoying the users too much (e.g., to learn when to ask and when not to ask questions). Our results can be found at:
The cost of interference to closed evolving systems (joint work with The Anh Han's group from Teesside): We investigate what is the cost to interfere into closed systems, if we want the system to achieve some desirable states. As a first step, we look at evolving evolutionay games, where an external decision maker can invest his resources into the system (e.g., via a reward scheme) such that in the long term, the agents will follow a prefered behaviour. A preliminary result has been published at Nature's Scientific Reports. A follow-up result has been accepted to IJCAI 2018.
Non-monetary referral incentives (joint work with Victor Naroditskiy, Seb Stein, Micro Tonin, and Michael Vlassopoulos): I am also investigating how non-monetary referral incentivisation work in social networks. You can find a preliminary version of our work here. For more details, you can visit the website of our project, or watch a video about it. We also have a publication at HCOMP 2014.
Algebraic topology for machine learning: With my PhD student Tom Davies we are also investigating how to make the application of persistent diagrams and other techniques from algebraic topology more efficient and automated in machine learning systems. Our first result is a fuzzy clustering method for persistent diagrams: paper on Arxiv.