Long Tran-Thanh - Human-Agent Learning Research Lab

Contact Information

Department of Computer Science,
University of Warwick,
UK CV4 7AL

So far I have done some research in the following topics:

Online learning
Game theory
AI for society
Crowdsourcing
Energy usage management
Other topics

Online learning/sequential decision making under uncertainty

A core area of my research is sequential decision making under uncertainty with resource constraints. More precisely, I am interested in multi-armed bandit (MAB) models where pulling an arm (i.e., making a decision) requires a cost and the total spending is limited by a finite budget. To tackle this problem, I have introduced a new model, called the budget-limited MAB, and have also proposed a number of arm pulling algorithms for which I have provided both theoretical and empirical performance analyses. For more details, see:

AAAI 2010 paper
AAAI 2012 paper (honourable mention for the AAAI 2012 Outstanding Paper Award)
My PhD thesis (runner-up for the best European PhD thesis in AI, and runner-up for the best Computer Science PhD thesis in the UK)

More recently with Anshuka Rangi from UCSD we have investigated the adversarial version of this problem (also known as the adversarial bandits with 1-dim knapsack problem).

IJCAI 2019 paper

I am also interested in applying this bandit model (or its variances) to other domains of AI, such as

Decentralised controlling for UAVs: AAMAS 2012 paper
Information collection in wireless sensor networks: JAAMAS journal article
Budget-limited online keyword bidding: UAI 2014 paper (collaboration with Peter Key from MSR Cambridge)
Applications in security games (see Game Theory section below)
Applications in crowdsourcing (see Crowdsourcing section)

Other work within this topic includes:

IJCAI 2015 paper on online stochastic multiple-choice knapsacks (collaboration with Tao Qin and his team from MSR Asia)
NIPS 2015 paper on Thompson sampling for online matrix factorisation (collaboration with Adobe Research folks)

Game theory

My other core research area is algorithmic game theory. I work on large coalition formation games from both game theoretical and decision making perspective. In more detail, I look at systems where the number of participants is very large (typically thousands or more). Within these systems, calculating different solution concepts (e.g., the core, nucleolus, Shapley-value, etc.) are very hard. Given this, my goal is to identify approximation techniques that can efficiently provide high quality results. To do so, with some of my colleagues, we have introduced a novel, vector-based, representation model of the participating agents, with which we can calculate the abovementioned concepts in a significantly more efficient way. For more details, see our IJCAI 2013 paper (collaboration with Tri-Dung Nguyen from School of Maths, Southampton).

Staying within game theory, I also study different games with resource allocation from both aspects of classical and behaviourial game theory. In particular, I am interested in calculating different equilibria and price of anarchy. A preliminary version of work has been published at SAGT 2011. Within the repeated games setting, I aim to identify players' favourite resource allocation strategies when they repeatedly play such games against different opponents (e.g., Repeated Colonel Blotto, repeated Dollar Auctions).

AAMAS 2016 paper on repeated dollar auctions (joint work with Marcin Waniek)
AAAI 2017 paper on dollar auction with spiteful players (joint work Marcin Waniek)
AAAI 2020 paper on repeated colonel Blotto and Hide and Seek games (joint work with Dong Quan Vu et al. from Grenoble)

More recently, I have been working on security games, where the Stackelberg model and its variants is used to model the game between defenders and attackers. In this model the defender is allowed to make the first move, and the attacker will reveal their best response in the second phase. Within this setting, I investigate the following research questions:

What happens if the game is repeatedly played, and the defender has no prior information about the attackers? Our results on zero-sum security games (joint work with Haifeng Xu) can be found here (AAMAS 2016), and this work (in collaboration with Qinyu Guo from NTU) looks at the network games setting (IJCAI 2017).
What happens if the attacker can lie about their true utilities by faking the observation of the defender? Our ACM EC 2019 paper and NeurIPS 2019 paper (both with Jiarui Gan from Oxford) investigates this problem in more detail, and we provide optimal deception solutions for the attacker to fool their opponent.
Vice versa, how a defender can incentivise the attacker to follow a certain desired behaviour that will be beneficial for the system? For the single shot game setting, our IJCAI 2018 paper provides some insights (joint work with Fei Fang's group from CMU). While for the repeated setting, we have some working papers that look at last round convergence guarantees and payoff matrix design approaches that guarantee a single unique Nash equilibrium point. The idea is that once we can establish these properties (i.e., unique Nash and last round convergence guarantees), we can guide the opponent to converge to a favourable mixed strategy. The first work is joint work with my PhD student Cong Dinh (co-supervised by Tri-Dung Nguyen and Alain Zemkoho from Maths). The second work is co-led by Nick Bishop (my other PhD student) and Cong.

Other work on security games with structures:

Coalitional security games: AAMAS 2016 paper (joint work with Bo An's group from NTU)
Network interdiction games: IJCAI 2017 paper, AAAI 2019 paper (both are joint work with Bo An's group from NTU)
Inducibility of Stackelberg security games: AAAI 2019 paper (multiple coauthors, working together while visiting Milind Tambe at USC)
Network security games:AAAI 2020 paper (joint work with Xiaowei Wu and Minming Li)

Other work on game theory:

Regret analysis in empirical games: AAAI 2020 paper (joint work with Arunesh Sinha and folks from Michigan)
Social cost guarantees in smart traffic control: PRICAI 2019 paper (joint work with my former postdoc Paolo Serafino)
Selfish mining in blockchain systems: PRIMA 2019 paper (led by my PhD student Tin Leelavimolsilp)
Continuous influence maximisation games: AAMAS 2020 extended abstract paper, Complex Networks 2020 paper, Complex Networks 2019 paper (my PhD student, Guillermo Moreno, is the lead author)
Application to cyber security: AAMAS 2018 paper (collaboration with Milind Tambe's group from USC)

AI for society

In the last few years I have been using principled AI techniques to tackle a number of societal and environmental challenges. These include:

Work on housing allocation management for homeless youth in the US: AIES 2018 paper (joint work with Hau Chan)
Game theoretic and online learning based patrolling mechanisms against illegal poaching: AAMAS 2019 paper (joint work with Shahrzad Gholami)
Efficient resource allocation to mitigate wild fires: IJCAI 2020 paper (will be uploaded soon, joint work with Hau Chan and Vignesh Viswanathan)

I also have 2 projects with my colleagues in Vietnam. One is about building low-cost sensor systems for air pollution monitoring in Saigon (joint work with Hien vo from VGU and Huy-Dzung Han from HUST), and the other one is about building stand-alone intelligent devices for tuberculosis testing (with Cuong Pham from PTIT).

Crowdsourcing

Another application domain of my research is crowdsourcing. In particular, I am interested in investigating the performance of different crowdsourcing systems from a theoretical perspective, aiming to provide rigorous performance guarantees for task allocation algorithms. Some of our results are listed below:

ECAI 2012 paper (runner-up for the ECAI 2012 Best Student Paper Award)
AAMAS 2013 paperon offline budget allocation for crowdsourcing systems
AAMAS 2014 paper on online budget allocation in the Find-Fix-Verify workflow
AIJ journal article (2014). This work investigates the bounded bandit problem and its applications to expert crowdsourcing systems (Top 25 most cited papers of the journal between 2015-2019 period)
AAAI 2015 paper on resource allocation in complex crowdsourced workflows
PRICAI 2019 paper on crowd contest design with budget limits
NeurIPS 2019 paper on efficient Bayesian inference for crowdsourced data collection
AIJ journal paper (2019) on efficient data collection from the crowd

Home energy usage management

I am heavily involved in the research work on home energy management. In particular, we aim to improve the energy consumption profile of home owners, in order to reduce the CO2 emission of the domestic energy sector. To do so, as the first step, we mainly focussed on the accurate learning and prediction of homeowners' habit, such as appliance usage and heating preferences. Some currently published results are (all with Henry Cuong Truong):

ATES 2013 workshop paper
ACM E-Energy 2013 paper
IJCAI 2013 paper

More recently we look at combining ML techniques with user preference elicitation to further improve predictive home energy management. However, the preference elicitation part has to be done without annoying the users too much (e.g., to learn when to ask and when not to ask questions). Our results can be found at:

IJCAI 2016 paper (joint work with Henry Cuong Truong and Tim Baarslag)
AAMAS 2018 paper (joint work with Tiep Le et al. from NMSU)

Other research interests

The cost of interference to closed evolving systems (joint work with The Anh Han's group from Teesside): We investigate what is the cost to interfere into closed systems, if we want the system to achieve some desirable states. As a first step, we look at evolving evolutionay games, where an external decision maker can invest his resources into the system (e.g., via a reward scheme) such that in the long term, the agents will follow a prefered behaviour. A preliminary result has been published at Nature's Scientific Reports. A follow-up result has been accepted to IJCAI 2018.

Non-monetary referral incentives (joint work with Victor Naroditskiy, Seb Stein, Micro Tonin, and Michael Vlassopoulos): I am also investigating how non-monetary referral incentivisation work in social networks. You can find a preliminary version of our work here. For more details, you can visit the website of our project, or watch a video about it. We also have a publication at HCOMP 2014.

Algebraic topology for machine learning: With my PhD student Tom Davies we are also investigating how to make the application of persistent diagrams and other techniques from algebraic topology more efficient and automated in machine learning systems. Our first result is a fuzzy clustering method for persistent diagrams: paper on Arxiv.