Jien Weng

I am a research assistant working across mathematical modelling, multi-agent reinforcement learning, and market microstructure.

My main interest is not just whether a learning algorithm works, but what kind of information the agent receives and what kind of world that information makes visible. In many of the problems I care about, what the agent observes and what the model assumes matter more than the particular update rule used to train it.

This pushes my work toward questions of regime design, credit assignment, prediction, and execution. I am also building deeper foundations in stochastic modelling and environmental modelling, partly through current wastewater treatment plant research. That is the thread connecting most of the work here, from cooperation problems in reinforcement learning to execution problems in quantitative finance and environmental systems.

Start Here

Bio for the short research background and current questions.
Notes for working explanations, derivations, and technical clarifications.
Blog for essays, event write-ups, and less formal pieces.

Current Work

Information design and credit assignment in multi-agent cooperation
Optimal execution with predictive alpha signals

Selected Publications

The Primacy of Information Design Over Algorithm Selection in Multi-Agent Cooperation

Jien Weng Lai, Wei Lun Tan, Ying Loong Lee, Ming Fai Chow · Under Review @ IEEE Social Computation

Apr 2026

DOI

Abstract

A full-factorial experiment in the Public Goods Game shows that information regime and incentive strength explain 85.8% of cooperation-rate variance; algorithm choice accounts for just 3.8%. Agents with the least information cooperate most (83% vs 42% under full observation), attributable to state-space compression. TreeSHAP and Shapley-variance decomposition confirm information structure, not algorithm selection is the primary design lever for cooperation.

Optimal Execution with Alpha Signals

Jien Weng Lai · SSRN · Published

Apr 2026

PDFDOI

Abstract

The execution of large portfolio transactions requires balancing market impact and adverse price drift. The Almgren-Chriss (2001) framework provides a meanvariance trade-off for martingale price processes, but practitioners often utilize short-term alpha signals. This paper re-evaluates the optimal liquidation problem using Stochastic Optimal Control. By incorporating a mean-reverting alpha signal into the price dynamics, we derive a closed-form solution using the Hamilton-Jacobi-Bellman (HJB) equation. The resulting optimal trading rate is an affine function of the current inventory and the predictive signal. This results in a trajectory that adjusts execution speed to capture transient alpha. This work provides a transparent and additive framework for institutional execution desks.