NanoBanana侧面黑白鲨守-modified.png

🛬 Attending

🇰🇷ICML2026, Seoul, KR


🥰 TY to grants from:

Open Philanthropy

Thinking Machines Lab


<aside> 😎 Google Scholar

</aside>

<aside> 🤠 Research Gate

</aside>

<aside> 🧐 Academic CV

</aside>

Socials


<aside> 😝

Linkedin

</aside>

<aside> 😁 BlueSky

</aside>

<aside> 😆 X/Twitter

</aside>


The rise of reasoning and agentic AI is a double-edged sword, which motivates me to study what could go wrong with reasoning agents (esp. in multi-agents that emulate human collaboration/competition) in 2 directions:

  1. Reasoning Done Right: How can we make agents smarter? My vertical focus on improving capability evaluation better inform post-training (reasoning-driven RL) leveraging horizontal from actionable interpretability (e.g. model diffing) and robustness probes (e.g. longitudinal analysis as probe for contamination). Ultimately, this line of work contributes to AI4Science that enable frontier AI to better accelerate scientific discovery, where I’m particularly intrigued by AI application in exoplanetary astrophysics, phenomenology of high-energy particle physics and organic synthetic chemistry.
  2. Reasoning For Good: How can we make smarter agents safer? Reasoning has also enabled novel threat model such as deception, scheming and collusion. My vertical focus on identifying key triggers that suppress/encourage such agentic misaligned behavior and probing how models react differently in realistic (quasi-deployment) vs. fictional (quasi-evaluation) scenarios. Eventually, I believe models need to learn safety constraints via consequence-aware reasoning (CoI, Chain-of-Implication) similar to how legal deterrence work on us humans.

Selected Work (Full List: GScholar)

Slide: What could go wrong with Reasoning Machines?

Useful:

The following represents only my personal opinions:

Science of Evaluation

Science of Post-Training

Education


<aside> 🐔 PhD

Advisor:

</aside>

<aside> 🐥 MSc. Interdisciplinary Science ETH (CS and Physics) ETH Zurich, Switzerland (2024-2025)

Thesis Advisor: Prof. Zhijing Jin, Prof. Bernhard Schölkopf

</aside>

<aside> 🐣 BSc. Interdisciplinary Science ETH (CS, Physics and Chemistry) ETH Zurich, Switzerland (2023-2025)

Thesis Advisor: Prof. Mrinmaya Sachan

</aside>