Wenxiang Jiao (焦文祥)

Biography

Hi, this is Wenxiang Jiao (焦文祥), an NLP researcher whose interests include LLM agents, reasoning, and personality. Previously, I was also an experienced researcher on machine translation and multilingualism. I received my Ph.D degree from the Chinese University of Hong Kong in 2021, under the supervision of Prof. Irwin King and Prof. Michael R. Lyu. Before that, I received my Bachelor degree and Mphil degree at Nanjing University in 2015 and 2017, respectively.

Experiences

Tencent AI Lab (2021 - 2025), Senior Researcher
Tencent AI Lab (2019 - 2021), Research Intern

Research

Currently, I focus on Large Language Models (LLMs), including various interesting topics. Representative works include Is ChatGPT A Good Translator, Multi-Agent Debate, CipherChat, PsychoBench.

LLM Agents & Reasoning

DeepAgent: A General Reasoning Agent with Scalable Toolsets. Under Review 2025.
LoopTool: Closing the Data–Training Loop for Robust LLM Tool Calls. Under Review 2025.
Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards. Under Review 2025.
REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models. Under Review 2025.
Learning to Ask: When LLM Agents Meet Unclear Instruction. EMNLP 2025.
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs’ Gaming Ability in Multi-Agent Environments. ICLR 2025.
Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate. EMNLP 2024.

LLM Personality

Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering. Under Review 2025.
Emotionally Numb or Empathetic? Evaluating How LLMs Feel using EmotionBench. NeurIPS 2024.
On the reliability of psychological scales on large language models. EMNLP 2024.
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs. ICLR 2024 Oral.

LLM Safety

Towards Evaluating Proactive Risk Awareness of Multimodal Language Models. NeurIPS 2025 D&B.
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training. ACL 2025.
Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing. Findings of ACL 2025.
Not all countries celebrate thanksgiving: On the cultural dominance in large language models. ACL 2024.
All languages matter: On the multilingual safety of large language models. Findings of ACL 2024.
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher. ICLR 2024.

LLM Continual Learning

DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization. ACL 2025.
NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates. NeurIPS 2024 D&B.
Tencent’s Multilingual Machine Translation System for WMT22 Large-Scale African Languages. WMT 2022.

Sign Language Translation

Improving Gloss-free Sign Language Translation by Reducing Representation Density. NeurIPS 2024.
Unsupervised Sign Language Translation and Generation. Findings of ACL 2024.
Cross-modality Data Augmentation for End-to-End Sign Language Translation. Findings of EMNLP 2023.

Spotlight Projects

🔥 News

Sep 19, 2025

One paper accepted to NeurIPS 2025 D&B. Congratulations!

Aug 21, 2025

One paper accepted to EMNLP 2025 main conference. Congratulations!

May 16, 2025

Three papers accepted to ACL 2025 conference (2 main + 1 findings). Congratulations to all the co-authors!

Jan 23, 2025

Great news! Congratulations to my PhD mentor, Prof. Irwin King, on being selected as a 2024 ACM Fellow.

Jan 23, 2025

Two papers accepted to ICLR 2025 conference. Congratulations to all the co-authors!

Sep 26, 2024

Three papers accepted to NeurIPS 2024 conference (2 main + 1 benchmark). Congratulations to all the co-authors!

🔥 News

... see all News