Biography
Hi, this is Wenxiang Jiao (焦文祥), an NLP researcher whose interests include LLM agents, reasoning, and personality.
Previously, I was also an experienced researcher on machine translation and multilingualism.
I received my Ph.D degree from the Chinese University of Hong Kong in 2021, under the supervision of Prof. Irwin King and Prof. Michael R. Lyu. Before that, I received my Bachelor degree and Mphil degree at Nanjing University in 2015 and 2017, respectively.
Experiences
- Tencent AI Lab (2021 - 2025), Senior Researcher
- Tencent AI Lab (2019 - 2021), Research Intern
Research
Currently, I focus on Large Language Models (LLMs), including various interesting topics.
Representative works include Is ChatGPT A Good Translator
, Multi-Agent Debate
, CipherChat
, PsychoBench
.
LLM Agents & Reasoning
- DeepAgent: A General Reasoning Agent with Scalable Toolsets. Under Review 2025.
- LoopTool: Closing the Data–Training Loop for Robust LLM Tool Calls. Under Review 2025.
- Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards. Under Review 2025.
- REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Large Reasoning Models. Under Review 2025.
- Learning to Ask: When LLM Agents Meet Unclear Instruction. EMNLP 2025.
- How Far Are We on the Decision-Making of LLMs? Evaluating LLMs’ Gaming Ability in Multi-Agent Environments. ICLR 2025.
- Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate. EMNLP 2024.
LLM Personality
- Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering. Under Review 2025.
- Emotionally Numb or Empathetic? Evaluating How LLMs Feel using EmotionBench. NeurIPS 2024.
- On the reliability of psychological scales on large language models. EMNLP 2024.
- On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs. ICLR 2024 Oral.
LLM Safety
- Towards Evaluating Proactive Risk Awareness of Multimodal Language Models. NeurIPS 2025 D&B.
- Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training. ACL 2025.
- Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing. Findings of ACL 2025.
- Not all countries celebrate thanksgiving: On the cultural dominance in large language models. ACL 2024.
- All languages matter: On the multilingual safety of large language models. Findings of ACL 2024.
- GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher. ICLR 2024.
LLM Continual Learning
- DRPruning: Efficient Large Language Model Pruning through Distributionally Robust Optimization. ACL 2025.
- NewTerm: Benchmarking Real-Time New Terms for Large Language Models with Annual Updates. NeurIPS 2024 D&B.
- Tencent’s Multilingual Machine Translation System for WMT22 Large-Scale African Languages. WMT 2022.
Sign Language Translation
- Improving Gloss-free Sign Language Translation by Reducing Representation Density. NeurIPS 2024.
- Unsupervised Sign Language Translation and Generation. Findings of ACL 2024.
- Cross-modality Data Augmentation for End-to-End Sign Language Translation. Findings of EMNLP 2023.
Spotlight Projects
🔥 News
Sep 19, 2025
One paper accepted to NeurIPS 2025 D&B. Congratulations!
Aug 21, 2025
One paper accepted to EMNLP 2025 main conference. Congratulations!
May 16, 2025
Three papers accepted to ACL 2025 conference (2 main + 1 findings). Congratulations to all the co-authors!
Jan 23, 2025
Great news! Congratulations to my PhD mentor, Prof. Irwin King, on being selected as a 2024 ACM Fellow.
Jan 23, 2025
Two papers accepted to ICLR 2025 conference. Congratulations to all the co-authors!
Sep 26, 2024
Three papers accepted to NeurIPS 2024 conference (2 main + 1 benchmark). Congratulations to all the co-authors!