LLMs · Agents · Reasoning · Safety

Wenxiang Jiao 焦文祥

I build benchmarks, agents, and alignment methods that expose where large language models break — in reasoning, safety, and psychological behavior — and turn the findings into systems that ship.

Wenxiang Jiao
CurrentXiaohongshu Inc., LLM Algorithm Expert, 2025 - Present
PreviouslyTencent AI Lab, Senior Researcher, 2021 - 2025
EducationPh.D., The Chinese University of Hong Kong, 2021
Service Reviewer for Nature Nature Machine Intelligence NeurIPS ICML ACL

Three papers accepted to ACL 2026 conference (2 main + 1 findings). Congratulations!

Two papers (REA-RL, DeepCompress) accepted to ICLR 2026. Congratulations!

DeepAgent accepted to WWW 2026. Congratulations!

One paper accepted to NeurIPS 2025 D&B. Congratulations!

ArXiv 2026

A benchmark for evaluating omni-modal general AI assistants.

WWW 2026

A general reasoning agent with scalable toolsets for complex multi-step tasks.

ICLR 2024

A safety evaluation framework for stealthy chat with LLMs via cipher encoding.

ICLR 2024 Oral

A benchmark for evaluating psychological portrayals in large language models.

General Agents

Multimodal, tool-using, and collaborative agents for complex tasks.

DeepAgent · OmniGAIA · MAD

LLM Reasoning

Mathematical, reflective, and efficient long-chain reasoning.

DeepCompress · REA-RL

LLM Safety

Risk awareness, jailbreak robustness, multilingual safety, and refusal.

CipherChat · DeRTa

LLM Personality

Emotion, personality, and psychological portrayals in conversational AI.

PsychoBench · EmotionBench · Fints