Wenxuan Wang 王文轩
Incoming Assistant Professor in Computer Science@Renmin University of China
I am extremely outgoing. Please feel free to reach me at:
Email: jwxwang@gmail.com
|
Competing Large Language Models in Multi-Agent Gaming Environments
Jen-tse Huang, Eric John Li, Man Ho Lam, Tian Liang, Wenxuan Wang (Corresponding), Youliang Yuan, Wenxiang Jiao, Xing Wang, Zhaopeng Tu, Michael R. Lyu
ICLR, 2025
| arXiv |
code |
|
|
VisFactor: Benchmarking Fundamental Visual Cognition in Multimodal Large Language Models
Jen-tse Huang, Dasen Dai, Jen-yuan Huang, Youliang Yuan, Xiaoyuan Liu, Wenxuan Wang (Corresponding), Wenxiang Jiao, Pinjia He, Zhaopeng Tu
arXiv, 2025
| arXiv |
code |
|
|
Can't See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs
Wenxuan Wang, Xiaoyuan Liu, Kuiyi Gao, Jen-tse Huang, Youliang Yuan, Pinjia He, Shuai Wang, Zhaopeng Tu
arXiv, 2025
| arXiv |
|
|
VLMs as GeoGuessr Masters: Exceptional Performance, Hidden Biases, and Privacy Risks
Jingyuan Huang, Jen-tse Huang, Ziyi Liu, Xiaoyuan Liu, Wenxuan Wang (Corresponding), Jieyu Zhao
arXiv, 2025
| arXiv |
code |
|
|
Fact-or-Fair: A Checklist for Behavioral Testing of AI Models on Fairness-Related Queries
Jen-tse Huang, Yuhang Yan, Linqi Liu, Yixin Wan, Wenxuan Wang (Corresponding), Kai-Wei Chang, Michael R Lyu
arXiv, 2025
| arXiv |
code |
|
|
Insight Over Sight? Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
Xiaoyuan Liu, Wenxuan Wang, Youliang Yuan, Jen-tse Huang, Qiuzhi Liu, Pinjia He, Zhaopeng Tu
arXiv, 2024
| arXiv |
code |
|
|
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
Wenxuan Wang, Kuiyi Gao, Zihan Jia, Youliang Yuan, Jen-tse Huang, Qiuzhi Liu, Shuai Wang, Wenxiang Jiao, Zhaopeng Tu
arXiv, 2024
| arXiv |
|
|
Learning to Ask: When LLMs Meet Unclear Instruction
Wenxuan Wang, Juluan Shi, Chaozheng Wang, Cheryl Lee, Youliang Yuan, Jen-tse Huang, Michael R. Lyu
arXiv, 2024
| arXiv |
|
|
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
Jen-tse Huang, Jiaxu Zhou, Tailin Jin, Xuhui Zhou, Zixi Chen, Wenxuan Wang (Corresponding), Youliang Yuan, Michael R. Lyu, Maarten Sap
arXiv, 2024
| arXiv |
code |
|
|
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
Youliang Yuan, Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Jiahao Xu, Tian Liang, Pinjia He, Zhaopeng Tu
arXiv, 2024
| arXiv |
code |
|
|
The Earth is Flat? Unveiling Factual Errors in Large Language Models
Wenxuan Wang, Juluan Shi, Zhaopeng Tu, Youliang Yuan, Jen-tse Huang, Wenxiang Jiao, Michael R. Lyu
arXiv, 2024
| arXiv |
|
|
Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine
Wenxiang Jiao, Wenxuan Wang, Jen-tse Huang, Xing Wang, Shuming Shi, Zhaopeng Tu
arXiv, 2023
| arXiv |
code |
|
|