📝 Publications
Selected Publications

WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality
Chunyang Li*, Yilun Zheng*, Xinting Huang, Tianqing Fang, Jiahao Xu, Lihui Chen, Yangqiu Song, Han Hu
The Fourteenth International Conference on Learning Representations. 2026.

Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations
Chunyang Li, Weiqi Wang, Tianshi Zheng, Yangqiu Song
In Findings of the Association for Computational Linguistics: ACL 2025.

MAVEN-FACT: A Large-scale Event Factuality Detection Dataset
Chunyang Li*, Hao Peng*, Xiaozhi Wang, Yunjia Qi, Lei Hou, Bin Xu, Juanzi Li
In Findings of the Association for Computational Linguistics: EMNLP 2024.

Baixuan Xu*, Chunyang Li*, Weiqi Wang*, Wei Fan, Tianshi Zheng, Haochen Shi, Tao Fan, Yangqiu Song, Qiang Yang

ChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time
Shangqing Tu*, Chunyang Li*, Jifan Yu, Xiaozhi Wang, Lei Hou, Juanzi Li
* indicates equal contributions.
Full Publications
You can also find my latest publications on Google Scholar.
Journals & Conference Proceedings
ICLR 2026WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality, Chunyang Li*, Yilun Zheng*, Xinting Huang, Tianqing Fang, Jiahao Xu, Lihui Chen, Yangqiu Song, Han Hu. The Fourteenth International Conference on Learning Representations. 2026.Findings of ACL 2025Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations, Chunyang Li, Weiqi Wang, Tianshi Zheng, Yangqiu Song. In Findings of the Association for Computational Linguistics: ACL 2025. 2025.Findings of EMNLP 2024MAVEN-FACT: A Large-scale Event Factuality Detection Dataset, Chunyang Li*, Hao Peng*, Xiaozhi Wang, Yunjia Qi, Lei Hou, Bin Xu, Juanzi Li. In Findings of the Association for Computational Linguistics: EMNLP 2024. 2024.TMLRThe Curse of CoT: On the Limitations of Chain-of-Thought in In-Context Learning, Tianshi Zheng*, Yixiang Chen*, Chengxi Li*, Chunyang Li, Qing Zong, Haochen Shi, Baixuan Xu, Yangqiu Song, Ginny Y. Wong, Simon See. Transactions on Machine Learning Research (TMLR). 2025.EMNLP 2025LogiDynamics: Unraveling the Dynamics of Logical Inference in Large Language Model Reasoning, Tianshi Zheng, Jiayang Cheng, Chunyang Li, Haochen Shi, Zihao Wang, Jiaxin Bai, Yangqiu Song, Ginny Y. Wong, Simon See. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. 2025.ACL 2024CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning, Weiqi Wang, Tianqing Fang, Chunyang Li, Haochen Shi, Wenxuan Ding, Baixuan Xu, Zhaowei Wang, Jiaxin Bai, Xin Liu, Cheng Jiayang, Chunkit Chan, Yangqiu Song. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2024.ICLR 2024KoLA: Carefully Benchmarking World Knowledge of Large Language Models, Jifan Yu*, Xiaozhi Wang*, Shangqing Tu*, Shulin Cao, Daniel Zhang-Li, Xin Lv, Hao Peng, Zijun Yao, Xiaohan Zhang, Hanming Li, Chunyang Li, Zheyuan Zhang, Yushi Bai, Yantao Liu, Amy Xin, Nianyi Lin, Kaifeng Yun, Linlu Gong, Jianhui Chen, Zhili Wu, Yunjia Qi, Weikai Li, Yong Guan, Kaisheng Zeng, Ji Qi, Hailong Jin, Jinxin Liu, Yu Gu, Yuan Yao, Ning Ding, Lei Hou, Zhiyuan Liu, Bin Xu, Jie Tang, Juanzi Li. The Twelfth International Conference on Learning Representations. 2024.CIKM 2023LittleMu: Deploying an Online Virtual Teaching Assistant via Heterogeneous Sources Integration and Chain of Teach Prompts, Shangqing Tu*, Zheyuan Zhang*, Jifan Yu, Chunyang Li, Siyu Zhang, Zijun Yao, Lei Hou, Juanzi Li. Proceedings of the 32nd ACM International Conference on Information and Knowledge Management. 2023.
Arxiv Preprints
ArxivAutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora, Jiaxin Bai*, Wei Fan*, Qi Hu*, Qing Zong, Chunyang Li, Hong Ting Tsang, Hongyu Luo, Yauwai Yim, Haoyu Huang, Xiao Zhou, Feng Qin, Tianshi Zheng, Xi Peng, Xin Yao, Huiwen Yang, Leijie Wu, Yi Ji, Gong Zhang, Renhai Chen, Yangqiu Song. Arxiv preprint. 2025.ArxivINFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling, Haochen Shi*, Tianshi Zheng*, Weiqi Wang*, Baixuan Xu, Chunyang Li, Chunkit Chan, Tao Fan, Yangqiu Song, Qiang Yang. Arxiv preprint. 2025.ArxivLegal Rule Induction: Towards Generalizable Principle Discovery from Analogous Judicial Precedents, Wei Fan, Tianshi Zheng, Yiran Hu, Zheye Deng, Weiqi Wang, Baixuan Xu, Chunyang Li, Haoran Li, Weixing Shen, Yangqiu Song. Arxiv preprint. 2025.ArxivTowards Multi-Agent Reasoning Systems for Collaborative Expertise Delegation: An Exploratory Design Study, Baixuan Xu*, Chunyang Li*, Weiqi Wang*, Wei Fan, Tianshi Zheng, Haochen Shi, Tao Fan, Yangqiu Song, Qiang Yang. Arxiv preprint. 2025.ArxivEvent-level Knowledge Editing, Hao Peng*, Xiaozhi Wang*, Chunyang Li, Kaisheng Zeng, Jiangshan Duo, Yixin Cao, Lei Hou, Juanzi Li. Arxiv preprint. 2024.ArxivChatLog: Carefully Evaluating the Evolution of ChatGPT Across Time, Shangqing Tu*, Chunyang Li*, Jifan Yu, Xiaozhi Wang, Lei Hou, Juanzi Li. Arxiv preprint. 2023.
* indicates equal contributions.