# Publications

Complete local list, based on Google Scholar as of May 19, 2026. # denotes equal contribution. Google Scholar remains the live source for citation counts.

# 2026

  1. Experience Sharing in Mutual Reinforcement Learning for Heterogeneous Language Models. Xiaoze Liu, Dhananjay Ram, Yuting Zhang, Zhaoyang Zhang, Wei Xia, Stefano Soatto, Preprint, 2026. Blog

  2. The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems. Xiaoze Liu#, Ruowang Zhang#, Weichen Yu, Siheng Xiong, Liu He, Feijie Wu, Hoin Jung, Matt Fredrikson, Xiaoqian Wang, Jing Gao, Preprint, 2026. Blog, Code

  3. The Trojan in the Vocabulary: Stealthy Sabotage of LLM Composition. Xiaoze Liu, Weichen Yu, Matt Fredrikson, Xiaoqian Wang, Jing Gao, Preprint, 2026. Blog

# 2025

  1. Survey on Factuality in Large Language Models. Cunxiang Wang#, Xiaoze Liu#, Yuanhao Yue#, Qipeng Guo, Xiangkun Hu, Xiangru Tang, Tianhang Zhang, Cheng Jiayang, Yunzhi Yao, Xuming Hu, Zehan Qi, Wenyang Gao, Yidong Wang, Linyi Yang, Jindong Wang, Xing Xie, Zheng Zhang, Yue Zhang, ACM Computing Surveys, 2025.

  2. CausalEval: Towards Better Causal Reasoning in Language Models. Longxuan Yu, Delin Chen, Siheng Xiong, Qingyang Wu, Dawei Li, Zhikai Chen, Xiaoze Liu, Liangming Pan, The 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics (NAACL), 2025.

  3. SUV: Scalable Large Language Model Copyright Compliance with Regularized Selective Unlearning. Tianyang Xu#, Xiaoze Liu#, Feijie Wu, Xiaoqian Wang, Jing Gao, The 2025 Conference on Language Modeling (COLM), 2025. Blog

  4. Knowledge Graphs for Multi-Modal Learning: Survey and Perspective. Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Jiaoyan Chen, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen, Wen Zhang, Information Fusion, 2025.

  5. Towards Federated RLHF with Aggregated Client Preference for LLMs. Feijie Wu, Xiaoze Liu, Haoyu Wang, Xingchen Wang, Lu Su, Jing Gao, The Thirteenth International Conference on Learning Representations (ICLR), 2025.

# 2024

  1. SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation. Xiaoze Liu#, Ting Sun#, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao, The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024. Blog

  2. SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales. Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao, The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.

  3. MultiEM: Efficient and Effective Unsupervised Multi-Table Entity Matching. Xiaocan Zeng, Pengfei Wang, Yuren Mao, Lu Chen, Xiaoze Liu, Yunjun Gao, The 40th IEEE International Conference on Data Engineering (ICDE), 2024.

  4. Distributed Representations of Entities in Open-World Knowledge Graphs. Lingbing Guo, Zhuo Chen, Jiaoyan Chen, Yichi Zhang, Zequn Sun, Zhongpu Bo, Yin Fang, Xiaoze Liu, Huajun Chen, Wen Zhang, Knowledge-Based Systems, 2024.

  5. Evaluating the Factuality of Large Language Models Using Large-Scale Knowledge Graphs. Xiaoze Liu, Feijie Wu, Tianyang Xu, Zhuo Chen, Yichi Zhang, Xiaoqian Wang, Jing Gao, IEEE Data Engineering Bulletin, 2024.

# 2023

  1. Universal Multi-Modal Entity Alignment via Iteratively Fusing Modality Similarity Paths. Bolin Zhu, Xiaoze Liu, Xin Mao, Zhuo Chen, Lingbing Guo, Tao Gui, Qi Zhang, Preprint, 2023.

  2. Real-Time Workload Pattern Analysis for Large-Scale Cloud Databases. Jiaqi Wang, Tianyi Li, Anni Wang, Xiaoze Liu, Lu Chen, Jie Chen, Jianye Liu, Junyang Wu, Feifei Li, Yunjun Gao, Proceedings of the VLDB Endowment, 2023.

  3. Quiver: Supporting GPUs for Low-Latency, High-Throughput GNN Serving with Workload Awareness. Zeyuan Tan, Xiulong Yuan, Congjie He, Man-Kit Sit, Guo Li, Xiaoze Liu, Baole Ai, Kai Zeng, Peter Pietzuch, Luo Mai, Preprint, 2023.

  4. Unsupervised Entity Alignment for Temporal Knowledge Graphs. Xiaoze Liu, Junyang Wu, Tianyi Li, Lu Chen, Yunjun Gao, The ACM Web Conference (WWW), 2023.

# 2022

  1. ClusterEA: Scalable Entity Alignment with Stochastic Training and Normalized Mini-Batch Similarities. Yunjun Gao#, Xiaoze Liu#, Junyang Wu, Tianyi Li, Pengfei Wang, Lu Chen, The 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2022.

  2. PinSQL: Pinpoint Root Cause SQLs to Resolve Performance Issues in Cloud Databases. Xiaoze Liu, Zheng Yin, Chao Zhao, Congcong Ge, Lu Chen, Yunjun Gao, Dimeng Li, Ziting Wang, Gaozhong Liang, Jian Tan, Feifei Li, The 38th IEEE International Conference on Data Engineering (ICDE), 2022.

# 2021

  1. CollaborEM: A Self-Supervised Entity Matching Framework Using Multi-Features Collaboration. Congcong Ge, Pengfei Wang, Lu Chen, Xiaoze Liu, Baihua Zheng, Yunjun Gao, IEEE Transactions on Knowledge and Data Engineering, 2021.

  2. LargeEA: Aligning Entities for Large-Scale Knowledge Graphs. Congcong Ge, Xiaoze Liu, Lu Chen, Baihua Zheng, Yunjun Gao, Proceedings of the VLDB Endowment, 2021.

  3. Make It Easy: An Effective End-to-End Entity Alignment Framework. Congcong Ge, Xiaoze Liu, Lu Chen, Baihua Zheng, Yunjun Gao, The 44th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR), 2021.