๐ About Me
I am Chengyou Jia (่ดพๆ้), a final-year Ph.D. student in Computer Science at Xiโan Jiaotong University. I am going to be a researcher in the Hunyuan Team of Tencent (Qingyun Project). My Ph.D. advisor is Prof. Minnan Luo, and I am also working closely with Prof. Xiaojun Chang. Previously, I was a visiting student in Singapore under the supervision of Prof. Ivor, and a research intern at Shanghai AI LAB, supervised by Dr. Zhiyong Wu. I received my B.E. degree from Xiโan Jiaotong University in 2021.
I have authored several publications in top-tier conferences and journals, including CVPR, AAAI, ACL, IEEE TIP, among others. I also serve as a reviewer for esteemed conferences and journals like NIPS, ICML, CVPR, ECCV.
Research Interests
I am working in the field of CV & Multi-modal. My current research interests and past experience can be summarized as follows:
- Agentic Vision Generation: Consistent Generation, Video Generation๏ผReward Model and RL for visual Generation
- Multimodal Agent: GUIAgent, Multi-Agent Systems
๐ฅ News
- 2026.02: ย Our papers [PaCo-RL, Chain-of-Merging] are accepted by CVPR 2026๏ผSee you in Denver. ๐๐
- 2025.11: ย One paper is accepted by AAAI 2026. ๐๐
- 2025.10: ย CoFFT is accepted by NeurIPS 2025. ๐๐
- 2025.06: ย Our papers [T2IS, DenseDiT, AutoGPS] are recently released. ๐๐
- 2025.05: ย Three papers are accepeted by ACL 2025. ๐๐
- 2025.02: ย ChatGen is accepeted by CVPR 2025. ๐๐
- 2025.01: ย OS-Atlas is accepeted by ICLR 2025 (Spotlight). ๐๐
- 2024.12: ย One paper is accepted by IEEE TCSVT. ๐๐
- 2024.11: ย Our papers [ChatGen, OS-Atlas, AgentStore, OS-Genesis] are recently released. ๐๐
- 2024.10: ย Ended a fulfilling internship at Shanghai AI Lab and started as a visiting student in Singapore.
- 2024.07: ย One paper is accepted by ACM-MM 2024. See you in Melbourne, Australia. ๐๐
- 2023.11: ย Two papers are accepted by AAAI 2024. ๐๐
- 2023.09: ย One paper is accepted by IEEE TIP. ๐๐
๐ Educations
- 2021.09 - 2026.06 (expected), M.S. + Ph.D Student, Computer Science, Xiโan Jiaotong University. โ
- 2017.09 - 2021.06, B.S. in Computer Science, Xiโan Jiaotong University.
๐ป Internships
- 2026.03 - Present, Researcher @ Tencent Hunyuan Team.
- 2024.11 - 2025.11, Research Intern @ CFAR, A*STAR. Focus on Multimodal Agents for Image Generation.
- 2024.03 - 2024.10, Research Intern @ Shanghai AI LAB. Focus on Multimodal Agents for OS (Operating System).
- 2022.12 - 2024.03, Research Intern @ SGIT AI Lab, State Grid Corporation of China. Focus on Controllable Image Generation.
๐ Selected Publications

PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling [CCF-A]
Bowen Ping*, Chengyou Jia*, Minnan Luo, Changliang Xia, Xin Shen, Zhuohang Dang, Hangwei Qian
(* means equal contributions)
Project Page ย
Datasets ย
Code ย
ย

Flow-Factory: A Unified Framework for Reinforcement Learning in Flow-Matching Models [Workshop]
Bowen Ping, Chengyou Jia, Minnan Luo, Hangwei Qian, Ivor Tsang
Code ย

Why Settle for One? Text-to-ImageSet Generation and Evaluation ๐ฅ [Preprint]
Chengyou Jia, Xin Shen, Zhuohang Dang, Changliang Xia, Weijia Wu, Xinyu Zhang, Huangwei Qian, Ivor Tsang, Minnan Luo
Project Page ย
Datasets ย
Code ย
ย

From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios ๐ฅ [Preprint]
Changliang Xia*, Chengyou Jia*, Zhuohang Dang, Minnan Luo
(* means equal contributions)
Code ย
Project Page ย
Models ย
Datasets ย

ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting ๐ฅ๐ฅ [CCF-A]
Chengyou Jia*, Changliang Xia*, Zhuohang Dang, Weijia Wu, Hangwei Qian, Minnan Luo
(* means equal contributions)
Code ย
Project Page ย
Datasets ย
Models ย
ย

AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant ๐ฅ๐ฅ
Chengyou Jia, Minnan Luo, Zhuohang Dang, Qiushi Sun, Fangzhi Xu, Junlin Hu, Tianbao Xie, Zhiyong Wu
Code ย
Project Page ย

OS-ATLAS: A Foundation Action Model for Generalist GUI Agents ๐ฅ๐ฅ [CCF-A]
Zhiyong Wu, Zhenyu Wu, Fangzhi Xu, Yian Wang, Qiushi Sun, Chengyou Jia, Kanzhi Cheng, Zichen Ding, Liheng Chen, Paul Pu Liang, Yu Qiao
Code ย
Project Page ย
Demo ย

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition [CCF-A]
Chengyou Jia, Minnan Luo, Xiaojun Chang, Zhuohang Dang, Mingfei Han, Mengmeng Wang, Guang Dai, Sizhe Dang, Jingdong Wang

SSMG: Spatial-semantic map guided diffusion model for free-form layout-to-image generation [CCF-A]
Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Mengmeng Wang, Jingdong Wang

Collaborative Contrastive Refining for Weakly Supervised Person Search [CCF-A]
Chengyou Jia, Minnan Luo, Caixia Yan, Linchao Zhu, Xiaojun Chang, Qinghua Zheng
๐งโ Other Paper
-
CVPR 2026Beyond Layer-Wise Merging: Chain-of-Merging for Vision-Language Models [CCF-A]
Xinyu Zhang, Yuxuan Dong, Lingling Zhang, Chengyou Jia, Zhuohang Dang, YiXing Yao, Yaqiang Wu, Basura Fernando, Jun Liu -
ICLR 2025Autogps: Automated geometry problem solving via multimodal formalization and deductive reasoning [CCF-A]
Bowen Ping, Minnan Luo, Zhuohang Dang, Chenxi Wang, Chengyou Jia -
PreprintMulti-Modal Dataset Distillation in the Wild
Zhuohang Dang, Minnan Luo, Chengyou Jia, Hangwei Qian, Xiaojun Chang, Ivor W Tsang -
AAAI 2026Correspondence Coverage Matters for Multi-Modal Dataset Distillation [CCF-A]
Zhuohang Dang, Minnan Luo, Chengyou Jia, Hangwei Qian, Xinyu Zhang, Xiaojun Chang, Ivor Tsang -
NeurIPS 2025CoFFT: Chain of Foresight-Focus Thought for Visual Language Models [CCF-A]
Xinyu Zhang, Yuxuan Dong, Lingling Zhang, Chengyou Jia, Zhuohang Dang, Basura Fernando, Jun Liu, Mike Zheng Shou -
ACL 2025PhysReason: A Comprehensive Benchmark towards Physics-Based Reasoning [CCF-A]
Xinyu Zhang, Yuxuan Dong, Yanrui Wu, Jiaxing Huang, Chengyou Jia, Basura Fernando, Mike Zheng Shou, Lingling Zhang, Jun Liu -
ACL 2025OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis [CCF-A]
Qiushi Sun, Kanzhi Cheng, Zichen Ding, Chuanyang Jin, Yian Wang, Fangzhi Xu, Zhenyu Wu, Chengyou Jia, Liheng Chen, Zhoumianze Liu,
Ben Kao, Guohao Li, Junxian He, Yu Qiao, Zhiyong Wu -
IEEE TCSVTPSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement [CCF-B]
Chengyou Jia, Minnan Luo, Zhuohang Dang, Guang Dai, Xiaojun Chang, Jingdong Wang, Qinghua Zheng -
ICASSP 2023Towards Real-time Person Search with Invariant Feature Learning [CCF-B]
Chengyou Jia, Minnan Luo, Zhuohang Dang, Xiaojun Chang, Qinghua Zheng -
AAAI 2024Noisy Correspondence Learning with Self-Reinforcing Errors Mitigation [CCF-A]
Zhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Xiaojun Chang, Jingdong Wang -
IEEE TCSVTDisentangled representation learning with transmitted information bottleneck [CCF-B]
Zhuohang Dang, Minnan Luo, Chengyou Jia, Guang Dai, Jihong Wang, Xiaojun Chang, Jingdong Wang -
IEEE TIPDisentangled Generation with Information Bottleneck for Enhanced Few-Shot Learning [CCF-A]
Zhuohang Dang, Minnan Luo, Jihong Wang, Chengyou Jia, Caixia Yan, Guang Dai, Xiaojun Chang, Qinghua Zheng -
IEEE TCSVTCounterfactual Generation Framework for Few-Shot Learning [CCF-B]
Zhuohang Dang, Minnan Luo, Chengyou Jia, Caixia Yan, Xiaojun Chang, Qinghua Zheng -
IEEE TIPDisentangled Noisy Correspondence Learning
Zhuohang Dang, Minnan Luo, Jihong Wang, Chengyou Jia, Haochen Han, Herun Wan, Guang Dai, Xiaojun Chang, Jingdong Wang
๐ Honors and Awards
Honors
- Oct.2022 โ Outstanding Graduate Student of Xiโan Jiaotong University, Award.
- Jun.2021 โ Excellent bachelor degree thesis award (Top 1%), Award.
- Jun.2021 โ Outstanding Graduate Student of Xiโan Jiaotong University, Award.
- Sep.2019 โ Outstanding Student in Xiโan Jiaotong University
- Sep.2018 โ Outstanding Student in Xiโan Jiaotong University
Competition Awards
- Jun.2021 โ Top 1% in the TianChi Global Video Cloud Innovation Challenge, Video Object Segmentation Algorithm Challenge (8/2904).
๐ Scholarships
- 2025.09 โ National Scholarship (ๅฝๅฎถๅฅๅญฆ้)
- 2023.09 โ Freshman First Prize Scholarship (PhD)
- 2022.09 โ First-Class Fellowships for Graduate Students at Xiโan Jiaotong University, Fellowships.
- 2020.09 โ Computer Science Special Scholarship at Xiโan Jiaotong University, Fellowships.
๐ฌ Academic Services
- Reviewer: NIPS, ECCV, AAAI, ICASSP, IEEE TIP, IEEE TCSVT and IEEE TNNLS.