Junyang Wang (王君阳)

Email: junyangwang@bjtu.edu.cn; junyangwang287@gmail.com

I am a research intern at Institute for Intelligent Computing of Alibaba Group.

I am a Ph.D candidate in the School of Computer and Information Technology, BJTU and work with Prof. Jitao Sang.

My current research content is Multi-modal Large Language Model (MLLMs), including MLLMs hallucination and MLLM-based agent. In addition, I have also studied Vision-Language Pre-training (VLP) and social fairness in computer vision.

Recent News

* [03.2025] Our paper PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC has been accepted by ICLR 2025 Workshop

* [11.2024] Our github repositoriy Mobile-Agent has gained 3k stars.

* [09.2024] Our paper Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration has been accepted by NeurIPS 2024.

* [07.2024] Our work Mobile-Agent won the best demo award at the The 23rd China National Conference on Computational Linguistics (CCL 2024).

* [03.2024] Our paper Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception has been accepted by ICLR 2024 Workshop

* [07.2023] Our paper Improved Visual Fine-tuning with Natural Language Supervision has been accepted by ICCV 2023 Oral.

* [04.2023] Our paper From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping has been accepted by IJCAI 2023.

* [10.2022] I joined Intelligent Computing of Alibaba Group, Ltd as a research intern.

* [06.2022] Our paper Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training Models has been accepted by MM 2022.

Publications

See Google scholar

Experience/Education