Over the past few years, foundation models have fundamentally transformed the landscape of computer vision, enabling large-scale visual understanding, generation, and multimodal reasoning. Building upon these advances, vision-language agents, embodied or digital systems powered by multimodal foundation models, are rapidly emerging as a central paradigm for intelligent perception, decision-making, and human-AI interaction. These agents integrate perception (vision), cognition (language and reasoning), and action (planning and control) within a unified framework, thereby bridging the gap between visual recognition and autonomous behavior. However, the growing autonomy and complexity of such agents have also amplified their susceptibility to adversarial and safety-critical risks. Beyond traditional pixel-level perturbations, new attack surfaces arise from adversarial prompts, instruction injections, and jailbreak manipulations, which can disrupt reasoning chains, mislead perception, or induce harmful actions. These vulnerabilities highlight fundamental challenges in building safe, robust, and trustworthy vision-language agents for real-world applications, from autonomous driving and embodied robotics to interactive medical or industrial systems. Addressing these challenges demands a deeper understanding of multimodal robustness, causal reasoning, and secure perception-action coupling in complex environments.
The 6th Workshop on Adversarial Machine Learning in Computer Vision (6th AdvML@CV): Safety of Vision-Language Agents aims to bring together researchers and practitioners from computer vision, multimodal learning, and AI safety communities to advance the frontier of robust and trustworthy vision-language agents. Continuing the success of the previous five CVPR AdvML@CV workshops, which have attracted thousands of submissions, participants, and widespread attention, the 2026 edition will feature keynote talks by leading experts, contributed papers, and an international challenge on adversarial robustness for multimodal agents.
Through this workshop, we aim to foster cross-disciplinary collaboration, inspire new research directions, and catalyze the development of secure, reliable, and ethically aligned vision-language agents that can safely operate in dynamic and human-centered environments.
| Workshop Schedule | |||
| Event | Start time | End time | |
| Opening Remarks | 9:00 | 9:15 | |
| Invited Talk #1: Prof. Bo Li | 9:15 | 9:45 | |
| Invited Talk #2: Prof. Chaowei Xiao | 9:45 | 10:15 | |
| Contributed Talk #1 | 10:15 | 10:30 | |
| Coffee Break | 10:30 | 10:45 | |
| Invited Talk #3: Prof. Ziwei Liu | 10:45 | 11:15 | |
| Invited talk #4: Prof. Florian Tramèr | 11:15 | 11:45 | |
| Contributed Talk #2 | 11:45 | 12:00 | |
| Lunch (12:00-13:30) | |||
| Invited Talk #5: Dr. Nouha Dziri | 13:30 | 14:00 | |
| Invited Talk #6: Prof. Yaodong Yang | 14:00 | 14:30 | |
| Invited Talk #7: Prof. Aditi Raghunathan | 14:30 | 15:00 | |
| Poster Session | 15:00 | 16:00 | |
| Challenge Session | 16:00 | 16:30 | |
| Poster Session #2 | 16:30 | 17:00 | |
![]() |
Ziwei
|
|
Nanyang Technological |
|
Chaowei
|
|
Johns Hopkins University |
|
Nouha
|
|
Allen Institute for AI |
|
Florian
|
|
ETH Zürich |
![]() |
Yaodong
|
|
Peking University |
![]() |
Aditi
|
|
Carnegie Mellon University |
|
Bo
|
|
University of Illinois |
|
Aishan
|
|
Beihang University |
|
Jin
|
|
Zhongguancun |
|
Tianyuan
|
|
Beihang |
|
Aishan
|
|
Beihang |
|
Jiakai
|
|
Zhongguancun |
|
Julia
|
|
University of Oxford |
![]() |
Yinpeng
|
|
Tsinghua |
![]() |
Zhenfei
|
|
University of Oxford |
![]() |
Shao
|
|
Shanghai AI Laboratory |
![]() |
Juntao
|
|
BAAI |
|
Xinyun
|
|
Meta |
|
Xianglong
|
|
Beihang |
|
Vishal M.
|
|
Johns Hopkins University |
![]() |
Dawn
|
|
UC Berkeley |
![]() |
Alan
|
|
Johns Hopkins |
![]() |
Philip
|
|
Oxford |
|
Dacheng
|
|
Nanyang Technological |
Timeline
| Challenge Timeline | |
| Mar 15, 2026 | Competition starts |
| Mar 17, 2026 | Phase 1 starts |
| April 17, 2026 | Phase 1 ends |
| April 18, 2026 | Phase 2 starts |
| May 18, 2026 | Phase 2 ends |
| May 30, 2026 | Results will be released and participants will be selected to present |
| June 2026 | Awards and presentation |
Challenge Chair
![]() |
Tianyuan
|
|
Beihang |
![]() |
Jin
|
|
Zhongguancun |
![]() |
Zonglei
|
|
Beihang |
![]() |
Jiangfan
|
|
Beihang |
![]() |
Hainan
|
|
Data Space |
![]() |
Zhilei
|
|
Data Space |
![]() |
Zonghao
|
|
Beihang |
|
Yisong
|
|
Beihang |
![]() |
Lei
|
|
Tsinghua |
![]() |
Haotong
|
|
ETH |
|
Jiakai
|
|
Zhongguancun |
|
Xianglong
|
|
Beihang |




