Large Language Model-Brained GUI Agents: A Survey

AgentStudio: A Toolkit for Building General Virtual Agents

image.png

image.png

SeeClick: Harnessing GUI Grounding for Advanced Visual GUI Agents

image.png

image.png

image.png

Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements

image.png

image.png