CogAgent: A Visual Language Model for GUI Agents
Paper: https://arxiv.org/pdf/2312.08914v3.pdf
CVPR 2024: http://openaccess.thecvf.com//content/CVPR2024/papers/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.pdf
Code1: https://github.com/thudm/cogvlm
Code2: https://github.com/digirl-agent/digirl
Code3: https://github.com/THUDM/CogAgent
Dataset: TextVQA
💠@Machine_learn
Paper: https://arxiv.org/pdf/2312.08914v3.pdf
CVPR 2024: http://openaccess.thecvf.com//content/CVPR2024/papers/Hong_CogAgent_A_Visual_Language_Model_for_GUI_Agents_CVPR_2024_paper.pdf
Code1: https://github.com/thudm/cogvlm
Code2: https://github.com/digirl-agent/digirl
Code3: https://github.com/THUDM/CogAgent
Dataset: TextVQA
💠@Machine_learn