Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms Apple Machine Learning Research
Building a generalist model for user interface (UI) understanding is challenging due to various foundational issues, such as platform diversity, resolution variation, and data limitation. In this paper, we introduce Ferret-UI 2, a multimodal large language model (MLLM) designed for universal UI understanding across a… Read More »Ferret-UI 2: Mastering Universal User Interface Understanding Across Platforms Apple Machine Learning Research