An GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward
An GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
最近更新: 21分钟前ComfyUI Lumi Batcher is a batch processing extension plugin designed for ComfyUI, aiming to improve workflow debugging efficiency. Traditional debu...
最近更新: 1个月前