Skip to content

Releases: microsoft/UFO

v0.1.1

08 May 14:44
d29ace8
Compare
Choose a tag to compare

Highlights

New Release for v0.1.1! We've made some significant updates! Previously known as AppAgent and ActAgent, we've rebranded them to HostAgent and AppAgent to better align with their functionalities. Explore the latest enhancements:
1. Learning from Human Demonstration: UFO now supports learning from human demonstration! Utilize the Windows Step Recorder to record your steps and demonstrate them for UFO. Refer to our detailed guide in README.md for more information.
2. Win32 Support: We've incorporated support for Win32 as a control backend, enhancing our UI automation capabilities.
3. Extended Application Interaction: UFO now goes beyond UI controls, allowing interaction with your application through keyboard inputs and native APIs! Presently, we support Word (examples), with more to come soon. Customize and build your own interactions.
4. Control Filtering: Streamline LLM's action process by using control filters to remove irrelevant control items. Enable them in config_dev.yaml under the control filtering section at the bottom.

What's Changed

New Contributors

Full Changelog: v0.0.1...v0.2.0

What's Changed

New Contributors

Full Changelog: v0.0.1...v0.1.1

v0.0.1

25 Mar 12:40
0f0a1c7
Compare
Choose a tag to compare

What's Changed

Key Updates:

1. We now support creating your help documents for each Windows application to become an app expert. Check the [README](./learner/README.md) for more details!
2. UFO now supports RAG from offline documents and online Bing search.
3. You can save the task completion trajectory into its memory for UFO's reference, improving its future success rate!
4. You can customize different GPT models for AppAgent and ActAgent. Text-only models (e.g., GPT-4) are now supported!

Detailed contributions

New Contributors

Full Changelog: https://github.com/microsoft/UFO/commits/v0.0.1