๐ Whatโs the hottest trend in AI agents right now?
Itโs not the traditional workflow-based or API-calling agents. The most interesting agents today are ๐๐จ๐ ๐ฎ๐ด๐ฒ๐ป๐๐ โ ones that see what you see on the screen. They donโt just execute calls behind the scenes โ they observe, interpret, and interact with visual interfaces just like human users.
This opens up a whole new class of capability: learning directly from real user behaviour, as it happens on-screen, rather than relying on disconnected logs or backend traces that miss subtle but critical interactions.
Proud to share that weโre making serious strides in this space. Join me in congratulating the CSIRO’s Data61 research team on winning the Distinguished Paper Award at the 2025 International Conference on Software Engineering! ๐
It introduces SeeAction, a deep learning-based computer vision tool that automatically recognises and describes user actions from screen recordings.
It enables powerful new applications:
– Automating UI testing and repetitive digital tasks
– Reproducing software bugs by retracing a userโs exact steps
– Helping AI tools learn from real human behaviour โ not just text or screenshots, but nuanced interactions in context