Definition
A structured map of what's on a screen that assistive tools and AI agents read instead of looking at pixels.
A semantic tree representation of UI elements exposed by an operating system or browser for screen readers and automation, used by computer-use agents as a non-pixel input channel.