Android Mobile

Android Mobile

erichung9060

Control Android devices remotely through AI agents with full UI interaction capabilities including touch, swipe, typing, and app management.

4254 views4Local (stdio)

What it does

  • Click and swipe on Android screens
  • Type text into input fields
  • Take screenshots of device screen
  • Launch and list installed apps
  • Extract UI elements as structured JSON
  • Press system buttons (back, home, recent)

Best for

Mobile app testing and automationAndroid UI interaction for AI agentsRemote device control scenarios
Requires ADB and USB debugging setupFull coordinate-based touch controlHierarchical UI element parsing

Tools (9)

mobile_init

Initialize the Android device connection. Must be called before using any other mobile tools.

mobile_dump_ui

Get UI elements from Android screen as JSON with hierarchical structure. Returns a JSON structure where elements contain their child elements, showing parent-child relationships. Only includes focusable elements or elements with text/content_desc/hint attributes.

mobile_click

Click on a specific coordinate on the Android screen. Args: x: X coordinate to click y: Y coordinate to click

mobile_type

Input text into the currently focused text field on Android. Args: text: The text to input submit: Whether to submit text (press Enter key) after typing

mobile_key_press

Press a physical or virtual button on the Android device. Args: button: Button name (BACK, HOME, RECENT, ENTER)

Alternatives