
Android Mobile MCP
Enables AI agents to control Android devices through touch, swipe, text input, and app management via ADB connection. Takes screenshots and extracts UI elements for automated mobile testing and interaction.
What it does
- Take screenshots of Android device screens
- Click and swipe on specific screen coordinates
- Extract UI elements and their hierarchical structure
- Type text into focused input fields
- Launch and manage Android applications
- Press system buttons like back, home, and recent
Best for
Tools (9)
Initialize the Android device connection. Must be called before using any other mobile tools.
Get UI elements from Android screen as JSON with hierarchical structure. Returns a JSON structure where elements contain their child elements, showing parent-child relationships. Only includes focusable elements or elements with text/content_desc/hint attributes.
Click on a specific coordinate on the Android screen. Args: x: X coordinate to click y: Y coordinate to click
Input text into the currently focused text field on Android. Args: text: The text to input submit: Whether to submit text (press Enter key) after typing
Press a physical or virtual button on the Android device. Args: button: Button name (BACK, HOME, RECENT, ENTER)