Common patterns for browser automation. These examples show the agent’s workflow — you just describe what you need in plain language, and the agent handles the tool calls.Documentation Index
Fetch the complete documentation index at: https://agents.craft.do/docs/llms.txt
Use this file to discover all available pages before exploring further.
Login and Navigate
The agent can log into web apps using your credentials and navigate authenticated pages.Snapshot the page
Takes an accessibility snapshot to find the email/password inputs and login button — each gets a reference like
@e3, @e5, @e7.Fill a Complex Form
The agent can handle multi-field forms with dropdowns, checkboxes, and text areas.Extract Data from a Page
Use JavaScript execution to pull structured data from any webpage.Monitor Network Traffic
See what API calls a page is making — useful for debugging or discovering internal endpoints.Take Targeted Screenshots
Capture specific elements or regions instead of the full page.- Element reference — snapshot the page, find the chart’s ref, capture just that element
- CSS selector — target elements like
div[data-testid="revenue-chart"] - Coordinates — capture a specific pixel region
Send Keyboard Shortcuts
Trigger app-specific keyboard shortcuts for power-user workflows.Multi-Step Workflows
Combine multiple browser actions into complex workflows.Tips for best results
Tips for best results
- Be specific about what you want — “Extract the employee names and emails from the table” is better than “get the data”
- Mention if you’re already logged in — saves time skipping the login flow
- Describe the page structure if it’s complex — “the data is in the second tab, under the Summary section”
- Ask for a specific output format — “put it in a spreadsheet” or “format as a table”
Troubleshooting
Troubleshooting
- Page not loading? — The agent will retry navigation. If it keeps failing, check that the URL is correct and the site is accessible.
- Can’t find an element? — The agent re-snapshots the page after navigation. If elements load dynamically, it may need to wait or scroll first.
- Login not working? — Some sites use CAPTCHAs or multi-factor auth that the browser can’t automate. You may need to log in manually first, then let the agent continue.
- Interactions seem flaky? — The agent will re-snapshot and retry. Dynamic pages with animations may need a brief wait between actions.