Reference map for automating web pages from script mini-apps with browser-native
automation APIs such as Selenium/WebDriver.
- Automate a browser page without image search.
- Fill forms, click buttons, and read DOM state.
- Run multiple browser sessions independently.
- Download/setup WebDriver dependencies.
- Decide between Selenium and screen/input automation.
- Browser automation is a separate path from screen CV and send-input.
- Selenium/WebDriver controls the browser through browser automation protocols.
- Browser instances are isolated from each other more cleanly than global mouse
and keyboard input.
- WebDriver dependencies are external packages/tools and may be downloaded or
managed at runtime.
- Use screen/input automation only when the target surface is not accessible as
DOM/browser state.
Selenium.WebDriver - main WebDriver package.
WebDriverManager - driver download/setup helper.
OpenQA.Selenium - common Selenium APIs.
OpenQA.Selenium.Edge - Microsoft Edge driver APIs.
EdgeDriverService - driver process service.
EdgeOptions - browser startup options.
IWebDriver.Navigate() - navigation.
FindElement(...), By.Name, By.CssSelector, By.XPath - DOM lookup.
IWebElement.SendKeys(...), Click(), Submit() - user-like actions.
- Add Selenium/WebDriver package references.
- Ensure a matching browser driver is available.
- Create browser options and driver service.
- Create the WebDriver.
- Navigate to a URL.
- Find DOM elements and interact with them.
- Dispose the driver or intentionally leave browser running.
- Prefer Selenium/WebDriver for websites and browser apps.
- Prefer DOM selectors over image matching when the DOM is available.
- Prefer explicit waits/conditions over fixed sleeps for real workflows.
- Prefer one WebDriver instance per independent browser session.
- Prefer disposing driver sessions with the script/app lifetime.
- Avoid using global mouse/keyboard simulation for browser pages unless the DOM
is unavailable.
- Avoid assuming WebDriver binaries exist on every machine.
- Avoid fixed long sleeps as synchronization in production mini-apps.
- Avoid storing API keys, passwords, or session cookies in source files.
Selenium.WebDriver
WebDriverManager
OpenQA.Selenium
IWebDriver
IWebElement
By
EdgeDriver
EdgeDriverService
EdgeOptions
ChromeDriver
WebDriverWait
ExpectedConditions
- browser automation
- Selenium
- WebDriver
- EdgeDriver
- ChromeDriver
- DOM automation
- web page automation
- browser session
- form fill
- web scraper
scripting/references-and-resources.md
scripting/web-server.md
windows-subsystems/input-hooks-hotkeys.md
computer-vision/images.md