Browser Only Mode

CUGA can operate in browser-only mode, focusing purely on web automation, browser interaction, and web page manipulation without API capabilities.

🎯 What is Browser Only Mode?

Browser Only mode restricts CUGA to:

🌐 Web Automation: Navigate, interact with, and manipulate web pages
🖱️ Browser Interaction: Click, type, scroll, and interact with web elements
📄 Page Manipulation: Modify HTML content, extract data, fill forms
🔄 Web Workflows: Multi-step web-based processes and automation
No API Access: Cannot make direct API calls or backend requests

⚙️ Configuration

Enable Browser Only Mode

Edit your settings.toml file:

[features]
mode = "web"  # Set to browser-only mode

[web_mode]
enabled = true
browser_enabled = true
api_enabled = false
web_automation = true

Environment Variable

# Set via environment variable
export CUGA_MODE="web"

🚀 Getting Started

1. Start CUGA in Browser Mode

# Start with browser-only mode
cuga start demo --mode web

# Or set mode and start
export CUGA_MODE="web"
cuga start demo

2. Verify Mode

# Check current mode
cuga mode current

# Expected output: "web"

# Check configuration
cuga config show --section features

💬 Example Queries

Browser-Only Queries

These queries work perfectly in browser-only mode:

"navigate to google.com and search for CUGA"
"fill out the contact form on the current page"
"click the submit button and wait for confirmation"
"scroll down and take a screenshot of the page"
"extract all product prices from the e-commerce page"
"navigate through the website menu and find the about page"

Web Automation Examples

"go to the login page and enter my credentials"
"navigate to the shopping cart and add the first product"
"fill out the registration form with my information"
"click on the download button and wait for the file"
"scroll through the page and find the contact information"

🌐 Browser Capabilities

Page Navigation: Navigate to URLs, follow links
Browser Control: Open, close, switch between tabs/windows
History Management: Go back, forward, refresh pages
URL Manipulation: Modify and navigate to different URLs

Interaction

Click Operations: Click buttons, links, form elements
Text Input: Type text into forms, search boxes, input fields
Form Handling: Fill out forms, select options, submit data
Mouse Actions: Hover, drag, drop, scroll operations

Content Manipulation

HTML Modification: Change page content, add/remove elements
Data Extraction: Extract text, images, links, tables
Page Analysis: Analyze page structure and content
Screenshot Capture: Take screenshots of pages or elements

🔧 Setup for Browser Mode

Prerequisites

✅ CUGA running in browser-only mode
✅ Web browser installed (Chrome, Firefox, Safari)
✅ Browser automation tools enabled
✅ Web pages accessible

Browser Setup

# Ensure browser automation is enabled
cuga config show --section web_mode

# Verify browser tools
cuga tools list --type browser

# Check browser connectivity
cuga browser test --url https://example.com

📄 HTML Page Example

Simple HTML Page for Testing

Create a simple HTML page to test browser automation:

<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>CUGA Browser Mode Demo</title>
    <style>
        body {
            font-family: Arial, sans-serif;
            max-width: 800px;
            margin: 0 auto;
            padding: 20px;
        }
        .form-section {
            background-color: #f5f5f5;
            padding: 20px;
            border-radius: 5px;
            margin: 20px 0;
        }
        .button {
            background-color: #2196f3;
            color: white;
            padding: 10px 20px;
            border: none;
            border-radius: 5px;
            cursor: pointer;
            margin: 5px;
        }
        .button:hover {
            background-color: #1976d2;
        }
        .result {
            background-color: #e8f5e8;
            padding: 15px;
            border-radius: 5px;
            margin: 20px 0;
            display: none;
        }
    </style>
</head>
<body>
    <h1>🤖 CUGA Browser Mode Demo</h1>
    
    <div class="form-section">
        <h2>Contact Form</h2>
        <form id="contact-form">
            <div>
                <label for="name">Name:</label><br>
                <input type="text" id="name" name="name" placeholder="Enter your name">
            </div>
            <br>
            <div>
                <label for="email">Email:</label><br>
                <input type="email" id="email" name="email" placeholder="Enter your email">
            </div>
            <br>
            <div>
                <label for="message">Message:</label><br>
                <textarea id="message" name="message" rows="4" placeholder="Enter your message"></textarea>
            </div>
            <br>
            <button type="submit" class="button">Submit Form</button>
            <button type="button" class="button" onclick="clearForm()">Clear Form</button>
        </form>
    </div>
    
    <div class="form-section">
        <h2>Navigation Test</h2>
        <button class="button" onclick="showSection('section1')">Show Section 1</button>
        <button class="button" onclick="showSection('section2')">Show Section 2</button>
        <button class="button" onclick="showSection('section3')">Show Section 3</button>
    </div>
    
    <div id="section1" class="result">
        <h3>Section 1 Content</h3>
        <p>This is the content for section 1. CUGA can navigate to this section.</p>
    </div>
    
    <div id="section2" class="result">
        <h3>Section 2 Content</h3>
        <p>This is the content for section 2. CUGA can interact with this section.</p>
    </div>
    
    <div id="section3" class="result">
        <h3>Section 3 Content</h3>
        <p>This is the content for section 3. CUGA can extract data from this section.</p>
    </div>
    
    <div class="form-section">
        <h2>Try These Browser Operations</h2>
        <p>Use CUGA to:</p>
        <ul>
            <li>"fill out the contact form with test data"</li>
            <li>"click the submit button on the form"</li>
            <li>"navigate to section 2 and show its content"</li>
            <li>"extract all text content from the page"</li>
            <li>"take a screenshot of the current page"</li>
        </ul>
    </div>
    
    <script>
        document.getElementById('contact-form').addEventListener('submit', function(e) {
            e.preventDefault();
            alert('Form submitted! (This is a demo)');
        });
        
        function clearForm() {
            document.getElementById('contact-form').reset();
        }
        
        function showSection(sectionId) {
            // Hide all sections
            document.querySelectorAll('.result').forEach(section => {
                section.style.display = 'none';
            });
            
            // Show selected section
            document.getElementById(sectionId).style.display = 'block';
        }
    </script>
</body>
</html>

Save and Open the Page

# Save the HTML content to a file
cat > browser_demo_page.html << 'EOF'
<!DOCTYPE html>
<html lang="en">
<head>
    <meta charset="UTF-8">
    <meta name="viewport" content="width=device-width, initial-scale=1.0">
    <title>CUGA Browser Mode Demo</title>
    <style>
        body {
            font-family: Arial, sans-serif;
            max-width: 800px;
            margin: 0 auto;
            padding: 20px;
        }
        .form-section {
            background-color: #f5f5f5;
            padding: 20px;
            border-radius: 5px;
            margin: 20px 0;
        }
        .button {
            background-color: #2196f3;
            color: white;
            padding: 10px 20px;
            border: none;
            border-radius: 5px;
            cursor: pointer;
            margin: 5px;
        }
        .button:hover {
            background-color: #1976d2;
        }
        .result {
            background-color: #e8f5e8;
            padding: 15px;
            border-radius: 5px;
            margin: 20px 0;
            display: none;
        }
    </style>
</head>
<body>
    <h1>🤖 CUGA Browser Mode Demo</h1>
    
    <div class="form-section">
        <h2>Contact Form</h2>
        <form id="contact-form">
            <div>
                <label for="name">Name:</label><br>
                <input type="text" id="name" name="name" placeholder="Enter your name">
            </div>
            <br>
            <div>
                <label for="email">Email:</label><br>
                <input type="email" id="email" name="email" placeholder="Enter your email">
            </div>
            <br>
            <div>
                <label for="message">Message:</label><br>
                <textarea id="message" name="message" rows="4" placeholder="Enter your message"></textarea>
            </div>
            <br>
            <button type="submit" class="button">Submit Form</button>
            <button type="button" class="button" onclick="clearForm()">Clear Form</button>
        </form>
    </div>
    
    <div class="form-section">
        <h2>Navigation Test</h2>
        <button class="button" onclick="showSection('section1')">Show Section 1</button>
        <button class="button" onclick="showSection('section2')">Show Section 2</button>
        <button class="button" onclick="showSection('section3')">Show Section 3</button>
    </div>
    
    <div id="section1" class="result">
        <h3>Section 1 Content</h3>
        <p>This is the content for section 1. CUGA can navigate to this section.</p>
    </div>
    
    <div id="section2" class="result">
        <h3>Section 2 Content</h3>
        <p>This is the content for section 2. CUGA can interact with this section.</p>
    </div>
    
    <div id="section3" class="result">
        <h3>Section 3 Content</h3>
        <p>This is the content for section 3. CUGA can extract data from this section.</p>
    </div>
    
    <div class="form-section">
        <h2>Try These Browser Operations</h2>
        <p>Use CUGA to:</p>
        <ul>
            <li>"fill out the contact form with test data"</li>
            <li>"click the submit button on the form"</li>
            <li>"navigate to section 2 and show its content"</li>
            <li>"extract all text content from the page"</li>
            <li>"take a screenshot of the current page"</li>
        </ul>
    </div>
    
    <script>
        document.getElementById('contact-form').addEventListener('submit', function(e) {
            e.preventDefault();
            alert('Form submitted! (This is a demo)');
        });
        
        function clearForm() {
            document.getElementById('contact-form').reset();
        }
        
        function showSection(sectionId) {
            // Hide all sections
            document.querySelectorAll('.result').forEach(section => {
                section.style.display = 'none';
            });
            
            // Show selected section
            document.getElementById(sectionId).style.display = 'block';
        }
    </script>
</body>
</html>
EOF

# Open the page in your browser
open browser_demo_page.html  # macOS
# OR
xdg-open browser_demo_page.html  # Linux
# OR
start browser_demo_page.html  # Windows

🧪 Testing Browser Operations

1. Open the Demo Page

# Open the HTML page in your browser
open browser_demo_page.html

2. Start CUGA in Browser Mode

# Start CUGA with browser-only mode
cuga start demo --mode web

3. Test Browser Operations

Try these queries in the CUGA interface:

"fill out the contact form with test data"
"click the submit button on the form"
"navigate to section 2 and show its content"
"extract all text content from the page"
"take a screenshot of the current page"

🔍 Browser Automation Features

Element Selection

CUGA can identify and interact with page elements using:

CSS Selectors: #id, .class, tag[attribute]
XPath: Complex element location expressions
Text Content: Finding elements by visible text
Element Hierarchy: Parent-child relationships

Interaction Methods

# Example of browser automation operations
def perform_browser_operations():
    # Navigate to page
    browser.navigate("https://example.com")
    
    # Find and fill form
    name_input = browser.find_element("#name")
    name_input.type("John Doe")
    
    # Click button
    submit_button = browser.find_element("button[type='submit']")
    submit_button.click()
    
    # Wait for result
    browser.wait_for_element(".result")
    
    # Extract data
    result_text = browser.get_text(".result")
    return result_text

Page Manipulation

// Example of how CUGA can modify page content
function modifyPageContent() {
    // Add new content
    const newSection = document.createElement('div');
    newSection.innerHTML = '<h3>CUGA Added Content</h3><p>This was added by CUGA automation.</p>';
    document.body.appendChild(newSection);
    
    // Modify existing content
    const title = document.querySelector('h1');
    title.textContent = '🤖 CUGA Browser Mode Demo - Modified';
    
    // Change styles
    document.body.style.backgroundColor = '#f0f8ff';
}

📊 Use Cases

Perfect for Browser Only Mode

Web Scraping: Extract data from websites
Form Automation: Fill out and submit web forms
UI Testing: Test web interface functionality
Content Management: Update web page content
Workflow Automation: Navigate through multi-step web processes

Example Scenarios

E-commerce Automation: Navigate product catalogs, add items to cart
Social Media Management: Post content, interact with posts
Web Application Testing: Test user flows and interface elements
Data Collection: Scrape information from multiple web pages
Content Publishing: Update blog posts, manage web content

🔧 Advanced Configuration

Browser Mode Settings

[web_mode]
# Core settings
enabled = true
browser_enabled = true
api_enabled = false

# Browser settings
headless = false
browser_type = "chrome"  # chrome, firefox, safari
browser_timeout = 60
page_load_timeout = 30

# Automation settings
auto_wait = true
wait_timeout = 10
retry_attempts = 3
screenshot_on_error = true

# Performance settings
lazy_loading = true
resource_blocking = false
image_loading = false

Browser Optimization

[web_mode.performance]
# Browser optimization
disable_images = true
disable_css = false
disable_javascript = false
disable_plugins = true

# Memory management
max_memory_usage = "2GB"
garbage_collection = "aggressive"
tab_limit = 5

# Network optimization
request_interception = true
response_caching = true
bandwidth_limit = "10MB/s"

🚨 Troubleshooting

Common Browser Mode Issues

Issue	Solution
🔴 Browser not starting	Check browser installation and drivers
🔴 Page not loading	Verify internet connectivity and URL accessibility
🔴 Elements not found	Check page structure and element selectors
🔴 Slow performance	Optimize browser settings and resource usage

Debug Commands

# Check browser mode status
cuga mode current

# Verify browser tools
cuga tools list --type browser

# Test browser connectivity
cuga browser test --url https://example.com

# Check page interaction
cuga browser test --page browser_demo_page.html

📈 Performance Considerations

Resource Usage

Browser-only mode uses significant resources:

Memory: Browser rendering and page content
CPU: Page rendering and JavaScript execution
Network: Page loading and resource fetching
Storage: Browser cache and page snapshots

Optimization Tips

Use Headless Mode: For production environments without UI
Limit Concurrent Browsers: Don't run too many browser instances
Implement Caching: Cache page elements and resources
Monitor Resources: Keep track of memory and CPU usage
Clean Up: Close unused tabs and browser instances

🔄 Switching Modes

To Hybrid Mode

# Switch to hybrid mode
cuga mode hybrid

# Or edit settings.toml
[features]
mode = "hybrid"

To API Only Mode

# Switch to API-only mode
cuga mode api

# Or edit settings.toml
[features]
mode = "api"

📚 Best Practices

Browser Automation Guidelines

Element Identification: Use stable selectors (IDs, data attributes)
Wait Strategies: Implement proper wait conditions for dynamic content
Error Handling: Handle network issues and page load failures
Resource Management: Clean up browser resources after operations
Testing: Test automation scripts on different browsers and devices

Performance Optimization

Minimize Page Loads: Reuse browser instances when possible
Efficient Selectors: Use fast and reliable element selectors
Batch Operations: Group related operations together
Resource Blocking: Block unnecessary resources (images, ads)
Memory Management: Monitor and manage browser memory usage

📚 Next Steps

After mastering browser-only mode:

Hybrid Mode: Add API interaction capabilities
API Only Mode: Focus on server-side operations
Save & Reuse: Optimize browser automation workflows
Advanced Features: Explore execution modes

Browser mode working? Explore Hybrid Mode next!

Browser Only Mode

On this page