Quick Start

An enterprise-ready configurable AI agent framework from IBM Research, delivering state-of-the-art performance on real-world automation tasks across web interfaces, APIs, and custom enterprise systems.

Get started quickly with these essential guides:

Installation

Complete installation guide for all environments

Configuration

Configure CUGA for your specific needs

Guides & Examples

OpenAPI integration, MCP servers, and more

Integrations

Build visual AI workflows with CUGA and Langflow

Save & Reuse

Capture and reuse successful execution paths

Evaluation

Test and validate your agent implementations

Policies

Manage agent behavior with Intent Guards, Playbooks, and more

Agent SDK

Python SDK for creating and running CUGA agents

Mission

Building reliable agents for real-world use is harder than it looks. Quick demos are easy; sustained, trustworthy performance in enterprise settings is not.

CUGA (ConfigUrable Generalist Agent) is an open-source generalist agent framework from IBM Research, purpose-built for enterprise automation. Designed for developers, CUGA combines and improves the best of foundational agentic patterns such as ReAct, CodeAct, and Planner-Executor — into a modular architecture enabling trustworthy, policy-aware, and composable automation across web interfaces, APIs, and custom enterprise systems.

CUGA achieves state-of-the-art performance on leading benchmarks:

🥇 #1 on AppWorld — a benchmark with 750 real-world tasks across 457 APIs
🥈 Top-tier on WebArena (#1 from 02/25 - 09/25) — a complex benchmark for autonomous web agents across application domains

What Makes CUGA Special

For us reliability and trust mean:

Handling the complexity of enterprise workflows
Respecting policies and user instructions
Delivering consistent execution by learning from success
Balancing cost, speed, and accuracy
Keeping a human in the loop when oversight matters

Key Features

CUGA is still early, but already provides useful building blocks:

Complex task execution: State of the art results across Web and APIs.
Flexible tool integrations: CUGA works across REST APIs via OpenAPI specs, MCP servers, and custom connectors.
Smart orchestration with code-generated API glue logic
Composable agent architecture: CUGA itself can be exposed as a tool to other agents, enabling nested reasoning and multi-agent collaboration.
Configurable reasoning modes: Choose between fast heuristics or deep planning depending on your task's complexity and latency needs.
Policy system: Configure agent behavior with Intent Guards, Playbooks, Tool Enrichment, Tool Approval, and Output Formatters to enforce security, guide workflows, and ensure compliance.
Save & Reuse (Experimental): CUGA captures and reuses successful execution paths, enabling consistent and faster behavior across repeated tasks.

Requirements

System Requirements

CUGA requires Python 3.12 or higher and works on Windows, macOS, and Linux systems.

Python: 3.12 or higher
RAM: 8GB minimum, 16GB recommended
Storage: 2GB free space (5GB for development)
Network: Internet connection for API access
OS: Ubuntu 20.04+, macOS 12+, or Windows 10+

Required Tools

Before you begin, ensure you have the following:

UV Package Manager - Modern Python dependency management (see installation guide below)
Git - For cloning the repository
API Keys - OpenAI API key or compatible LLM provider

Additional requirements vary by installation method. See the specific installation section below for details.

🚀 Quick Start

Get started with the essential setup steps to have CUGA running quickly using the standard installation method.

Clone the CUGA repository to your local machine:

git clone https://github.com/cuga-project/cuga-agent.git
cd cuga-agent

Create a virtual environment and install dependencies:

uv venv --python=3.12
source .venv/bin/activate  # On Windows: .venv\Scripts\activate
uv sync

Make sure to activate the virtual environment before running any CUGA commands.

Copy the example environment file and add your API key:

cp .env.example .env

Edit the .env file and add your OpenAI API key:

OPENAI_API_KEY=your_api_key_here

For IBM team members, you can use the ETE LiteLLM API key from the ETE LiteLLM portal.

First Run

Start CUGA in demo mode to explore its capabilities:

cuga start demo

Try this example query to test the system:

get my top account by revenue from digital sales

Congratulations! You've successfully installed CUGA. Continue reading for detailed configuration options and advanced setup.

📚 Resources

📖 Example
📧 Contact: CUGA Team

Team

Alon Oved
Asaf Adi
Avi Yaeli
Harold Ship
Ido Levy
Nir Mashkif
Offer Akrabi
Sami Marreed
Segev Shlomov
Yinon Goldshtein

Call for the Community

CUGA is open source because we believe trustworthy enterprise agents must be built together.
Here's how you can help:

Share use cases → Show us how you'd use CUGA in real workflows.
Request features → Suggest capabilities that would make it more useful.
Report bugs → Help improve stability by filing clear, reproducible reports.

All contributions are welcome through GitHub Issues - whether it's sharing use cases, requesting features, or reporting bugs!

Roadmap

CUGA is still early. Amongst other, we're exploring the following directions:

Policy support: procedural SOPs, domain knowledge, input/output guards, context- and tool-based constraints
Performance improvements: dynamic reasoning strategies that adapt to task complexity

Before Submitting a PR

Please follow the contribution guide in CONTRIBUTING.md.

📞 Support & Community

Contact Us: Contact Form
Documentation: Full Documentation

Made by the CUGA Team at IBM Research

Quick Start

Installation

Configuration

Guides & Examples

Integrations

Save & Reuse

Evaluation

Policies

Agent SDK

💼 Use Cases

🛠️ Technology Stack

📦 Installing UV

On this page