Logos Testing Framework

Declarative, multi-node blockchain testing for the Logos network

The Logos Testing Framework enables you to test consensus and transaction workloads across local processes, Docker Compose, and Kubernetes deployments—all with a unified scenario API.

Get Started

Core Concept

Everything in this framework is a Scenario.

A Scenario is a controlled experiment over time, composed of:

Topology — The cluster shape (nodes, network layout)
Workloads — Traffic and conditions that exercise the system (transactions, chaos)
Expectations — Success criteria verified after execution (liveness, inclusion, recovery)
Duration — The time window for the experiment

This single abstraction makes tests declarative, portable, and composable.

How It Works

flowchart LR
    Build[Define Scenario] --> Deploy[Deploy Topology]
    Deploy --> Execute[Run Workloads]
    Execute --> Evaluate[Check Expectations]
    
    style Build fill:#e1f5ff
    style Deploy fill:#fff4e1
    style Execute fill:#ffe1f5
    style Evaluate fill:#e1ffe1

Define Scenario — Describe your test: topology, workloads, and success criteria
Deploy Topology — Launch nodes using host, compose, or k8s runners
Run Workloads — Drive transactions and chaos operations
Check Expectations — Verify consensus liveness, inclusion, and system health

Key Features

Declarative API

Express scenarios as topology + workloads + expectations
Reuse the same test definition across different deployment targets
Compose complex tests from modular components

Multiple Deployment Modes

Host Runner: Local processes for fast iteration
Compose Runner: Containerized environments with node control
Kubernetes Runner: Production-like cluster testing

Built-in Workloads

Transaction submission with configurable rates
Chaos testing with controlled node restarts

Comprehensive Observability

Real-time block feed for monitoring consensus progress
Prometheus/Grafana integration for metrics
Per-node log collection and debugging

Choose Your Path

New to the Framework?

Start with the Quickstart Guide for a hands-on introduction that gets you running tests in minutes.

Ready to Write Tests?

Explore the User Guide to learn about authoring scenarios, workloads, expectations, and deployment strategies.

Setting Up CI/CD?

Jump to Operations & Deployment for prerequisites, environment configuration, and continuous integration patterns.

Extending the Framework?

Check the Developer Reference to implement custom workloads, expectations, and runners.

Project Context

Logos is a modular blockchain protocol composed of nodes that participate in consensus and produce blocks.

Meaningful testing must be performed in multi-node environments that include real networking and timing behavior.

The Logos Testing Framework provides the infrastructure to orchestrate these multi-node scenarios reliably across development, CI, and production-like environments.

Learn more about the protocol: Logos Project Documentation

Documentation Structure

Section	Description
Foundations	Architecture, philosophy, and design principles
User Guide	Writing and running scenarios, workloads, and expectations
Developer Reference	Extending the framework with custom components
Operations & Deployment	Setup, CI integration, and environment configuration
Appendix	Quick reference, troubleshooting, FAQ, and glossary

Quick Links

What You Will Learn — Overview of book contents and learning path
Quickstart — Get up and running in 10 minutes
Examples — Concrete scenario patterns
Troubleshooting — Common issues and solutions
Environment Variables — Complete configuration reference

Ready to start? Head to the Quickstart

What You Will Learn

This book gives you a clear mental model for Logos multi-node testing, shows how to author scenarios that pair realistic workloads with explicit expectations, and guides you to run them across local, containerized, and cluster environments without changing the plan.

By the End of This Book, You Will Be Able To:

Understand the Framework

Explain the six-phase scenario lifecycle (Build, Deploy, Capture, Execute, Evaluate, Cleanup)
Describe how Deployers, Runners, Workloads, and Expectations work together
Navigate the crate architecture and identify extension points
Understand when to use each runner (Host, Compose, Kubernetes)

Author and Run Scenarios

Define multi-node topologies with nodes
Configure transaction workloads with appropriate rates
Add consensus liveness and inclusion expectations
Run scenarios across all three deployment modes
Use BlockFeed to monitor block production in real-time
Implement chaos testing with node restarts

Operate in Production

Set up prerequisites and dependencies correctly
Configure environment variables for different runners
Integrate tests into CI/CD pipelines (GitHub Actions)
Troubleshoot common failure scenarios
Collect and analyze logs from multi-node runs
Optimize test durations and resource usage

Extend the Framework

Implement custom Workload traits for new traffic patterns
Create custom Expectation traits for domain-specific checks
Add new Deployer implementations for different backends
Contribute topology helpers and DSL extensions

Learning Path

Beginner (0-2 hours)

Read Quickstart and run your first scenario
Review Examples to see common patterns
Understand Scenario Lifecycle phases

Intermediate (2-8 hours)

Study Runners comparison and choose appropriate mode
Learn Workloads & Expectations in depth
Review Prerequisites & Setup for your environment
Practice with Advanced Examples

Advanced (8+ hours)

Master Environment Variables configuration
Implement Custom Workloads for your use cases
Set up CI Integration for automated testing
Explore Internal Crate Reference for deep dives

What This Book Does NOT Cover

Logos node internals — This book focuses on testing infrastructure, not the blockchain protocol implementation. See the Logos node repository (logos-blockchain-node) for protocol documentation.
Consensus algorithm theory — We assume familiarity with basic blockchain concepts (nodes, blocks, transactions).
Rust language basics — Examples use Rust, but we don’t teach the language. See The Rust Book if you’re new to Rust.
Kubernetes administration — We show how to use the K8s runner, but don’t cover cluster setup, networking, or operations.
Docker fundamentals — We assume basic Docker/Compose knowledge for the Compose runner.

Quickstart

Get a working example running quickly.

From Scratch (Complete Setup)

If you’re starting from zero, here’s everything you need:

# 1. Install Rust nightly
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh
rustup default nightly

# 2. Clone the repository
git clone https://github.com/logos-blockchain/logos-blockchain-testing.git
cd logos-blockchain-testing

# 3. Run your first scenario (downloads dependencies automatically)
scripts/run/run-examples.sh -t 60 -n 1 host

First run takes 5-10 minutes (downloads ~120MB circuit assets, builds binaries).

Windows users: Use WSL2 (Windows Subsystem for Linux). Native Windows is not supported.

Prerequisites

If you already have the repository cloned:

Rust toolchain (nightly)
Unix-like system (tested on Linux and macOS)
For Docker Compose examples: Docker daemon running
For Docker Desktop on Apple silicon (compose/k8s): set LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/arm64 to avoid slow/fragile amd64 emulation builds
versions.env file at repository root (defines VERSION, LOGOS_BLOCKCHAIN_NODE_REV, LOGOS_BLOCKCHAIN_BUNDLE_VERSION)

Note: logos-blockchain-node binaries are built automatically on demand or can be provided via prebuilt bundles.

Important: The versions.env file is required by helper scripts. If missing, the scripts will fail with an error. The file should already exist in the repository root.

Your First Test

The framework ships with runnable example binaries in examples/src/bin/.

Recommended: Use the convenience script:

# From the logos-blockchain-testing directory
scripts/run/run-examples.sh -t 60 -n 1 host

This handles circuit setup, binary building, and runs a complete scenario: 1 node, transaction workload (5 tx/block), 60s duration.

Alternative: Direct cargo run (requires manual setup):

# Requires circuits in place and LOGOS_BLOCKCHAIN_NODE_BIN set
cargo run -p runner-examples --bin local_runner

Core API Pattern (simplified example):

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn run_local_demo() -> Result<()> {
    // Define the scenario (1 node, tx workload)
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
        .wallets(1_000)
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
                .users(500) // use 500 of the seeded wallets
        })
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(60))
        .build();

    // Deploy and run
    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

Note: The examples are binaries with #[tokio::main], not test functions. If you want to write integration tests, wrap this pattern in #[tokio::test] functions in your own test suite.

What you should see:

Nodes spawn as local processes
Consensus starts producing blocks
Scenario runs for the configured duration
Node state/logs written under a temporary per-run directory in the current working directory (removed after the run unless LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1)
To write per-node log files to a stable location: set LOGOS_BLOCKCHAIN_LOG_DIR=/path/to/logs (files will have prefix like logos-blockchain-node-0*, may include timestamps)

What Just Happened?

Let’s unpack the code:

1. Topology Configuration

use testing_framework_core::scenario::ScenarioBuilder;

pub fn step_1_topology() -> testing_framework_core::scenario::Builder<()> {
    ScenarioBuilder::topology_with(|t| {
        t.network_star() // Star topology: all nodes connect to seed
            .nodes(1) // 1 node
    })
}

This defines what your test network looks like.

2. Wallet Seeding

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn step_2_wallets() -> testing_framework_core::scenario::Builder<()> {
    ScenarioBuilder::with_node_counts(1).wallets(1_000) // Seed 1,000 funded wallet accounts
}

Provides funded accounts for transaction submission.

3. Workloads

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn step_3_workloads() -> testing_framework_core::scenario::Builder<()> {
    ScenarioBuilder::with_node_counts(1)
        .wallets(1_000)
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
                .users(500) // Use 500 of the 1,000 wallets
        })
}

Generates transaction traffic to stress the inclusion pipeline.

4. Expectation

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn step_4_expectation() -> testing_framework_core::scenario::Builder<()> {
    ScenarioBuilder::with_node_counts(1).expect_consensus_liveness() // This says what success means: blocks must be produced continuously.
}

This says what success means: blocks must be produced continuously.

5. Run Duration

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;

pub fn step_5_run_duration() -> testing_framework_core::scenario::Builder<()> {
    ScenarioBuilder::with_node_counts(1).with_run_duration(Duration::from_secs(60))
}

Run for 60 seconds (~27 blocks with default 2s slots, 0.9 coefficient). Framework ensures this is at least 2× the consensus slot duration. Adjust consensus timing via CONSENSUS_SLOT_TIME and CONSENSUS_ACTIVE_SLOT_COEFF.

6. Deploy and Execute

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;

pub async fn step_6_deploy_and_execute() -> Result<()> {
    let mut plan = ScenarioBuilder::with_node_counts(1).build();

    let deployer = LocalDeployer::default(); // Use local process deployer
    let runner = deployer.deploy(&plan).await?; // Provision infrastructure
    let _handle = runner.run(&mut plan).await?; // Execute workloads & expectations

    Ok(())
}

Deployer provisions the infrastructure. Runner orchestrates execution.

Adjust the Topology

With run-examples.sh (recommended):

# Scale up to 3 nodes, run for 2 minutes
scripts/run/run-examples.sh -t 120 -n 3 host

With direct cargo run:

# Uses LOGOS_BLOCKCHAIN_DEMO_* env vars (or legacy *_DEMO_* vars)
LOGOS_BLOCKCHAIN_DEMO_NODES=3 \
LOGOS_BLOCKCHAIN_DEMO_RUN_SECS=120 \
cargo run -p runner-examples --bin local_runner

Try Docker Compose

Use the same API with a different deployer for reproducible containerized environment.

Recommended: Use the convenience script (handles everything):

scripts/run/run-examples.sh -t 60 -n 1 compose

This automatically:

Fetches circuit assets (to ~/.logos-blockchain-circuits by default)
Builds/uses prebuilt binaries (via LOGOS_BLOCKCHAIN_BINARIES_TAR if available)
Builds the Docker image
Runs the compose scenario

Alternative: Direct cargo run with manual setup:

# Option 1: Use prebuilt bundle (recommended for compose/k8s)
scripts/build/build-bundle.sh --platform linux  # Creates .tmp/nomos-binaries-linux-v0.3.1.tar.gz
export LOGOS_BLOCKCHAIN_BINARIES_TAR=.tmp/nomos-binaries-linux-v0.3.1.tar.gz

# Option 2: Manual circuit/image setup (rebuilds during image build)
scripts/setup/setup-logos-blockchain-circuits.sh v0.3.1 /tmp/logos-blockchain-circuits
scripts/build/build_test_image.sh

# Run with Compose
LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local \
cargo run -p runner-examples --bin compose_runner

Benefit: Reproducible containerized environment (Dockerized nodes, repeatable deployments).

Optional: Prometheus + Grafana

The runner can integrate with external observability endpoints. For a ready-to-run local stack:

scripts/setup/setup-observability.sh compose up
eval "$(scripts/setup/setup-observability.sh compose env)"

Then run your compose scenario as usual (the environment variables enable PromQL querying and node OTLP metrics export).

Note: Compose expects circuits at /opt/circuits inside containers (set by the image build).

In code: Just swap the deployer:

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_compose::ComposeDeployer;

pub async fn run_with_compose_deployer() -> Result<()> {
    // ... same scenario definition ...
    let mut plan = ScenarioBuilder::with_node_counts(1).build();

    let deployer = ComposeDeployer::default(); // Use Docker Compose
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

Next Steps

Now that you have a working test:

Understand the philosophy: Testing Philosophy
Learn the architecture: Architecture Overview
See more examples: Examples
API reference: Builder API Quick Reference
Debug failures: Troubleshooting

Part I — Foundations

Conceptual chapters that establish the mental model for the framework and how it approaches multi-node testing.

Introduction

The Logos Testing Framework is a purpose-built toolkit for exercising Logos in realistic, multi-node environments. It solves the gap between small, isolated tests and full-system validation by letting teams describe a cluster layout, drive meaningful traffic, and assert the outcomes in one coherent plan.

It is for protocol engineers, infrastructure operators, and QA teams who need repeatable confidence that node components work together under network and timing constraints.

Multi-node integration testing is required because many Logos behaviors—block progress and liveness under churn—only emerge when several nodes interact over real networking and time. This framework makes those checks declarative, observable, and portable across environments.

A Scenario in 20 Lines

Here’s the conceptual shape of every test you’ll write:

// 1. Define the cluster
let scenario = ScenarioBuilder::topology_with(|t| {
    t.network_star()
        .nodes(3)
})
// 2. Add workloads (traffic)
.transactions_with(|tx| tx.rate(10).users(5))

// 3. Define success criteria
.expect_consensus_liveness()

// 4. Set experiment duration
.with_run_duration(Duration::from_secs(60))
.build();

// 5. Deploy and run
let runner = deployer.deploy(&scenario).await?;
runner.run(&mut scenario).await?;

This pattern—topology, workloads, expectations, duration—repeats across all scenarios in this book.

Learn more: For protocol-level documentation and node internals, see the Logos Project Documentation.

Architecture Overview

The framework follows a clear flow: Topology → Scenario → Deployer → Runner → Workloads → Expectations.

Core Flow

flowchart LR
    A(Topology<br/>shape cluster) --> B(Scenario<br/>plan)
    B --> C(Deployer<br/>provision & readiness)
    C --> D(Runner<br/>orchestrate execution)
    D --> E(Workloads<br/>drive traffic)
    E --> F(Expectations<br/>verify outcomes)

Crate Architecture

flowchart TB
    subgraph Examples["Runner Examples"]
        LocalBin[local_runner.rs]
        ComposeBin[compose_runner.rs]
        K8sBin[k8s_runner.rs]
    end
    
    subgraph Workflows["Workflows (Batteries Included)"]
        DSL[ScenarioBuilderExt<br/>Fluent API]
        TxWorkload[Transaction Workload]
        ChaosWorkload[Chaos Workload]
        Expectations[Built-in Expectations]
    end
    
    subgraph Core["Core Framework"]
        ScenarioModel[Scenario Model]
        Traits[Deployer + Runner Traits]
        BlockFeed[BlockFeed]
        NodeClients[Node Clients]
        Topology[Topology Generation]
    end
    
    subgraph Deployers["Runner Implementations"]
        LocalDeployer[LocalDeployer]
        ComposeDeployer[ComposeDeployer]
        K8sDeployer[K8sDeployer]
    end
    
    subgraph Support["Supporting Crates"]
        Configs[Configs & Topology]
        Nodes[Node API Clients]
    end
    
    Examples --> Workflows
    Examples --> Deployers
    Workflows --> Core
    Deployers --> Core
    Deployers --> Support
    Core --> Support
    Workflows --> Support
    
    style Examples fill:#e1f5ff
    style Workflows fill:#e1ffe1
    style Core fill:#fff4e1
    style Deployers fill:#ffe1f5
    style Support fill:#f0f0f0

Layer Responsibilities

Runner Examples (Entry Points)

Executable binaries that demonstrate framework usage
Wire together deployers, scenarios, and execution
Provide CLI interfaces for different modes

Workflows (High-Level API)

ScenarioBuilderExt trait provides fluent DSL
Built-in workloads (transactions, chaos)
Common expectations (liveness, inclusion)
Simplifies scenario authoring

Core Framework (Foundation)

Scenario model and lifecycle orchestration
Deployer and Runner traits (extension points)
BlockFeed for real-time block observation
RunContext providing node clients and metrics
Topology generation and validation

Runner Implementations

LocalDeployer - spawns processes on host
ComposeDeployer - orchestrates Docker Compose
K8sDeployer - deploys to Kubernetes cluster
Each implements Deployer trait

Supporting Crates

configs - Topology configuration and generation
nodes - HTTP/RPC client for node APIs

Extension Points

flowchart LR
    Custom[Your Code] -.implements.-> Workload[Workload Trait]
    Custom -.implements.-> Expectation[Expectation Trait]
    Custom -.implements.-> Deployer[Deployer Trait]
    
    Workload --> Core[Core Framework]
    Expectation --> Core
    Deployer --> Core
    
    style Custom fill:#ffe1f5
    style Core fill:#fff4e1

Extend by implementing:

Workload - Custom traffic generation patterns
Expectation - Custom success criteria
Deployer - Support for new deployment targets

See Extending the Framework for details.

Components

Topology describes the cluster: how many nodes and the high-level network parameters they should follow.
Scenario combines that topology with the activities to run and the checks to perform, forming a single plan.
Deployer provisions infrastructure on the chosen backend (local processes, Docker Compose, or Kubernetes), waits for readiness, and returns a Runner.
Runner orchestrates scenario execution: starts workloads, observes signals, evaluates expectations, and triggers cleanup.
Workloads generate traffic and conditions that exercise the system.
Expectations observe the run and judge success or failure once activity completes.

Each layer has a narrow responsibility so that cluster shape, deployment choice, traffic generation, and health checks can evolve independently while fitting together predictably.

Entry Points

The framework is consumed via runnable example binaries in examples/src/bin/:

local_runner.rs — Spawns nodes as host processes
compose_runner.rs — Deploys via Docker Compose (requires LOGOS_BLOCKCHAIN_TESTNET_IMAGE built)
k8s_runner.rs — Deploys via Kubernetes Helm (requires cluster + image)

Recommended: Use the convenience script:

scripts/run/run-examples.sh -t <duration> -n <nodes> <mode>
# mode: host, compose, or k8s

This handles circuit setup, binary building/bundling, image building, and execution.

Alternative: Direct cargo run (requires manual setup):

cargo run -p runner-examples --bin <name>

These binaries use the framework API (ScenarioBuilder) to construct and execute scenarios.

Builder API

Scenarios are defined using a fluent builder pattern:

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn scenario_plan() -> testing_framework_core::scenario::Scenario<()> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .wallets(50)
        .transactions_with(|txs| txs.rate(5).users(20))
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(90))
        .build()
}

Key API Points:

Topology uses .topology_with(|t| { t.nodes(N) }) closure pattern
Workloads are configured via _with closures (transactions_with, chaos_with)
Chaos workloads require .enable_node_control() and a compatible runner

Deployers

Three deployer implementations:

Deployer	Backend	Prerequisites	Node Control
`LocalDeployer`	Host processes	Binaries (built on demand or via bundle)	No
`ComposeDeployer`	Docker Compose	Image with embedded assets/binaries	Yes
`K8sDeployer`	Kubernetes Helm	Cluster + image loaded	Not yet

Compose-specific features:

Observability is external (set LOGOS_BLOCKCHAIN_METRICS_QUERY_URL / LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL / LOGOS_BLOCKCHAIN_GRAFANA_URL as needed)
Optional OTLP trace/metrics endpoints (LOGOS_BLOCKCHAIN_OTLP_ENDPOINT, LOGOS_BLOCKCHAIN_OTLP_METRICS_ENDPOINT)
Node control for chaos testing (restart nodes)

Assets and Images

Docker Image

Built via scripts/build/build_test_image.sh:

Embeds circuit assets and binaries
Includes runner scripts: run_nomos_node.sh
Tagged as LOGOS_BLOCKCHAIN_TESTNET_IMAGE (default: logos-blockchain-testing:local)
Recommended: Use prebuilt bundle via scripts/build/build-bundle.sh --platform linux and set LOGOS_BLOCKCHAIN_BINARIES_TAR before building image

Circuit Assets

Circuit assets required by the node binary:

Host path: ~/.logos-blockchain-circuits (default)
Container path: /opt/circuits (for compose/k8s)
Override: LOGOS_BLOCKCHAIN_CIRCUITS=/custom/path/to/dir (must point to a directory)
Fetch via: scripts/setup/setup-logos-blockchain-circuits.sh v0.3.1 ~/.logos-blockchain-circuits or use scripts/run/run-examples.sh

Compose Stack

Templates and configs in testing-framework/runners/compose/assets/:

docker-compose.yml.tera — Stack template (nodes)
Cfgsync config: testing-framework/assets/stack/cfgsync.yaml
Monitoring assets (not deployed by the framework): testing-framework/assets/stack/monitoring/

Logging Architecture

Two separate logging pipelines:

Component	Configuration	Output
Runner binaries	`RUST_LOG`	Framework orchestration logs
Node processes	`LOGOS_BLOCKCHAIN_LOG_LEVEL`, `LOGOS_BLOCKCHAIN_LOG_FILTER` (+ `LOGOS_BLOCKCHAIN_LOG_DIR` on host runner)	Consensus, mempool, network logs

Node logging:

Local runner: Writes to temporary directories by default (cleaned up). Set LOGOS_BLOCKCHAIN_TESTS_TRACING=true + LOGOS_BLOCKCHAIN_LOG_DIR for persistent files.
Compose runner: Default logs to container stdout/stderr (docker logs). To write per-node files, set tracing_settings.logger: !File in testing-framework/assets/stack/cfgsync.yaml (and mount a writable directory).
K8s runner: Logs to pod stdout/stderr (kubectl logs). To write per-node files, set tracing_settings.logger: !File in testing-framework/assets/stack/cfgsync.yaml (and mount a writable directory).

File naming: Per-node files use prefix logos-blockchain-node-{index} (may include timestamps).

Observability

Prometheus-compatible metrics querying (optional):

The framework does not deploy Prometheus/Grafana.
Provide a Prometheus-compatible base URL (PromQL API) via LOGOS_BLOCKCHAIN_METRICS_QUERY_URL.
Accessible in expectations when configured: ctx.telemetry().prometheus().map(|p| p.base_url())

Grafana dashboards (optional):

Dashboards live in testing-framework/assets/stack/monitoring/grafana/dashboards/ and can be imported into your Grafana.
If you set LOGOS_BLOCKCHAIN_GRAFANA_URL, the deployer prints it in TESTNET_ENDPOINTS.

Node APIs:

HTTP endpoints per node for consensus info and network status
Accessible in expectations: ctx.node_clients().node_clients().get(0)

OTLP (optional):

Trace endpoint: LOGOS_BLOCKCHAIN_OTLP_ENDPOINT=http://localhost:4317
Metrics endpoint: LOGOS_BLOCKCHAIN_OTLP_METRICS_ENDPOINT=http://localhost:4318
Disabled by default (no noise if unset)

For detailed logging configuration, see Logging & Observability.

Testing Philosophy

This framework embodies specific principles that shape how you author and run scenarios. Understanding these principles helps you write effective tests and interpret results correctly.

Declarative over Imperative

Describe what you want to test, not how to orchestrate it:

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn declarative_over_imperative() {
    // Good: declarative
    let _plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(2))
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
        })
        .expect_consensus_liveness()
        .build();

    // Bad: imperative (framework doesn't work this way)
    // spawn_node();
    // loop { submit_tx(); check_block(); }
}

Why it matters: The framework handles deployment, readiness, and cleanup. You focus on test intent, not infrastructure orchestration.

Exception: For advanced network scenarios (split-brain, late joins, network healing) that can’t be expressed declaratively, see Manual Clusters for imperative control.

Protocol Time, Not Wall Time

Reason in blocks and consensus intervals, not wall-clock seconds.

Consensus defaults:

Slot duration: 2 seconds (NTP-synchronized, configurable via CONSENSUS_SLOT_TIME)
Active slot coefficient: 0.9 (90% block probability per slot, configurable via CONSENSUS_ACTIVE_SLOT_COEFF)
Expected rate: ~27 blocks per minute

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn protocol_time_not_wall_time() {
    // Good: protocol-oriented thinking
    let _plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(2))
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
        })
        .with_run_duration(Duration::from_secs(60)) // Let framework calculate expected blocks
        .expect_consensus_liveness() // "Did we produce the expected blocks?"
        .build();

    // Bad: wall-clock assumptions
    // "I expect exactly 30 blocks in 60 seconds"
    // This breaks on slow CI where slot timing might drift
}

Why it matters: Slot timing is fixed (2s by default, NTP-synchronized), so the expected number of blocks is predictable: ~27 blocks in 60s with the default 0.9 active slot coefficient. The framework calculates expected blocks from slot duration and run window, making assertions protocol-based rather than tied to specific wall-clock expectations. Assert on “blocks produced relative to slots” not “blocks produced in exact wall-clock seconds”.

Determinism First, Chaos When Needed

Default scenarios are repeatable:

Fixed topology
Predictable traffic rates
Deterministic checks

Chaos is opt-in:

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::{ChaosBuilderExt, ScenarioBuilderExt};

pub fn determinism_first() {
    // Separate: functional test (deterministic)
    let _plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(2))
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
        })
        .expect_consensus_liveness()
        .build();

    // Separate: chaos test (introduces randomness)
    let _chaos_plan =
        ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
            .enable_node_control()
            .chaos_with(|c| {
                c.restart()
                    .min_delay(Duration::from_secs(30))
                    .max_delay(Duration::from_secs(60))
                    .target_cooldown(Duration::from_secs(45))
                    .apply()
            })
            .transactions_with(|txs| {
                txs.rate(5) // 5 transactions per block
            })
            .expect_consensus_liveness()
            .build();
}

Why it matters: Mixing determinism with chaos creates noisy, hard-to-debug failures. Separate concerns make failures actionable.

Observable Health Signals

Prefer user-facing signals over internal state:

Good checks:

Blocks progressing at expected rate (liveness)
Transactions included within N blocks (inclusion)
Transactions included within N blocks (inclusion)

Avoid internal checks:

Memory pool size
Internal service state
Cache hit rates

Why it matters: User-facing signals reflect actual system health. Internal state can be “healthy” while the system is broken from a user perspective.

Minimum Run Windows

Always run long enough for meaningful block production:

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn minimum_run_windows() {
    // Bad: too short (~2 blocks with default 2s slots, 0.9 coeff)
    let _too_short = ScenarioBuilder::with_node_counts(1)
        .with_run_duration(Duration::from_secs(5))
        .expect_consensus_liveness()
        .build();

    // Good: enough blocks for assertions (~27 blocks with default 2s slots, 0.9
    // coeff)
    let _good = ScenarioBuilder::with_node_counts(1)
        .with_run_duration(Duration::from_secs(60))
        .expect_consensus_liveness()
        .build();
}

Note: Block counts assume default consensus parameters:

Slot duration: 2 seconds (configurable via CONSENSUS_SLOT_TIME)
Active slot coefficient: 0.9 (90% block probability per slot, configurable via CONSENSUS_ACTIVE_SLOT_COEFF)
Formula: blocks ≈ (duration / slot_duration) × active_slot_coeff

If upstream changes these parameters, adjust your duration expectations accordingly.

The framework enforces minimum durations (at least 2× slot duration), but be explicit. Very short runs risk false confidence—one lucky block doesn’t prove liveness.

Summary

These principles keep scenarios:

Portable across environments (protocol time, declarative)
Debuggable (determinism, separation of concerns)
Meaningful (observable signals, sufficient duration)

When authoring scenarios, ask: “Does this test the protocol behavior or my local environment quirks?”

Scenario Lifecycle

A scenario progresses through six distinct phases, each with a specific responsibility:

flowchart TB
    subgraph Phase1["1. Build Phase"]
        Build[Define Scenario]
        BuildDetails["• Declare topology<br/>• Attach workloads<br/>• Add expectations<br/>• Set run duration"]
        Build --> BuildDetails
    end
    
    subgraph Phase2["2. Deploy Phase"]
        Deploy[Provision Environment]
        DeployDetails["• Launch nodes<br/>• Wait for readiness<br/>• Establish connectivity<br/>• Return Runner"]
        Deploy --> DeployDetails
    end
    
    subgraph Phase3["3. Capture Phase"]
        Capture[Baseline Metrics]
        CaptureDetails["• Snapshot initial state<br/>• Start BlockFeed<br/>• Initialize expectations"]
        Capture --> CaptureDetails
    end
    
    subgraph Phase4["4. Execution Phase"]
        Execute[Drive Workloads]
        ExecuteDetails["• Submit transactions<br/>• Trigger chaos events<br/>• Run for duration"]
        Execute --> ExecuteDetails
    end
    
    subgraph Phase5["5. Evaluation Phase"]
        Evaluate[Check Expectations]
        EvaluateDetails["• Verify liveness<br/>• Check inclusion<br/>• Validate outcomes<br/>• Aggregate results"]
        Evaluate --> EvaluateDetails
    end
    
    subgraph Phase6["6. Cleanup Phase"]
        Cleanup[Teardown]
        CleanupDetails["• Stop nodes<br/>• Remove containers<br/>• Collect logs<br/>• Release resources"]
        Cleanup --> CleanupDetails
    end
    
    Phase1 --> Phase2
    Phase2 --> Phase3
    Phase3 --> Phase4
    Phase4 --> Phase5
    Phase5 --> Phase6
    
    style Phase1 fill:#e1f5ff
    style Phase2 fill:#fff4e1
    style Phase3 fill:#f0ffe1
    style Phase4 fill:#ffe1f5
    style Phase5 fill:#e1ffe1
    style Phase6 fill:#ffe1e1

Phase Details

1. Build the Plan

Declare a topology, attach workloads and expectations, and set the run window. The plan is the single source of truth for what will happen.

Key actions:

Define cluster shape (nodes, network topology)
Configure workloads (transaction rate, chaos patterns)
Attach expectations (liveness, inclusion, custom checks)
Set timing parameters (run duration, cooldown period)

Output: Immutable Scenario plan

2. Deploy

Hand the plan to a deployer. It provisions the environment on the chosen backend, waits for nodes to signal readiness, and returns a runner.

Key actions:

Provision infrastructure (processes, containers, or pods)
Launch nodes
Wait for readiness probes (HTTP endpoints respond)
Establish node connectivity and metrics endpoints
Spawn BlockFeed for real-time block observation

Output: Runner + RunContext (with node clients, metrics, control handles)

3. Capture Baseline

Expectations snapshot initial state before workloads begin.

Key actions:

Record starting block height
Initialize counters and trackers
Subscribe to BlockFeed
Capture baseline metrics

Output: Captured state for later comparison

4. Drive Workloads

The runner starts traffic and behaviors for the planned duration.

Key actions:

Submit transactions at configured rates
Trigger chaos events (node restarts)
Run concurrently for the specified duration
Observe blocks and metrics in real-time

Note: Network partitions/peer blocking are not yet supported by node control; today chaos is restart-based. See RunContext: BlockFeed & Node Control.

Duration: Controlled by with_run_duration()

5. Evaluate Expectations

Once activity stops (and optional cooldown completes), the runner checks liveness and workload-specific outcomes.

Key actions:

Verify consensus liveness (minimum block production)
Check transaction inclusion rates
Assess system recovery after chaos events
Aggregate pass/fail results

Output: Success or detailed failure report

6. Cleanup

Tear down resources so successive runs start fresh and do not inherit leaked state.

Key actions:

Stop all node processes/containers/pods
Remove temporary directories and volumes
Collect and archive logs (if LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1)
Release ports and network resources
Cleanup observability stack (if spawned)

Guarantee: Runs even on panic via CleanupGuard

Design Rationale

Modular crates keep configuration, orchestration, workloads, and runners decoupled so each can evolve without breaking the others.
Pluggable runners let the same scenario run on a laptop, a Docker host, or a Kubernetes cluster, making validation portable across environments.
Separated workloads and expectations clarify intent: what traffic to generate versus how to judge success. This simplifies review and reuse.
Declarative topology makes cluster shape explicit and repeatable, reducing surprise when moving between CI and developer machines.
Maintainability through predictability: a clear flow from plan to deployment to verification lowers the cost of extending the framework and interpreting failures.

Part II — User Guide

Practical guidance for shaping scenarios, combining workloads and expectations, and running them across different environments.

Workspace Layout

The workspace focuses on multi-node integration testing and sits alongside a logos-blockchain-node checkout. Its crates separate concerns to keep scenarios repeatable and portable:

Configs: prepares high-level node, network, tracing, and wallet settings used across test environments.
Core scenario orchestration: the engine that holds topology descriptions, scenario plans, runtimes, workloads, and expectations.
Workflows: ready-made workloads (transactions, data-availability, chaos) and reusable expectations assembled into a user-facing DSL.
Runners: deployment backends for local processes, Docker Compose, and Kubernetes, all consuming the same scenario plan.
Runner Examples (crate name: runner-examples, path: examples/): runnable binaries (examples/src/bin/local_runner.rs, examples/src/bin/compose_runner.rs, examples/src/bin/k8s_runner.rs) that demonstrate complete scenario execution with each deployer.

This split keeps configuration, orchestration, reusable traffic patterns, and deployment adapters loosely coupled while sharing one mental model for tests.

Annotated Tree

Directory structure with key paths annotated:

logos-blockchain-testing/
├─ testing-framework/           # Core library crates
│  ├─ configs/                  # Node config builders, topology generation, tracing/logging config
│  ├─ core/                     # Scenario model (ScenarioBuilder), runtime (Runner, Deployer), topology, node spawning
│  ├─ workflows/                # Workloads (transactions, chaos), expectations (liveness), builder DSL extensions
│  ├─ deployers/                # Deployment backends
│  │  ├─ local/                 # LocalDeployer (spawns local processes)
│  │  ├─ compose/               # ComposeDeployer (Docker Compose + Prometheus)
│  │  └─ k8s/                   # K8sDeployer (Kubernetes Helm)
│  └─ assets/                   # Docker/K8s stack assets
│     └─ stack/
│        ├─ monitoring/         # Prometheus config
│        ├─ scripts/            # Container entrypoints
│        └─ cfgsync.yaml        # Config sync server template
│
├─ examples/                    # PRIMARY ENTRY POINT: runnable binaries
│  └─ src/bin/
│     ├─ local_runner.rs        # Host processes demo (LocalDeployer)
│     ├─ compose_runner.rs      # Docker Compose demo (ComposeDeployer)
│     └─ k8s_runner.rs          # Kubernetes demo (K8sDeployer)
│
├─ scripts/                     # Helper utilities
│  ├─ run-examples.sh           # Convenience script (handles setup + runs examples)
│  ├─ build-bundle.sh           # Build prebuilt binaries+circuits bundle
│  └─ setup-logos-blockchain-circuits.sh  # Fetch circuit assets (Linux + host)
│
└─ book/                        # This documentation (mdBook)

Key Directories Explained

`testing-framework/`

Core library crates providing the testing API.

Crate	Purpose	Key Exports
`configs`	Node configuration builders	Topology generation, tracing config
`core`	Scenario model & runtime	`ScenarioBuilder`, `Deployer`, `Runner`
`workflows`	Workloads & expectations	`ScenarioBuilderExt`, `ChaosBuilderExt`
`deployers/local`	Local process deployer	`LocalDeployer`
`deployers/compose`	Docker Compose deployer	`ComposeDeployer`
`deployers/k8s`	Kubernetes deployer	`K8sDeployer`

`testing-framework/assets/stack/`

Docker/K8s deployment assets:

monitoring/: Prometheus config
scripts/: Container entrypoints

`scripts/`

Convenience utilities:

run-examples.sh: All-in-one script for host/compose/k8s modes (recommended)
build-bundle.sh: Create prebuilt binaries+circuits bundle for compose/k8s
build_test_image.sh: Build the compose/k8s Docker image (bakes in assets)
setup-logos-blockchain-circuits.sh: Fetch circuit assets for both Linux and host
cfgsync.yaml: Configuration sync server template

`examples/` (Start Here!)

Runnable binaries demonstrating framework usage:

local_runner.rs — Local processes
compose_runner.rs — Docker Compose (requires LOGOS_BLOCKCHAIN_TESTNET_IMAGE built)
k8s_runner.rs — Kubernetes (requires cluster + image)

Run with: cargo run -p runner-examples --bin <name>

`scripts/`

Helper utilities:

setup-logos-blockchain-circuits.sh: Fetch circuit assets from releases

Observability

Compose runner includes:

Prometheus at http://localhost:9090 (metrics scraping)
Node metrics exposed per node
Access in expectations: ctx.telemetry().prometheus().map(|p| p.base_url())

Logging controlled by:

LOGOS_BLOCKCHAIN_LOG_DIR — Write per-node log files
LOGOS_BLOCKCHAIN_LOG_LEVEL — Global log level (error/warn/info/debug/trace)
LOGOS_BLOCKCHAIN_LOG_FILTER — Target-specific filtering (e.g., cryptarchia=trace)
LOGOS_BLOCKCHAIN_TESTS_TRACING — Enable file logging for local runner

See Logging & Observability for details.

To Do This	Go Here
Run an example	`examples/src/bin/` → `cargo run -p runner-examples --bin <name>`
Write a custom scenario	`testing-framework/core/` → Implement using `ScenarioBuilder`
Add a new workload	`testing-framework/workflows/src/workloads/` → Implement `Workload` trait
Add a new expectation	`testing-framework/workflows/src/expectations/` → Implement `Expectation` trait
Modify node configs	`testing-framework/configs/src/topology/configs/`
Extend builder DSL	`testing-framework/workflows/src/builder/` → Add trait methods
Add a new deployer	`testing-framework/deployers/` → Implement `Deployer` trait

For detailed guidance, see Internal Crate Reference.

Authoring Scenarios

Creating a scenario is a declarative exercise. This page walks you through the core authoring loop with concrete examples, explains the units and timing model, and shows how to structure scenarios in Rust test suites.

The Core Authoring Loop

Every scenario follows the same pattern:

flowchart LR
    A[1. Topology] --> B[2. Workloads]
    B --> C[3. Expectations]
    C --> D[4. Duration]
    D --> E[5. Deploy & Run]

Shape the topology — How many nodes, what network shape
Attach workloads — What traffic to generate (transactions, chaos)
Define expectations — What success looks like (liveness, inclusion, recovery)
Set duration — How long to run the experiment
Choose a runner — Where to execute (local, compose, k8s)

Hello Scenario: Your First Test

Let’s build a minimal consensus liveness test step-by-step.

Step 1: Shape the Topology

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

let scenario = ScenarioBuilder::topology_with(|t| {
    t.network_star()      // Star network (one gateway + nodes)
        .nodes(3)     // 3 nodes
})

What goes in topology?

Node counts (nodes)
Network shape (network_star() is currently the only built-in layout)

What does NOT go in topology?

Traffic rates (that’s workloads)
Success criteria (that’s expectations)
Runtime configuration (that’s duration/runner)

Step 2: Attach Workloads

.wallets(20) // Seed funded wallet accounts for transaction workloads
.transactions_with(|tx| {
    tx.rate(10)    // 10 transactions per block
      .users(5)    // distributed across 5 wallets
})

What goes in workloads?

Transaction traffic (rate, users)
Chaos injection (restarts, delays)

Units explained:

.rate(10) = 10 transactions per block (not per second!)
.users(5) = use 5 distinct wallet accounts
The framework adapts to block time automatically

Step 3: Define Expectations

.expect_consensus_liveness()

What goes in expectations?

Health checks that run after the scenario completes
Liveness (blocks produced)
Inclusion (workload activity landed on-chain)
Recovery (system survived chaos)

When do expectations run? After the duration window ends, during the evaluation phase of the scenario lifecycle.

Step 4: Set Duration

use std::time::Duration;

.with_run_duration(Duration::from_secs(60))

How long is enough?

Minimum: 2× the expected block time × number of blocks you want
For consensus liveness: 30-60 seconds
For transaction inclusion: 60-120 seconds
For chaos recovery: 2-5 minutes

What happens during this window?

Nodes are running
Workloads generate traffic
Metrics/logs are collected
BlockFeed broadcasts observations in real-time

Step 5: Build and Deploy

.build();

// Choose a runner
use testing_framework_core::scenario::Deployer;
use testing_framework_runner_local::LocalDeployer;

let deployer = LocalDeployer::default();
let runner = deployer.deploy(&scenario).await?;
let _result = runner.run(&mut scenario).await?;

Complete “Hello Scenario”

Putting it all together:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

#[tokio::test]
async fn hello_consensus_liveness() -> Result<()> {
    let mut scenario = ScenarioBuilder::topology_with(|t| {
        t.network_star()
            .nodes(3)
    })
    .wallets(20)
    .transactions_with(|tx| tx.rate(10).users(5))
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(60))
    .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&scenario).await?;
    runner.run(&mut scenario).await?;

    Ok(())
}

Run it:

cargo test hello_consensus_liveness

Understanding Units & Timing

Transaction Rate: Per-Block, Not Per-Second

Wrong mental model: .rate(10) = 10 tx/second

Correct mental model: .rate(10) = 10 tx/block

Why? The blockchain produces blocks at variable rates depending on consensus timing. The framework submits the configured rate per block to ensure predictable load regardless of block time.

Example:

Block time = 2 seconds
.rate(10) → 10 tx/block → 5 tx/second average
Block time = 5 seconds
.rate(10) → 10 tx/block → 2 tx/second average

Duration: Wall-Clock Time

.with_run_duration(Duration::from_secs(60)) means the scenario runs for 60 seconds of real time, not 60 blocks.

How many blocks will be produced? Depends on consensus timing (slot time, active slot coefficient). Typical: 1-2 seconds per block.

Rule of thumb:

60 seconds → ~30-60 blocks
120 seconds → ~60-120 blocks

Structuring Scenarios in a Test Suite

Pattern 1: Integration Test Module

// tests/integration_test.rs
use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

#[tokio::test]
async fn test_consensus_liveness() -> Result<()> {
    let mut scenario = ScenarioBuilder::topology_with(|t| {
        t.network_star().nodes(3)
    })
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(30))
    .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&scenario).await?;
    runner.run(&mut scenario).await?;
    Ok(())
}

#[tokio::test]
async fn test_transaction_inclusion() -> Result<()> {
    let mut scenario = ScenarioBuilder::topology_with(|t| {
        t.network_star().nodes(2)
    })
    .wallets(10)
    .transactions_with(|tx| tx.rate(5).users(5))
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(60))
    .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&scenario).await?;
    runner.run(&mut scenario).await?;
    Ok(())
}

Pattern 2: Shared Scenario Builders

Extract common topology patterns:

// tests/helpers.rs
use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn minimal_topology() -> ScenarioBuilder {
    ScenarioBuilder::topology_with(|t| {
        t.network_star().nodes(2)
    })
}

pub fn production_like_topology() -> ScenarioBuilder {
    ScenarioBuilder::topology_with(|t| {
        t.network_star().nodes(7)
    })
}

// tests/consensus_tests.rs
use std::time::Duration;

use helpers::*;

#[tokio::test]
async fn small_cluster_liveness() -> anyhow::Result<()> {
    let mut scenario = minimal_topology()
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(30))
        .build();
    // ... deploy and run
    Ok(())
}

#[tokio::test]
async fn large_cluster_liveness() -> anyhow::Result<()> {
    let mut scenario = production_like_topology()
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(60))
        .build();
    // ... deploy and run
    Ok(())
}

Pattern 3: Parameterized Scenarios

Test the same behavior across different scales:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

async fn test_liveness_with_topology(nodes: usize) -> Result<()> {
    let mut scenario = ScenarioBuilder::topology_with(|t| {
        t.network_star()
            .nodes(nodes)
    })
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(60))
    .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&scenario).await?;
    runner.run(&mut scenario).await?;
    Ok(())
}

#[tokio::test]
async fn liveness_small() -> Result<()> {
    test_liveness_with_topology(2, 1).await
}

#[tokio::test]
async fn liveness_medium() -> Result<()> {
    test_liveness_with_topology(5, 2).await
}

#[tokio::test]
async fn liveness_large() -> Result<()> {
    test_liveness_with_topology(10, 3).await
}

What Belongs Where?

Topology

Do include:

Node counts (.nodes(3))
Network shape (.network_star())

Don’t include:

Traffic rates (workload concern)
Expected outcomes (expectation concern)
Runtime behavior (runner/duration concern)

Workloads

Do include:

Transaction traffic (.transactions_with(|tx| ...))
Chaos traffic (.chaos().restart() or RandomRestartWorkload)
Chaos injection (.with_workload(RandomRestartWorkload::new(...)))
Rates, users, timing

Don’t include:

Node configuration (topology concern)
Success criteria (expectation concern)

Expectations

Do include:

Health checks (.expect_consensus_liveness())
Inclusion verification (built-in to workloads)
Custom assertions (.with_expectation(MyExpectation::new()))

Don’t include:

Traffic generation (workload concern)
Cluster shape (topology concern)

Best Practices

Keep scenarios focused: One scenario = one behavior under test
Start small: 2-3 nodes, 30-60 seconds
Use descriptive names: test_consensus_survives_node_restart not test_1
Extract common patterns: Shared topology builders, helper functions
Document intent: Add comments explaining what you’re testing and why
Mind the units: .rate(N) is per-block, .with_run_duration() is wall-clock
Set realistic durations: Allow enough time for multiple blocks + workload effects

Next Steps

Core Content: Workloads & Expectations — Comprehensive reference for built-in workloads and expectations
Examples — More scenario patterns (chaos, advanced topologies)
Running Scenarios — How execution works, artifacts produced, per-runner details
API Levels — When to use builder DSL vs. direct instantiation

Core Content: Workloads & Expectations

Workloads describe the activity a scenario generates; expectations describe the signals that must hold when that activity completes. This page is the canonical reference for all built-in workloads and expectations, including configuration knobs, defaults, prerequisites, and debugging guidance.

Overview

flowchart TD
    I[Inputs<br/>topology + wallets + rates] --> Init[Workload init]
    Init --> Drive[Drive traffic]
    Drive --> Collect[Collect signals]
    Collect --> Eval[Expectations evaluate]

Key concepts:

Workloads run during the execution phase (generate traffic)
Expectations run during the evaluation phase (check health signals)
Each workload can attach its own expectations automatically
Expectations can also be added explicitly

Built-in Workloads

1. Transaction Workload

Submits user-level transactions at a configurable rate to exercise transaction processing and inclusion paths.

Import:

use testing_framework_workflows::workloads::transaction::Workload;

Configuration

Parameter	Type	Default	Description
`rate`	`u64`	Required	Transactions per block (not per second!)
`users`	`Option<usize>`	All wallets	Number of distinct wallet accounts to use

DSL Usage

use testing_framework_workflows::ScenarioBuilderExt;

ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(20)  // Seed 20 wallet accounts
    .transactions_with(|tx| {
        tx.rate(10)   // 10 transactions per block
          .users(5)   // Use only 5 of the 20 wallets
    })
    .with_run_duration(Duration::from_secs(60))
    .build();

Direct Instantiation

use testing_framework_workflows::workloads::transaction;

let tx_workload = transaction::Workload::with_rate(10)
    .expect("transaction rate must be non-zero");

ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(20)
    .with_workload(tx_workload)
    .with_run_duration(Duration::from_secs(60))
    .build();

Prerequisites

Wallet accounts must be seeded:
```
.wallets(N)  // Before .transactions_with()
```
The workload will fail during init() if no wallets are configured.
Circuit artifacts must be available:
- Automatically staged by scripts/run/run-examples.sh
- Or manually via scripts/setup/setup-logos-blockchain-circuits.sh (recommended) / scripts/setup/setup-logos-blockchain-circuits.sh

Attached Expectation

TxInclusionExpectation — Verifies that submitted transactions were included in blocks.

What it checks:

At least N transactions were included on-chain (where N = rate × user count × expected block count)
Uses BlockFeed to count transactions across all observed blocks

Failure modes:

“Expected >= X transactions, observed Y” (Y < X)
Common causes: proof generation timeouts, node crashes, insufficient duration

What Failure Looks Like

Error: Expectation failed: TxInclusionExpectation
  Expected: >= 600 transactions (10 tx/block × 60 blocks)
  Observed: 127 transactions
  
  Possible causes:
  - Duration too short (nodes still syncing)
  - Node crashes (check logs for panics/OOM)
  - Wallet accounts not seeded (check topology config)

How to debug:

Check logs for proof generation timing:

grep "proof generation" $LOGOS_BLOCKCHAIN_LOG_DIR/*/*.log

Increase duration: .with_run_duration(Duration::from_secs(120))
Reduce rate: .rate(5) instead of .rate(10)

2. Chaos Workload (Random Restart)

Triggers controlled node restarts to test resilience and recovery behaviors.

Import:

use testing_framework_workflows::workloads::chaos::RandomRestartWorkload;

Configuration

Parameter	Type	Default	Description
`min_delay`	`Duration`	Required	Minimum time between restart attempts
`max_delay`	`Duration`	Required	Maximum time between restart attempts
`target_cooldown`	`Duration`	Required	Minimum time before restarting same node again
`include_nodes`	`bool`	Required	Whether to restart nodes

Usage

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::{ScenarioBuilderExt, workloads::chaos::RandomRestartWorkload};

let scenario = ScenarioBuilder::topology_with(|t| {
    t.network_star().nodes(3)
})
.enable_node_control()  // REQUIRED for chaos
.with_workload(RandomRestartWorkload::new(
    Duration::from_secs(45),   // min_delay
    Duration::from_secs(75),   // max_delay
    Duration::from_secs(120),  // target_cooldown
    true,                      // include_nodes
))
.expect_consensus_liveness()
.with_run_duration(Duration::from_secs(180))
.build();

Prerequisites

Node control must be enabled:
```
.enable_node_control()
```
This adds NodeControlCapability to the scenario.
Runner must support node control:
- Compose runner: Supported
- Local runner: Not supported
- K8s runner: Not yet implemented
Sufficient topology:
- For nodes: Need >1 node (workload skips if only 1)
Realistic timing:
- Total duration should be 2-3× the max_delay + cooldown
- Example: max_delay=75s, cooldown=120s → duration >= 180s

Attached Expectation

None. You must explicitly add expectations (typically .expect_consensus_liveness()).

Why? Chaos workloads are about testing recovery under disruption. The appropriate expectation depends on what you’re testing:

Consensus survives restarts → .expect_consensus_liveness()
Height converges after chaos → Custom expectation checking BlockFeed

What Failure Looks Like

Error: Workload failed: chaos_restart
  Cause: NodeControlHandle not available
  
  Possible causes:
  - Forgot .enable_node_control() in scenario builder
  - Using local runner (doesn't support node control)
  - Using k8s runner (doesn't support node control)

Or:

Error: Expectation failed: ConsensusLiveness
  Expected: >= 20 blocks
  Observed: 8 blocks
  
  Possible causes:
  - Restart frequency too high (nodes can't recover)
  - Consensus timing too slow (increase duration)
  - Too many nodes restarted simultaneously
  - Nodes crashed after restart (check logs)

How to debug:

Check restart events in logs:

grep "restarting\|restart complete" $LOGOS_BLOCKCHAIN_LOG_DIR/*/*.log

Verify node control is enabled:

grep "NodeControlHandle" $LOGOS_BLOCKCHAIN_LOG_DIR/*/*.log

Increase cooldown: Duration::from_secs(180)
Increase duration: .with_run_duration(Duration::from_secs(300))

Built-in Expectations

1. Consensus Liveness

Verifies the system continues to produce blocks during the execution window.

Import:

use testing_framework_workflows::ScenarioBuilderExt;

DSL Usage

ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(60))
    .build();

What It Checks

At least N blocks were produced (where N = duration / expected_block_time)
Uses BlockFeed to count observed blocks
Compares against a minimum threshold (typically 50% of theoretical max)

Failure Modes

Error: Expectation failed: ConsensusLiveness
  Expected: >= 30 blocks
  Observed: 3 blocks
  
  Possible causes:
  - Nodes crashed or never started (check logs)
  - Consensus timing misconfigured (CONSENSUS_SLOT_TIME too high)
  - Insufficient nodes (need >= 2 for BFT consensus)
  - Duration too short (nodes still syncing)

How to Debug

Check if nodes started:

grep "node started\|listening on" $LOGOS_BLOCKCHAIN_LOG_DIR/*/*.log

Check block production:

grep "block.*height" $LOGOS_BLOCKCHAIN_LOG_DIR/node-*/*.log

Check consensus participation:

grep "consensus.*slot\|proposal" $LOGOS_BLOCKCHAIN_LOG_DIR/node-*/*.log

Increase duration: .with_run_duration(Duration::from_secs(120))
Check env vars: echo $CONSENSUS_SLOT_TIME $CONSENSUS_ACTIVE_SLOT_COEFF

2. Workload-Specific Expectations

Each workload automatically attaches its own expectation:

Workload	Expectation	What It Checks
Transaction	`TxInclusionExpectation`	Transactions were included in blocks
Chaos	(None)	Add `.expect_consensus_liveness()` explicitly

These expectations are added automatically when using the DSL (.transactions_with()).

Configuration Quick Reference

Transaction Workload

.wallets(20)
.transactions_with(|tx| tx.rate(10).users(5))

What	Value	Unit
Rate	10	tx/block
Users	5	wallet accounts
Wallets	20	total seeded

Chaos Workload

.enable_node_control()
.with_workload(RandomRestartWorkload::new(
    Duration::from_secs(45),   // min
    Duration::from_secs(75),   // max
    Duration::from_secs(120),  // cooldown
    true,  // nodes
))

Common Patterns

Pattern 1: Multiple Workloads

ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(20)
    .transactions_with(|tx| tx.rate(5).users(10))
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(120))
    .build();

All workloads run concurrently. Expectations for each workload run after the execution window ends.

Pattern 2: Custom Expectation

use testing_framework_core::scenario::Expectation;

struct MyCustomExpectation;

#[async_trait]
impl Expectation for MyCustomExpectation {
    async fn evaluate(&self, ctx: &RunContext) -> Result<(), DynError> {
        // Access BlockFeed, metrics, topology, etc.
        let block_count = ctx.block_feed()?.count();
        if block_count < 10 {
            return Err("Not enough blocks".into());
        }
        Ok(())
    }
}

ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .with_expectation(MyCustomExpectation)
    .with_run_duration(Duration::from_secs(60))
    .build();

Debugging Checklist

When a workload or expectation fails:

Check logs: $LOGOS_BLOCKCHAIN_LOG_DIR/*/ or docker compose logs or kubectl logs
Check prerequisites: wallets, node control, circuits
Increase duration: Double the run duration and retry
Reduce rates: Half the traffic rates and retry
Check metrics: Prometheus queries for block height and tx count
Reproduce locally: Use local runner for faster iteration

Core Content: ScenarioBuilderExt Patterns

When should I read this? After writing 2-3 scenarios. This page documents patterns that emerge from real usage—come back when you’re refactoring or standardizing your test suite.

Patterns that keep scenarios readable and reusable:

Topology-first: start by shaping the cluster (counts, layout) so later steps inherit a clear foundation.
Bundle defaults: use the DSL helpers to attach common expectations (like liveness) whenever you add a matching workload, reducing forgotten checks.
Intentional rates: express traffic in per-block terms to align with protocol timing rather than wall-clock assumptions.
Opt-in chaos: enable restart patterns only in scenarios meant to probe resilience; keep functional smoke tests deterministic.
Wallet clarity: seed only the number of actors you need; it keeps transaction scenarios deterministic and interpretable.

These patterns make scenario definitions self-explanatory while staying aligned with the framework’s block-oriented timing model.

Best Practices

This page collects proven patterns for authoring, running, and maintaining test scenarios that are reliable, maintainable, and actionable.

Scenario Design

State your intent

Document the goal of each scenario (throughput, resilience) so expectation choices are obvious
Use descriptive variable names that explain topology purpose (e.g., star_topology_3val_2exec vs topology)
Add comments explaining why specific rates or durations were chosen

Keep runs meaningful

Choose durations that allow multiple blocks and make timing-based assertions trustworthy
Use FAQ: Run Duration Calculator to estimate minimum duration
Avoid runs shorter than 30 seconds unless testing startup behavior specifically

Separate concerns

Start with deterministic workloads for functional checks
Add chaos in dedicated resilience scenarios to avoid noisy failures
Don’t mix high transaction load with aggressive chaos in the same test (hard to debug)

Start small, scale up

Begin with minimal topology (1-2 nodes) to validate scenario logic
Gradually increase topology size and workload rates
Use Host runner for fast iteration, then validate on Compose before production

Code Organization

Reuse patterns

Standardize on shared topology and workload presets so results are comparable across environments and teams
Extract common topology builders into helper functions
Create workspace-level constants for standard rates and durations

Example: Topology preset

pub fn standard_topology() -> GeneratedTopology {
    TopologyBuilder::new()
        .network_star()
        .nodes(3)
        .generate()
}

Example: Shared constants

pub const STANDARD_TX_RATE: f64 = 10.0;
pub const SHORT_RUN_DURATION: Duration = Duration::from_secs(60);
pub const LONG_RUN_DURATION: Duration = Duration::from_secs(300);

Debugging & Observability

Observe first, tune second

Rely on liveness and inclusion signals to interpret outcomes before tweaking rates or topology
Enable detailed logging (RUST_LOG=debug, LOGOS_BLOCKCHAIN_LOG_LEVEL=debug) only after initial failure
Use LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1 to persist logs when debugging failures

Use BlockFeed effectively

Subscribe to BlockFeed in expectations for real-time block monitoring
Track block production rate to detect liveness issues early
Use block statistics (block_feed.stats().total_transactions()) to verify inclusion

Collect metrics

Set up Prometheus/Grafana via scripts/setup/setup-observability.sh compose up for visualizing node behavior
Use metrics to identify bottlenecks before adding more load
Monitor mempool size, block size, and consensus timing

Environment & Runner Selection

Environment fit

Pick runners that match the feedback loop you need:
- Host: Fast iteration during development, quick CI smoke tests
- Compose: Reproducible environments (recommended for CI), chaos testing
- K8s: Production-like fidelity, large topologies (10+ nodes)

Runner-specific considerations

Runner	When to Use	When to Avoid
Host	Development iteration, fast feedback	Chaos testing, container-specific issues
Compose	CI pipelines, chaos tests, reproducibility	Very large topologies (>10 nodes)
K8s	Production-like testing, cluster behaviors	Local development, fast iteration

Minimal surprises

Seed only necessary wallets and keep configuration deltas explicit when moving between CI and developer machines
Use versions.env to pin node versions consistently across environments
Document non-default environment variables in scenario comments or README

CI/CD Integration

Use matrix builds

strategy:
  matrix:
    runner: [host, compose]
    topology: [small, medium]

Cache aggressively

Cache Rust build artifacts (target/)
Cache circuit parameters (~/.logos-blockchain-circuits/)
Cache Docker layers (use BuildKit cache)

Collect logs on failure

- name: Collect logs on failure
  if: failure()
  run: |
    mkdir -p test-logs
    find /tmp -name "nomos-*.log" -exec cp {} test-logs/ \;
- uses: actions/upload-artifact@v3
  if: failure()
  with:
    name: test-logs-${{ matrix.runner }}
    path: test-logs/

Time limits

Set job timeout to prevent hung runs: timeout-minutes: 30
Use shorter durations in CI (60s) vs local testing (300s)
Run expensive tests (k8s, large topologies) only on main branch or release tags

See also: CI Integration for complete workflow examples

Anti-Patterns to Avoid

# BAD: Will hang/timeout on proof generation
cargo run -p runner-examples --bin local_runner

DON’T: Use tiny durations

// BAD: Not enough time for blocks to propagate
.with_run_duration(Duration::from_secs(5))

// GOOD: Allow multiple consensus rounds
.with_run_duration(Duration::from_secs(60))

DON’T: Ignore cleanup failures

// BAD: Next run inherits leaked state
runner.run(&mut scenario).await?;
// forgot to call cleanup or use CleanupGuard

// GOOD: Cleanup via guard (automatic on panic)
let _cleanup = CleanupGuard::new(runner.clone());
runner.run(&mut scenario).await?;

DON’T: Mix concerns in one scenario

// BAD: Hard to debug when it fails
.transactions_with(|tx| tx.rate(50).users(100))  // high load
.chaos_with(|c| c.restart().min_delay(...))        // AND chaos

// GOOD: Separate tests for each concern
// Test 1: High transaction load only
// Test 2: Chaos resilience only

DON’T: Hardcode paths or ports

// BAD: Breaks on different machines
let path = PathBuf::from("/home/user/circuits");
let port = 9000; // might conflict

// GOOD: Use env vars and dynamic allocation
let path = std::env::var("LOGOS_BLOCKCHAIN_CIRCUITS")
    .unwrap_or_else(|_| "~/.logos-blockchain-circuits".to_string());
let port = get_available_tcp_port();

DON’T: Ignore resource limits

# BAD: Large topology without checking resources
scripts/run/run-examples.sh -n 20 compose
# (might OOM or exhaust ulimits)

# GOOD: Scale gradually and monitor resources
scripts/run/run-examples.sh -n 3 compose  # start small
docker stats  # monitor resource usage
# then increase if resources allow

Scenario Design Heuristics

Minimal viable topology

Consensus: 3 nodes (minimum for Byzantine fault tolerance)
Network: Star topology (simplest for debugging)

Workload rate selection

Start with 1-5 tx/s per user, then increase
Chaos: 30s+ intervals between restarts (allow recovery)

Duration guidelines

Test Type	Minimum Duration	Typical Duration
Smoke test	30s	60s
Integration test	60s	120s
Load test	120s	300s
Resilience test	120s	300s
Soak test	600s (10m)	3600s (1h)

Expectation selection

Test Goal	Expectations
Basic functionality	`expect_consensus_liveness()`
Transaction handling	`expect_consensus_liveness()` + custom inclusion check
Resilience	`expect_consensus_liveness()` + recovery time measurement

Testing the Tests

Validate scenarios before committing

Run on Host runner first (fast feedback)
Run on Compose runner (reproducibility check)
Check logs for warnings or errors
Verify cleanup (no leaked processes/containers)
Run 2-3 times to check for flakiness

Handling flaky tests

Increase run duration (timing-sensitive assertions need longer runs)
Reduce workload rates (might be saturating nodes)
Check resource limits (CPU/RAM/ulimits)
Add debugging output to identify race conditions
Consider if test is over-specified (too strict expectations)

See also:

Troubleshooting for common failure patterns
FAQ for design decisions and gotchas

Usage Patterns

Shape a topology, pick a runner: choose local for quick iteration, compose for reproducible multi-node stacks with observability, or k8s for cluster-grade validation.
Compose workloads deliberately: pair transactions and data-availability traffic for end-to-end coverage; add chaos only when assessing recovery and resilience.
Align expectations with goals: use liveness-style checks to confirm the system keeps up with planned activity, and add workload-specific assertions for inclusion or availability.
Reuse plans across environments: keep the scenario constant while swapping runners to compare behavior between developer machines and CI clusters.
Iterate with clear signals: treat expectation outcomes as the primary pass/fail indicator, and adjust topology or workloads based on what those signals reveal.

Examples

Concrete scenario shapes that illustrate how to combine topologies, workloads, and expectations.

View Complete Source Code:

local_runner.rs — Host processes (local)
compose_runner.rs — Docker Compose
k8s_runner.rs — Kubernetes

Runnable examples: The repo includes complete binaries in examples/src/bin/:

local_runner.rs — Host processes (local)
compose_runner.rs — Docker Compose (requires image built)
k8s_runner.rs — Kubernetes (requires cluster access and image loaded)

Recommended: Use scripts/run/run-examples.sh -t <duration> -n <nodes> <mode> where mode is host, compose, or k8s.

Alternative: Direct cargo run: cargo run -p runner-examples --bin <name>

Code patterns below show how to build scenarios. Wrap these in #[tokio::test] functions for integration tests, or #[tokio::main] for binaries.

Simple consensus liveness

Minimal test that validates basic block production:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn simple_consensus() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(30))
        .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

When to use: smoke tests for consensus on minimal hardware.

Transaction workload

Test consensus under transaction load:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn transaction_workload() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(2))
        .wallets(20)
        .transactions_with(|txs| txs.rate(5).users(10))
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(60))
        .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

When to use: validate transaction submission and inclusion.

Chaos resilience

Test system resilience under node restarts:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_compose::ComposeDeployer;
use testing_framework_workflows::{ChaosBuilderExt, ScenarioBuilderExt};

pub async fn chaos_resilience() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(4))
        .enable_node_control()
        .wallets(20)
        .transactions_with(|txs| txs.rate(3).users(10))
        .chaos_with(|c| {
            c.restart()
                .min_delay(Duration::from_secs(20))
                .max_delay(Duration::from_secs(40))
                .target_cooldown(Duration::from_secs(30))
                .apply()
        })
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(120))
        .build();

    let deployer = ComposeDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

When to use: resilience validation and operational readiness drills.

Note: Chaos tests require ComposeDeployer or another runner with node control support.

Advanced Examples

When should I read this? Skim now to see what’s possible, revisit later when you need load testing, chaos scenarios, or custom extensions. Start with basic examples first.

Realistic advanced scenarios demonstrating framework capabilities for production testing.

Adapt from Complete Source:

compose_runner.rs — Compose examples with workloads
k8s_runner.rs — K8s production patterns
Chaos testing patterns — Node control implementation

Summary

Example	Topology	Workloads	Deployer	Key Feature
Load Progression	3 nodes	Increasing tx rate	Compose	Dynamic load testing
Sustained Load	4 nodes	High tx rate	Compose	Stress testing
Aggressive Chaos	4 nodes	Frequent restarts + traffic	Compose	Resilience validation

Load Progression Test

Test consensus under progressively increasing transaction load:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_compose::ComposeDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn load_progression_test() -> Result<()> {
    for rate in [5, 10, 20, 30] {
        println!("Testing with rate: {}", rate);

        let mut plan =
            ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
                .wallets(50)
                .transactions_with(|txs| txs.rate(rate).users(20))
                .expect_consensus_liveness()
                .with_run_duration(Duration::from_secs(60))
                .build();

        let deployer = ComposeDeployer::default();
        let runner = deployer.deploy(&plan).await?;
        let _handle = runner.run(&mut plan).await?;
    }

    Ok(())
}

When to use: Finding the maximum sustainable transaction rate for a given topology.

Sustained Load Test

Run high transaction load for extended duration:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_compose::ComposeDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn sustained_load_test() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(4))
        .wallets(100)
        .transactions_with(|txs| txs.rate(15).users(50))
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(300))
        .build();

    let deployer = ComposeDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

When to use: Validating stability under continuous high load over extended periods.

Aggressive Chaos Test

Frequent node restarts with active traffic:

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_compose::ComposeDeployer;
use testing_framework_workflows::{ChaosBuilderExt, ScenarioBuilderExt};

pub async fn aggressive_chaos_test() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(4))
        .enable_node_control()
        .wallets(50)
        .transactions_with(|txs| txs.rate(10).users(20))
        .chaos_with(|c| {
            c.restart()
                .min_delay(Duration::from_secs(10))
                .max_delay(Duration::from_secs(20))
                .target_cooldown(Duration::from_secs(15))
                .apply()
        })
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(180))
        .build();

    let deployer = ComposeDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

When to use: Validating recovery and liveness under aggressive failure conditions.

Note: Requires ComposeDeployer for node control support.

Extension Ideas

These scenarios require custom implementations but demonstrate framework extensibility:

Mempool & Transaction Handling

Transaction Propagation & Inclusion Test

Concept: Submit the same batch of independent transactions to different nodes in randomized order/offsets, then verify all transactions are included and final state matches across nodes.

Requirements:

Custom workload: Generates a fixed batch of transactions and submits the same set to different nodes via ctx.node_clients(), with randomized submission order and timing offsets per node
Custom expectation: Verifies all transactions appear in blocks (order may vary), final state matches across all nodes (compare balances or state roots), and no transactions are dropped

Why useful: Exercises mempool propagation, proposer fairness, and transaction inclusion guarantees under realistic race conditions. Tests that the protocol maintains consistency regardless of which node receives transactions first.

Implementation notes: Requires both a custom Workload implementation (to submit same transactions to multiple nodes with jitter) and a custom Expectation implementation (to verify inclusion and state consistency).

Cross-Validator Mempool Divergence & Convergence

Concept: Drive different transaction subsets into different nodes (or differing arrival orders) to create temporary mempool divergence, then verify mempools/blocks converge to contain the union (no permanent divergence).

Requirements:

Custom workload: Targets specific nodes via ctx.node_clients() with disjoint or jittered transaction batches
Custom expectation: After a convergence window, verifies that all transactions appear in blocks (order may vary) or that mempool contents converge across nodes
Run normal workloads during convergence period

Expectations:

Temporary mempool divergence is acceptable (different nodes see different transactions initially)
After convergence window, all transactions appear in blocks or mempools converge
No transactions are permanently dropped despite initial divergence
Mempool gossip/reconciliation mechanisms work correctly

Why useful: Exercises mempool gossip and reconciliation under uneven input or latency. Ensures no node “drops” transactions seen elsewhere, validating that mempool synchronization mechanisms correctly propagate transactions across the network even when they arrive at different nodes in different orders.

Implementation notes: Requires both a custom Workload implementation (to inject disjoint/jittered batches per node) and a custom Expectation implementation (to verify mempool convergence or block inclusion). Uses existing ctx.node_clients() capability—no new infrastructure needed.

Adaptive Mempool Pressure Test

Concept: Ramp transaction load over time to observe mempool growth, fee prioritization/eviction, and block saturation behavior, detecting performance regressions and ensuring backpressure/eviction work under increasing load.

Requirements:

Custom workload: Steadily increases transaction rate over time (optional: use fee tiers)
Custom expectation: Monitors mempool size, evictions, and throughput (blocks/txs per slot), flagging runaway growth or stalls
Run for extended duration to observe pressure buildup

Expectations:

Mempool size grows predictably with load (not runaway growth)
Fee prioritization/eviction mechanisms activate under pressure
Block saturation behavior is acceptable (blocks fill appropriately)
Throughput (blocks/txs per slot) remains stable or degrades gracefully
No stalls or unbounded mempool growth

Why useful: Detects performance regressions in mempool management. Ensures backpressure and eviction mechanisms work correctly under increasing load, preventing memory exhaustion or unbounded growth. Validates that fee prioritization correctly selects high-value transactions when mempool is full.

Implementation notes: Can be built with current workload model (ramping rate). Requires custom Expectation implementation that reads mempool metrics (via node HTTP APIs or Prometheus) and monitors throughput to judge behavior. No new infrastructure needed—uses existing observability capabilities.

Invalid Transaction Fuzzing

Concept: Submit malformed transactions and verify they’re rejected properly.

Implementation approach:

Custom workload that generates invalid transactions (bad signatures, insufficient funds, malformed structure)
Expectation verifies mempool rejects them and they never appear in blocks
Test mempool resilience and filtering

Why useful: Ensures mempool doesn’t crash or include invalid transactions under fuzzing.

Network & Gossip

Gossip Latency Gradient Scenario

Concept: Test consensus robustness under skewed gossip delays by partitioning nodes into latency tiers (tier A ≈10ms, tier B ≈100ms, tier C ≈300ms) and observing propagation lag, fork rate, and eventual convergence.

Requirements:

Partition nodes into three groups (tiers)
Apply per-group network delay via chaos: netem/iptables in compose; NetworkPolicy + netem sidecar in k8s
Run standard workload (transactions/block production)
Optional: Remove delays at end to check recovery

Expectations:

Propagation: Messages reach all tiers within acceptable bounds
Safety: No divergent finalized heads; fork rate stays within tolerance
Liveness: Chain keeps advancing; convergence after delays relaxed (if healed)

Why useful: Real networks have heterogeneous latency. This stress-tests proposer selection and fork resolution when some peers are “far” (high latency), validating that consensus remains safe and live under realistic network conditions.

Current blocker: Runner support for per-group delay injection (network delay via netem/iptables) is not present today. Would require new chaos plumbing in compose/k8s deployers to inject network delays per node group.

Byzantine Gossip Flooding (libp2p Peer)

Concept: Spin up a custom workload/sidecar that runs a libp2p host, joins the cluster’s gossip mesh, and publishes a high rate of syntactically valid but useless/stale messages to selected topics, testing gossip backpressure, scoring, and queue handling under a “malicious” peer.

Requirements:

Custom workload/sidecar that implements a libp2p host
Join the cluster’s gossip mesh as a peer
Publish high-rate syntactically valid but useless/stale messages to selected gossip topics
Run alongside normal workloads (transactions/block production)

Expectations:

Gossip backpressure mechanisms prevent message flooding from overwhelming nodes
Peer scoring correctly identifies and penalizes the malicious peer
Queue handling remains stable under flood conditions
Normal consensus operation continues despite malicious peer

Why useful: Tests Byzantine behavior (malicious peer) which is critical for consensus protocol robustness. More realistic than RPC spam since it uses the actual gossip protocol. Validates that gossip backpressure, peer scoring, and queue management correctly handle adversarial peers without disrupting consensus.

Current blocker: Requires adding gossip-capable helper (libp2p integration) to the framework. Would need a custom workload/sidecar implementation that can join the gossip mesh and inject messages. The rest of the scenario can use existing runners/workloads.

Network Partition Recovery

Concept: Test consensus recovery after network partitions.

Requirements:

Needs block_peer() / unblock_peer() methods in NodeControlHandle
Partition subsets of nodes, wait, then restore connectivity
Verify chain convergence after partition heals

Why useful: Tests the most realistic failure mode in distributed systems.

Current blocker: Node control doesn’t yet support network-level actions (only process restarts).

Time & Timing

Time-Shifted Blocks (Clock Skew Test)

Concept: Test consensus and timestamp handling when nodes run with skewed clocks (e.g., +1s, −1s, +200ms jitter) to surface timestamp validation issues, reorg sensitivity, and clock drift handling.

Requirements:

Assign per-node time offsets (e.g., +1s, −1s, +200ms jitter)
Run normal workload (transactions/block production)
Observe whether blocks are accepted/propagated and the chain stays consistent

Expectations:

Blocks with skewed timestamps are handled correctly (accepted or rejected per protocol rules)
Chain remains consistent across nodes despite clock differences
No unexpected reorgs or chain splits due to timestamp validation issues

Why useful: Clock skew is a common real-world issue in distributed systems. This validates that consensus correctly handles timestamp validation and maintains safety/liveness when nodes have different clock offsets, preventing timestamp-based attacks or failures.

Current blocker: Runner ability to skew per-node clocks (e.g., privileged containers with libfaketime/chrony or time-offset netns) is not available today. Would require a new chaos/time-skew hook in deployers to inject clock offsets per node.

Block Timing Consistency

Concept: Verify block production intervals stay within expected bounds.

Implementation approach:

Custom expectation that consumes BlockFeed
Collect block timestamps during run
Assert intervals are within (slot_duration * active_slot_coeff) ± tolerance

Why useful: Validates consensus timing under various loads.

Topology & Membership

Dynamic Topology (Churn) Scenario

Concept: Nodes join and leave mid-run (new identities/addresses added; some nodes permanently removed) to exercise peer discovery, bootstrapping, reputation, and load balancing under churn.

Requirements:

Runner must be able to spin up new nodes with fresh keys/addresses at runtime
Update peer lists and bootstraps dynamically as nodes join/leave
Optionally tear down nodes permanently (not just restart)
Run normal workloads (transactions/block production) during churn

Expectations:

New nodes successfully discover and join the network
Peer discovery mechanisms correctly handle dynamic topology changes
Reputation systems adapt to new/removed peers
Load balancing adjusts to changing node set
Consensus remains safe and live despite topology churn

Why useful: Real networks experience churn (nodes joining/leaving). Unlike restarts (which preserve topology), churn changes the actual topology size and peer set, testing how the protocol handles dynamic membership. This exercises peer discovery, bootstrapping, reputation systems, and load balancing under realistic conditions.

Current blocker: Runner support for dynamic node addition/removal at runtime is not available today. Chaos today only restarts existing nodes; churn would require the ability to spin up new nodes with fresh identities/addresses, update peer lists/bootstraps dynamically, and permanently remove nodes. Would need new topology management capabilities in deployers.

API & External Interfaces

API DoS/Stress Test

Concept: Adversarial workload floods node HTTP/WS APIs with high QPS and malformed/bursty requests; expectation checks nodes remain responsive or rate-limit without harming consensus.

Requirements:

Custom workload: Targets node HTTP/WS API endpoints with mixed valid/invalid requests at high rate
Custom expectation: Monitors error rates, latency, and confirms block production/liveness unaffected
Run alongside normal workloads (transactions/block production)

Expectations:

Nodes remain responsive or correctly rate-limit under API flood
Error rates/latency are acceptable (rate limiting works)
Block production/liveness unaffected by API abuse
Consensus continues normally despite API stress

Why useful: Validates API hardening under abuse and ensures control/telemetry endpoints don’t destabilize the node. Tests that API abuse is properly isolated from consensus operations, preventing DoS attacks on API endpoints from affecting blockchain functionality.

Implementation notes: Requires custom Workload implementation that directs high-QPS traffic to node APIs (via ctx.node_clients() or direct HTTP clients) and custom Expectation implementation that monitors API responsiveness metrics and consensus liveness. Uses existing node API access—no new infrastructure needed.

State & Correctness

Wallet Balance Verification

Concept: Track wallet balances and verify state consistency.

Description: After transaction workload completes, query all wallet balances via node API and verify total supply is conserved. Requires tracking initial state, submitted transactions, and final balances. Validates that the ledger maintains correctness under load (no funds lost or created). This is a state assertion expectation that checks correctness, not just liveness.

Running Scenarios

This page focuses on how scenarios are executed (deploy → run → evaluate → cleanup), what artifacts you get back, and how that differs across runners.

For “just run something that works” commands, see Running Examples.

Execution Flow (High Level)

When you run a built scenario via a deployer, the run follows the same shape:

flowchart TD
    Build[Scenario built] --> Deploy[Deploy]
    Deploy --> Capture[Capture]
    Capture --> Execute[Execute]
    Execute --> Evaluate[Evaluate]
    Evaluate --> Cleanup[Cleanup]

Deploy: provision infrastructure and start nodes (processes/containers/pods)
Capture: establish clients/observability and capture initial state
Execute: run workloads for the configured wall-clock duration
Evaluate: run expectations (after the execution window ends)
Cleanup: stop resources and finalize artifacts

The Core API

use std::time::Duration;

use testing_framework_core::scenario::{Deployer as _, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

async fn run_once() -> anyhow::Result<()> {
    let mut scenario = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .wallets(20)
        .transactions_with(|tx| tx.rate(1).users(5))
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(60))
        .build()?;

    let runner = LocalDeployer::default().deploy(&scenario).await?;
    runner.run(&mut scenario).await?;

    Ok(())
}

Notes:

with_run_duration(...) is wall-clock time, not “number of blocks”.
.transactions_with(...) rates are per-block.
Most users should run scenarios via scripts/run/run-examples.sh unless they are embedding the framework in their own test crate.

Runner Differences

Local (Host) Runner

Best for: fast iteration and debugging
Logs/state: stored under a temporary run directory unless you set LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1 and/or LOGOS_BLOCKCHAIN_LOG_DIR=...
Limitations: no node-control capability (chaos workflows that require node control won’t work here)

Run the built-in local examples:

scripts/run/run-examples.sh -t 60 -n 3 host

Compose Runner

Best for: reproducible multi-node environments and node control
Logs: primarily via docker compose logs (and any node-level log configuration you apply)
Debugging: set COMPOSE_RUNNER_PRESERVE=1 to keep the environment up after a run

Run the built-in compose examples:

scripts/run/run-examples.sh -t 60 -n 3 compose

K8s Runner

Best for: production-like behavior, cluster scheduling/networking
Logs: kubectl logs ...
Debugging: set K8S_RUNNER_PRESERVE=1 and K8S_RUNNER_NAMESPACE=... to keep resources around

Run the built-in k8s examples:

scripts/run/run-examples.sh -t 60 -n 3 k8s

Artifacts & Where to Look

Node logs: configure via LOGOS_BLOCKCHAIN_LOG_DIR, LOGOS_BLOCKCHAIN_LOG_LEVEL, LOGOS_BLOCKCHAIN_LOG_FILTER (see Logging & Observability)
Runner logs: controlled by RUST_LOG (runner process only)
Keep run directories: set LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1
Compose environment preservation: set COMPOSE_RUNNER_PRESERVE=1
K8s environment preservation: set K8S_RUNNER_PRESERVE=1

Runners

Runners turn a scenario plan into a live environment while keeping the plan unchanged. Choose based on feedback speed, reproducibility, and fidelity. For environment and operational considerations, see Operations Overview.

Host runner (local processes)

Launches node processes directly on the host (via LocalDeployer).
Binary: local_runner.rs, script mode: host
Fastest feedback loop and minimal orchestration overhead.
Best for development-time iteration and debugging.
Can run in CI for fast smoke tests.
Node control: Not supported (chaos workloads not available)

Run with: scripts/run/run-examples.sh -t 60 -n 1 host

Docker Compose runner

Starts nodes in containers to provide a reproducible multi-node stack on a single machine (via ComposeDeployer).
Binary: compose_runner.rs, script mode: compose
Discovers service ports and wires observability for convenient inspection.
Good balance between fidelity and ease of setup.
Recommended for CI pipelines (isolated environment, reproducible).
Node control: Supported (can restart nodes for chaos testing)

Run with: scripts/run/run-examples.sh -t 60 -n 1 compose

Kubernetes runner

Deploys nodes onto a cluster for higher-fidelity, longer-running scenarios (via K8sDeployer).
Binary: k8s_runner.rs, script mode: k8s
Suits CI with cluster access or shared test environments where cluster behavior and scheduling matter.
Node control: Not supported yet (chaos workloads not available)

Run with: scripts/run/run-examples.sh -t 60 -n 1 k8s

Common expectations

All runners require at least one node and, for transaction scenarios, access to seeded wallets.
Readiness probes gate workload start so traffic begins only after nodes are reachable.
Environment flags can relax timeouts or increase tracing when diagnostics are needed.

Runner Comparison

flowchart TB
    subgraph Host["Host Runner (Local)"]
        H1["Speed: Fast"]
        H2["Isolation: Shared host"]
        H3["Setup: Minimal"]
        H4["Chaos: Not supported"]
        H5["CI: Quick smoke tests"]
    end
    
    subgraph Compose["Compose Runner (Docker)"]
        C1["Speed: Medium"]
        C2["Isolation: Containerized"]
        C3["Setup: Image build required"]
        C4["Chaos: Supported"]
        C5["CI: Recommended"]
    end
    
    subgraph K8s["K8s Runner (Cluster)"]
        K1["Speed: Slower"]
        K2["Isolation: Pod-level"]
        K3["Setup: Cluster + image"]
        K4["Chaos: Not yet supported"]
        K5["CI: Large-scale tests"]
    end
    
    Decision{Choose Based On}
    Decision -->|Fast iteration| Host
    Decision -->|Reproducibility| Compose
    Decision -->|Production-like| K8s
    
    style Host fill:#e1f5ff
    style Compose fill:#e1ffe1
    style K8s fill:#ffe1f5

Detailed Feature Matrix

Feature	Host	Compose	K8s
Speed	Fastest	Medium	Slowest
Setup Time	< 1 min	2-5 min	5-10 min
Isolation	Process-level	Container	Pod + namespace
Node Control	No	Yes	Not yet
Observability	Basic	External stack	Cluster-wide
CI Integration	Smoke tests	Recommended	Heavy tests
Resource Usage	Low	Medium	High
Reproducibility	Environment-dependent	High	Highest
Network Fidelity	Localhost only	Virtual network	Real cluster
Parallel Runs	Port conflicts	Isolated	Namespace isolation

Decision Guide

flowchart TD
    Start[Need to run tests?] --> Q1{Local development?}
    Q1 -->|Yes| Q2{Testing chaos?}
    Q1 -->|No| Q5{Have cluster access?}
    
    Q2 -->|Yes| UseCompose[Use Compose]
    Q2 -->|No| Q3{Need isolation?}
    
    Q3 -->|Yes| UseCompose
    Q3 -->|No| UseHost[Use Host]
    
    Q5 -->|Yes| Q6{Large topology?}
    Q5 -->|No| Q7{CI pipeline?}
    
    Q6 -->|Yes| UseK8s[Use K8s]
    Q6 -->|No| UseCompose
    
    Q7 -->|Yes| Q8{Docker available?}
    Q7 -->|No| UseHost
    
    Q8 -->|Yes| UseCompose
    Q8 -->|No| UseHost
    
    style UseHost fill:#e1f5ff
    style UseCompose fill:#e1ffe1
    style UseK8s fill:#ffe1f5

Quick Recommendations

Use Host Runner when:

Iterating rapidly during development
Running quick smoke tests
Testing on a laptop with limited resources
Don’t need chaos testing

Use Compose Runner when:

Need reproducible test environments
Testing chaos scenarios (node restarts)
Running in CI pipelines
Want containerized isolation

Use K8s Runner when:

Testing large-scale topologies (10+ nodes)
Need production-like environment
Have cluster access in CI
Testing cluster-specific behaviors

RunContext: BlockFeed & Node Control

The deployer supplies a RunContext that workloads and expectations share. It provides:

Topology descriptors (GeneratedTopology)
Client handles (NodeClients / ClusterClient) for HTTP/RPC calls
Metrics (RunMetrics, Metrics) and block feed
Optional NodeControlHandle for managing nodes

BlockFeed: Observing Block Production

The BlockFeed is a broadcast stream of block observations that allows workloads and expectations to monitor blockchain progress in real-time. It polls a node continuously and broadcasts new blocks to all subscribers.

What BlockFeed Provides

Real-time block stream:

Subscribe to receive BlockRecord notifications as blocks are produced
Each record includes the block header (HeaderId) and full block payload
Backed by a background task that polls node storage every second

Block statistics:

Track total transactions across all observed blocks
Access via block_feed.stats().total_transactions()

Broadcast semantics:

Multiple subscribers can receive the same blocks independently
Late subscribers start receiving from current block (no history replay)
Lagged subscribers skip missed blocks automatically

Accessing BlockFeed

BlockFeed is available through RunContext:

let block_feed = ctx.block_feed();

Usage in Expectations

Expectations typically use BlockFeed to verify block production and inclusion of transactions/data.

Example: Counting blocks during a run

use std::sync::{
    Arc,
    atomic::{AtomicU64, Ordering},
};

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, Expectation, RunContext};

struct MinimumBlocksExpectation {
    min_blocks: u64,
    captured_blocks: Option<Arc<AtomicU64>>,
}

#[async_trait]
impl Expectation for MinimumBlocksExpectation {
    fn name(&self) -> &'static str {
        "minimum_blocks"
    }

    async fn start_capture(&mut self, ctx: &RunContext) -> Result<(), DynError> {
        let block_count = Arc::new(AtomicU64::new(0));
        let block_count_task = Arc::clone(&block_count);
        
        // Subscribe to block feed
        let mut receiver = ctx.block_feed().subscribe();
        
        // Spawn a task to count blocks
        tokio::spawn(async move {
            loop {
                match receiver.recv().await {
                    Ok(_record) => {
                        block_count_task.fetch_add(1, Ordering::Relaxed);
                    }
                    Err(tokio::sync::broadcast::error::RecvError::Lagged(skipped)) => {
                        tracing::debug!(skipped, "receiver lagged, skipping blocks");
                    }
                    Err(tokio::sync::broadcast::error::RecvError::Closed) => {
                        tracing::debug!("block feed closed");
                        break;
                    }
                }
            }
        });
        
        self.captured_blocks = Some(block_count);
        Ok(())
    }

    async fn evaluate(&mut self, ctx: &RunContext) -> Result<(), DynError> {
        let blocks = self.captured_blocks
            .as_ref()
            .expect("start_capture must be called first")
            .load(Ordering::Relaxed);
        
        if blocks < self.min_blocks {
            return Err(format!(
                "expected at least {} blocks, observed {}",
                self.min_blocks, blocks
            ).into());
        }
        
        tracing::info!(blocks, min = self.min_blocks, "minimum blocks expectation passed");
        Ok(())
    }
}

Example: Inspecting block contents

use testing_framework_core::scenario::{DynError, RunContext};

async fn start_capture(ctx: &RunContext) -> Result<(), DynError> {
    let mut receiver = ctx.block_feed().subscribe();
    
    tokio::spawn(async move {
        loop {
            match receiver.recv().await {
                Ok(record) => {
                    // Access block header
                    let header_id = &record.header;
                    
                    // Access full block
                    let tx_count = record.block.transactions().len();
                    
                    tracing::debug!(
                        ?header_id,
                        tx_count,
                        "observed block"
                    );
                    
                    // Process transactions or other block data.
                }
                Err(tokio::sync::broadcast::error::RecvError::Closed) => break,
                Err(_) => continue,
            }
        }
    });
    
    Ok(())
}

Usage in Workloads

Workloads can use BlockFeed to coordinate timing or wait for specific conditions before proceeding.

Example: Wait for N blocks before starting

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, RunContext, Workload};

struct DelayedWorkload {
    wait_blocks: usize,
}

#[async_trait]
impl Workload for DelayedWorkload {
    fn name(&self) -> &str {
        "delayed_workload"
    }

    async fn start(&self, ctx: &RunContext) -> Result<(), DynError> {
        tracing::info!(wait_blocks = self.wait_blocks, "waiting for blocks before starting");
        
        // Subscribe to block feed
        let mut receiver = ctx.block_feed().subscribe();
        let mut count = 0;
        
        // Wait for N blocks
        while count < self.wait_blocks {
            match receiver.recv().await {
                Ok(_) => count += 1,
                Err(tokio::sync::broadcast::error::RecvError::Lagged(_)) => continue,
                Err(tokio::sync::broadcast::error::RecvError::Closed) => {
                    return Err("block feed closed before reaching target".into());
                }
            }
        }
        
        tracing::info!("warmup complete, starting actual workload");
        
        // Now do the actual work
        // ...
        
        Ok(())
    }
}

Example: Rate limiting based on block production

use testing_framework_core::scenario::{DynError, RunContext};

async fn generate_request() -> Option<()> {
    None
}

async fn start(ctx: &RunContext) -> Result<(), DynError> {
    let clients = ctx.node_clients().node_clients();
    let mut receiver = ctx.block_feed().subscribe();
    let mut pending_requests: Vec<()> = Vec::new();

    loop {
        tokio::select! {
            // Issue a batch on each new block.
            Ok(_record) = receiver.recv() => {
                if !pending_requests.is_empty() {
                    tracing::debug!(count = pending_requests.len(), "issuing requests on new block");
                    for _req in pending_requests.drain(..) {
                        let _info = clients[0].consensus_info().await?;
                    }
                }
            }

            // Generate work continuously.
            Some(req) = generate_request() => {
                pending_requests.push(req);
            }
        }
    }
}

BlockFeed vs Direct Polling

Use BlockFeed when:

You need to react to blocks as they’re produced
Multiple components need to observe the same blocks
You want automatic retry/reconnect logic
You’re tracking statistics across many blocks

Use direct polling when:

You need to query specific historical blocks
You’re checking final state after workloads complete
You need transaction receipts or other indexed data
You’re implementing a one-time health check

Example direct polling in expectations:

use testing_framework_core::scenario::{DynError, RunContext};

async fn evaluate(ctx: &RunContext) -> Result<(), DynError> {
    let client = &ctx.node_clients().node_clients()[0];
    
    // Poll current height once
    let info = client.consensus_info().await?;
    tracing::info!(height = info.height, "final block height");
    
    // This is simpler than BlockFeed for one-time checks
    Ok(())
}

Block Statistics

Access aggregated statistics without subscribing to the feed:

use testing_framework_core::scenario::{DynError, RunContext};

async fn evaluate(ctx: &RunContext, expected_min: u64) -> Result<(), DynError> {
    let stats = ctx.block_feed().stats();
    let total_txs = stats.total_transactions();
    
    tracing::info!(total_txs, "transactions observed across all blocks");
    
    if total_txs < expected_min {
        return Err(format!(
            "expected at least {} transactions, observed {}",
            expected_min, total_txs
        ).into());
    }
    
    Ok(())
}

Important Notes

Subscription timing:

Subscribe in start_capture() for expectations
Subscribe in start() for workloads
Late subscribers miss historical blocks (no replay)

Lagged receivers:

If your subscriber is too slow, it may lag behind
Handle RecvError::Lagged(skipped) gracefully
Consider increasing processing speed or reducing block rate

Feed lifetime:

BlockFeed runs for the entire scenario duration
Automatically cleaned up when the run completes
Closed channels signal graceful shutdown

Performance:

BlockFeed polls nodes every 1 second
Broadcasts to all subscribers with minimal overhead
Suitable for scenarios with hundreds of blocks

Real-World Examples

The framework’s built-in expectations use BlockFeed extensively:

ConsensusLiveness: Doesn’t directly subscribe but uses block feed stats to verify progress
TransactionInclusion: Subscribes to find specific transactions in blocks

See Examples and Workloads & Expectations for more patterns.

Current Chaos Capabilities and Limitations

The framework currently supports process-level chaos (node restarts) for resilience testing:

Supported:

Restart nodes (restart_node)
Random restart workload via .chaos().restart()

Not Yet Supported:

Network partitions (blocking peers, packet loss)
Resource constraints (CPU throttling, memory limits)
Byzantine behavior injection (invalid blocks, bad signatures)
Selective peer blocking/unblocking

For network partition testing, see Extension Ideas which describes the proposed block_peer/unblock_peer API (not yet implemented).

Accessing node control in workloads/expectations

Check for control support and use it conditionally:

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, RunContext, Workload};

struct RestartWorkload;

#[async_trait]
impl Workload for RestartWorkload {
    fn name(&self) -> &str {
        "restart_workload"
    }

    async fn start(&self, ctx: &RunContext) -> Result<(), DynError> {
        if let Some(control) = ctx.node_control() {
            // Restart the first node (index 0) if supported.
            control.restart_node(0).await?;
        }
        Ok(())
    }
}

When chaos workloads need control, require enable_node_control() in the scenario builder and deploy with a runner that supports it.

Current API surface

The NodeControlHandle trait currently provides:

use async_trait::async_trait;
use testing_framework_core::scenario::DynError;

#[async_trait]
pub trait NodeControlHandle: Send + Sync {
    async fn restart_node(&self, index: usize) -> Result<(), DynError>;
}

Future extensions may include peer blocking/unblocking or other control operations. For now, focus on restart-based chaos patterns as shown in the chaos workload examples.

Considerations

Always guard control usage: not all runners expose NodeControlHandle.
Treat control as best-effort: failures should surface as test failures, but workloads should degrade gracefully when control is absent.
Combine control actions with expectations (e.g., restart then assert height convergence) to keep scenarios meaningful.

Chaos Workloads

When should I read this? You don’t need chaos testing to be productive with the framework. Focus on basic scenarios first—chaos is for resilience validation and operational readiness drills once your core tests are stable.

Chaos in the framework uses node control to introduce failures and validate recovery. The built-in restart workload lives in testing_framework_workflows::workloads::chaos::RandomRestartWorkload.

How it works

Requires NodeControlCapability (enable_node_control() in the scenario builder) and a runner that provides a NodeControlHandle.
Randomly selects nodes to restart based on your include/exclude flags.
Respects min/max delay between restarts and a target cooldown to avoid flapping the same node too frequently.
Runs alongside other workloads; expectations should account for the added disruption.
Support varies by runner: node control is not provided by the local runner and is not yet implemented for the k8s runner. Use a runner that advertises NodeControlHandle support (e.g., compose) for chaos workloads.

Usage

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::{ScenarioBuilderExt, workloads::chaos::RandomRestartWorkload};

pub fn random_restart_plan() -> testing_framework_core::scenario::Scenario<
    testing_framework_core::scenario::NodeControlCapability,
> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(2))
        .enable_node_control()
        .with_workload(RandomRestartWorkload::new(
            Duration::from_secs(45),  // min delay
            Duration::from_secs(75),  // max delay
            Duration::from_secs(120), // target cooldown
            true,                     // include nodes
        ))
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(150))
        .build()
}

Expectations to pair

Consensus liveness: ensure blocks keep progressing despite restarts.
Height convergence: optionally check all nodes converge after the chaos window.
Any workload-specific inclusion checks if you’re also driving transactions.

Best practices

Keep delays/cooldowns realistic; avoid back-to-back restarts that would never happen in production.
Limit chaos scope: toggle nodes based on what you want to test.
Combine with observability: monitor metrics/logs to explain failures.

Topology & Chaos Patterns

This page focuses on cluster manipulation: node control, chaos patterns, and what the tooling supports today.

Node control availability

Supported: restart control via NodeControlHandle (compose runner).
Not supported: local runner does not expose node control; k8s runner does not support it yet.
Not yet supported: peer blocking/unblocking and network partitions.

See also: RunContext: BlockFeed & Node Control for the current node-control API surface and limitations.

Chaos patterns to consider

Restarts: random restarts with minimum delay/cooldown to test recovery.
Partitions (planned): block/unblock peers to simulate partial isolation, then assert height convergence after healing.
Node churn (planned): stop one node and start another (new key) mid-run to test membership changes; expect convergence.
Load SLOs: push transaction rates and assert inclusion/latency budgets instead of only liveness.
API probes: poll HTTP/RPC endpoints during chaos to ensure external contracts stay healthy (shape + latency).

Expectations to pair

Liveness/height convergence after chaos windows.
SLO checks: inclusion latency, API latency/shape.
Recovery checks: ensure nodes that were isolated or restarted catch up to cluster height within a timeout.

Guidance

Keep chaos realistic: avoid flapping or patterns you wouldn’t operate in prod.
Scope chaos: choose nodes intentionally; don’t restart all nodes at once unless you’re testing full outages.
Combine chaos with observability: capture block feed/metrics and API health so failures are diagnosable.

Manual Clusters: Imperative Control

When should I read this? You’re integrating external test drivers (like Cucumber/BDD frameworks) that need imperative node orchestration. This is an escape hatch for when the test orchestration must live outside the framework—most tests should use the standard scenario approach.

Overview

Manual clusters provide imperative, on-demand node control for scenarios that don’t fit the declarative ScenarioBuilder pattern:

#![allow(unused)]
fn main() {
use testing_framework_core::topology::config::TopologyConfig;
use testing_framework_core::scenario::{PeerSelection, StartNodeOptions};
use testing_framework_runner_local::LocalDeployer;

let config = TopologyConfig::with_node_numbers(3);
let deployer = LocalDeployer::new();
let cluster = deployer.manual_cluster(config)?;

// Start nodes on demand with explicit peer selection
let node_a = cluster.start_node_with(
    "a",
    StartNodeOptions {
        peers: PeerSelection::None, // Start isolated
    }
).await?.api;

let node_b = cluster.start_node_with(
    "b",
    StartNodeOptions {
        peers: PeerSelection::Named(vec!["node-a".to_owned()]), // Connect to A
    }
).await?.api;

// Wait for network readiness
cluster.wait_network_ready().await?;

// Custom validation logic
let info_a = node_a.consensus_info().await?;
let info_b = node_b.consensus_info().await?;
assert!(info_a.height.abs_diff(info_b.height) <= 5);
}

Key difference from scenarios:

External orchestration: Your code (or an external driver like Cucumber) controls the execution flow step-by-step
Imperative model: You call start_node(), sleep(), poll APIs directly in test logic
No framework execution: The scenario runner doesn’t drive workloads—you do

Note: Scenarios with node control can also start nodes dynamically, control peer selection, and orchestrate timing—but via workloads within the framework’s execution model. Use manual clusters only when the orchestration must be external (e.g., Cucumber steps).

When to Use Manual Clusters

Manual clusters are an escape hatch for when orchestration must live outside the framework.

Prefer workloads for scenario logic; use manual clusters only when an external system needs to control node lifecycle—for example:

Cucumber/BDD integration
Gherkin steps control when nodes start, which peers they connect to, and when to verify state. The test driver (Cucumber) orchestrates the scenario step-by-step.

Custom test harnesses
External scripts or tools that need programmatic control over node lifecycle as part of a larger testing pipeline.

Core API

Starting the Cluster

#![allow(unused)]
fn main() {
use testing_framework_core::topology::config::TopologyConfig;
use testing_framework_runner_local::LocalDeployer;

// Define capacity (preallocates ports/configs for N nodes)
let config = TopologyConfig::with_node_numbers(5);

let deployer = LocalDeployer::new();
let cluster = deployer.manual_cluster(config)?;
// Nodes are stopped automatically when cluster is dropped
}

Important: The TopologyConfig defines the maximum capacity, not the initial state. Nodes are started on-demand via API calls.

Starting Nodes

Default peers (topology layout):

#![allow(unused)]
fn main() {
let node = cluster.start_node("seed").await?;
}

No peers (isolated):

#![allow(unused)]
fn main() {
use testing_framework_core::scenario::{PeerSelection, StartNodeOptions};

let node = cluster.start_node_with(
    "isolated",
    StartNodeOptions {
        peers: PeerSelection::None,
    }
).await?;
}

Explicit peers (named):

#![allow(unused)]
fn main() {
let node = cluster.start_node_with(
    "follower",
    StartNodeOptions {
        peers: PeerSelection::Named(vec![
            "node-seed".to_owned(),
            "node-isolated".to_owned(),
        ]),
    }
).await?;
}

Note: Node names are prefixed with node- internally. If you start a node with name "a", reference it as "node-a" in peer lists.

Getting Node Clients

#![allow(unused)]
fn main() {
// From start result
let started = cluster.start_node("my-node").await?;
let client = started.api;

// Or lookup by name
if let Some(client) = cluster.node_client("node-my-node") {
    let info = client.consensus_info().await?;
    println!("Height: {}", info.height);
}
}

Waiting for Readiness

#![allow(unused)]
fn main() {
// Waits until all started nodes have connected to their expected peers
cluster.wait_network_ready().await?;
}

Behavior:

Single-node clusters always ready (no peers to verify)
Multi-node clusters wait for peer counts to match expectations
Timeout after 60 seconds (120 seconds if SLOW_TEST_ENV=true) with diagnostic message

Complete Example: External Test Driver Pattern

This shows how an external test driver (like Cucumber) might use manual clusters to control node lifecycle:

#![allow(unused)]
fn main() {
use std::time::Duration;
use anyhow::Result;
use testing_framework_core::{
    scenario::{PeerSelection, StartNodeOptions},
    topology::config::TopologyConfig,
};
use testing_framework_runner_local::LocalDeployer;
use tokio::time::sleep;

#[tokio::test]
async fn external_driver_example() -> Result<()> {
    // Step 1: Create cluster with capacity for 3 nodes
    let config = TopologyConfig::with_node_numbers(3);
    let deployer = LocalDeployer::new();
    let cluster = deployer.manual_cluster(config)?;

    // Step 2: External driver decides to start 2 nodes initially
    println!("Starting initial topology...");
    let node_a = cluster.start_node("a").await?.api;
    let node_b = cluster
        .start_node_with(
            "b",
            StartNodeOptions {
                peers: PeerSelection::Named(vec!["node-a".to_owned()]),
            },
        )
        .await?
        .api;

    cluster.wait_network_ready().await?;

    // Step 3: External driver runs some protocol operations
    let info = node_a.consensus_info().await?;
    println!("Initial cluster height: {}", info.height);

    // Step 4: Later, external driver decides to add third node
    println!("External driver adding third node...");
    let node_c = cluster
        .start_node_with(
            "c",
            StartNodeOptions {
                peers: PeerSelection::Named(vec!["node-a".to_owned()]),
            },
        )
        .await?
        .api;

    cluster.wait_network_ready().await?;

    // Step 5: External driver validates final state
    let heights = vec![
        node_a.consensus_info().await?.height,
        node_b.consensus_info().await?.height,
        node_c.consensus_info().await?.height,
    ];
    println!("Final heights: {:?}", heights);

    Ok(())
}
}

Key pattern: The external driver controls when nodes start and which peers they connect to, allowing test frameworks like Cucumber to orchestrate scenarios step-by-step based on Gherkin steps or other external logic.

Peer Selection Strategies

PeerSelection::DefaultLayout
Uses the topology’s network layout (star/chain/full). Default behavior.

#![allow(unused)]
fn main() {
let node = cluster.start_node_with(
    "normal",
    StartNodeOptions {
        peers: PeerSelection::DefaultLayout,
    }
).await?;
}

PeerSelection::None
Node starts with no initial peers. Use when an external driver needs to build topology incrementally.

#![allow(unused)]
fn main() {
let isolated = cluster.start_node_with(
    "isolated",
    StartNodeOptions {
        peers: PeerSelection::None,
    }
).await?;
}

PeerSelection::Named(vec!["node-a", "node-b"])
Explicit peer list. Use when an external driver needs to construct specific peer relationships.

#![allow(unused)]
fn main() {
let follower = cluster.start_node_with(
    "follower",
    StartNodeOptions {
        peers: PeerSelection::Named(vec![
            "node-seed".to_owned(),
            "node-seed".to_owned(),
        ]),
    }
).await?;
}

Remember: Node names are automatically prefixed with node-. If you call start_node("a"), reference it as "node-a" in peer lists.

Custom Validation Patterns

Manual clusters don’t have built-in expectations—you write validation logic directly:

Height Convergence

#![allow(unused)]
fn main() {
use tokio::time::{sleep, Duration};

let start = tokio::time::Instant::now();
loop {
    let heights: Vec<u64> = vec![
        node_a.consensus_info().await?.height,
        node_b.consensus_info().await?.height,
        node_c.consensus_info().await?.height,
    ];

    let max_diff = heights.iter().max().unwrap() - heights.iter().min().unwrap();
    if max_diff <= 5 {
        println!("Converged: heights={:?}", heights);
        break;
    }

    if start.elapsed() > Duration::from_secs(60) {
        return Err(anyhow::anyhow!("Convergence timeout: heights={:?}", heights));
    }

    sleep(Duration::from_secs(2)).await;
}
}

Peer Count Verification

#![allow(unused)]
fn main() {
let info = node.network_info().await?;
assert_eq!(
    info.n_peers, 3,
    "Expected 3 peers, found {}",
    info.n_peers
);
}

Block Production

#![allow(unused)]
fn main() {
// Verify node is producing blocks
let initial_height = node_a.consensus_info().await?.height;

sleep(Duration::from_secs(10)).await;

let current_height = node_a.consensus_info().await?.height;
assert!(
    current_height > initial_height,
    "Node should have produced blocks: initial={}, current={}",
    initial_height,
    current_height
);
}

Limitations

Local deployer only
Manual clusters currently only work with LocalDeployer. Compose and K8s support is not available.

No built-in workloads
You must manually submit transactions via node API clients. The framework’s transaction workloads are scenario-specific.

No automatic expectations
You wire validation yourself. The .expect_*() methods from scenarios are not automatically attached—you write custom validation loops.

No RunContext
Manual clusters don’t provide RunContext, so features like BlockFeed and metrics queries require manual setup.

Relationship to Node Control

Manual clusters and node control share the same underlying infrastructure (LocalDynamicNodes), but serve different purposes:

Feature	Manual Cluster	Node Control (Scenario)
Orchestration	External (your code/Cucumber)	Framework (workloads)
Programming model	Imperative (step-by-step)	Declarative (plan + execute)
Node lifecycle	Manual `start_node()` calls	Automatic + workload-driven
Traffic generation	Manual API calls	Built-in workloads (tx, chaos)
Validation	Manual polling loops	Built-in expectations + custom
Use case	Cucumber/BDD integration	Standard testing & chaos

When to use which:

Scenarios with node control → Standard testing (built-in workloads drive node control)
Manual clusters → External drivers (Cucumber/BDD where external logic drives node control)

Running Manual Cluster Tests

Manual cluster tests are typically marked with #[ignore] to prevent accidental runs:

#![allow(unused)]
fn main() {
#[tokio::test]
#[ignore = "run manually with: cargo test -- --ignored external_driver_example"]
async fn external_driver_example() -> Result<()> {
    // ...
}
}

To run:

# Required: dev mode for fast proofs
cargo test -p runner-examples -- --ignored external_driver_example

Logs:

# Preserve logs after test
LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1 \
RUST_LOG=info \
cargo test -p runner-examples -- --ignored external_driver_example

Part III — Developer Reference

Deep dives for contributors who extend the framework, evolve its abstractions, or maintain the crate set.

Scenario Model (Developer Level)

The scenario model defines clear, composable responsibilities:

Topology: a declarative description of the cluster—how many nodes, their roles, and the broad network and data-availability characteristics. It represents the intended shape of the system under test.
Scenario: a plan combining topology, workloads, expectations, and a run window. Building a scenario validates prerequisites (like seeded wallets) and ensures the run lasts long enough to observe meaningful block progression.
Workloads: asynchronous tasks that generate traffic or conditions. They use shared context to interact with the deployed cluster and may bundle default expectations.
Expectations: post-run assertions. They can capture baselines before workloads start and evaluate success once activity stops.
Runtime: coordinates workloads and expectations for the configured duration, enforces cooldowns when control actions occur, and ensures cleanup so runs do not leak resources.

Developers extending the model should keep these boundaries strict: topology describes, scenarios assemble, deployers provision, runners orchestrate, workloads drive, and expectations judge outcomes. For guidance on adding new capabilities, see Extending the Framework.

API Levels: Builder DSL vs. Direct Instantiation

The framework supports two styles for constructing scenarios:

High-level Builder DSL (recommended): fluent helper methods (e.g. .transactions_with(...))
Low-level direct instantiation: construct workload/expectation types explicitly, then attach them

Both styles produce the same runtime behavior because they ultimately call the same core builder APIs.

High-Level Builder DSL (Recommended)

The DSL is implemented as extension traits (primarily testing_framework_workflows::ScenarioBuilderExt) on the core scenario builder.

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

let plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(5)
    .transactions_with(|txs| txs.rate(5).users(3))
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(60))
    .build();

When to use:

Most test code (smoke, regression, CI)
When you want sensible defaults and minimal boilerplate

Low-Level Direct Instantiation

Direct instantiation gives you explicit control over the concrete types you attach:

use std::{
    num::NonZeroUsize,
    time::Duration,
};

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::{
    expectations::ConsensusLiveness,
    workloads::transaction,
};

let tx_workload = transaction::Workload::with_rate(5)
    .expect("transaction rate must be non-zero")
    .with_user_limit(NonZeroUsize::new(3));

let plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(5)
    .with_workload(tx_workload)
    .with_expectation(ConsensusLiveness::default())
    .with_run_duration(Duration::from_secs(60))
    .build();

When to use:

Custom workload/expectation implementations
Reusing preconfigured workload instances across multiple scenarios
Debugging / exploring the underlying workload types

Method Correspondence

High-Level DSL	Low-Level Direct
`.transactions_with(\|txs\| txs.rate(5).users(3))`	`.with_workload(transaction::Workload::with_rate(5).expect(...).with_user_limit(...))`
`.expect_consensus_liveness()`	`.with_expectation(ConsensusLiveness::default())`

Bundled Expectations (Important)

Workloads can bundle expectations by implementing Workload::expectations().

These bundled expectations are attached automatically whenever you call .with_workload(...) (including when you use the DSL), because the core builder expands workload expectations during attachment.

Mixing Both Styles

Mixing is common: use the DSL for built-ins, and direct instantiation for custom pieces.

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::{ScenarioBuilderExt, workloads::transaction};

let tx_workload = transaction::Workload::with_rate(5)
    .expect("transaction rate must be non-zero");

let plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(5)
    .with_workload(tx_workload)          // direct instantiation
    .expect_consensus_liveness()         // DSL
    .with_run_duration(Duration::from_secs(60))
    .build();

Implementation Detail (How the DSL Works)

The DSL methods are thin wrappers. For example:

builder.transactions_with(|txs| txs.rate(5).users(3))

is roughly equivalent to:

builder.transactions().rate(5).users(3).apply()

Troubleshooting

DSL method not found

Ensure the extension traits are in scope, e.g. use testing_framework_workflows::ScenarioBuilderExt;
Cross-check method names in Builder API Quick Reference

Extending the Framework

This guide shows how to extend the framework with custom workloads, expectations, runners, and topology helpers. Each section includes the trait outline and a minimal code example.

Adding a Workload

Steps:

Implement testing_framework_core::scenario::Workload
Provide a name and any bundled expectations
Use init to derive inputs from topology/metrics; fail fast if prerequisites missing
Use start to drive async traffic using RunContext clients
Expose from testing-framework/workflows and optionally add a DSL helper

Trait outline:

use async_trait::async_trait;
use testing_framework_core::scenario::{
    DynError, Expectation, RunContext, RunMetrics, Workload,
};
use testing_framework_core::topology::generation::GeneratedTopology;

struct MyExpectation;

#[async_trait]
impl Expectation for MyExpectation {
    fn name(&self) -> &str {
        "my_expectation"
    }

    async fn evaluate(&mut self, _ctx: &RunContext) -> Result<(), DynError> {
        Ok(())
    }
}

pub struct MyWorkload {
    // Configuration fields
    target_rate: u64,
}

impl MyWorkload {
    pub fn new(target_rate: u64) -> Self {
        Self { target_rate }
    }
}

#[async_trait]
impl Workload for MyWorkload {
    fn name(&self) -> &str {
        "my_workload"
    }

    fn expectations(&self) -> Vec<Box<dyn Expectation>> {
        // Return bundled expectations that should run with this workload
        vec![Box::new(MyExpectation)]
    }

    fn init(
        &mut self,
        topology: &GeneratedTopology,
        _run_metrics: &RunMetrics,
    ) -> Result<(), DynError> {
        // Validate prerequisites (e.g., enough nodes, wallet data present)
        if topology.nodes().is_empty() {
            return Err("no nodes available".into());
        }
        Ok(())
    }

    async fn start(&self, ctx: &RunContext) -> Result<(), DynError> {
        // Drive async activity: submit transactions, query nodes, etc.
        let clients = ctx.node_clients().node_clients();
        
        for client in clients {
            let info = client.consensus_info().await?;
            tracing::info!(height = info.height, "workload queried node");
        }
        
        Ok(())
    }
}

Key points:

name() identifies the workload in logs
expectations() bundles default checks (can be empty)
init() validates topology before run starts
start() executes concurrently with other workloads; it should complete before run duration expires

See Example: New Workload & Expectation for a complete, runnable example.

Adding an Expectation

Steps:

Implement testing_framework_core::scenario::Expectation
Use start_capture to snapshot baseline metrics (optional)
Use evaluate to assert outcomes after workloads finish
Return descriptive errors; the runner aggregates them
Export from testing-framework/workflows if reusable

Trait outline:

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, Expectation, RunContext};

pub struct MyExpectation {
    expected_value: u64,
    captured_baseline: Option<u64>,
}

impl MyExpectation {
    pub fn new(expected_value: u64) -> Self {
        Self {
            expected_value,
            captured_baseline: None,
        }
    }
}

#[async_trait]
impl Expectation for MyExpectation {
    fn name(&self) -> &str {
        "my_expectation"
    }

    async fn start_capture(&mut self, ctx: &RunContext) -> Result<(), DynError> {
        // Optional: capture baseline state before workloads start
        let client = ctx.node_clients().node_clients().first()
            .ok_or("no nodes")?;
        
        let info = client.consensus_info().await?;
        self.captured_baseline = Some(info.height);
        
        tracing::info!(baseline = self.captured_baseline, "captured baseline");
        Ok(())
    }

    async fn evaluate(&mut self, ctx: &RunContext) -> Result<(), DynError> {
        // Assert the expected condition holds after workloads finish
        let client = ctx.node_clients().node_clients().first()
            .ok_or("no nodes")?;
        
        let info = client.consensus_info().await?;
        let final_height = info.height;
        
        let baseline = self.captured_baseline.unwrap_or(0);
        let delta = final_height.saturating_sub(baseline);
        
        if delta < self.expected_value {
            return Err(format!(
                "expected at least {} blocks, got {}",
                self.expected_value, delta
            ).into());
        }
        
        tracing::info!(delta, "expectation passed");
        Ok(())
    }
}

Key points:

name() identifies the expectation in logs
start_capture() runs before workloads start (optional)
evaluate() runs after workloads finish; return descriptive errors
Expectations run sequentially; keep them fast

Adding a Runner (Deployer)

Steps:

Implement testing_framework_core::scenario::Deployer<Caps> for your capability type
Deploy infrastructure and return a Runner
Construct NodeClients and spawn a BlockFeed
Build a RunContext and provide a CleanupGuard for teardown

Trait outline:

use async_trait::async_trait;
use testing_framework_core::scenario::{
    CleanupGuard, Deployer, DynError, Metrics, NodeClients, RunContext, Runner, Scenario,
    spawn_block_feed,
};
use testing_framework_core::topology::deployment::Topology;

pub struct MyDeployer {
    // Configuration: cluster connection details, etc.
}

impl MyDeployer {
    pub fn new() -> Self {
        Self {}
    }
}

#[async_trait]
impl Deployer<()> for MyDeployer {
    type Error = DynError;

    async fn deploy(&self, scenario: &Scenario<()>) -> Result<Runner, Self::Error> {
        // 1. Launch nodes using scenario.topology()
        // 2. Wait for readiness (e.g., consensus info endpoint responds)
        // 3. Build NodeClients for nodes
        // 4. Spawn a block feed for expectations (optional but recommended)
        // 5. Create NodeControlHandle if you support restarts (optional)
        // 6. Return a Runner wrapping RunContext + CleanupGuard

        tracing::info!("deploying scenario with MyDeployer");

        let topology: Option<Topology> = None; // Some(topology) if you spawned one
        let node_clients = NodeClients::default(); // Or NodeClients::from_topology(...)

        let client = node_clients
            .any_client()
            .ok_or("no api clients available")?
            .clone();
        let (block_feed, block_feed_guard) = spawn_block_feed(client).await?;

        let telemetry = Metrics::empty(); // or Metrics::from_prometheus(...)
        let node_control = None; // or Some(Arc<dyn NodeControlHandle>)

        let context = RunContext::new(
            scenario.topology().clone(),
            topology,
            node_clients,
            scenario.duration(),
            telemetry,
            block_feed,
            node_control,
        );

        // If you also have other resources to clean up (containers/pods/etc),
        // wrap them in your own CleanupGuard implementation and call
        // CleanupGuard::cleanup(Box::new(block_feed_guard)) inside it.
        Ok(Runner::new(context, Some(Box::new(block_feed_guard))))
    }
}

Key points:

deploy() must return a fully prepared Runner
Block until nodes are ready before returning (avoid false negatives)
Use a CleanupGuard to tear down resources on failure (and on RunHandle drop)
If you want chaos workloads, also provide a NodeControlHandle via RunContext

Adding Topology Helpers

Steps:

Extend testing_framework_core::topology::config::TopologyBuilder with new layouts
Keep defaults safe: ensure at least one participant, clamp dispersal factors
Consider adding configuration presets for specialized parameters

Example:

use testing_framework_core::topology::{
    config::TopologyBuilder,
    configs::network::Libp2pNetworkLayout,
};

pub trait TopologyBuilderExt {
    fn network_full(self) -> Self;
}

impl TopologyBuilderExt for TopologyBuilder {
    fn network_full(self) -> Self {
        self.with_network_layout(Libp2pNetworkLayout::Full)
    }
}

Key points:

Maintain method chaining (return &mut Self)
Validate inputs: clamp factors, enforce minimums
Document assumptions (e.g., “requires at least 4 nodes”)

Adding a DSL Helper

To expose your custom workload through the high-level DSL, add a trait extension:

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, RunContext, ScenarioBuilder, Workload};

#[derive(Default)]
pub struct MyWorkloadBuilder {
    target_rate: u64,
    some_option: bool,
}

impl MyWorkloadBuilder {
    pub const fn target_rate(mut self, target_rate: u64) -> Self {
        self.target_rate = target_rate;
        self
    }

    pub const fn some_option(mut self, some_option: bool) -> Self {
        self.some_option = some_option;
        self
    }

    pub const fn build(self) -> MyWorkload {
        MyWorkload {
            target_rate: self.target_rate,
            some_option: self.some_option,
        }
    }
}

pub struct MyWorkload {
    target_rate: u64,
    some_option: bool,
}

#[async_trait]
impl Workload for MyWorkload {
    fn name(&self) -> &str {
        "my_workload"
    }

    async fn start(&self, _ctx: &RunContext) -> Result<(), DynError> {
        Ok(())
    }
}

pub trait MyWorkloadDsl {
    fn my_workload_with(
        self,
        f: impl FnOnce(MyWorkloadBuilder) -> MyWorkloadBuilder,
    ) -> Self;
}

impl MyWorkloadDsl for ScenarioBuilder {
    fn my_workload_with(
        self,
        f: impl FnOnce(MyWorkloadBuilder) -> MyWorkloadBuilder,
    ) -> Self {
        let builder = f(MyWorkloadBuilder::default());
        self.with_workload(builder.build())
    }
}

Users can then call:

ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
    .my_workload_with(|w| {
        w.target_rate(10)
         .some_option(true)
    })
    .build()

Example: New Workload & Expectation (Rust)

A minimal, end-to-end illustration of adding a custom workload and matching expectation. This shows the shape of the traits and where to plug into the framework; expand the logic to fit your real test.

Workload: simple reachability probe

Key ideas:

name: identifies the workload in logs.
expectations: workloads can bundle defaults so callers don’t forget checks.
init: derive inputs from the generated topology (e.g., pick a target node).
start: drive async activity using the shared RunContext.

use async_trait::async_trait;
use testing_framework_core::{
    scenario::{DynError, Expectation, RunContext, RunMetrics, Workload},
    topology::generation::GeneratedTopology,
};

pub struct ReachabilityWorkload {
    target_idx: usize,
}

impl ReachabilityWorkload {
    pub fn new(target_idx: usize) -> Self {
        Self { target_idx }
    }
}

#[async_trait]
impl Workload for ReachabilityWorkload {
    fn name(&self) -> &str {
        "reachability_workload"
    }

    fn expectations(&self) -> Vec<Box<dyn Expectation>> {
        vec![Box::new(
            crate::custom_workload_example_expectation::ReachabilityExpectation::new(
                self.target_idx,
            ),
        )]
    }

    fn init(
        &mut self,
        topology: &GeneratedTopology,
        _run_metrics: &RunMetrics,
    ) -> Result<(), DynError> {
        if topology.nodes().get(self.target_idx).is_none() {
            return Err(Box::new(std::io::Error::new(
                std::io::ErrorKind::Other,
                "no node at requested index",
            )));
        }
        Ok(())
    }

    async fn start(&self, ctx: &RunContext) -> Result<(), DynError> {
        let client = ctx
            .node_clients()
            .node_clients()
            .get(self.target_idx)
            .ok_or_else(|| {
                Box::new(std::io::Error::new(
                    std::io::ErrorKind::Other,
                    "missing target client",
                )) as DynError
            })?;

        // Lightweight API call to prove reachability.
        client
            .consensus_info()
            .await
            .map(|_| ())
            .map_err(|e| e.into())
    }
}

Expectation: confirm the target stayed reachable

Key ideas:

start_capture: snapshot baseline if needed (not used here).
evaluate: assert the condition after workloads finish.

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, Expectation, RunContext};

pub struct ReachabilityExpectation {
    target_idx: usize,
}

impl ReachabilityExpectation {
    pub fn new(target_idx: usize) -> Self {
        Self { target_idx }
    }
}

#[async_trait]
impl Expectation for ReachabilityExpectation {
    fn name(&self) -> &str {
        "target_reachable"
    }

    async fn evaluate(&mut self, ctx: &RunContext) -> Result<(), DynError> {
        let client = ctx
            .node_clients()
            .node_clients()
            .get(self.target_idx)
            .ok_or_else(|| {
                Box::new(std::io::Error::new(
                    std::io::ErrorKind::Other,
                    "missing target client",
                )) as DynError
            })?;

        client
            .consensus_info()
            .await
            .map(|_| ())
            .map_err(|e| e.into())
    }
}

How to wire it

Build your scenario as usual and call .with_workload(ReachabilityWorkload::new(0)).
The bundled expectation is attached automatically; you can add more with .with_expectation(...) if needed.
Keep the logic minimal and fast for smoke tests; grow it into richer probes for deeper scenarios.

Internal Crate Reference

High-level roles of the crates that make up the framework:

Configs (testing-framework/configs/): Prepares reusable configuration primitives for nodes, networking, tracing, and wallets, shared by all scenarios and runners. Includes topology generation and circuit asset resolution.
Core scenario orchestration (testing-framework/core/): Houses the topology and scenario model, runtime coordination, node clients, and readiness/health probes. Defines Deployer and Runner traits, ScenarioBuilder, and RunContext.
Workflows (testing-framework/workflows/): Packages workloads (transaction, chaos) and expectations (consensus liveness) into reusable building blocks. Offers fluent DSL extensions (ScenarioBuilderExt, ChaosBuilderExt).
Deployers (testing-framework/deployers/{local,compose,k8s}/): Implements deployment backends (local host, Docker Compose, Kubernetes) that all consume the same scenario plan. Each provides a Deployer implementation (LocalDeployer, ComposeDeployer, K8sDeployer).
Runner Examples (crate name: runner-examples, path: examples/): Runnable binaries demonstrating framework usage and serving as living documentation. These are the primary entry point for running scenarios (examples/src/bin/local_runner.rs, examples/src/bin/compose_runner.rs, examples/src/bin/k8s_runner.rs).

Where to Add New Capabilities

What You’re Adding	Where It Goes	Examples
Node config parameter	`testing-framework/configs/src/topology/configs/`	Slot duration, log levels
Topology feature	`testing-framework/core/src/topology/`	New network layouts
Scenario capability	`testing-framework/core/src/scenario/`	New capabilities, context methods
Workload	`testing-framework/workflows/src/workloads/`	New traffic generators
Expectation	`testing-framework/workflows/src/expectations/`	New success criteria
Builder API	`testing-framework/workflows/src/builder/`	DSL extensions, fluent methods
Deployer	`testing-framework/deployers/`	New deployment backends
Example scenario	`examples/src/bin/`	Demonstration binaries

Extension Workflow

Adding a New Workload

Define the workload in testing-framework/workflows/src/workloads/your_workload.rs:

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, RunContext, Workload};

pub struct YourWorkload;

#[async_trait]
impl Workload for YourWorkload {
    fn name(&self) -> &'static str {
        "your_workload"
    }

    async fn start(&self, _ctx: &RunContext) -> Result<(), DynError> {
        // implementation
        Ok(())
    }
}

Add builder extension in testing-framework/workflows/src/builder/mod.rs:

pub struct YourWorkloadBuilder;

impl YourWorkloadBuilder {
    pub fn some_config(self) -> Self {
        self
    }
}

pub trait ScenarioBuilderExt: Sized {
    fn your_workload(self) -> YourWorkloadBuilder;
}

Use in examples in examples/src/bin/your_scenario.rs:

use testing_framework_core::scenario::ScenarioBuilder;

pub struct YourWorkloadBuilder;

impl YourWorkloadBuilder {
    pub fn some_config(self) -> Self {
        self
    }
}

pub trait YourWorkloadDslExt: Sized {
    fn your_workload_with<F>(self, configurator: F) -> Self
    where
        F: FnOnce(YourWorkloadBuilder) -> YourWorkloadBuilder;
}

impl<Caps> YourWorkloadDslExt for testing_framework_core::scenario::Builder<Caps> {
    fn your_workload_with<F>(self, configurator: F) -> Self
    where
        F: FnOnce(YourWorkloadBuilder) -> YourWorkloadBuilder,
    {
        let _ = configurator(YourWorkloadBuilder);
        self
    }
}

pub fn use_in_examples() {
    let _plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .your_workload_with(|w| w.some_config())
        .build();
}

Adding a New Expectation

Define the expectation in testing-framework/workflows/src/expectations/your_expectation.rs:

use async_trait::async_trait;
use testing_framework_core::scenario::{DynError, Expectation, RunContext};

pub struct YourExpectation;

#[async_trait]
impl Expectation for YourExpectation {
    fn name(&self) -> &'static str {
        "your_expectation"
    }

    async fn evaluate(&mut self, _ctx: &RunContext) -> Result<(), DynError> {
        // implementation
        Ok(())
    }
}

Add builder extension in testing-framework/workflows/src/builder/mod.rs:

use testing_framework_core::scenario::ScenarioBuilder;

pub trait YourExpectationDslExt: Sized {
    fn expect_your_condition(self) -> Self;
}

impl<Caps> YourExpectationDslExt for testing_framework_core::scenario::Builder<Caps> {
    fn expect_your_condition(self) -> Self {
        self
    }
}

pub fn use_in_examples() {
    let _plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .expect_your_condition()
        .build();
}

Adding a New Deployer

Implement Deployer trait in testing-framework/runners/your_runner/src/deployer.rs:

use async_trait::async_trait;
use testing_framework_core::scenario::{Deployer, Runner, Scenario};

#[derive(Debug)]
pub struct YourError;

pub struct YourDeployer;

#[async_trait]
impl Deployer for YourDeployer {
    type Error = YourError;

    async fn deploy(&self, _scenario: &Scenario<()>) -> Result<Runner, Self::Error> {
        // Provision infrastructure
        // Wait for readiness
        // Return Runner
        todo!()
    }
}

Provide cleanup and handle node control if supported.
Add example in examples/src/bin/your_runner.rs.

For detailed examples, see Extending the Framework and Custom Workload Example.

Part IV — Operations & Deployment

This section covers operational aspects of running the testing framework: prerequisites, deployment configuration, continuous integration, and observability.

What You’ll Learn

Prerequisites & Setup: Required files, binaries, circuit assets, and environment configuration
Running Examples: How to execute scenarios across host, compose, and k8s runners
CI Integration: Automating tests in continuous integration pipelines with caching and matrix testing
Environment Variables: Complete reference of all configuration variables
Logging & Observability: Log collection strategies, metrics integration, and debugging techniques

Who This Section Is For

Operators setting up the framework for the first time
DevOps Engineers integrating tests into CI/CD pipelines
Developers debugging test failures or performance issues
Platform Engineers deploying across different environments (local, Docker, Kubernetes)

This section is organized for progressive depth:

Start with Operations Overview for the big picture
Follow Prerequisites & Setup to prepare your environment
Use Running Examples to execute your first scenarios
Integrate with CI Integration for automated testing
Reference Environment Variables for complete configuration options
Debug with Logging & Observability when issues arise

Key Principles

Operational Hygiene: Assets present, prerequisites satisfied, observability reachable

Environment Fit: Choose the right deployment target based on isolation, reproducibility, and resource needs

Clear Signals: Verify runners report node readiness before starting workloads

Failure Triage: Map failures to specific causes—missing prerequisites, platform issues, or unmet expectations

Ready to get started? Begin with Operations Overview →

Operations & Deployment Overview

Operational readiness focuses on prerequisites, environment fit, and clear signals that ensure your test scenarios run reliably across different deployment targets.

Core Principles

Prerequisites First: Ensure all required files, binaries, and assets are in place before attempting to run scenarios
Environment Fit: Choose the right deployment target (host, compose, k8s) based on your isolation, reproducibility, and resource needs
Clear Signals: Verify runners report node readiness before starting workloads to avoid false negatives
Failure Triage: Map failures to specific causes—missing prerequisites, platform issues, or unmet expectations

Key Operational Concerns

Prerequisites:

versions.env file at repository root (required by helper scripts)
Node binaries (logos-blockchain-node) available or built on demand
Platform requirements met (Docker for compose, cluster access for k8s)
Circuit assets for proof generation

Artifacts:

Circuit parameters required by the node binary
Docker images for compose/k8s deployments
Binary bundles for reproducible builds

Environment Configuration:

Logging configured via LOGOS_BLOCKCHAIN_LOG_* variables
Observability endpoints (Prometheus, Grafana) optional but useful

Readiness & Health:

Runners verify node readiness before starting workloads
Health checks prevent premature workload execution
Consensus liveness expectations validate basic operation

Runner-Agnostic Design

The framework is intentionally runner-agnostic: the same scenario plan runs across all deployment targets. Understanding which operational concerns apply to each runner helps you choose the right fit.

Concern	Host	Compose	Kubernetes
Topology	Full support	Full support	Full support
Workloads	All workloads	All workloads	All workloads
Expectations	All expectations	All expectations	All expectations
Chaos / Node Control	Not supported	Supported	Not yet
Metrics / Observability	Manual setup	External stack	Cluster-wide
Log Collection	Temp files	Container logs	Pod logs
Isolation	Process-level	Container	Pod + namespace
Setup Time	< 1 min	2-5 min	5-10 min
CI Recommended?	Smoke tests	Primary	Large-scale only

Key insight: Operational concerns (prerequisites, environment variables) are largely consistent across runners, while deployment-specific concerns (isolation, chaos support) vary by backend.

Operational Workflow

flowchart LR
    Setup[Prerequisites & Setup] --> Run[Run Scenarios]
    Run --> Monitor[Monitor & Observe]
    Monitor --> Debug{Success?}
    Debug -->|No| Triage[Failure Triage]
    Triage --> Setup
    Debug -->|Yes| Done[Complete]

Setup: Verify prerequisites, configure environment, prepare assets
Run: Execute scenarios using appropriate runner (host/compose/k8s)
Monitor: Collect logs, metrics, and observability signals
Triage: When failures occur, map to root causes and fix prerequisites

Documentation Structure

This Operations & Deployment section covers:

Prerequisites & Setup — Required files, binaries, and environment setup
Running Examples — How to run scenarios across different runners
CI Integration — Automating tests in continuous integration pipelines
Environment Variables — Complete reference of configuration variables
Logging & Observability — Log collection, metrics, and debugging

Philosophy: Treat operational hygiene—assets present, prerequisites satisfied, observability reachable—as the first step to reliable scenario outcomes.

Prerequisites & Setup

This page covers everything you need before running your first scenario.

Required Files

`versions.env` (Required)

All helper scripts require a versions.env file at the repository root:

VERSION=v0.3.1
LOGOS_BLOCKCHAIN_NODE_REV=abc123def456789
LOGOS_BLOCKCHAIN_BUNDLE_VERSION=v1

What it defines:

VERSION — Circuit assets release tag
LOGOS_BLOCKCHAIN_NODE_REV — Git revision of logos-blockchain-node to build/fetch
LOGOS_BLOCKCHAIN_BUNDLE_VERSION — Bundle schema version

Where it’s used:

scripts/run/run-examples.sh
scripts/build/build-bundle.sh
scripts/setup/setup-logos-blockchain-circuits.sh
CI workflows

Error if missing:

ERROR: versions.env not found at repository root
This file is required and should define:
  VERSION=<circuit release tag>
  LOGOS_BLOCKCHAIN_NODE_REV=<logos-blockchain-node git revision>
  LOGOS_BLOCKCHAIN_BUNDLE_VERSION=<bundle schema version>

Fix: Ensure you’re in the repository root. The file should already exist in the checked-out repo.

Node Binaries

Scenarios need compiled logos-blockchain-node binaries.

Option 1: Use Helper Scripts (Recommended)

scripts/run/run-examples.sh -t 60 -n 3 host

This automatically:

Clones/updates logos-blockchain-node checkout
Builds required binaries
Sets LOGOS_BLOCKCHAIN_NODE_BIN

Option 2: Manual Build

If you have a sibling logos-blockchain-node checkout:

cd ../logos-blockchain-node
cargo build --release --bin logos-blockchain-node 

# Set environment variables
export LOGOS_BLOCKCHAIN_NODE_BIN=$PWD/target/release/logos-blockchain-node

# Return to testing framework
cd ../nomos-testing

Option 3: Prebuilt Bundles (CI)

CI workflows use prebuilt artifacts:

- name: Download nomos binaries
  uses: actions/download-artifact@v3
  with:
    name: nomos-binaries-linux
    path: .tmp/

- name: Extract bundle
  run: |
    tar -xzf .tmp/nomos-binaries-linux-*.tar.gz -C .tmp/
    export LOGOS_BLOCKCHAIN_NODE_BIN=$PWD/.tmp/logos-blockchain-node

Circuit Assets

Nodes require circuit assets for proof generation. The framework expects a directory containing the circuits, not a single file.

Asset Location

Default path: ~/.logos-blockchain-circuits

Container path (compose/k8s): /opt/circuits (set during image build)

Getting Assets

Option 1: Use helper script (recommended):

scripts/setup/setup-logos-blockchain-circuits.sh v0.3.1 ~/.logos-blockchain-circuits

Option 2: Let run-examples.sh handle it:

scripts/run/run-examples.sh -t 60 -n 3 host

Override Path

Set LOGOS_BLOCKCHAIN_CIRCUITS to use a custom location:

LOGOS_BLOCKCHAIN_CIRCUITS=/custom/path/to/circuits \
cargo run -p runner-examples --bin local_runner

When Are Assets Needed?

Runner	When Required
Host (local)	Always
Compose	During image build (baked into image)
K8s	During image build

Error without assets:

Error: circuits directory not found (LOGOS_BLOCKCHAIN_CIRCUITS)

Platform Requirements

Host Runner (Local Processes)

Requires:

Rust nightly toolchain
Node binaries built
Circuit assets for proof generation
Available ports (18080+, 3100+, etc.)

No Docker required.

Best for:

Quick iteration
Development
Smoke tests

Compose Runner (Docker Compose)

Requires:

Docker daemon running
Docker image built: logos-blockchain-testing:local
Circuit assets baked into image
Docker Desktop (macOS) or Docker Engine (Linux)

Platform notes (macOS / Apple silicon):

Prefer LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/arm64 for native performance
Use linux/amd64 only if targeting amd64 environments (slower via emulation)

Best for:

Reproducible environments
CI testing
Chaos workloads (node control support)

K8s Runner (Kubernetes)

Requires:

Kubernetes cluster (Docker Desktop K8s, minikube, kind, or remote)
kubectl configured
Docker image built and loaded/pushed
Circuit assets baked into image

Local cluster setup:

# Docker Desktop: Enable Kubernetes in settings

# OR: Use kind
kind create cluster
kind load docker-image logos-blockchain-testing:local

# OR: Use minikube
minikube start
minikube image load logos-blockchain-testing:local

Remote cluster: Push image to registry and set LOGOS_BLOCKCHAIN_TESTNET_IMAGE.

Best for:

Production-like testing
Resource isolation
Large topologies

Quick Setup Check

Run this checklist before your first scenario:

# 1. Verify versions.env exists
cat versions.env

# 2. Check circuit assets
ls -lh "${HOME}/.logos-blockchain-circuits"

# 3. For compose/k8s: verify Docker is running
docker ps

# 4. For compose/k8s: verify image exists
docker images | grep logos-blockchain-testing

# 5. For host runner: verify node binaries (if not using scripts)
$LOGOS_BLOCKCHAIN_NODE_BIN --version

Recommended: Use Helper Scripts

The easiest path is to let the helper scripts handle everything:

# Host runner
scripts/run/run-examples.sh -t 60 -n 3 host

# Compose runner
scripts/run/run-examples.sh -t 60 -n 3 compose

# K8s runner
scripts/run/run-examples.sh -t 60 -n 3 k8s

These scripts:

Verify versions.env exists
Clone/build logos-blockchain-node if needed
Fetch circuit assets if missing
Build Docker images (compose/k8s)
Load images into cluster (k8s)
Run the scenario with proper environment

Next Steps:

Running Examples — Learn how to run scenarios
Environment Variables — Full variable reference
Troubleshooting — Common issues and fixes

Running Examples

The framework provides three runner modes: host (local processes), compose (Docker Compose), and k8s (Kubernetes).

Quick Start (Recommended)

Use scripts/run/run-examples.sh for all modes—it handles all setup automatically:

# Host mode (local processes)
scripts/run/run-examples.sh -t 60 -n 3 host

# Compose mode (Docker Compose)
scripts/run/run-examples.sh -t 60 -n 3 compose

# K8s mode (Kubernetes)
scripts/run/run-examples.sh -t 60 -n 3 k8s

Parameters:

-t 60 — Run duration in seconds
-n 3 — Number of nodes
host|compose|k8s — Deployment mode

This script handles:

Circuit asset setup
Binary building/bundling
Image building (compose/k8s)
Image loading into cluster (k8s)
Execution with proper environment

Note: For k8s runs against non-local clusters (e.g. EKS), the cluster pulls images from a registry. In that case, build + push your image separately (see scripts/build/build_test_image.sh) and set LOGOS_BLOCKCHAIN_TESTNET_IMAGE to the pushed reference.

Quick Smoke Matrix

For a small “does everything still run?” matrix across all runners:

scripts/run/run-test-matrix.sh -t 120 -n 1

This runs host, compose, and k8s modes with various image-build configurations. Useful after making runner/image/script changes. Forwards --metrics-* options through to scripts/run/run-examples.sh.

Common options:

--modes host,compose,k8s — Restrict which modes run
--no-clean — Skip scripts/ops/clean.sh step
--no-bundles — Skip scripts/build/build-bundle.sh (reuses existing .tmp tarballs)
--no-image-build — Skip the “rebuild image” variants in the matrix (compose/k8s)
--allow-nonzero-progress — Soft-pass expectation failures if logs show non-zero progress (local iteration only)
--force-k8s-image-build — Allow the k8s image-build variant even on non-docker-desktop clusters

Environment overrides:

VERSION=v0.3.1 — Circuit version
LOGOS_BLOCKCHAIN_NODE_REV=<commit> — logos-blockchain-node git revision
LOGOS_BLOCKCHAIN_BINARIES_TAR=path/to/bundle.tar.gz — Use prebuilt bundle
LOGOS_BLOCKCHAIN_SKIP_IMAGE_BUILD=1 — Skip image rebuild inside run-examples.sh (compose/k8s)
LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/arm64|linux/amd64 — Docker platform for bundle builds (macOS/Windows)
COMPOSE_CIRCUITS_PLATFORM=linux-aarch64|linux-x86_64 — Circuits platform for image builds
SLOW_TEST_ENV=true — Doubles built-in readiness timeouts (useful in CI / constrained laptops)
TESTNET_PRINT_ENDPOINTS=1 — Print TESTNET_ENDPOINTS / TESTNET_PPROF lines during deploy

Dev Workflow: Updating logos-blockchain-node Revision

The repo pins a logos-blockchain-node revision in versions.env for reproducible builds. To update it or point to a local checkout:

# Pin to a new git revision (updates versions.env + Cargo.toml git revs)
scripts/ops/update-nomos-rev.sh --rev <git_sha>

# Use a local logos-blockchain-node checkout instead (for development)
scripts/ops/update-nomos-rev.sh --path /path/to/logos-blockchain-node

# If Cargo.toml was marked skip-worktree, clear it
scripts/ops/update-nomos-rev.sh --unskip-worktree

Notes:

Don’t commit absolute LOGOS_BLOCKCHAIN_NODE_PATH values; prefer --rev for shared history/CI
After changing rev/path, expect Cargo.lock to update on the next cargo build/cargo test

Cleanup Helper

If you hit Docker build failures, I/O errors, or disk space issues:

scripts/ops/clean.sh

For extra Docker cache cleanup:

scripts/ops/clean.sh --docker

Host Runner (Direct Cargo Run)

For manual control, run the local_runner binary directly:

LOGOS_BLOCKCHAIN_NODE_BIN=/path/to/logos-blockchain-node \
cargo run -p runner-examples --bin local_runner

Host Runner Environment Variables

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_DEMO_NODES`	1	Number of nodes (legacy: `LOCAL_DEMO_NODES`)
`LOGOS_BLOCKCHAIN_DEMO_RUN_SECS`	60	Run duration in seconds (legacy: `LOCAL_DEMO_RUN_SECS`)
`LOGOS_BLOCKCHAIN_NODE_BIN`	—	Path to logos-blockchain-node binary (required)
`LOGOS_BLOCKCHAIN_LOG_DIR`	None	Directory for per-node log files
`LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS`	0	Keep per-run temporary directories (useful for debugging/CI)
`LOGOS_BLOCKCHAIN_TESTS_TRACING`	false	Enable debug tracing preset
`LOGOS_BLOCKCHAIN_LOG_LEVEL`	info	Global log level: error, warn, info, debug, trace
`LOGOS_BLOCKCHAIN_LOG_FILTER`	None	Fine-grained module filtering (e.g., `cryptarchia=trace`)

Note: Requires circuit assets and host binaries. Use scripts/run/run-examples.sh host to handle setup automatically.

Compose Runner (Direct Cargo Run)

For manual control, run the compose_runner binary directly. Compose requires a Docker image with embedded assets.

Option 1: Prebuilt Bundle (Recommended)

# 1. Build a Linux bundle (includes binaries + circuits)
scripts/build/build-bundle.sh --platform linux
# Creates .tmp/nomos-binaries-linux-v0.3.1.tar.gz

# 2. Build image (embeds bundle assets)
export LOGOS_BLOCKCHAIN_BINARIES_TAR=.tmp/nomos-binaries-linux-v0.3.1.tar.gz
scripts/build/build_test_image.sh

# 3. Run
LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local \
cargo run -p runner-examples --bin compose_runner

Option 2: Manual Circuit/Image Setup

# Fetch circuits
scripts/setup/setup-logos-blockchain-circuits.sh v0.3.1 ~/.logos-blockchain-circuits

# Build image
scripts/build/build_test_image.sh

# Run
LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local \
cargo run -p runner-examples --bin compose_runner

Platform Note (macOS / Apple Silicon)

Docker Desktop runs a linux/arm64 engine by default
For native performance: LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/arm64 (recommended for local testing)
For amd64 targets: LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/amd64 (slower via emulation)

Compose Runner Environment Variables

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_TESTNET_IMAGE`	—	Image tag (required, must match built image)
`LOGOS_BLOCKCHAIN_DEMO_NODES`	1	Number of nodes
`LOGOS_BLOCKCHAIN_DEMO_RUN_SECS`	60	Run duration in seconds
`COMPOSE_NODE_PAIRS`	—	Alternative topology format: “nodes” (e.g., `3`)
`LOGOS_BLOCKCHAIN_METRICS_QUERY_URL`	None	Prometheus-compatible base URL for runner to query
`LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL`	None	Full OTLP HTTP ingest URL for node metrics export
`LOGOS_BLOCKCHAIN_GRAFANA_URL`	None	Grafana base URL for printing/logging
`COMPOSE_RUNNER_HOST`	127.0.0.1	Host address for port mappings
`COMPOSE_RUNNER_PRESERVE`	0	Keep containers running after test
`LOGOS_BLOCKCHAIN_LOG_LEVEL`	info	Node log level (stdout/stderr)
`LOGOS_BLOCKCHAIN_LOG_FILTER`	None	Fine-grained module filtering

Config file option: testing-framework/assets/stack/cfgsync.yaml (tracing_settings.logger) — Switch node logs between stdout/stderr and file output

Compose-Specific Features

Node control support: Only runner that supports chaos testing (.enable_node_control() + chaos workloads)
External observability: Set LOGOS_BLOCKCHAIN_METRICS_* / LOGOS_BLOCKCHAIN_GRAFANA_URL to enable telemetry links and querying
- Quickstart: scripts/setup/setup-observability.sh compose up then scripts/setup/setup-observability.sh compose env

Important:

Containers expect circuits at /opt/circuits (set by the image build)
Use scripts/run/run-examples.sh compose to handle all setup automatically

K8s Runner (Direct Cargo Run)

For manual control, run the k8s_runner binary directly. K8s requires the same image setup as Compose.

Prerequisites

Kubernetes cluster with kubectl configured
Test image built (same as Compose, preferably with prebuilt bundle)
Image available in cluster (loaded or pushed to registry)

Build and Load Image

# 1. Build image with bundle (recommended)
scripts/build/build-bundle.sh --platform linux
export LOGOS_BLOCKCHAIN_BINARIES_TAR=.tmp/nomos-binaries-linux-v0.3.1.tar.gz
scripts/build/build_test_image.sh

# 2. Load into cluster (choose one)
export LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local

# For kind:
kind load docker-image logos-blockchain-testing:local

# For minikube:
minikube image load logos-blockchain-testing:local

# For remote cluster (push to registry):
docker tag logos-blockchain-testing:local your-registry/logos-blockchain-testing:latest
docker push your-registry/logos-blockchain-testing:latest
export LOGOS_BLOCKCHAIN_TESTNET_IMAGE=your-registry/logos-blockchain-testing:latest

Run the Example

export LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local
cargo run -p runner-examples --bin k8s_runner

K8s Runner Environment Variables

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_TESTNET_IMAGE`	—	Image tag (required)
`LOGOS_BLOCKCHAIN_DEMO_NODES`	1	Number of nodes
`LOGOS_BLOCKCHAIN_DEMO_RUN_SECS`	60	Run duration in seconds
`LOGOS_BLOCKCHAIN_METRICS_QUERY_URL`	None	Prometheus-compatible base URL for runner to query (PromQL)
`LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL`	None	Full OTLP HTTP ingest URL for node metrics export
`LOGOS_BLOCKCHAIN_GRAFANA_URL`	None	Grafana base URL for printing/logging
`K8S_RUNNER_NAMESPACE`	Random	Kubernetes namespace (pin for debugging)
`K8S_RUNNER_RELEASE`	Random	Helm release name (pin for debugging)
`K8S_RUNNER_NODE_HOST`	—	NodePort host resolution for non-local clusters
`K8S_RUNNER_DEBUG`	0	Log Helm stdout/stderr for install commands
`K8S_RUNNER_PRESERVE`	0	Keep namespace/release after run (for debugging)

K8s + Observability (Optional)

export LOGOS_BLOCKCHAIN_METRICS_QUERY_URL=http://your-prometheus:9090
# Prometheus OTLP receiver example:
export LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL=http://your-prometheus:9090/api/v1/otlp/v1/metrics
# Optional: print Grafana link in TESTNET_ENDPOINTS
export LOGOS_BLOCKCHAIN_GRAFANA_URL=http://your-grafana:3000
cargo run -p runner-examples --bin k8s_runner

Notes:

LOGOS_BLOCKCHAIN_METRICS_QUERY_URL must be reachable from the runner process (often via kubectl port-forward)
LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL must be reachable from nodes (pods/containers) and is backend-specific
- Quickstart installer: scripts/setup/setup-observability.sh k8s install then scripts/setup/setup-observability.sh k8s env
- Optional dashboards: scripts/setup/setup-observability.sh k8s dashboards

Via `scripts/run/run-examples.sh` (Recommended)

scripts/run/run-examples.sh -t 60 -n 3 k8s \
  --metrics-query-url http://your-prometheus:9090 \
  --metrics-otlp-ingest-url http://your-prometheus:9090/api/v1/otlp/v1/metrics

In Code (Optional)

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ObservabilityBuilderExt as _;

let plan = ScenarioBuilder::with_node_counts(1)
    .with_metrics_query_url_str("http://your-prometheus:9090")
    .with_metrics_otlp_ingest_url_str("http://your-prometheus:9090/api/v1/otlp/v1/metrics")
    .build();

Important K8s Notes

K8s runner uses circuits baked into the image
File path inside pods: /opt/circuits
No node control support yet: Chaos workloads (.enable_node_control()) will fail
Optimized for local clusters (Docker Desktop K8s / minikube / kind)
- Remote clusters require additional setup (registry push, PV/CSI for assets, etc.)
Use scripts/run/run-examples.sh k8s to handle all setup automatically

Next Steps

CI Integration — Automate tests in continuous integration
Environment Variables — Full variable reference
Logging & Observability — Log collection and metrics
Troubleshooting — Common issues and fixes

CI Integration

Both LocalDeployer and ComposeDeployer work well in CI environments. Choose based on your tradeoffs.

Runner Comparison for CI

LocalDeployer (Host Runner):

Faster startup (no Docker overhead)
Good for quick smoke tests
Trade-off: Less isolation (processes share host resources)

ComposeDeployer (Recommended for CI):

Better isolation (containerized)
Reproducible environment
Can integrate with external Prometheus/Grafana (optional)
Trade-offs: Slower startup (Docker image build), requires Docker daemon

K8sDeployer:

Production-like environment
Full resource isolation
Trade-offs: Slowest (cluster setup + image loading), requires cluster access
Best for nightly/weekly runs or production validation

Existing Examples:

See .github/workflows/lint.yml (jobs: host_smoke, compose_smoke) for CI examples running the demo scenarios in this repository.

Complete CI Workflow Example

Here’s a comprehensive GitHub Actions workflow demonstrating host and compose runners with caching, matrix testing, and log collection:

name: Testing Framework CI

on:
  push:
    branches: [main, develop]
  pull_request:
    branches: [main]

env:
  CARGO_TERM_COLOR: always
  RUST_BACKTRACE: 1

jobs:
  # Quick smoke test with host runner (no Docker)
  host_smoke:
    name: Host Runner Smoke Test
    runs-on: ubuntu-latest
    timeout-minutes: 15
    
    steps:
      - name: Checkout repository
        uses: actions/checkout@v4
      
      - name: Set up Rust toolchain
        uses: actions-rs/toolchain@v1
        with:
          profile: minimal
          toolchain: nightly
          override: true
      
      - name: Cache Rust dependencies
        uses: actions/cache@v3
        with:
          path: |
            ~/.cargo/bin/
            ~/.cargo/registry/index/
            ~/.cargo/registry/cache/
            ~/.cargo/git/db/
            target/
          key: ${{ runner.os }}-cargo-host-${{ hashFiles('**/Cargo.lock') }}
          restore-keys: |
            ${{ runner.os }}-cargo-host-
      
      - name: Cache logos-blockchain-node build
        uses: actions/cache@v3
        with:
          path: |
            ../logos-blockchain-node/target/release/logos-blockchain-node
          key: ${{ runner.os }}-nomos-${{ hashFiles('../logos-blockchain-node/**/Cargo.lock') }}
          restore-keys: |
            ${{ runner.os }}-nomos-
      
      - name: Run host smoke test
        run: |
          # Use run-examples.sh which handles setup automatically
          scripts/run/run-examples.sh -t 120 -n 3 host
      
      - name: Upload logs on failure
        if: failure()
        uses: actions/upload-artifact@v3
        with:
          name: host-runner-logs
          path: |
            .tmp/
            *.log
          retention-days: 7

  # Compose runner matrix (with Docker)
  compose_matrix:
    name: Compose Runner (${{ matrix.topology }})
    runs-on: ubuntu-latest
    timeout-minutes: 25
    
    strategy:
      fail-fast: false
      matrix:
        topology:
          - "3v1e"
          - "5v1e"
    
    steps:
      - name: Checkout repository
        uses: actions/checkout@v4
      
      - name: Set up Rust toolchain
        uses: actions-rs/toolchain@v1
        with:
          profile: minimal
          toolchain: nightly
          override: true
      
      - name: Set up Docker Buildx
        uses: docker/setup-buildx-action@v2
      
      - name: Cache Rust dependencies
        uses: actions/cache@v3
        with:
          path: |
            ~/.cargo/bin/
            ~/.cargo/registry/index/
            ~/.cargo/registry/cache/
            ~/.cargo/git/db/
            target/
          key: ${{ runner.os }}-cargo-compose-${{ hashFiles('**/Cargo.lock') }}
          restore-keys: |
            ${{ runner.os }}-cargo-compose-
      
      - name: Cache Docker layers
        uses: actions/cache@v3
        with:
          path: /tmp/.buildx-cache
          key: ${{ runner.os }}-buildx-${{ hashFiles('Dockerfile', 'scripts/build/build_test_image.sh') }}
          restore-keys: |
            ${{ runner.os }}-buildx-
      
      - name: Run compose test
        env:
          TOPOLOGY: ${{ matrix.topology }}
        run: |
          # Build and run with the specified topology
          scripts/run/run-examples.sh -t 120 -n ${TOPOLOGY:0:1} compose
      
      - name: Collect Docker logs on failure
        if: failure()
        run: |
          mkdir -p logs
          for container in $(docker ps -a --filter "name=nomos-compose-" -q); do
            docker logs $container > logs/$(docker inspect --format='{{.Name}}' $container).log 2>&1
          done
      
      - name: Upload logs and artifacts
        if: failure()
        uses: actions/upload-artifact@v3
        with:
          name: compose-${{ matrix.topology }}-logs
          path: |
            logs/
            .tmp/
          retention-days: 7
      
      - name: Clean up Docker resources
        if: always()
        run: |
          docker compose down -v 2>/dev/null || true
          docker ps -a --filter "name=nomos-compose-" -q | xargs -r docker rm -f

  # Summary job (requires all tests to pass)
  ci_success:
    name: CI Success
    needs: [host_smoke, compose_matrix]
    runs-on: ubuntu-latest
    if: always()
    
    steps:
      - name: Check all jobs
        run: |
          if [[ "${{ needs.host_smoke.result }}" != "success" ]] || \
             [[ "${{ needs.compose_matrix.result }}" != "success" ]]; then
            echo "One or more CI jobs failed"
            exit 1
          fi
          echo "All CI jobs passed!"

Workflow Features

Matrix Testing: Runs compose tests with different topologies (3v1e, 5v1e)
Caching: Caches Rust dependencies, Docker layers, and logos-blockchain-node builds for faster runs
Log Collection: Automatically uploads logs and artifacts when tests fail
Timeout Protection: Reasonable timeouts prevent jobs from hanging indefinitely
Clean Teardown: Ensures Docker resources are cleaned up even on failure

Customization Points

Topology Matrix:

Add more topologies for comprehensive testing:

matrix:
  topology:
    - "3v1e"
    - "5v1e"
    - "10v2e"  # Larger scale

Timeout Adjustments:

Increase timeout-minutes for longer-running scenarios or slower environments:

timeout-minutes: 30  # Instead of 15

Artifact Retention:

Change retention-days based on your storage needs:

retention-days: 14  # Keep logs for 2 weeks

Conditional Execution:

Run expensive tests only on merge to main:

if: github.event_name == 'push' && github.ref == 'refs/heads/main'

Best Practices

Use Helper Scripts

Prefer scripts/run/run-examples.sh which handles all setup automatically:

scripts/run/run-examples.sh -t 120 -n 3 host

This is more reliable than manual cargo run commands.

Cache Aggressively

Cache Rust dependencies, logos-blockchain-node builds, and Docker layers to speed up CI:

- name: Cache Rust dependencies
  uses: actions/cache@v3
  with:
    path: |
      ~/.cargo/bin/
      ~/.cargo/registry/index/
      ~/.cargo/registry/cache/
      ~/.cargo/git/db/
      target/
    key: ${{ runner.os }}-cargo-${{ hashFiles('**/Cargo.lock') }}

Collect Logs on Failure

Always upload logs when tests fail for easier debugging:

- name: Upload logs on failure
  if: failure()
  uses: actions/upload-artifact@v3
  with:
    name: test-logs
    path: |
      .tmp/
      *.log
    retention-days: 7

Split Workflows for Faster Iteration

For large projects, split host/compose/k8s into separate workflow files:

.github/workflows/test-host.yml — Fast smoke tests
.github/workflows/test-compose.yml — Reproducible integration tests
.github/workflows/test-k8s.yml — Production-like validation (nightly)

Run K8s Tests Less Frequently

K8s tests are slower. Consider running them only on main branch or scheduled:

on:
  push:
    branches: [main]
  schedule:
    - cron: '0 2 * * *'  # Daily at 2 AM

Platform-Specific Notes

Ubuntu Runners

Docker pre-installed and running
Best for compose/k8s runners
Most common choice

macOS Runners

Docker Desktop not installed by default
Slower and more expensive
Use only if testing macOS-specific issues

Self-Hosted Runners

Cache Docker images locally for faster builds
Set resource limits (SLOW_TEST_ENV=true if needed)
Ensure cleanup scripts run (docker system prune)

Debugging CI Failures

Enable Debug Logging

Add debug environment variables temporarily:

env:
  RUST_LOG: debug
  LOGOS_BLOCKCHAIN_LOG_LEVEL: debug

Preserve Containers (Compose)

Set COMPOSE_RUNNER_PRESERVE=1 to keep containers running for inspection:

- name: Run compose test (preserve on failure)
  env:
    COMPOSE_RUNNER_PRESERVE: 1
  run: scripts/run/run-examples.sh -t 120 -n 3 compose

Access Artifacts

Download uploaded artifacts from the GitHub Actions UI to inspect logs locally.

Next Steps

Running Examples — Manual execution for local development
Environment Variables — Full variable reference
Troubleshooting — Common CI-specific issues

Environment Variables Reference

Complete reference of environment variables used by the testing framework, organized by category.

Runner Selection & Topology

Control which runner to use and the test topology:

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_DEMO_NODES`	1	Number of nodes (all runners)
`LOGOS_BLOCKCHAIN_DEMO_RUN_SECS`	60	Run duration in seconds (all runners)
`LOCAL_DEMO_NODES`	—	Legacy: Number of nodes (host runner only)
`LOCAL_DEMO_RUN_SECS`	—	Legacy: Run duration (host runner only)
`COMPOSE_NODE_PAIRS`	—	Compose-specific topology format: “nodes” (e.g., `3`)

Example:

# Run with 5 nodes, for 120 seconds
LOGOS_BLOCKCHAIN_DEMO_NODES=5 \
LOGOS_BLOCKCHAIN_DEMO_RUN_SECS=120 \
scripts/run/run-examples.sh -t 120 -n 5 host

Node Binaries (Host Runner)

Required for host runner when not using helper scripts:

Variable	Required	Default	Effect
`LOGOS_BLOCKCHAIN_NODE_BIN`	Yes (host)	—	Path to `logos-blockchain-node` binary
`LOGOS_BLOCKCHAIN_NODE_PATH`	No	—	Path to logos-blockchain-node git checkout (dev workflow)

Example:

export LOGOS_BLOCKCHAIN_NODE_BIN=/path/to/logos-blockchain-node/target/release/logos-blockchain-node

Docker Images (Compose / K8s)

Required for compose and k8s runners:

Variable	Required	Default	Effect
`LOGOS_BLOCKCHAIN_TESTNET_IMAGE`	Yes (compose/k8s)	`logos-blockchain-testing:local`	Docker image tag for node containers
`LOGOS_BLOCKCHAIN_TESTNET_IMAGE_PULL_POLICY`	No	`IfNotPresent` (local) / `Always` (ECR)	K8s `imagePullPolicy` used by the runner
`LOGOS_BLOCKCHAIN_BINARIES_TAR`	No	—	Path to prebuilt bundle (`.tar.gz`) for image build
`LOGOS_BLOCKCHAIN_SKIP_IMAGE_BUILD`	No	0	Skip image rebuild (compose/k8s); assumes image already exists
`LOGOS_BLOCKCHAIN_FORCE_IMAGE_BUILD`	No	0	Force rebuilding the image even when the script would normally skip it (e.g. non-local k8s)

Example:

# Using prebuilt bundle
export LOGOS_BLOCKCHAIN_BINARIES_TAR=.tmp/nomos-binaries-linux-v0.3.1.tar.gz
export LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local
scripts/build/build_test_image.sh

# Using pre-existing image (skip build)
export LOGOS_BLOCKCHAIN_SKIP_IMAGE_BUILD=1
scripts/run/run-examples.sh -t 60 -n 3 compose

Circuit Assets

Circuit asset configuration used by local runs and image builds:

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_CIRCUITS`	`~/.logos-blockchain-circuits`	Directory containing circuit assets
`VERSION`	From `versions.env`	Circuit release tag (used by helper scripts)
`LOGOS_BLOCKCHAIN_CIRCUITS_VERSION`	—	Legacy alias for `VERSION` (supported by some build scripts)
`LOGOS_BLOCKCHAIN_CIRCUITS_PLATFORM`	Auto-detected	Override circuits platform (e.g. `linux-x86_64`, `macos-aarch64`)
`LOGOS_BLOCKCHAIN_CIRCUITS_HOST_DIR_REL`	`.tmp/logos-blockchain-circuits-host`	Output dir for host circuit bundle (relative to repo root)
`LOGOS_BLOCKCHAIN_CIRCUITS_LINUX_DIR_REL`	`.tmp/logos-blockchain-circuits-linux`	Output dir for linux circuit bundle (relative to repo root)
`LOGOS_BLOCKCHAIN_CIRCUITS_NONINTERACTIVE`	0	Set to `1` to overwrite outputs without prompting in setup scripts
`LOGOS_BLOCKCHAIN_CIRCUITS_REBUILD_RAPIDSNARK`	0	Set to `1` to force rebuilding rapidsnark (host bundle only)

Example:

# Use custom circuit assets
LOGOS_BLOCKCHAIN_CIRCUITS=/custom/path/to/circuits \
cargo run -p runner-examples --bin local_runner

Node Logging

Control node log output (not framework runner logs):

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_LOG_LEVEL`	`info`	Global log level: `error`, `warn`, `info`, `debug`, `trace`
`LOGOS_BLOCKCHAIN_LOG_FILTER`	—	Fine-grained module filtering (e.g., `cryptarchia=trace`)
`LOGOS_BLOCKCHAIN_LOG_DIR`	—	Host runner: directory for per-node log files (persistent). Compose/k8s: use `cfgsync.yaml` for file logging.
`LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS`	0	Keep per-run temporary directories (useful for debugging/CI artifacts)
`LOGOS_BLOCKCHAIN_TESTS_TRACING`	false	Enable debug tracing preset (combine with `LOGOS_BLOCKCHAIN_LOG_DIR` unless external tracing backends configured)

Important: Node logging ignores RUST_LOG; use LOGOS_BLOCKCHAIN_LOG_LEVEL and LOGOS_BLOCKCHAIN_LOG_FILTER for node logs.

Example:

# Debug logging to files
LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/test-logs \
LOGOS_BLOCKCHAIN_LOG_LEVEL=debug \
LOGOS_BLOCKCHAIN_LOG_FILTER="cryptarchia=trace" \
cargo run -p runner-examples --bin local_runner

# Inspect logs
ls /tmp/test-logs/
# logos-blockchain-node-0.2024-12-18T14-30-00.log
# logos-blockchain-node-1.2024-12-18T14-30-00.log

Common filter targets:

Target Prefix	Subsystem
`lb_cryptarchia`	Consensus (Cryptarchia)
`lb_blend`	Mix network/privacy layer
`lb_chain_service`	Chain service (node APIs/state)
`lb_chain_network`	P2P networking
`lb_chain_leader_service`	Leader election

Observability & Metrics

Optional observability integration:

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_METRICS_QUERY_URL`	—	Prometheus-compatible base URL for runner to query (e.g., `http://localhost:9090`)
`LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL`	—	Full OTLP HTTP ingest URL for node metrics export (e.g., `http://localhost:9090/api/v1/otlp/v1/metrics`)
`LOGOS_BLOCKCHAIN_GRAFANA_URL`	—	Grafana base URL for printing/logging (e.g., `http://localhost:3000`)
`LOGOS_BLOCKCHAIN_OTLP_ENDPOINT`	—	OTLP trace endpoint (optional)
`LOGOS_BLOCKCHAIN_OTLP_METRICS_ENDPOINT`	—	OTLP metrics endpoint (optional)

Example:

# Enable Prometheus querying
export LOGOS_BLOCKCHAIN_METRICS_QUERY_URL=http://localhost:9090
export LOGOS_BLOCKCHAIN_METRICS_OTLP_INGEST_URL=http://localhost:9090/api/v1/otlp/v1/metrics
export LOGOS_BLOCKCHAIN_GRAFANA_URL=http://localhost:3000

scripts/run/run-examples.sh -t 60 -n 3 compose

Compose Runner Specific

Variables specific to Docker Compose deployment:

Variable	Default	Effect
`COMPOSE_RUNNER_HOST`	`127.0.0.1`	Host address for port mappings
`COMPOSE_RUNNER_PRESERVE`	0	Keep containers running after test (for debugging)
`COMPOSE_RUNNER_HTTP_TIMEOUT_SECS`	—	Override HTTP readiness timeout (seconds)
`COMPOSE_RUNNER_HOST_GATEWAY`	`host.docker.internal:host-gateway`	Controls `extra_hosts` entry injected into compose (set to `disable` to omit)
`TESTNET_RUNNER_PRESERVE`	—	Alias for `COMPOSE_RUNNER_PRESERVE`

Example:

# Keep containers after test for debugging
COMPOSE_RUNNER_PRESERVE=1 \
scripts/run/run-examples.sh -t 60 -n 3 compose

# Containers remain running
docker ps --filter "name=nomos-compose-"
docker logs <container-id>

K8s Runner Specific

Variables specific to Kubernetes deployment:

Variable	Default	Effect
`K8S_RUNNER_NAMESPACE`	Random UUID	Kubernetes namespace (pin for debugging)
`K8S_RUNNER_RELEASE`	Random UUID	Helm release name (pin for debugging)
`K8S_RUNNER_NODE_HOST`	—	NodePort host resolution for non-local clusters
`K8S_RUNNER_DEBUG`	0	Log Helm stdout/stderr for install commands
`K8S_RUNNER_PRESERVE`	0	Keep namespace/release after run (for debugging)
`K8S_RUNNER_DEPLOYMENT_TIMEOUT_SECS`	—	Override deployment readiness timeout
`K8S_RUNNER_HTTP_TIMEOUT_SECS`	—	Override HTTP readiness timeout (port-forwards)
`K8S_RUNNER_HTTP_PROBE_TIMEOUT_SECS`	—	Override HTTP readiness timeout (NodePort probes)
`K8S_RUNNER_PROMETHEUS_HTTP_TIMEOUT_SECS`	—	Override Prometheus readiness timeout
`K8S_RUNNER_PROMETHEUS_HTTP_PROBE_TIMEOUT_SECS`	—	Override Prometheus NodePort probe timeout

Example:

# Pin namespace for debugging
K8S_RUNNER_NAMESPACE=nomos-test-debug \
K8S_RUNNER_PRESERVE=1 \
K8S_RUNNER_DEBUG=1 \
scripts/run/run-examples.sh -t 60 -n 3 k8s

# Inspect resources
kubectl get pods -n nomos-test-debug
kubectl logs -n nomos-test-debug -l nomos/logical-role=node

Platform & Build Configuration

Platform-specific build configuration:

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM`	Host arch	Docker platform for bundle builds: `linux/arm64` or `linux/amd64` (macOS/Windows hosts)
`LOGOS_BLOCKCHAIN_BIN_PLATFORM`	—	Legacy alias for `LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM`
`COMPOSE_CIRCUITS_PLATFORM`	Host arch	Circuits platform for image builds: `linux-aarch64` or `linux-x86_64`
`LOGOS_BLOCKCHAIN_EXTRA_FEATURES`	—	Extra cargo features to enable when building bundles (used by `scripts/build/build-bundle.sh`)

macOS / Apple Silicon:

# Native performance (recommended for local testing)
export LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/arm64

# Or target amd64 (slower via emulation)
export LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/amd64

Timeouts & Performance

Timeout and performance tuning:

Variable	Default	Effect
`SLOW_TEST_ENV`	false	Doubles built-in readiness timeouts (useful in CI / constrained laptops)
`TESTNET_PRINT_ENDPOINTS`	0	Print `TESTNET_ENDPOINTS` / `TESTNET_PPROF` lines during deploy (set automatically by `scripts/run/run-examples.sh`)

Example:

# Increase timeouts for slow environments
SLOW_TEST_ENV=true \
scripts/run/run-examples.sh -t 120 -n 5 compose

Node Configuration (Advanced)

Node-level configuration passed through to logos-blockchain-node:

Variable	Default	Effect
`CONSENSUS_SLOT_TIME`	—	Consensus slot time (seconds)
`CONSENSUS_ACTIVE_SLOT_COEFF`	—	Active slot coefficient (0.0-1.0)
`LOGOS_BLOCKCHAIN_USE_AUTONAT`	Unset	If set, use AutoNAT instead of a static loopback address for libp2p NAT settings
`LOGOS_BLOCKCHAIN_CFGSYNC_PORT`	4400	Port used for cfgsync service inside the stack
`LOGOS_BLOCKCHAIN_TIME_BACKEND`	`monotonic`	Select time backend (used by compose/k8s stack scripts and deployers)

Example:

# Faster block production
CONSENSUS_SLOT_TIME=5 \
CONSENSUS_ACTIVE_SLOT_COEFF=0.9 \
cargo run -p runner-examples --bin local_runner

Framework Runner Logging (Not Node Logs)

Control framework runner process logs (uses RUST_LOG, not NOMOS_*):

Variable	Default	Effect
`RUST_LOG`	—	Framework runner log level (e.g., `debug`, `info`)
`RUST_BACKTRACE`	—	Enable Rust backtraces on panic (`1` or `full`)
`CARGO_TERM_COLOR`	—	Cargo output color (`always`, `never`, `auto`)

Example:

# Debug framework runner (not nodes)
RUST_LOG=debug \
RUST_BACKTRACE=1 \
cargo run -p runner-examples --bin local_runner

Helper Script Variables

Variables used by helper scripts (scripts/run/run-examples.sh, etc.):

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_NODE_REV`	From `versions.env`	logos-blockchain-node git revision to build/fetch
`LOGOS_BLOCKCHAIN_BUNDLE_VERSION`	From `versions.env`	Bundle schema version
`LOGOS_BLOCKCHAIN_IMAGE_SELECTION`	—	Internal: image selection mode set by `run-examples.sh` (`local`/`ecr`/`auto`)
`LOGOS_BLOCKCHAIN_NODE_APPLY_PATCHES`	1	Set to `0` to disable applying local patches when building bundles
`LOGOS_BLOCKCHAIN_NODE_PATCH_DIR`	`patches/logos-blockchain-node`	Patch directory applied to logos-blockchain-node checkout during bundle builds
`LOGOS_BLOCKCHAIN_NODE_PATCH_LEVEL`	—	Patch application level (`all` or an integer) for bundle builds

Quick Reference Examples

Minimal Host Run

scripts/run/run-examples.sh -t 60 -n 3 host

Debug Logging (Host)

LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/logs \
LOGOS_BLOCKCHAIN_LOG_LEVEL=debug \
LOGOS_BLOCKCHAIN_LOG_FILTER="cryptarchia=trace" \
scripts/run/run-examples.sh -t 60 -n 3 host

Compose with Observability

LOGOS_BLOCKCHAIN_METRICS_QUERY_URL=http://localhost:9090 \
LOGOS_BLOCKCHAIN_GRAFANA_URL=http://localhost:3000 \
scripts/run/run-examples.sh -t 60 -n 3 compose

K8s with Debug

K8S_RUNNER_NAMESPACE=nomos-debug \
K8S_RUNNER_DEBUG=1 \
K8S_RUNNER_PRESERVE=1 \
scripts/run/run-examples.sh -t 60 -n 3 k8s

CI Environment

env:
  RUST_BACKTRACE: 1
  LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS: 1

Logging & Observability

Comprehensive guide to log collection, metrics, and debugging across all runners.

Node Logging vs Framework Logging

Critical distinction: Node logs and framework logs use different configuration mechanisms.

Component	Controlled By	Purpose
Framework binaries (`cargo run -p runner-examples --bin local_runner`)	`RUST_LOG`	Runner orchestration, deployment logs
Node processes (nodes spawned by runner)	`LOGOS_BLOCKCHAIN_LOG_LEVEL`, `LOGOS_BLOCKCHAIN_LOG_FILTER` (+ `LOGOS_BLOCKCHAIN_LOG_DIR` on host runner)	Consensus, mempool, network logs

Common mistake: Setting RUST_LOG=debug only increases verbosity of the runner binary itself. Node logs remain at their default level unless you also set LOGOS_BLOCKCHAIN_LOG_LEVEL=debug.

Example:

# This only makes the RUNNER verbose, not the nodes:
RUST_LOG=debug cargo run -p runner-examples --bin local_runner

# This makes the NODES verbose:
LOGOS_BLOCKCHAIN_LOG_LEVEL=debug cargo run -p runner-examples --bin local_runner

# Both verbose (typically not needed):
RUST_LOG=debug LOGOS_BLOCKCHAIN_LOG_LEVEL=debug cargo run -p runner-examples --bin local_runner

Logging Environment Variables

See Environment Variables Reference for complete details. Quick summary:

Variable	Default	Effect
`LOGOS_BLOCKCHAIN_LOG_DIR`	None (console only)	Host runner: directory for per-node log files. Compose/k8s: use `cfgsync.yaml`
`LOGOS_BLOCKCHAIN_LOG_LEVEL`	`info`	Global log level: `error`, `warn`, `info`, `debug`, `trace`
`LOGOS_BLOCKCHAIN_LOG_FILTER`	None	Fine-grained target filtering (e.g., `cryptarchia=trace`)
`LOGOS_BLOCKCHAIN_TESTS_TRACING`	false	Enable debug tracing preset
`LOGOS_BLOCKCHAIN_OTLP_ENDPOINT`	None	OTLP trace endpoint (optional)
`LOGOS_BLOCKCHAIN_OTLP_METRICS_ENDPOINT`	None	OTLP metrics endpoint (optional)

Example: Full debug logging to files:

LOGOS_BLOCKCHAIN_TESTS_TRACING=true \
LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/test-logs \
LOGOS_BLOCKCHAIN_LOG_LEVEL=debug \
LOGOS_BLOCKCHAIN_LOG_FILTER="lb_cryptarchia=trace,lb_chain_service=info,lb_chain_network=info" \
cargo run -p runner-examples --bin local_runner

Per-Node Log Files

When LOGOS_BLOCKCHAIN_LOG_DIR is set, each node writes logs to separate files:

File naming pattern:

Validators: Prefix logos-blockchain-node-0, logos-blockchain-node-1, etc. (may include timestamp suffix)

Example filenames:

logos-blockchain-node-0.2024-12-18T14-30-00.log
logos-blockchain-node-1.2024-12-18T14-30-00.log

Local runner note: The local runner uses per-run temporary directories under the current working directory and removes them after the run unless LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1. Use LOGOS_BLOCKCHAIN_LOG_DIR=/path/to/logs to write per-node log files to a stable location.

Filter Target Names

Common target prefixes for LOGOS_BLOCKCHAIN_LOG_FILTER:

Target Prefix	Subsystem
`lb_cryptarchia`	Consensus (Cryptarchia)
`lb_blend`	Mix network/privacy layer
`lb_chain_service`	Chain service (node APIs/state)
`lb_chain_network`	P2P networking
`lb_chain_leader_service`	Leader election

Example filter:

LOGOS_BLOCKCHAIN_LOG_FILTER="lb_cryptarchia=trace,lb_chain_service=info,lb_chain_network=info"

Accessing Logs by Runner

Local Runner (Host Processes)

Default (temporary directories, auto-cleanup):

cargo run -p runner-examples --bin local_runner
# Logs written to temporary directories in working directory
# Automatically cleaned up after test completes

Persistent file output:

LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/local-logs \
cargo run -p runner-examples --bin local_runner

# After test completes:
ls /tmp/local-logs/
# Files with prefix: logos-blockchain-node-0*, logos-blockchain-node-1*
# May include timestamps in filename

Tip: Use LOGOS_BLOCKCHAIN_LOG_DIR for persistent per-node log files, and LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1 if you want to keep the per-run temporary directories (configs/state) for post-mortem inspection.

Compose Runner (Docker Containers)

Via Docker logs (default, recommended):

# List containers (note the UUID prefix in names)
docker ps --filter "name=nomos-compose-"

# Stream logs from specific container
docker logs -f <container-id-or-name>

# Or use name pattern matching:
docker logs -f $(docker ps --filter "name=nomos-compose-.*-node-0" -q | head -1)

# Show last 100 lines
docker logs --tail 100 <container-id>

Via file collection (advanced):

To write per-node log files inside containers, set tracing_settings.logger: !File in testing-framework/assets/stack/cfgsync.yaml (and ensure the directory is writable). To access them, you must either:

Copy files out after the run:

# Ensure cfgsync.yaml is configured to log to /logs
LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local \
cargo run -p runner-examples --bin compose_runner

# After test, copy files from containers:
docker ps --filter "name=nomos-compose-"
docker cp <container-id>:/logs/node* /tmp/

Mount a host volume (requires modifying compose template):

volumes:
  - /tmp/host-logs:/logs  # Add to docker-compose.yml.tera

Recommendation: Use docker logs by default. File collection inside containers is complex and rarely needed.

Keep containers for debugging:

COMPOSE_RUNNER_PRESERVE=1 \
LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local \
cargo run -p runner-examples --bin compose_runner
# Containers remain running after test—inspect with docker logs or docker exec

Compose debugging variables:

COMPOSE_RUNNER_HOST=127.0.0.1 — host used for readiness probes
COMPOSE_RUNNER_HOST_GATEWAY=host.docker.internal:host-gateway — controls extra_hosts entry (set to disable to omit)
TESTNET_RUNNER_PRESERVE=1 — alias for COMPOSE_RUNNER_PRESERVE=1
COMPOSE_RUNNER_HTTP_TIMEOUT_SECS=<secs> — override HTTP readiness timeout

Note: Container names follow pattern nomos-compose-{uuid}-node-{index}-1 where {uuid} changes per run.

K8s Runner (Kubernetes Pods)

Via kubectl logs (use label selectors):

# List pods
kubectl get pods

# Stream logs using label selectors (recommended)
# Helm chart labels:
# - nomos/logical-role=node
# - nomos/node-index
kubectl logs -l nomos/logical-role=node -f

# Stream logs from specific pod
kubectl logs -f logos-blockchain-node-0

# Previous logs from crashed pods
kubectl logs --previous -l nomos/logical-role=node

Download logs for offline analysis:

# Using label selectors
kubectl logs -l nomos/logical-role=node --tail=1000 > all-nodes.log

# Specific pods
kubectl logs logos-blockchain-node-0 > node-0.log

K8s debugging variables:

K8S_RUNNER_DEBUG=1 — logs Helm stdout/stderr for install commands
K8S_RUNNER_PRESERVE=1 — keep namespace/release after run
K8S_RUNNER_NODE_HOST=<ip|hostname> — override NodePort host resolution
K8S_RUNNER_NAMESPACE=<name> / K8S_RUNNER_RELEASE=<name> — pin namespace/release (useful for debugging)

Specify namespace (if not using default):

kubectl logs -n my-namespace -l nomos/logical-role=node -f

Note: K8s runner is optimized for local clusters (Docker Desktop K8s, minikube, kind). Remote clusters require additional setup.

OTLP and Telemetry

OTLP exporters are optional. If you see errors about unreachable OTLP endpoints, it’s safe to ignore them unless you’re actively collecting traces/metrics.

To enable OTLP:

LOGOS_BLOCKCHAIN_OTLP_ENDPOINT=http://localhost:4317 \
LOGOS_BLOCKCHAIN_OTLP_METRICS_ENDPOINT=http://localhost:4318 \
cargo run -p runner-examples --bin local_runner

To silence OTLP errors: Simply leave these variables unset (the default).

Observability: Prometheus and Node APIs

Runners expose metrics and node HTTP endpoints for expectation code and debugging.

Prometheus-Compatible Metrics Querying (Optional)

Runners do not provision Prometheus automatically
For a ready-to-run stack, use scripts/setup/setup-observability.sh:
- Compose: scripts/setup/setup-observability.sh compose up then scripts/setup/setup-observability.sh compose env
- K8s: scripts/setup/setup-observability.sh k8s install then scripts/setup/setup-observability.sh k8s env
Provide LOGOS_BLOCKCHAIN_METRICS_QUERY_URL (PromQL base URL) to enable ctx.telemetry() queries
Access from expectations when configured: ctx.telemetry().prometheus().map(|p| p.base_url())

Example:

# Start observability stack (Compose)
scripts/setup/setup-observability.sh compose up

# Get environment variables
eval $(scripts/setup/setup-observability.sh compose env)

# Run scenario with metrics
scripts/run/run-examples.sh -t 60 -n 3 compose

Grafana (Optional)

Runners do not provision Grafana automatically (but scripts/setup/setup-observability.sh can)
If you set LOGOS_BLOCKCHAIN_GRAFANA_URL, the deployer prints it in TESTNET_ENDPOINTS
Dashboards live in testing-framework/assets/stack/monitoring/grafana/dashboards/ (the bundled stack auto-provisions them)

Example:

# Bring up the bundled Prometheus+Grafana stack (optional)
scripts/setup/setup-observability.sh compose up
eval $(scripts/setup/setup-observability.sh compose env)

export LOGOS_BLOCKCHAIN_GRAFANA_URL=http://localhost:3000
scripts/run/run-examples.sh -t 60 -n 3 compose

Default bundled Grafana login: admin / admin (see scripts/observability/compose/docker-compose.yml).

Node APIs

Access from expectations: ctx.node_clients().node_clients().get(0)
Endpoints: consensus info, network info, etc.
See testing-framework/core/src/nodes/api_client.rs for available methods

Example usage in expectations:

use testing_framework_core::scenario::{DynError, RunContext};

async fn evaluate(ctx: &RunContext) -> Result<(), DynError> {
    let client = &ctx.node_clients().node_clients()[0];

    let info = client.consensus_info().await?;
    tracing::info!(height = info.height, "consensus info from node 0");

    Ok(())
}

Observability Flow

flowchart TD
    Expose[Runner exposes endpoints/ports] --> Collect[Runtime collects block/health signals]
    Collect --> Consume[Expectations consume signals<br/>decide pass/fail]
    Consume --> Inspect[Operators inspect logs/metrics<br/>when failures arise]

Quick Reference

Debug Logging (Host)

LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/logs \
LOGOS_BLOCKCHAIN_LOG_LEVEL=debug \
LOGOS_BLOCKCHAIN_LOG_FILTER="cryptarchia=trace" \
scripts/run/run-examples.sh -t 60 -n 3 host

Compose with Observability

# Start observability stack
scripts/setup/setup-observability.sh compose up
eval $(scripts/setup/setup-observability.sh compose env)

# Run with metrics
scripts/run/run-examples.sh -t 60 -n 3 compose

# Access Grafana at http://localhost:3000

K8s with Debug

K8S_RUNNER_NAMESPACE=nomos-debug \
K8S_RUNNER_DEBUG=1 \
K8S_RUNNER_PRESERVE=1 \
scripts/run/run-examples.sh -t 60 -n 3 k8s

# Inspect logs
kubectl logs -n nomos-debug -l nomos/logical-role=node

Part V — Appendix

Quick reference materials, troubleshooting guides, and supplementary information.

Builder API Quick Reference: Cheat sheet for DSL methods
Troubleshooting Scenarios: Common issues and their solutions, including “What Failure Looks Like” with realistic examples
FAQ: Frequently asked questions
Glossary: Terminology reference

When to Use This Section

Quick lookups: Find DSL method signatures without reading full guides
Debugging failures: Match symptoms to known issues and fixes
Clarifying concepts: Look up unfamiliar terms in the glossary
Common questions: Check FAQ before asking for help

This section complements the main documentation with practical reference materials that you’ll return to frequently during development and operations.

Jump to:

Builder API Quick Reference
Troubleshooting Scenarios
FAQ
Glossary

Builder API Quick Reference

Quick reference for the scenario builder DSL. All methods are chainable.

Imports

use std::time::Duration;

use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_compose::ComposeDeployer;
use testing_framework_runner_k8s::K8sDeployer;
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::{ChaosBuilderExt, ScenarioBuilderExt};

Topology

use testing_framework_core::scenario::{Builder, ScenarioBuilder};

pub fn topology() -> Builder<()> {
    ScenarioBuilder::topology_with(|t| {
        t.network_star() // Star topology (all connect to seed node)
            .nodes(3) // Number of nodes
    })
}

Wallets

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn wallets_plan() -> testing_framework_core::scenario::Scenario<()> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
        .wallets(50) // Seed 50 funded wallet accounts
        .build()
}

Transaction Workload

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn transactions_plan() -> testing_framework_core::scenario::Scenario<()> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
        .wallets(50)
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
                .users(20) // Use 20 of the seeded wallets
        }) // Finish transaction workload config
        .build()
}

Chaos Workload (Requires `enable_node_control()`)

use std::time::Duration;

use testing_framework_core::scenario::{NodeControlCapability, ScenarioBuilder};
use testing_framework_workflows::{ChaosBuilderExt, ScenarioBuilderExt};

pub fn chaos_plan() -> testing_framework_core::scenario::Scenario<NodeControlCapability> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .enable_node_control() // Enable node control capability
        .chaos_with(|c| {
            c.restart() // Random restart chaos
                .min_delay(Duration::from_secs(30)) // Min time between restarts
                .max_delay(Duration::from_secs(60)) // Max time between restarts
                .target_cooldown(Duration::from_secs(45)) // Cooldown after restart
                .apply() // Required for chaos configuration
        })
        .build()
}

Expectations

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn expectations_plan() -> testing_framework_core::scenario::Scenario<()> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
        .expect_consensus_liveness() // Assert blocks are produced continuously
        .build()
}

Run Duration

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn run_duration_plan() -> testing_framework_core::scenario::Scenario<()> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
        .with_run_duration(Duration::from_secs(120)) // Run for 120 seconds
        .build()
}

Build

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

pub fn build_plan() -> testing_framework_core::scenario::Scenario<()> {
    ScenarioBuilder::topology_with(|t| t.network_star().nodes(1)).build() // Construct the final Scenario
}

Deployers

use testing_framework_runner_compose::ComposeDeployer;
use testing_framework_runner_k8s::K8sDeployer;
use testing_framework_runner_local::LocalDeployer;

pub fn deployers() {
    // Local processes
    let _deployer = LocalDeployer::default();

    // Docker Compose
    let _deployer = ComposeDeployer::default();

    // Kubernetes
    let _deployer = K8sDeployer::default();
}

Execution

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn execution() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(1))
        .expect_consensus_liveness()
        .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

Complete Example

use std::time::Duration;

use anyhow::Result;
use testing_framework_core::scenario::{Deployer, ScenarioBuilder};
use testing_framework_runner_local::LocalDeployer;
use testing_framework_workflows::ScenarioBuilderExt;

pub async fn run_test() -> Result<()> {
    let mut plan = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
        .wallets(50)
        .transactions_with(|txs| {
            txs.rate(5) // 5 transactions per block
                .users(20)
        })
        .expect_consensus_liveness()
        .with_run_duration(Duration::from_secs(90))
        .build();

    let deployer = LocalDeployer::default();
    let runner = deployer.deploy(&plan).await?;
    let _handle = runner.run(&mut plan).await?;

    Ok(())
}

Troubleshooting Scenarios

Prerequisites for All Runners:

versions.env file at repository root (required by helper scripts)
Circuit assets must be present and LOGOS_BLOCKCHAIN_CIRCUITS must point to a directory that contains them

Platform/Environment Notes:

macOS + Docker Desktop (Apple silicon): prefer LOGOS_BLOCKCHAIN_BUNDLE_DOCKER_PLATFORM=linux/arm64 for local compose/k8s runs to avoid slow/fragile amd64 emulation builds.
Disk space: bundle/image builds are storage-heavy. If you see I/O errors or Docker build failures, check free space and prune old artifacts (.tmp/, target/, and Docker build cache) before retrying.
K8s runner scope: the default Helm chart mounts circuit assets via hostPath and uses a local image tag (logos-blockchain-testing:local). This is intended for local clusters (Docker Desktop / minikube / kind), not remote managed clusters without additional setup.
- Quick cleanup: scripts/ops/clean.sh (and scripts/ops/clean.sh --docker if needed).
- Destructive cleanup (last resort): scripts/ops/clean.sh --docker-system --dangerous (add --volumes if you also want to prune Docker volumes).

Recommended: Use scripts/run/run-examples.sh which handles all setup automatically.

Quick Symptom Guide

Common symptoms and likely causes:

Transactions not included: unfunded or misconfigured wallets (check .wallets(N) vs .users(M)), transaction rate exceeding block capacity, or rates exceeding block production speed—reduce rate, increase wallet count, verify wallet setup in logs.
Chaos stalls the run: chaos (node control) only works with ComposeDeployer; host runner (LocalDeployer) and K8sDeployer don’t support it (won’t “stall”, just can’t execute chaos workloads). With compose, aggressive restart cadence can prevent consensus recovery—widen restart intervals.
Observability gaps: metrics or logs unreachable because ports clash or services are not exposed—adjust observability ports and confirm runner wiring.
Flaky behavior across runs: mixing chaos with functional smoke tests or inconsistent topology between environments—separate deterministic and chaos scenarios and standardize topology presets.

What Failure Looks Like

This section shows what you’ll actually see when common issues occur. Each example includes realistic console output and the fix.

1. Missing `versions.env` File

Symptoms:

Helper scripts fail immediately
Error about missing file at repo root
Scripts can’t determine which circuit/node versions to use

What you’ll see:

$ scripts/run/run-examples.sh -t 60 -n 1 host
ERROR: versions.env not found at repository root
This file is required and should define:
  VERSION=<circuit release tag>
  LOGOS_BLOCKCHAIN_NODE_REV=<logos-blockchain-node git revision>
  LOGOS_BLOCKCHAIN_BUNDLE_VERSION=<bundle schema version>

Root Cause: Helper scripts need versions.env to know which versions to build/fetch.

Fix: Ensure you’re in the repository root directory. The versions.env file should already exist—verify it’s present:

cat versions.env
# Should show:
# VERSION=v0.3.1
# LOGOS_BLOCKCHAIN_NODE_REV=abc123def456
# LOGOS_BLOCKCHAIN_BUNDLE_VERSION=v1

2. Missing Circuit Assets

Symptoms:

Node startup fails early
Error messages about missing circuit files

What you’ll see:

$ cargo run -p runner-examples --bin local_runner
[INFO  testing_framework_runner_local] Starting local runner scenario
Error: circuit assets directory missing or invalid
thread 'main' panicked at 'workload init failed'

Root Cause: Circuit assets are required for proof-related paths. The runner expects LOGOS_BLOCKCHAIN_CIRCUITS to point to a directory containing the assets.

Fix (recommended):

# Use run-examples.sh which handles setup automatically
scripts/run/run-examples.sh -t 60 -n 1 host

Fix (manual):

# Fetch circuits
scripts/setup/setup-logos-blockchain-circuits.sh v0.3.1 ~/.logos-blockchain-circuits

# Set the environment variable
export LOGOS_BLOCKCHAIN_CIRCUITS=$HOME/.logos-blockchain-circuits

3. Node Binaries Not Found

Symptoms:

Error about missing logos-blockchain-node binary
“file not found” or “no such file or directory”
Environment variables LOGOS_BLOCKCHAIN_NODE_BIN not set

What you’ll see:

$ cargo run -p runner-examples --bin local_runner
[INFO  testing_framework_runner_local] Spawning node 0
Error: Os { code: 2, kind: NotFound, message: "No such file or directory" }
thread 'main' panicked at 'failed to spawn logos-blockchain-node process'

Root Cause: The local runner needs compiled logos-blockchain-node binaries, but doesn’t know where they are.

Fix (recommended):

# Use run-examples.sh which builds binaries automatically
scripts/run/run-examples.sh -t 60 -n 1 host

Fix (manual - set paths explicitly):

# Build binaries first
cd ../logos-blockchain-node  # or wherever your logos-blockchain-node checkout is
cargo build --release --bin logos-blockchain-node

# Set environment variables
export LOGOS_BLOCKCHAIN_NODE_BIN=$PWD/target/release/logos-blockchain-node

# Return to testing framework
cd ../nomos-testing
cargo run -p runner-examples --bin local_runner

4. Docker Daemon Not Running (Compose)

Symptoms:

Compose tests fail immediately
“Cannot connect to Docker daemon”
Docker commands don’t work

What you’ll see:

$ scripts/run/run-examples.sh -t 60 -n 1 compose
[INFO  runner_examples::compose_runner] Starting compose deployment
Error: Cannot connect to the Docker daemon at unix:///var/run/docker.sock. Is the docker daemon running?
thread 'main' panicked at 'compose deployment failed'

Root Cause: Docker Desktop isn’t running, or your user doesn’t have permission to access Docker.

Fix:

# macOS: Start Docker Desktop application
open -a Docker

# Linux: Start Docker daemon
sudo systemctl start docker

# Verify Docker is working
docker ps

# If permission denied, add your user to docker group (Linux)
sudo usermod -aG docker $USER
# Then log out and log back in

5. Image Not Found (Compose/K8s)

Symptoms:

Compose/K8s tests fail during deployment
“Image not found: logos-blockchain-testing:local”
Containers fail to start

What you’ll see:

$ cargo run -p runner-examples --bin compose_runner
[INFO  testing_framework_runner_compose] Starting compose deployment
Error: Failed to pull image 'logos-blockchain-testing:local': No such image
thread 'main' panicked at 'compose deployment failed'

Root Cause: The Docker image hasn’t been built yet, or was pruned.

Fix (recommended):

# Use run-examples.sh which builds the image automatically
scripts/run/run-examples.sh -t 60 -n 1 compose

Fix (manual):

# 1. Build Linux bundle
scripts/build/build-bundle.sh --platform linux

# 2. Set bundle path
export LOGOS_BLOCKCHAIN_BINARIES_TAR=$(ls -t .tmp/nomos-binaries-linux-*.tar.gz | head -1)

# 3. Build Docker image
scripts/build/build_test_image.sh

# 4. Verify image exists
docker images | grep logos-blockchain-testing

# 5. For kind/minikube: load image into cluster
kind load docker-image logos-blockchain-testing:local
# OR: minikube image load logos-blockchain-testing:local

6. Port Conflicts

Symptoms:

“Address already in use” errors
Tests fail during node startup
Observability stack (Prometheus/Grafana) won’t start

What you’ll see:

$ cargo run -p runner-examples --bin local_runner
[INFO  testing_framework_runner_local] Launching node 0 on port 18080
Error: Os { code: 48, kind: AddrInUse, message: "Address already in use" }
thread 'main' panicked at 'failed to bind port 18080'

Root Cause: Previous test didn’t clean up properly, or another service is using the port.

Fix:

# Find processes using the port
lsof -i :18080   # macOS/Linux
netstat -ano | findstr :18080  # Windows

# Kill orphaned nomos processes
pkill logos-blockchain-node

# For compose: ensure containers are stopped
docker compose down
docker ps -a --filter "name=nomos-compose-" -q | xargs docker rm -f

# Check if port is now free
lsof -i :18080  # Should return nothing

For Observability Stack Port Conflicts:

# Edit ports in observability compose file
vim scripts/observability/compose/docker-compose.yml

# Change conflicting port mappings:
# ports:
#   - "9090:9090"  # Prometheus - change to "19090:9090" if needed
#   - "3000:3000"  # Grafana - change to "13000:3000" if needed

7. Wallet Seeding Failed (Insufficient Funds)

Symptoms:

Transaction workload reports wallet issues
“Insufficient funds” errors
Transactions aren’t being submitted

What you’ll see:

$ cargo run -p runner-examples --bin local_runner
[INFO  testing_framework_workflows] Starting transaction workload with 10 users
[ERROR testing_framework_workflows] Wallet seeding failed: requested 10 users but only 3 wallets available
thread 'main' panicked at 'workload init failed: insufficient wallets'

Root Cause: Topology configured fewer wallets than the workload needs. Transaction workload has .users(M) but topology only has .wallets(N) where N < M.

Fix:

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

let scenario = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .wallets(20) // ← Increase wallet count
    .transactions_with(|tx| {
        tx.users(10) // ← Must be ≤ wallets(20)
            .rate(5)
    })
    .build();

8. Resource Exhaustion (OOM / CPU)

Symptoms:

Nodes crash randomly
“OOM Killed” messages
Test becomes flaky under load
Docker containers restart repeatedly

What you’ll see:

$ docker ps --filter "name=nomos-compose-"
CONTAINER ID   STATUS
abc123def456   Restarting (137) 30 seconds ago  # 137 = OOM killed

$ docker logs abc123def456
[INFO  nomos_node] Starting node
[INFO  consensus] Processing block
Killed  # ← OOM killer terminated the process

Root Cause: Too many nodes, too much workload traffic, or insufficient Docker resources.

Fix:

# 1. Reduce topology size
# In your scenario:
#   .topology(Topology::preset_3v1e())  # Instead of preset_10v2e()

# 2. Reduce workload rates
#   .workload(TransactionWorkload::new().rate(5.0))  # Instead of rate(100.0)

# 3. Increase Docker resources (Docker Desktop)
# Settings → Resources → Memory: 8GB minimum (12GB+ recommended for large topologies)
# Settings → Resources → CPUs: 4+ cores recommended

# 4. Increase file descriptor limits (Linux/macOS)
ulimit -n 4096

# 5. Close other heavy applications (browsers, IDEs, etc.)

9. Logs Disappear After Run

Symptoms:

Test completes but no logs on disk
Can’t debug failures because logs are gone
Temporary directories cleaned up automatically

What you’ll see:

$ cargo run -p runner-examples --bin local_runner
[INFO  runner_examples] Test complete, cleaning up
[INFO  testing_framework_runner_local] Removing temporary directories
$ ls .tmp/
# Empty or missing

Root Cause: Framework cleans up temporary directories by default to avoid disk bloat.

Fix:

# Persist logs to a specific directory
LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/test-logs \
LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1 \
cargo run -p runner-examples --bin local_runner

# Logs persist after run
ls /tmp/test-logs/
# logos-blockchain-node-0.2024-12-18T14-30-00.log
# logos-blockchain-node-1.2024-12-18T14-30-00.log
# ...

10. Consensus Timing Too Tight / Run Duration Too Short

Symptoms:

“Consensus liveness expectation failed”
Only 1-2 blocks produced (or zero)
Nodes appear healthy but not making progress

What you’ll see:

$ cargo run -p runner-examples --bin local_runner
[INFO  testing_framework_core] Starting workloads
[INFO  testing_framework_core] Run window: 10 seconds
[INFO  testing_framework_core] Evaluating expectations
[ERROR testing_framework_core] Consensus liveness expectation failed: expected min 5 blocks, got 1
thread 'main' panicked at 'expectations failed'

Root Cause: Run duration too short for consensus parameters. If CONSENSUS_SLOT_TIME=20s but run duration is only 10s, you can’t produce many blocks.

Fix:

use std::time::Duration;

use testing_framework_core::scenario::ScenarioBuilder;
use testing_framework_workflows::ScenarioBuilderExt;

// Increase run duration to allow more blocks.
let scenario = ScenarioBuilder::topology_with(|t| t.network_star().nodes(3))
    .expect_consensus_liveness()
    .with_run_duration(Duration::from_secs(120)) // ← Give more time
    .build();

Or adjust consensus timing (if you control node config):

# Faster block production (shorter slot time)
CONSENSUS_SLOT_TIME=5 \
CONSENSUS_ACTIVE_SLOT_COEFF=0.9 \
cargo run -p runner-examples --bin local_runner

Summary: Quick Checklist for Failed Runs

When a test fails, check these in order:

versions.env exists at repo root
Circuit assets present (LOGOS_BLOCKCHAIN_CIRCUITS points to a valid directory)
Node binaries available (LOGOS_BLOCKCHAIN_NODE_BIN set, or using run-examples.sh)
Docker daemon running (for compose/k8s)
Docker image built (logos-blockchain-testing:local exists for compose/k8s)
No port conflicts (lsof -i :18080, kill orphaned processes)
Sufficient wallets (.wallets(N) ≥ .users(M))
Enough resources (Docker memory 8GB+, ulimit -n 4096)
Run duration appropriate (long enough for consensus timing)
Logs persisted (LOGOS_BLOCKCHAIN_LOG_DIR + LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1 if needed)

Still stuck? Check node logs (see Where to Find Logs) for the actual error.

Where to Find Logs

Log Location Quick Reference

Runner	Default Output	With `LOGOS_BLOCKCHAIN_LOG_DIR` + Flags	Access Command
Host (local)	Per-run temporary directories under the current working directory (removed unless `LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1`)	Per-node files with prefix `logos-blockchain-node-{index}` (set `LOGOS_BLOCKCHAIN_LOG_DIR`)	`cat $LOGOS_BLOCKCHAIN_LOG_DIR/logos-blockchain-node-0*`
Compose	Docker container stdout/stderr	Set `tracing_settings.logger: !File` in `testing-framework/assets/stack/cfgsync.yaml` (and mount a writable directory)	`docker ps` then `docker logs <container-id>`
K8s	Pod stdout/stderr	Set `tracing_settings.logger: !File` in `testing-framework/assets/stack/cfgsync.yaml` (and mount a writable directory)	`kubectl logs -l nomos/logical-role=node`

Important Notes:

Host runner (local processes): Per-run temporary directories are created under the current working directory and removed after the run unless LOGOS_BLOCKCHAIN_TESTS_KEEP_LOGS=1. To write per-node log files to a stable location, set LOGOS_BLOCKCHAIN_LOG_DIR=/path/to/logs.
Compose/K8s: Node log destination is controlled by testing-framework/assets/stack/cfgsync.yaml (tracing_settings.logger). By default, rely on docker logs or kubectl logs.
File naming: Log files use prefix logos-blockchain-node-{index}* with timestamps, e.g., logos-blockchain-node-0.2024-12-01T10-30-45.log (NOT just .log suffix).
Container names: Compose containers include project UUID, e.g., nomos-compose-<uuid>-node-0-1 where <uuid> is randomly generated per run

Accessing Node Logs by Runner

Local Runner

Console output (default):

cargo run -p runner-examples --bin local_runner 2>&1 | tee test.log

Persistent file output:

LOGOS_BLOCKCHAIN_LOG_DIR=/tmp/debug-logs \
LOGOS_BLOCKCHAIN_LOG_LEVEL=debug \
cargo run -p runner-examples --bin local_runner

# Inspect logs (note: filenames include timestamps):
ls /tmp/debug-logs/
# Example: logos-blockchain-node-0.2024-12-01T10-30-45.log
tail -f /tmp/debug-logs/logos-blockchain-node-0*  # Use wildcard to match timestamp

Compose Runner

Stream live logs:

# List running containers (note the UUID prefix in names)
docker ps --filter "name=nomos-compose-"

# Find your container ID or name from the list, then:
docker logs -f <container-id>

# Or filter by name pattern:
docker logs -f $(docker ps --filter "name=nomos-compose-.*-node-0" -q | head -1)

# Show last 100 lines
docker logs --tail 100 <container-id>

Keep containers for post-mortem debugging:

COMPOSE_RUNNER_PRESERVE=1 \
LOGOS_BLOCKCHAIN_TESTNET_IMAGE=logos-blockchain-testing:local \
cargo run -p runner-examples --bin compose_runner

# OR: Use run-examples.sh (handles setup automatically)
COMPOSE_RUNNER_PRESERVE=1 scripts/run/run-examples.sh -t 60 -n 1 compose

# After test failure, containers remain running:
docker ps --filter "name=nomos-compose-"
docker exec -it <container-id> /bin/sh
docker logs <container-id> > debug.log

Note: Container names follow the pattern nomos-compose-{uuid}-node-{index}-1, where {uuid} is randomly generated per run.

K8s Runner

Important: Always verify your namespace and use label selectors instead of assuming pod names.

Stream pod logs (use label selectors):

# Check your namespace first
kubectl config view --minify | grep namespace

# All node pods (add -n <namespace> if not using default)
kubectl logs -l nomos/logical-role=node -f

# Specific pod by name (find exact name first)
kubectl get pods -l nomos/logical-role=node  # Find the exact pod name
kubectl logs -f <actual-pod-name>        # Then use it

# With explicit namespace
kubectl logs -n my-namespace -l nomos/logical-role=node -f

Download logs from crashed pods:

# Previous logs from crashed pod
kubectl get pods -l nomos/logical-role=node  # Find crashed pod name first
kubectl logs --previous <actual-pod-name> > crashed-node.log

# Or use label selector for all crashed nodes
for pod in $(kubectl get pods -l nomos/logical-role=node -o name); do
  kubectl logs --previous $pod > $(basename $pod)-previous.log 2>&1
done

Access logs from all pods:

# All pods in current namespace
for pod in $(kubectl get pods -o name); do
  echo "=== $pod ==="
  kubectl logs $pod
done > all-logs.txt

# Or use label selectors (recommended)
kubectl logs -l nomos/logical-role=node --tail=500 > nodes.log

# With explicit namespace
kubectl logs -n my-namespace -l nomos/logical-role=node --tail=500 > nodes.log

Debugging Workflow

When a test fails, follow this sequence:

1. Check Framework Output

Start with the test harness output—did expectations fail? Was there a deployment error?

Look for:

Expectation failure messages
Timeout errors
Deployment/readiness failures

2. Verify Node Readiness

Ensure all nodes started successfully and became ready before workloads began.

Commands:

# Local: check process list
ps aux | grep nomos

# Compose: check container status (note UUID in names)
docker ps -a --filter "name=nomos-compose-"

# K8s: check pod status (use label selectors, add -n <namespace> if needed)
kubectl get pods -l nomos/logical-role=node
kubectl describe pod <actual-pod-name>  # Get name from above first

3. Inspect Node Logs

Focus on the first node that exhibited problems or the node with the highest index (often the last to start).

Common error patterns:

“ERROR: versions.env missing” → missing required versions.env file at repository root
“Failed to bind address” → port conflict
“Connection refused” → peer not ready or network issue
“Circuit file not found” → missing circuit assets at the path in LOGOS_BLOCKCHAIN_CIRCUITS
“Insufficient funds” → wallet seeding issue (increase .wallets(N) or reduce .users(M))

4. Check Log Levels

If logs are too sparse, increase verbosity:

LOGOS_BLOCKCHAIN_LOG_LEVEL=debug \
LOGOS_BLOCKCHAIN_LOG_FILTER="cryptarchia=trace" \
cargo run -p runner-examples --bin local_runner

If metric updates are polluting your logs (fields like counter.* / gauge.*), move those events to a dedicated tracing target (e.g. target: "nomos_metrics") and set LOGOS_BLOCKCHAIN_LOG_FILTER="nomos_metrics=off,..." so they don’t get formatted into log output.

5. Verify Observability Endpoints

If expectations report observability issues:

Prometheus (Compose):

curl http://localhost:9090/-/healthy

Node HTTP APIs:

curl http://localhost:18080/consensus/info  # Adjust port per node

6. Compare with Known-Good Scenario

Run a minimal baseline test (e.g., 2 nodes, consensus liveness only). If it passes, the issue is in your workload or topology configuration.

Common Error Messages

“Consensus liveness expectation failed”

Cause: Not enough blocks produced during the run window, missing circuit assets.
Fix:
1. Verify circuit assets exist at the path referenced by LOGOS_BLOCKCHAIN_CIRCUITS.
2. Extend with_run_duration() to allow more blocks.
3. Check node logs for proof generation or circuit asset errors.
4. Reduce transaction rate if nodes are overwhelmed.

“Wallet seeding failed”

Cause: Topology doesn’t have enough funded wallets for the workload.
Fix: Increase .wallets(N) count or reduce .users(M) in the transaction workload (ensure N ≥ M).

“Node control not available”

Cause: Runner doesn’t support node control (only ComposeDeployer does), or enable_node_control() wasn’t called.
Fix:
1. Use ComposeDeployer for chaos tests (LocalDeployer and K8sDeployer don’t support node control).
2. Ensure .enable_node_control() is called in the scenario before .chaos().

“Readiness timeout”

Cause: Nodes didn’t become responsive within expected time (often due to missing prerequisites).
Fix: it, proof generation is too slow).
1. Check node logs for startup errors (port conflicts, missing assets).
2. Verify network connectivity between nodes.
3. Ensure circuit assets are present and LOGOS_BLOCKCHAIN_CIRCUITS points to them.

“ERROR: versions.env missing”

Cause: Helper scripts (run-examples.sh, build-bundle.sh, setup-logos-blockchain-circuits.sh) require versions.env file at repository root.
Fix: Ensure you’re running from the repository root directory. The versions.env file should already exist and contains:

  VERSION=<circuit release tag>
  LOGOS_BLOCKCHAIN_NODE_REV=<logos-blockchain-node git revision>
  LOGOS_BLOCKCHAIN_BUNDLE_VERSION=<bundle schema version>

Use the checked-in versions.env at the repository root as the source of truth.

“Port already in use”

Cause: Previous test didn’t clean up, or another process holds the port.
Fix: Kill orphaned processes (pkill logos-blockchain-node), wait for Docker cleanup (docker compose down), or restart Docker.

“Image not found: logos-blockchain-testing:local”

Cause: Docker image not built for Compose/K8s runners, or circuit assets not baked into the image.
Fix (recommended): Use run-examples.sh which handles everything:
```
scripts/run/run-examples.sh -t 60 -n 1 compose
```
Fix (manual):
1. Build bundle: scripts/build/build-bundle.sh --platform linux
2. Set bundle path: export LOGOS_BLOCKCHAIN_BINARIES_TAR=.tmp/nomos-binaries-linux-v0.3.1.tar.gz
3. Build image: scripts/build/build_test_image.sh
4. kind/minikube: load the image into the cluster nodes (e.g. kind load docker-image logos-blockchain-testing:local, or minikube image load ...), or push to a registry and set LOGOS_BLOCKCHAIN_TESTNET_IMAGE accordingly.

“Circuit file not found”

Cause: Circuit assets are missing or LOGOS_BLOCKCHAIN_CIRCUITS points to a non-existent directory. Inside containers, assets are expected at /opt/circuits.
Fix (recommended): Use run-examples.sh which handles setup:
```
scripts/run/run-examples.sh -t 60 -n 1 <mode>
```
Fix (manual):
1. Fetch assets: scripts/setup/setup-logos-blockchain-circuits.sh v0.3.1 ~/.logos-blockchain-circuits
2. Set LOGOS_BLOCKCHAIN_CIRCUITS=$HOME/.logos-blockchain-circuits
3. Verify directory exists: ls -lh $LOGOS_BLOCKCHAIN_CIRCUITS
4. For Compose/K8s: rebuild image with assets baked in

For detailed logging configuration and observability setup, see Logging & Observability.

FAQ

Why block-oriented timing?
Slots advance at a fixed rate (NTP-synchronized, 2s by default), so reasoning about blocks and consensus intervals keeps assertions aligned with protocol behavior rather than arbitrary wall-clock durations.

Can I reuse the same scenario across runners?
Yes. The plan stays the same; swap runners (local, compose, k8s) to target different environments.

When should I enable chaos workloads?
Only when testing resilience or operational recovery; keep functional smoke tests deterministic.

How long should runs be?
The framework enforces a minimum of 2× slot duration (4 seconds with default 2s slots), but practical recommendations:

Smoke tests: 30s minimum (~14 blocks with default 2s slots, 0.9 coefficient)
Transaction workloads: 60s+ (~27 blocks) to observe inclusion patterns
Chaos tests: 120s+ (~54 blocks) to allow recovery after restarts

Very short runs (< 30s) risk false confidence—one or two lucky blocks don’t prove liveness.

Do I always need seeded wallets?
Only for transaction scenarios. Pure chaos scenarios may not require them, but liveness checks still need nodes producing blocks.

What if expectations fail but workloads “look fine”?
Trust expectations first—they capture the intended success criteria. Use the observability signals and runner logs to pinpoint why the system missed the target.

Glossary

Node: process that participates in consensus and produces blocks.
Deployer: component that provisions infrastructure (spawns processes, creates containers, or launches pods), waits for readiness, and returns a Runner. Examples: LocalDeployer, ComposeDeployer, K8sDeployer.
Runner: component returned by deployers that orchestrates scenario execution—starts workloads, observes signals, evaluates expectations, and triggers cleanup.
Workload: traffic or behavior generator that exercises the system during a scenario run.
Expectation: post-run assertion that judges whether the system met the intended success criteria.
Topology: declarative description of the cluster shape, roles, and high-level parameters for a scenario.
Scenario: immutable plan combining topology, workloads, expectations, and run duration.
Blockfeed: stream of block observations used for liveness or inclusion signals during a run.
Control capability: the ability for a runner to start, stop, or restart nodes, used by chaos workloads.
Slot duration: time interval between consensus rounds in Cryptarchia. Blocks are produced at multiples of the slot duration based on lottery outcomes.
Block cadence: observed rate of block production in a live network, measured in blocks per second or seconds per block.
Cooldown: waiting period after a chaos action (e.g., node restart) before triggering the next action, allowing the system to stabilize.
Run window: total duration a scenario executes, specified via with_run_duration(). Framework auto-extends to at least 2× slot duration.
Readiness probe: health check performed by runners to ensure nodes are reachable and responsive before starting workloads. Prevents false negatives from premature traffic.
Liveness: property that the system continues making progress (producing blocks) under specified conditions. Contrasts with safety/correctness which verifies that state transitions are accurate.
State assertion: expectation that verifies specific values in the system state (e.g., wallet balances, UTXO sets) rather than just progress signals. Also called “correctness expectations.”
Mantle transaction: transaction type in Logos that can contain UTXO transfers (LedgerTx) and operations (Op).

External Resources

Logos Project Documentation — Protocol specifications, node internals, and architecture details

Keyboard shortcuts

Logos Blockchain Testing Framework Book