Shadow Configuration Specification

Shadow uses the standard YAML 1.2 format to accept configuration options, with the following extensions:

The following describes Shadow's YAML format and all of the options that Shadow supports that can be used to customize a simulation.

Example:


general:
  stop_time: 2 min
network:
  graph:
    type: gml
    inline: |
      graph [
        node [
          id 0
          host_bandwidth_down "140 Mbit"
          host_bandwidth_up "18 Mbit"
        ]
        edge [
          source 0
          target 0
          latency "50 ms"
          packet_loss 0.01
        ]
      ]
hosts:
  server:
    network_node_id: 0
    processes:
    - path: /usr/sbin/nginx
      args: -c ../../../nginx.conf -p .
      start_time: 1
      expected_final_state: running
  client1: &client_host
    network_node_id: 0
    host_options:
      log_level: debug
    processes:
    - path: /usr/bin/curl
      args: server --silent
      start_time: 5
  client2: *client_host
  client3: *client_host

general
general.bootstrap_end_time
general.data_directory
general.heartbeat_interval
general.log_level
general.model_unblocked_syscall_latency
general.parallelism
general.progress
general.seed
general.stop_time
general.template_directory
network
network.graph
network.graph.type
network.graph.<file|inline>
network.graph.file.path
network.graph.file.compression
network.use_shortest_path
experimental
experimental.interface_qdisc
experimental.max_unapplied_cpu_latency
experimental.native_preemption_enabled
experimental.native_preemption_native_interval
experimental.native_preemption_sim_interval
experimental.report_errors_to_stderr
experimental.runahead
experimental.scheduler
experimental.socket_recv_autotune
experimental.socket_recv_buffer
experimental.socket_send_autotune
experimental.socket_send_buffer
experimental.strace_logging_mode
experimental.unblocked_syscall_latency
experimental.unblocked_vdso_latency
experimental.use_cpu_pinning
experimental.use_dynamic_runahead
experimental.use_memory_manager
experimental.use_new_tcp
experimental.use_object_counters
experimental.use_preload_libc
experimental.use_preload_openssl_crypto
experimental.use_preload_openssl_rng
experimental.use_sched_fifo
experimental.use_syscall_counters
experimental.use_worker_spinning
host_option_defaults
host_option_defaults.log_level
host_option_defaults.pcap_capture_size
host_option_defaults.pcap_enabled
hosts
hosts.<hostname>.bandwidth_down
hosts.<hostname>.bandwidth_up
hosts.<hostname>.ip_addr
hosts.<hostname>.network_node_id
hosts.<hostname>.host_options
hosts.<hostname>.processes
hosts.<hostname>.processes[*].args
hosts.<hostname>.processes[*].environment
hosts.<hostname>.processes[*].expected_final_state
hosts.<hostname>.processes[*].path
hosts.<hostname>.processes[*].shutdown_signal
hosts.<hostname>.processes[*].shutdown_time
hosts.<hostname>.processes[*].start_time

`general.bootstrap_end_time`

Default: "0 sec"
Type: String OR Integer

The simulated time that ends Shadow's high network bandwidth/reliability bootstrap period.

If the bootstrap end time is greater than 0, Shadow uses a simulation bootstrapping period where hosts have unrestricted network bandwidth and no packet drop. This can help to bootstrap large networks quickly when the network hosts have low network bandwidth or low network reliability.

`general.data_directory`

Default: "shadow.data"
Type: String

Path to store simulation output.

`general.heartbeat_interval`

Default: "1 sec"
Type: String OR Integer OR null

Interval at which to print simulation heartbeat messages.

`general.log_level`

Default: "info"
Type: "error" OR "warning" OR "info" OR "debug" OR "trace"

Log level of output written on stdout. If Shadow was built in release mode, then messages at level 'trace' will always be dropped.

`general.model_unblocked_syscall_latency`

Default: false
Type: Bool

Whether to model syscalls and VDSO functions that don't block as having some latency. This should have minimal effect on typical simulations, but can be helpful for programs with "busy loops" that otherwise deadlock under Shadow.

`general.parallelism`

Default: 0
Type: Integer

How many parallel threads to use to run the simulation. Optimal performance is usually obtained with the number of physical CPU cores (nproc without hyperthreading or nproc/2 with hyperthreading).

A value of 0 will allow Shadow to choose the number of threads, typically the number of physical CPU cores available in the current CPU affinity mask and cgroup.

Virtual hosts depend on network packets that can potentially arrive from other virtual hosts, so each worker can only advance according to the propagation delay to avoid dependency violations. Therefore, not all threads will have 100% CPU utilization.

`general.progress`

Default: false
Type: Bool

Show the simulation progress on stderr.

When running in a tty, the progress will be updated every second and shown at the bottom of the terminal. Otherwise the progress will be printed without ANSI escape codes at intervals which increase as the simulation progresses.

`general.seed`

Default: 1
Type: Integer

Initialize randomness using seed N.

`general.stop_time`

Required
Type: String OR Integer

The simulated time at which the simulation ends.

`general.template_directory`

Default: null
Type: String OR null

Path to recursively copy during startup and use as the data-directory.

`network`

Required

Network settings.

`network.graph`

Required

The network topology graph.

A network topology represented by a connected graph with certain attributes specified on the network nodes and edges. For more information on how to structure this data, see the Network Graph Overview.

Example:


network:
  graph:
    type: gml
    inline: |
      graph [
        ...
      ]

`network.graph.type`

Required
Type: "gml" OR "1_gbit_switch"

The network graph can be specified in the GML format, or a built-in "1_gbit_switch" graph with a single network node can be used instead.

The built-in "1_gbit_switch" graph contains the following:


graph [
  directed 0
  node [
    id 0
    host_bandwidth_up "1 Gbit"
    host_bandwidth_down "1 Gbit"
  ]
  edge [
    source 0
    target 0
    latency "1 ms"
    packet_loss 0.0
  ]
]

`network.graph.<file|inline>`

Required if network.graph.type is "gml"
Type: Object OR String

If the network graph type is not a built-in network graph, the graph data can be specified as a path to an external file, or as an inline string.

`network.graph.file.path`

Required
Type: String

The path to the file.

If the path begins with ~/, it will be considered relative to the current user's home directory. No other shell expansion is performed on the path.

`network.graph.file.compression`

Default: null
Type: "xz" OR null

The file's compression format.

`network.use_shortest_path`

Default: true
Type: Bool

When routing packets, follow the shortest path rather than following a direct edge between network nodes. If false, the network graph is required to be complete (including self-loops) and to have exactly one edge between any two nodes.

`experimental`

Experimental experiment settings. Unstable and may change or be removed at any time, regardless of Shadow version.

`experimental.interface_qdisc`

Default: "fifo"
Type: "fifo" OR "round-robin"

The queueing discipline to use at the network interface.

`experimental.max_unapplied_cpu_latency`

Default: "1 microsecond"
Type: String

Max amount of execution-time latency allowed to accumulate before the clock is moved forward. Moving the clock forward is a potentially expensive operation, so larger values reduce simulation overhead, at the cost of coarser time jumps.

Note also that accumulated-but-unapplied latency is discarded when a thread is blocked on a syscall.

No effect when CPU latency isn't being modeled, e.g. via general.model_unblocked_syscall_latency or experimental.native_preemption_enabled.

`experimental.native_preemption_enabled`

Default: false
Type: Bool

When true, and when managed code runs for an extended time without returning control to shadow (e.g. by making a syscall), shadow preempts the managed code and moves simulated time forward.

This usually shouldn't be needed, and breaks simulation determinism, but can be used to escape "pure-CPU busy-loops". See [limitations.md#cpu-busy-loops].

`experimental.native_preemption_native_interval`

Default: "100 milliseconds"
Type: String

When native_preemption_enabled is true, amount of native CPU-time to wait before preempting managed code that hasn't returned control to shadow.

Using a relatively long value here avoids triggering preemption when it isn't needed (and thereby unnecessarily reducing determinism of the simulation), but may cause the simulation to take longer to escape a "CPU-only busy-loop" when it is needed.

Only supports microsecond granularity, and values below 1 microsecond are rejected.

No effect when native_preemption_enabled is false.

`experimental.native_preemption_sim_interval`

Default: "10 milliseconds"
Type: String

When native_preemption_enabled is true, amount of simulated time to consume after native_preemption_native_interval has elapsed without returning control to shadow.

Larger values here may mean fewer preemptions, and therefore less real time, are required to escape a CPU-only busy loop, but result in larger time-jumps inside the simulation, which may have unexpected effects.

For simulation efficiency, this latency is only actually applied when max_unapplied_cpu_latency is reached.

No effect when native_preemption_enabled is false.

`experimental.report_errors_to_stderr`

Default: true
Type: Bool

Report Error-level log messages to shadow's stderr in addition to logging them to stdout.

`experimental.runahead`

Default: "1 ms"
Type: String OR null

If set, overrides the automatically calculated minimum time workers may run ahead when sending events between virtual hosts.

`experimental.scheduler`

Default: "thread-per-core"
Type: "thread-per-core" OR "thread-per-host"

The host scheduler implementation, which decides how to assign hosts to threads and threads to CPU cores.

`experimental.socket_recv_autotune`

Default: true
Type: Bool

Enable receive window autotuning.

`experimental.socket_recv_buffer`

Default: "174760 B"
Type: String OR Integer

Initial size of the socket's receive buffer.

`experimental.socket_send_autotune`

Default: true
Type: Bool

Enable send window autotuning.

`experimental.socket_send_buffer`

Default: "131072 B"
Type: String OR Integer

Initial size of the socket's send buffer.

`experimental.strace_logging_mode`

Default: "off"
Type: "off" OR "standard" OR "deterministic"

Log the syscalls for each process to individual "strace" files.

The mode determines the format that the syscalls are logged in. For example, the "deterministic" mode will avoid logging memory addresses or potentially uninitialized memory.

The logs will be stored at shadow.data/hosts/<hostname>/<procname>.<pid>.strace.

Limitations:

Syscalls run natively will not log the syscall arguments or return value (for example SYS_getcwd).
Syscalls processed within Shadow's C code will not log the syscall arguments.
Syscalls that are interrupted by a signal may not be logged (for example SYS_read).
Syscalls that are interrupted by a signal may be logged inaccurately. For example, the log may show syscall(...) = -1 (EINTR), but the managed process may not actually see this return value. Instead the syscall may be restarted.

`experimental.unblocked_syscall_latency`

Default: "1 microseconds"
Type: String

The simulated latency of an unblocked syscall. For simulation efficiency, this latency is only added when max_unapplied_cpu_latency is reached.

Ignored when general.model_unblocked_syscall_latency is false.

`experimental.unblocked_vdso_latency`

The default of 65535 bytes is the maximum length of an IP packet.

`host_option_defaults.pcap_enabled`

Default: false
Type: Bool

Should Shadow generate pcap files?

Logs all network input and output for this host in PCAP format (for viewing in e.g. wireshark). The pcap files will be stored in the host's data directory, for example shadow.data/hosts/myhost/eth0.pcap.

`hosts`

Required
Type: Object

The simulated hosts which execute processes. Each field corresponds to a host configuration, with the field name being used as the network hostname. A hostname must follow the character requirements of hostname(7).

Shadow assigns each host to a network node in the network graph.

In Shadow, each host is given an RNG whose seed is derived from the global seed (general.seed) and the hostname. This means that changing a host's name will change that host's RNG seed, subtly affecting the simulation results.

`hosts.<hostname>.bandwidth_down`

Default: null
Type: String OR Integer OR null

Downstream bandwidth capacity of the host.

Overrides any default bandwidth values set in the assigned network graph node.

`hosts.<hostname>.bandwidth_up`

Default: null
Type: String OR Integer OR null

Upstream bandwidth capacity of the host.

Overrides any default bandwidth values set in the assigned network graph node.

`hosts.<hostname>.ip_addr`

Default: null
Type: String OR null

IP address to assign to the host.

This IP address must not conflict with the address of any other host (two hosts must not have the same IP address). The address must also not be 0.0.0.0, a loopback address, a multicast address, or a broadcast address.

If set to null, an address will be chosen for the host automatically. Automatic addresses begin at 11.0.0.1, assigned to hosts in alphabetical order according to their hostname. Shadow's automatic IP address assignment is meant for hosts where their specific IP is unimportant. It's recommended to not rely on Shadow's specific IP assignment behaviour, and to specify IP addresses explicitly when a fixed IP address is needed.

`hosts.<hostname>.network_node_id`

Required
Type: Integer

Network graph node ID to assign the host to.

`hosts.<hostname>.host_options`

See host_option_defaults for supported fields.

Example:


hosts:
  client:
    ...
    host_options:
      log_level: debug

`hosts.<hostname>.processes`

Required
Type: Array

Virtual software processes that the host will run.

`hosts.<hostname>.processes[*].args`

Default: ""
Type: String OR Array of String

Process arguments.

The arguments can be specified as a string in a shell command-line format:


args: "--user-agent 'Mozilla/5.0 (compatible; ...)' http://myserver:8080"

Or as an array of strings:


args: ['--user-agent', 'Mozilla/5.0 (compatible; ...)', 'http://myserver:8080']

Shell expansion (which includes ~/ expansion) is not performed on either format. In the command-line format, the string is parsed as an argument vector following typical shell quotation parsing rules.

`hosts.<hostname>.processes[*].environment`

Default: ""
Type: Object

Environment variables passed when executing this process.

Shell expansion (which includes ~/ expansion) is not performed on any fields.

Examples:


environment:
  ENV_A: "1"
  ENV_B: foo


environment: { ENV_A: "1", ENV_B: foo }

`hosts.<hostname>.processes[*].expected_final_state`

Default: {exited: 0}
Type: {"exited": <Integer>} OR {"signaled": Unix Signal} OR "running"

The expected state of the process at the end of the simulation. If the process exits before the end of the simulation with an unexpected state, or is still running at the end of the simulation when this was not running, shadow will log an error and return a non-zero status for the simulation.

Use exited to indicate that a process should have exited normally; e.g. by returning from main or calling exit.

Use signaled to indicate that a process should have been killed by a signal.

Use running for a process expected to still be running at the end of the simulation, such as a server process that you didn't arrange to shutdown before the end of the simulation. (All processes will be killed by Shadow when the simulation ends).

Examples:

{exited: 0}
{exited: 1}
{signaled: SIGINT}
{signaled: 9}
running

Only processes started directly from the configuration have an expected_final_state. Processes that those processes start (e.g. via fork in C, or running an executable in a shell script) don't have one. Generally it's the parent process's responsibility to do any necessary validation of the exit status of its children (e.g. via waitpid in C, or checking $? in a bash script).

`hosts.<hostname>.processes[*].path`

Required
Type: String

If the path begins with ~/, it will be considered relative to the current user's home directory. No other shell expansion is performed on the path.

Bare file basenames like sleep will be located using Shadow's PATH environment variable (e.g. to /usr/bin/sleep).

`hosts.<hostname>.processes[*].shutdown_signal`

Default: "SIGTERM"
Type: Unix Signal

The signal that will be sent to the process at hosts.<hostname>.processes[*].shutdown_time. Signals specified by name should be all-caps and include the SIG prefix; e.g. "SIGTERM".

Many long-running processes support exiting cleanly when sent SIGTERM or SIGINT.

If the process is expected to be killed directly by the signal instead of catching it and exiting cleanly, you can set expected_final_state to prevent Shadow from interpreting this as an error. e.g. SIGKILL cannot be caught, so will always result in an end state of {signaled: SIGKILL} if the process didn't already exit before the signal was sent.


path: sleep
args: "1000"
start_time: 1s
shutdown_time: 2s
shutdown_signal: SIGKILL
expected_final_state: {signaled: SIGKILL}

`hosts.<hostname>.processes[*].shutdown_time`

Default: null
Type: String OR Integer OR null

The simulated time at which to send hosts.<hostname>.processes[*].shutdown_signal to the process. This must be before general.stop_time.

`hosts.<hostname>.processes[*].start_time`

Default: "0 sec"
Type: String OR Integer

The simulated time at which to execute the process. This must be before general.stop_time.

Keyboard shortcuts

The Shadow Simulator