Task Spawning

The main entrypoint to Dagger is @spawn:

Dagger.@spawn [option=value]... f(args...; kwargs...)

or spawn if it's more convenient:

Dagger.spawn(f, Dagger.Options(options), args...; kwargs...)

When called, it creates an DTask (also known as a "task" or "thunk") object representing a call to function f with the arguments args and keyword arguments kwargs. If it is called with other tasks as args/kwargs, such as in Dagger.@spawn f(Dagger.@spawn g()), then, in this example, the function f gets passed the results of executing g(), once that result is available. If g() isn't yet finished executing, then the execution of f waits on g() to complete before executing.

An important observation to make is that, for each argument to @spawn/spawn, if the argument is the result of another @spawn/spawn call (thus it's an DTask), the argument will be computed first, and then its result will be passed into the function receiving the argument. If the argument is not an DTask (instead, some other type of Julia object), it'll be passed as-is to the function f (with some exceptions).

Task / thread occupancy

By default, Dagger assumes that tasks saturate the thread they are running on and does not try to schedule other tasks on the thread. This default can be controlled by specifying Options (more details can be found under Task and Scheduler options). The section Changing the thread occupancy shows a runnable example of how to achieve this.

Options

The Options struct in the second argument position is optional; if provided, it is passed to the scheduler to control its behavior. Options contains option key-value pairs, which can be any field in Options (see Task and Scheduler options).

Simple example

Let's see a very simple directed acyclic graph (or DAG) constructed with Dagger:

using Dagger

add1(value) = value + 1
add2(value) = value + 2
combine(a...) = sum(a)

p = Dagger.@spawn add1(4)
q = Dagger.@spawn add2(p)
r = Dagger.@spawn add1(3)
s = Dagger.@spawn combine(p, q, r)

@assert fetch(s) == 16

The tasks p, q, r, and s have the following structure:

graph

The final result (from fetch(s)) is the obvious consequence of the operation:

add1(4) + add2(add1(4)) + add1(3)

(4 + 1) + ((4 + 1) + 2) + (3 + 1) == 16

Eager Execution

Dagger's @spawn macro works similarly to @async and Threads.@spawn: when called, it wraps the function call specified by the user in an DTask object, and immediately places it onto a running scheduler, to be executed once its dependencies are fulfilled.

x = rand(400,400)
y = rand(400,400)
zt = Dagger.@spawn x * y
z = fetch(zt)
@assert isapprox(z, x * y)

One can also wait on the result of @spawn and check completion status with isready:

x = Dagger.@spawn sleep(10)
@assert !isready(x)
wait(x)
@assert isready(x)

Like @async and Threads.@spawn, Dagger.@spawn synchronizes with locally-scoped @sync blocks:

function sleep_and_print(delay, str)
    sleep(delay)
    println(str)
end
@sync begin
    Dagger.@spawn sleep_and_print(3, "I print first")
end
wait(Dagger.@spawn sleep_and_print(1, "I print second"))

One can also safely call @spawn from another worker (not ID 1), and it will be executed correctly:

x = fetch(Distributed.@spawnat 2 Dagger.@spawn 1+2) # fetches the result of `@spawnat`
x::DTask
@assert fetch(x) == 3 # fetch the result of `@spawn`

This is useful for nested execution, where an @spawn'd task calls @spawn. This is detailed further in Dynamic Scheduler Control.

Options

The Options struct in the second argument position is optional; if provided, it is passed to the scheduler to control its behavior. Options contains a NamedTuple of option key-value pairs, which can be any of:

Any field in Options (see Task and Scheduler options)
meta::Bool – Pass the input Chunk objects themselves to f and not the value contained in them.

There are also some extra options that can be passed, although they're considered advanced options to be used only by developers or library authors:

get_result::Bool – return the actual result to the scheduler instead of Chunk objects. Used when f explicitly constructs a Chunk or when return value is small (e.g. in case of reduce)
persist::Bool – the result of this Thunk should not be released after it becomes unused in the DAG
cache::Bool – cache the result of this Thunk such that if the thunk is evaluated again, one can just reuse the cached value. If it’s been removed from cache, recompute the value.

Errors

If a task errors while running under the eager scheduler, it will be marked as having failed, all dependent (downstream) tasks will be marked as failed, and any future tasks that use a failed task as input will fail. Failure can be determined with fetch, which will re-throw the error that the originally-failing task threw. wait and isready will not check whether a task or its upstream failed; they only check if the task has completed, error or not.

This failure behavior is not the default for lazy scheduling (Lazy API), but can be enabled by setting the scheduler/task option (Task and Scheduler options) allow_error to true. However, this option isn't terribly useful for non-dynamic usecases, since any task failure will propagate down to the output task regardless of where it occurs.

Cancellation

Sometimes a task runs longer than expected (maybe it's hanging due to a bug), or the user decides that they don't want to wait on a task to run to completion. In these cases, Dagger provides the Dagger.cancel! function, which allows for stopping a task while it's running, or terminating it before it gets the chance to start running.

t = Dagger.@spawn sleep(1000)
# We're bored, let's cancel `t`
Dagger.cancel!(t)

Dagger.cancel! is generally safe to call, as it will not actually force a task to stop; instead, Dagger will simply "abandon" the task and allow it to finish on its own in the background, and it will not block the execution of other DTasks that are queued to run. It is possible to force-cancel a task by doing Dagger.cancel!(t; force=true), but this is generally discouraged, as it can cause memory leaks, hangs, and segfaults.

If it's desired to cancel all tasks that are scheduled or running, one can call Dagger.cancel!(), and all tasks will be abandoned (or force-cancelled, if specified). Additionally, if Dagger's scheduler needs to be restarted for any reason, one can call Dagger.cancel!(;halt_sch=true) to stop the scheduler and all tasks. The scheduler will be automatically restarted on the next @spawn/spawn call.

Lazy API

Alongside the modern eager API, Dagger also has a legacy lazy API, accessible via @par or delayed. The above computation can be executed with the lazy API by substituting @spawn with @par and fetch with collect:

p = Dagger.@par add1(4)
q = Dagger.@par add2(p)
r = Dagger.@par add1(3)
s = Dagger.@par combine(p, q, r)

@assert collect(s) == 16

or similarly, in block form:

s = Dagger.@par begin
    p = add1(4)
    q = add2(p)
    r = add1(3)
    combine(p, q, r)
end

@assert collect(s) == 16

Alternatively, if you want to compute but not fetch the result of a lazy operation, you can call compute on the task. This will return a Chunk object which references the result (see Chunks for more details):

x = Dagger.@par 1+2
cx = compute(x)
cx::Chunk
@assert collect(cx) == 3

Note that, as a legacy API, usage of the lazy API is generally discouraged for modern usage of Dagger. The reasons for this are numerous:

Nothing useful is happening while the DAG is being constructed, adding extra latency
Dynamically expanding the DAG can't be done with @par and delayed, making recursive nesting annoying to write
Each call to compute/collect starts a new scheduler, and destroys it at the end of the computation, wasting valuable time on setup and teardown
Distinct schedulers don't share runtime metrics or learned parameters, thus causing the scheduler to act less intelligently
Distinct schedulers can't share work or data directly

Task and Scheduler options

While Dagger generally "just works", sometimes one needs to exert some more fine-grained control over how the scheduler allocates work. There are two parallel mechanisms to achieve this: Task options (from Options) and Scheduler options (from Sch.SchedulerOptions). Scheduler options operate globally across an entire DAG, and Task options operate on a task-by-task basis.

Scheduler options can be constructed and passed to collect() or compute() as the keyword argument options for lazy API usage:

t = Dagger.@par 1+2
opts = Dagger.Sch.SchedulerOptions(;single=1) # Execute on worker 1

compute(t; options=opts)

collect(t; options=opts)

Task options can be passed to @spawn/spawn, @par, and delayed similarly:

# Execute on worker 1

Dagger.@spawn single=1 1+2
Dagger.spawn(+, Dagger.Options(;single=1), 1, 2)

delayed(+; single=1)(1, 2)

Changing the thread occupancy

One of the supported Options is the occupancy keyword. This keyword can be used to communicate that a task is not expected to fully saturate a CPU core (e.g. due to being IO-bound). The basic usage looks like this:

Dagger.@spawn occupancy=Dict(Dagger.ThreadProc=>0) fn

Consider the following function definitions:

using Dagger

function inner()
    sleep(0.1)
end

function outer_full_occupancy()
    @sync for _ in 1:2
        # By default, full occupancy is assumed
        Dagger.@spawn inner()
    end
end

function outer_low_occupancy()
    @sync for _ in 1:2
        # Here, we're explicitly telling the scheduler to assume low occupancy
        Dagger.@spawn occupancy=Dict(Dagger.ThreadProc => 0) inner()
    end
end

When running the first outer function N times in parallel, you should see parallelization until all threads are blocked:

for N in [1, 2, 4, 8, 16]
    @time fetch.([Dagger.@spawn outer_full_occupancy() for _ in 1:N])
end

The results from the above code snippet should look similar to this (the timings will be influenced by your specific machine):

  0.124829 seconds (44.27 k allocations: 3.055 MiB, 12.61% compilation time)
  0.104652 seconds (14.80 k allocations: 1.081 MiB)
  0.110588 seconds (28.94 k allocations: 2.138 MiB, 4.91% compilation time)
  0.208937 seconds (47.53 k allocations: 2.932 MiB)
  0.527545 seconds (79.35 k allocations: 4.384 MiB, 0.64% compilation time)

Whereas running the outer function that communicates a low occupancy (outer_low_occupancy) should run fully in parallel:

for N in [1, 2, 4, 8, 16]
    @time fetch.([Dagger.@spawn outer_low_occupancy() for _ in 1:N])
end

In comparison, the outer_low_occupancy snippet should show results like this:

  0.120686 seconds (44.38 k allocations: 3.070 MiB, 13.00% compilation time)
  0.105665 seconds (15.40 k allocations: 1.072 MiB)
  0.107495 seconds (28.56 k allocations: 1.940 MiB)
  0.109904 seconds (55.03 k allocations: 3.631 MiB)
  0.117239 seconds (87.95 k allocations: 5.372 MiB)

Different ways to spawn tasks

Beyond the standard function call syntax Dagger.@spawn f(args...), Dagger also supports several other convenient ways to spawn tasks, mirroring Julia's own syntax variations.

Broadcast

Tasks can be spawned using Julia's broadcast syntax. This is useful for applying an operation element-wise to collections.

using Dagger
A = rand(4)
B = rand(4)

# Spawn a task to compute A .+ B
add_task = Dagger.@spawn A .+ B
@assert fetch(add_task) ≈ A .+ B

x = randn(100)
abs_task = Dagger.@spawn abs.(x)
@assert fetch(abs_task) == abs.(x)

Do block

Dagger supports spawning tasks using Julia's do block syntax, which is often used for functions that take another function as an argument, especially anonymous functions.

using Dagger
A = rand(4)

# Spawn a task using a do block with sum
sum_do_task = Dagger.@spawn sum(A) do a
    a + 1
end
@assert fetch(sum_do_task) ≈ sum(a -> a + 1, A)

# Spawn a task with a function that accepts a do block
do_f = f -> f(42)
do_task = Dagger.@spawn do_f() do x
    x + 1
end
@assert fetch(do_task) == 43

Anonymous direct call

Tasks can be spawned directly from anonymous function definitions.

using Dagger
A = rand(4)

# Spawn a task from an anonymous function
anon_task = Dagger.@spawn A -> sum(A)
@assert fetch(anon_task) == sum(A)

# Anonymous function with closed-over arguments
dims = 1
anon_kwargs_task = Dagger.@spawn A -> sum(A; dims=dims)
@assert fetch(anon_kwargs_task) == sum(A; dims=dims)

Getindex

Spawning tasks that retrieve elements from indexable collections, such as arrays, using index notation is supported.

using Dagger
A = rand(4, 4)

# Spawn a task to get A[1, 2]
getindex_task1 = Dagger.@spawn A[1, 2]
@assert fetch(getindex_task1) == A[1, 2]

# Spawn a task to get A[2] (linear indexing)
getindex_task2 = Dagger.@spawn A[2]
@assert fetch(getindex_task2) == A[2]

# Getindex from a DTask result
B_task = Dagger.@spawn rand(4, 4)
getindex_task_from_dtask = Dagger.@spawn B_task[1, 2]
@assert fetch(getindex_task_from_dtask) == fetch(B_task)[1, 2]

R = Ref(42)
# Spawn a task to get R[]
ref_getindex_task = Dagger.@spawn R[]
@assert fetch(ref_getindex_task) == 42

Setindex!

Similarly, tasks can be spawned to modify elements of mutable collections (such as arrays). The object being modified must be running under Datadeps, or wrapped with Dagger.@mutable, to ensure that its contents can be mutated correctly.

using Dagger
A = Dagger.@mutable rand(4, 4)

# Spawn a task to set A[1, 2] = 3.0
setindex_task1 = Dagger.@spawn A[1, 2] = 3.0
fetch(setindex_task1) # Wait for the setindex! to complete
@assert fetch(Dagger.@spawn A[1, 2]) == 3.0

# Spawn a task to set A[2] = 4.0 (linear indexing)
setindex_task2 = Dagger.@spawn A[2] = 4.0
fetch(setindex_task2)
@assert fetch(Dagger.@spawn A[2]) == 4.0

R = Dagger.@mutable Ref(42)
# Spawn a task to set R[] = 43
ref_setindex_task = Dagger.@spawn R[] = 43
fetch(ref_setindex_task)
@assert fetch(Dagger.@spawn R[]) == 43

NamedTuple

Tasks can be spawned to conveniently create NamedTuples.

using Dagger

# Spawn a task to create a NamedTuple
nt_task = Dagger.@spawn (;a=1, b=2)
@assert fetch(nt_task) == (;a=1, b=2)

# Spawn a task to create an empty NamedTuple
empty_nt_task = Dagger.@spawn (;)
@assert fetch(empty_nt_task) == (;)

Getproperty

Tasks can be spawned to access properties of NamedTuples (or other objects supporting getproperty).

using Dagger
nt = (;a=1, b=2)

# Spawn a task to get nt.b
getprop_task = Dagger.@spawn nt.b
@assert fetch(getprop_task) == nt.b

# Getproperty from a DTask result
nt2_task = Dagger.@spawn (;a=1, b=3)
getprop_task_from_dtask = Dagger.@spawn nt2_task.b
@assert fetch(getprop_task_from_dtask) == fetch(nt2_task).b