Introduction to incrementalism in OCurrent
Welcome to the beginning of theses docs on building things with OCurrent – an OCaml DSL for generating incremental pipelines to build… well almost anything. This introductory chapter will give a solid understanding of the foundation upon which the OCurrent tower stands including:
- The core eDSL features and how it achieves incrementalism
- The notion of an OCurrent plugin
- The broad overview of the suite of existing OCurrent features
Why be incremental?
An incremental approach to building is used within and outside software development. Not only does it allow for steady progress to be made, but also responsive and fast rebuilding in the face of changes. If you want to paint a wall yellow, you don’t tear down the building and start from scratch!
Incrementalism nearly always provides:
- Faster rebuilds
- Less resource-intensive lifetimes of applications
This provides users with a better experience (almost instant continuous integration from a hot pipeline) and puts the resources to good use!
For an incremental approach to work well, individual commands or steps need to have a good understanding of their dependencies in order to react if they change. This is provided by OCurrent in the eDSL (more on that later). But you can think of it like any process which reacts to a changing dependency – your heating turning off when the thermometer reaches a certain temperature or your git repository rebuilding when the main ref
changes.
With this added reactivity, the incremental process not only has the afore mentioned benefits but now it has a real sense of automation too!
We have seen the underlying mechanism for providing incrementalism in OCurrent (Current_incr
and Term
). Now we’ll look at the high-level, user-facing library where we have the full-power of asynchronous programming to build pipelines.
Current
Current
is the intended library for users to interact with. We’ll take a look at its core modules which provided a meaningful way to program incrementally and asynchronously. But first… let’s rewrite plus
in Current
.
# #require "current"
# open Current.Syntax
# let plus a b =
let open Current in
component "PLUS" |>
let** a = a
and* b = b in
return (a + b)
val plus : int Current.term -> int Current.term -> int Current.term = <fun>
And now we can run this inside the Current.Engine
on a thread. The major difference now is how we generate our changeable inputs, the a
and the b
from before. Now we have to use the functor.
# module CInt = Current.Var (struct
include Int
let pp = Format.pp_print_int
end)
module CInt :
sig
type t
val get : t -> int Current.term
val create : name:string -> int Current_term.Output.t -> t
val set : t -> int Current_term.Output.t -> unit
val update :
t -> (int Current_term.Output.t -> int Current_term.Output.t) -> unit
end
From the signature you can also see we need a way to work with Current_term.Output.t
s –
Largely based on the very excellent write-up by Thomas Leonard.
An embedded domain specific language (eDSL) is a set of primitive values and functions written in a host-language to enable a programming experience of a new language. For those old enough, think jQuery
. Within OCurrent the entire eDSL lives in the Current_incr
package.
# #require "current_incr"
# Current_incr.var
- : 'a -> 'a Current_incr.var = <fun>
As with a lot of things in OCaml, the eDSL takes on this monadic look as it wraps things up into its on type (thinks 'a list
or 'a Lwt.t
).
A simple arithmetic example
The result of a plus
operator has two dependencies, the operands.
# let plus a b = a + b
val plus : int -> int -> int = <fun>
Nothing too surprising here, but what if we want to make this incremental so that it updates if a
or b
change. At this point a
and b
must be incremental values (i.e. int Current_incr.t
).
# let plus a b =
let open Current_incr in
read a (fun a -> read b (fun b -> write (a + b))) |> of_cc
val plus : int Current_incr.t -> int Current_incr.t -> int Current_incr.t =
<fun>
# let a = Current_incr.var 3
val a : int Current_incr.var = <abstr>
# let b = Current_incr.var 6
val b : int Current_incr.var = <abstr>
# let c = Current_incr.(plus (of_var a) (of_var b))
val c : int Current_incr.t = <abstr>
Now we can actually have a look at the value we computed using the observe
function. Most importantly we can change the dependency variables a
and b
to a new integer and then run propagate
and see the incrementalism happen before our eyes!
# Current_incr.observe c
- : int = 9
# Current_incr.change a 10
- : unit = ()
# Current_incr.propagate ()
- : unit = ()
# Current_incr.observe c
- : int = 16
Abstracting away
The primitives for incrementalism are small and easy to understand, but not ideal for building larger applications. For one it would be nice to know ahead of time (statically) what are computation graph looks like. It would also be nice to incorporate ('a, 'b) Result.t
style exception handling because… things go wrong.
This is exactly what current.term
and eventually Current
do! The Term
module provides the static analysis and error handling whilst the final user-facing Current
module provides asynchronous computations using Lwt
and persistent logging. But first Term
.
Term Module
To build our usable Term
module, we need to use the function Current_term.Make
. It expects the simplest of module arguments:
# #require "current.term"
# #show Current_term
module Current_term :
sig
module S = Current_term__.S
module Output = Current_term__.Output
module Make : functor (Metadata : sig type t end) -> sig ... end
end
Something with a type t
. This is used (as the argument name helpfully points out) for Metadata
information. For now it isn’t too important.
# module Term = Current_term.Make (Unit)
module Term :
sig
type 'a t = 'a Current_term.Make(Unit).t
type description = Current_term.Make(Unit).description
val active : Current_term.Output.active -> 'a t
val return : ?label:string -> 'a -> 'a t
val fail : string -> 'a t
val state :
?hidden:bool ->
'a t ->
('a, [ `Active of Current_term.Output.active | `Msg of string ]) result
t
val catch : ?hidden:bool -> 'a t -> 'a Current_term.S.or_error t
val ignore_value : 'a t -> unit t
val of_output : 'a Current_term__.Output.t -> 'a t
val map : ('a -> 'b) -> 'a t -> 'b t
val map_error : (string -> string) -> 'a t -> 'a t
val pair : 'a t -> 'b t -> ('a * 'b) t
val list_map :
(module Current_term.S.ORDERED with type t = 'a) ->
?collapse_key:string -> ('a t -> 'b t) -> 'a list t -> 'b list t
val list_iter :
(module Current_term.S.ORDERED with type t = 'a) ->
?collapse_key:string -> ('a t -> unit t) -> 'a list t -> unit t
val list_seq : 'a t list -> 'a list t
val option_map : ('a t -> 'b t) -> 'a option t -> 'b option t
val option_seq : 'a t option -> 'a option t
val all : unit t list -> unit t
val all_labelled : (string * unit t) list -> unit t
val gate : on:unit t -> 'a t -> 'a t
val collapse : key:string -> value:string -> input:'b t -> 'a t -> 'a t
val with_context : 'b t -> (unit -> 'a t) -> 'a t
val bind : ?info:description -> ('a -> 'b t) -> 'a t -> 'b t
type 'a primitive =
('a Current_term.Output.t * unit option) Current_incr.t
val primitive : info:description -> ('a -> 'b primitive) -> 'a t -> 'b t
val component : ('a, Format.formatter, unit, description) format4 -> 'a
module Syntax :
sig
val ( let+ ) : 'a t -> ('a -> 'b) -> 'b t
val ( and+ ) : 'a t -> 'b t -> ('a * 'b) t
val ( let* ) : 'a t -> ('a -> 'b t) -> 'b t
val ( let> ) : 'a t -> ('a -> 'b primitive) -> description -> 'b t
val ( let** ) : 'a t -> ('a -> 'b t) -> description -> 'b t
val ( and* ) : 'a t -> 'b t -> ('a * 'b) t
val ( and> ) : 'a t -> 'b t -> ('a * 'b) t
end
module Analysis :
sig
val metadata : 'a t -> unit option t
val pp : 'a t Fmt.t
val pp_dot :
env:(string * string) list ->
collapse_link:(k:string -> v:string -> string option) ->
job_info:(unit -> Current_term.Output.active option * string option) ->
'a t Fmt.t
val stats : 'a t -> Current_term.S.stats
end
module Executor :
sig val run : 'a t -> 'a Current_term__.Output.t Current_incr.t end
end
Now that we have a Term
module we can rebuild our plus
operator from earlier.
# open Term.Syntax
# let plus a b =
Term.component "PLUS" |>
let** a = a
and* b = b in
Term.return (a + b)
val plus : int Term.t -> int Term.t -> int Term.t = <fun>
Wahhh what’s this crazy let**
syntax? Since OCaml 4.08 we’ve had binding operators. Just like you can define infix operators such as ( >>= )
these operators are very similar except they happen on the let
bindings.
# Term.Syntax.( let** )
- : 'a Term.t -> ('a -> 'b Term.t) -> Term.description -> 'b Term.t = <fun>
As you can see it is just our friendly bind operator with a way for passing a description (more on that later). Give me something wrapped up in something and a function that works on the wrapped up thing, and I’ll give you that function applied to the inner value wrapped up.
# let res =
let a = 3 |> Term.return ~label:"a" in
let b = 7 |> Term.return ~label:"a" in
Term.Executor.run (plus a b)
val res : int Current_term__.Output.t Current_incr.t = <abstr>
# Current_incr.observe res
- : int Current_term__.Output.t = Ok 10
Here we also see the result type making it’s way in. Term
also gives us a way to fail too.
# Current_incr.observe (Term.fail "Woops!" |> Term.Executor.run)
- : 'a Current_term__.Output.t = Error (`Msg "Woops!")
Of course inside a pipeline we might not know if something has failed or not, in which case we can expose the underlying result using Term.catch
and pattern-match on Ok
and Error
.
The Extra Metadata
One of the original goals of OCurrent was not only to build incremental pipelines but to expose them to users. The Term
library (under the hood) is adding extra metadata and static analysis to be able to generate useful graphics and information. For example:
# let a = 3 |> Term.return ~label:"Operand 1" in
let b = 4 |> Term.return ~label:"Operand 2" in
Format.printf "%a@." Term.Analysis.pp (plus a b)
Operand 1
||
Operand 2 >>=
PLUS
- : unit = ()
Now, hopefully, it becomes apparent why the component "PLUS"
and the let**
operator were needed.