Introduction

I like Language Oriented Programming (LOP) a lot, but I am allergic to doing the dreaded big up front design. Pretty much all LOP and Domain Specific Language (DSL) materials assume or recommend a design phase.

Last year, I set out to do LOP without any foresight beyond that required to design the code I would write that day. The challenge that I chose as my framework for this was Advent of Code (AoC) 2019. There were two rules to this challenge:

I was not allowed to use any existing programming language to solve any part of any of the problems.
I was not allowed to look past the current prompt until it was completed.
I could only use the Racket ecosystem. I chose this after looking at the available language workbenches. (This was rule 3/2)

Notice that there was no rule that they must all be solved in the same DSL. It is perfectly okay to build on to an old DSL or to start over.

When it comes to the choice of using Racket, I compared the seven entrants to the Language Workbench Challenge and came away liking two of them. Racket won out over MPS because MPS has vendor lock, is closed source, and reminds me of Java. I like MPS but not as much as Racket. There was also the option to do it in some other language which is not a language workbench. This would have been a poor choice in my opinion because language workbenches allow for rapid prototyping, and that seemed necessary if I wanted to get anywhere quickly. I did not want to go to the trouble of implementing an interpreter in Haskell, Python, Java, or C, because those languages seemed better suited to implementing languages which had a design already. Language workbenches seemed better for designing a language rather than implementing them. When it comes to language implementation, Crafting Interpreters has an excellent handling of the subject, and even it prototypes the language in Java before moving to C. Prototyping in Racket seemed like the next step on that spectrum to me. If you’re interested in learning Racket, I recommend The Little Schemer for fundamentals of Scheme, SICP for great material on using Scheme to build data abstractions and function abstractions, Realm of Racket for learning general purpose Racket, Fear of Macros for macros in Racket, and Beautiful Racket for LOP.

The idea behind the AoC LOP challenge was to simulate a Speed to Value (S2V) delivery style project, where you don’t have the luxury of looking too far ahead. AoC was also chosen for its broadness. It is a stress test of sorts because the domain of AoC is very large. The only thing the prompts have in common is that they all read one file and produce one output that can be typed in to a box. In short, AoC is unpredictable like a real world problem, but broader than a a good deal of real problems. It also has the charm of not constraining your tool set, and not including anything that requires user input or networking.

A year later, I have made a lot of mistakes and have not managed to complete the first prompt. Obviously this could be viewed as an abject failure. Therefore I am writing this post to go back in time and deliver it to myself. Its main message is that I was correct in assuming that I didn’t need a design phase.

The reason I was not able to complete this in a timely fashion was because I did not know about the major pitfall involved in this sort of task. In this post I will talk about two things:

What a thin slice of a language feature can look like in the best of times.
Warning signs for when you have gone off the rails and are trending toward spending eleven months on a twenty minute task.
Another shout out to the best LOP book: Beautiful Racket. I would not have gotten anywhere without it. It is an inspiration, and everyone should buy it. Its design phases are also very short, and it takes thin slices. In a lot of ways, this post is redundant to the book. It is only shorter and more focused on the idea of keeping design phases narrow. In the book, the design phases are narrow but the focus is on LOP techniques themselves. The true test of this post will be in later posts when further days of AoC are picked up and either produce something terrible or something good. (This is thing 3/2).

Case Study: Adding the “/” Built in Function to AOCLOP

First, let us start with a line of code from a made up tiny DSL. In this language, all programs are a single line of code. I named it aoclop (pronounced eyoo-clop) for mysterious reasons. This is a program that takes in a file which has a column of numbers in it, divides each of those numbers by three, and then prints out the whole list of numbers. This is the first part of the first prompt of AoC 2019.

read: 1 nl | v | / 3 | ^

This is a line of code in the target language. Reading it out loud sounds like

read: 1 nl “read file 1, and assume each line in the file is a different value.
| And then …
v Downscope. In step one we were looking at the whole file. Now we are looking at each line in the file separately.
| and then…
/ 3 Divide the single value by 3.
^ look at the whole result. Not each individual number.”

So if I give it a file called 1.txt with contents:

3
6
9

This program will return out: '(1 2 3)

For the sake of this post, I will assume that everything except for / is already there. I will now add / to make the example above work.

Syntax

The first step is making it so that Racket is aware of / as a term. Since it falls between | characters it is already picked up as something that does something. This is a result of a bit of magic in the parser (written in brag) which looks like:

#lang brag
aoclop-program : /NEWLINE read scope-block /NEWLINE
read           : /READ INTEGER delimiter
scope-block    : /PIPE /DOWNSCOPE /PIPE all-ops /PIPE /UPSCOPE
delimiter      : DELIMITER
all-ops        : op*
op             : OP

There is a lot here, but the important part is that an op is something that falls between pipes. The pipes are among the symbols that are preceded by a / character (meaning “cut”), so they are used for parsing, but they are dropped before they get any further. An aoclop-program has a read statement, then a scope-block. A scope block has a pipe, a downscope, a pipe, some number of ops, a pipe, and then an upscope. This means that /, as something that falls between pipes, will be considered an op. I have left out some glue code because it is boring and will confuse you. If you really want to see the connections they are in main.rkt and tokenizer.rkt but I don’t expect to update those for awhile.

There is one issue though. If you look at the above code, it says that an op is a single thing. / is not a single thing. It is a single thing that takes a number. Therefore we have to edit the op line to make it understand that ops can take numbers. This looks like:

op             : OP | OP INTEGER

And so the result is:

#lang brag
aoclop-program : /NEWLINE read scope-block /NEWLINE
read           : /READ INTEGER delimiter
scope-block    : /PIPE /DOWNSCOPE /PIPE all-ops /PIPE /UPSCOPE
delimiter      : DELIMITER
all-ops        : op*
op             : OP | OP INTEGER

The trick now is to make / available as a token of code. That is done by adding this line to the lexer:

["/" (token 'OP lexeme)]

After this addition, the lexer looks like:

(define aoclop-lexer
  (lexer-srcloc
   ["read:" (token 'READ lexeme)]
   ["\n" (token 'NEWLINE lexeme)]
   ["nl" (token 'DELIMITER lexeme)]
   ["|" (token 'PIPE lexeme)]
   ["/" (token 'OP lexeme)]
   ["^" (token 'UPSCOPE lexeme)]
   ["v" (token 'DOWNSCOPE lexeme)]
   [digits (token 'INTEGER (string->number lexeme))]
   [whitespace (token lexeme #:skip? #t)]))
(provide aoclop-lexer)

Together the tokenizer, parser, and lexer can turn: read: 1 nl | v | / 3 | ^

into this: (aoclop-program (read 1 (delimiter "nl")) (scope-block (all-ops (op "/" 3))))

complete with all sorts of nice things like source location so that errors can be reported to the correct place in the code. Conveniently, the above string looks like a lisp s-expression. This makes it really easy to write macros and functions for. In the big picture, this is already code. It simply has no semantic bindings.

Semantics

Next, we have to imbue / with what it does. The previous action made it a word. Now we give it meaning. To do this, we have two options:

Make a Racket function that does what we want.
Make a Racket macro that turns what we have into Racket code that already does what we want.
Both (this is option 3/2).

I went with option two this time. This was probably a bad choice but I like macros.

Below is the line of code that handles the / function’s semantics. This is taken out of context for simplicity, but the context will be there in the next block if that makes you nervous. #' is shorthand that generates a Racket syntax object. Macros in Racket take in syntax and produce syntax. This is useful because there is more to syntax than you see. There is a concept of source location and bindings. Some macro forms hide the #' behind still more macros, but that does not mean they are not there.

[(op x "/" divisor) #'(/ x divisor)]

This shows how / can be interpreted. “if you see an op, and then something, and then the symbol / followed by a number, take that thing and convert it to Racket syntax which applies the Racket function / to the thing with the number you saw.

It is a part of this larger function that turns operations into Racket code (“stx” is shorthand for “syntax”):

(define-syntax (op stx)
  (syntax-case stx ()
    [(op x "/" divisor) #'(/ x divisor)]))
(provide op)

This is a case expression with one case. I expect to add to it soon.

That in turn has to be called from somewhere:

(define-syntax-rule (aoclop-program read-expr (scope-block (all-ops op ...)))
  (map (λ~> op ...) read-expr))
(provide aoclop-program)

The define-syntax-rule form is a convenient macro that allows you to take in one pattern and produce a different syntax pattern. The input to this is the entire parsed program. The output is a mix of Racket code and more half-baked code to grab with a macro later.

op ... is every operation. This macro is merely surrounding the ops with a map call (to replace the scope block) and a threading macro (to simulate function chaining). λ~> is a threading macro that takes a series of functions and threads x through them in order. That is why the receiver macro op takes in x even though you don’t see it. λ~> is the progenitor of this phantom x. With this threading macro in place, x will be substituted in at first with a single number from the file, and then with any ops run before.

For the input: (aoclop-program (read 1 (delimiter "nl")) (scope-block (all-ops (op "/" 3) (op "/" 5))))

You will see:

(op x "/" 3)
(op (op x "/" 3) "/" 5)

passed in to the macro. The first time, x is a x (a number from the list we are mapping over), and the second time x is (op x "/" 3) which evaluates to a number eventually.

As a note on process, I did not just imagine all of this into existence. Typing the original #lang aoclop program in to an editor threw an error in the parser. Fixing that gave me an error in the lexer. Once that was fixed, I wrote a unit test for the lexer, and took the expected s-expression the lexer produced and pasted it into the expander. Once the expander could evaluate that expression into the number I wanted I knew I was done. After that, I deleted the pasted expression and ran the whole thing end-to-end. Unit testing macro-heavy code is not a topic for this post, but it is much more possible than people let on, and can be easily built into this process.

That is the whole slice! Now for the pitfall:

The One Big Pitfall

It is tempting to write the simplest code that will produce the input/output pairs that you want. If you do that but make the wrong sort of shortcut to get there, you will regret it.

Let me take a short cut to get the same result that we had before, and then we can talk about why this is a terrible idea.

What if I didn’t want to think about the fact that I can have multiple operations just yet, and also reasoned that I can just do the mapping inside of the / operation? So the syntax would imply that / works on a stream of single numbers, but under the covers it can take the whole list in at once. We could devise something like this:

(define-syntax-rule (aoclop-program read-expr (scope-block (all-ops op)))
    (bigop (op read-expr)))
(provide aoclop-program)

(define-syntax-rule (bigop ((op "/" divisor) inp))
  (div-each divisor inp))
(provide bigop)

(define (div-each divisor xs)
  (map (λ (x) (/ x divisor)) xs))

This was much easier to write. No threading macros, no syntax cases. I had to do one tricky thing and introduce a bigop that had scope on both the op and the read block (because in the input program the only thing with enough scope was aoclop-program but I didn’t want all of that extra stuff, and I really didn’t want to introduce global data, syntax parameters, or data mutation). Once I did that, I could just tell bigop to pass the whole input list to div-each, which is a regular function that calls map. Now when I want to add my next operation, I can just …

Well, no. Let’s assume we want to add floor, because that is the next operation in the first question. Now I am capturing only one op. So I need to capture two. If I want to be lazy, I can pattern match against two, but that will deprecate this program which has only one. Or I can match with .... That works, but now bigop must take all ops. What will it do with that second one? A recursive call with the output of the first? That maybe could work. Bear in mind that each op must have a map call in it now to preserve the intended semantics. Now we have a pile of recursive operations, each with a map call, and a growing headache.

This is not hypothetical. Here is the code with the map call pulled out, but without the threading operator and explicitly recursive macros:

 (define-syntax (x-then-ops-in-order stx)
  (syntax-case stx ()
    [(_ (lambda-x (ops ...))) (syntax-case #'(ops ...) ()
                                [((_ "identity")) #'(identity lambda-x)]
                                [((_ "identity") otherop ...) #'(identity (x-then-ops-in-order(lambda-x (otherop ...))))]
                                [((_ "floor")) #'(floor lambda-x)]
                                [((_ "floor") otherop ...) #'(floor (x-then-ops-in-order(lambda-x (otherop ...))))])]))

And it only gets worse from here.

All of this happened because we stripped the semantics of scope-block. Reading the input program and how it interacts with the grammar in the parser, scope-block is a mappy thing. Taking that power from it and giving it to / introduces a thread that you’ll be pulling on for months if you’re not careful.

A design phase could have prevented this kind of issue. If I had bothered to design up front a bit, I could have just made it a part of the design that the scope-block is semantically equivalent to map and that would shake out.

Without the design phase it becomes necessary to make sure that your code is consistent at all times. Because there is no “gate” of design, there is no time at which you make sure that you’re not breaking your own rules. Therefore, consistency must be achieved continuously or at the least at a branch or commit level. If you make a scope block in your syntax and ignore it semantically, you’re breaking your design, whether that design is expressed in a design document or in code. This has sort of an Extreme Programming flavor so I find it appealing. Note that I am not talking about syntactic sugar here. There are different opinions on that, and I am not against sprinkling or even dumping syntactic sugar into a DSL. I am talking about when one piece of syntax steals another piece of syntax’s semantics. When you look at a specific macro or function in your expander, you should not see that function or macro introducing semantics that should be attributed to a different function or macro.

Skipping this pitfall leaves you in a really good place. You now have a case statement that you can add more operators to. Each one only needs to be concerned with integers, as mapping is done for them. There is no explicit recursion in your macros. Future changes can be in the range of 4-5 lines of code at a time.

Takeaway

Be very careful about implementation at every phase in order to assure syntactic and semantic consistency. Misses in the semantic mappings between source and target will only get worse over time.

Philosophical Rambling

A DSL is defined by that which may be assumed. If your DSL grows into a General Purpose Language (GPL), it loses value. This concept is handled very well in Domain-Specific Languages. DSLs increase your number of assumptions. Assumptions are things you don’t have to think about. They also represent system inflexibility. The more system inflexibility you can introduce, the more flexibility you can achieve in the flexible bits, and the easier it is to add functionality. The issue is that you can get this wrong, and then you’re in a world of hurt. The fix to this is that you have to be very willing to throw away old code. Anything that does not map perfectly to what is desired from the standpoint of what was intended semantically to what is really there must be fixed early on or it will only get worse. Of course, this is a moving target as features are added.

Next Research Questions

The next question to look at related to these ideas is in practical LOP. I think that these advanced tools make it practical to thin slice delivery of a language without a big up front design. The question is in whether doing so allows for the foresight to know when to split languages, and whether there is strong enough support for disparate language composition. Racket makes it easy to mix domain specific languages (see the above example with Brag) so the idea of writing a dozen tiny languages to fix one problem seems more tenable than one would imagine. The ideal for DSLs would be something that feels like changing tools while working on a carpentry project. In particular, there should be many, many languages that are so small that learning them and using them is effortless. It would be odd if people said they only did carpentry using saw, or only using lacquer. That does not even make sense. In practice, there is a general purpose tool (hand) which drives switching of small, highly specialized tools. This is the ideal that software development should aspire to. The two specific questions become:

Can LOP be done quickly without an up front design? I am sure it can be done, but am not sure about how long it will take.
Is the judgement required to decide when to continue expanding a DSL vs. starting a new one beyond what people are capable of?
What language should be used to glue DSLs together? Another DSL, or the implementation language of the DSLs (this is question 3/2)?

Next Steps

I am planning to keep working on aoclop even though it is way over budget and over schedule. If you would like to help, or even just want to talk about it, please let me know! There is a lot that did not make it in to this post.

Appendix

Getting from the above slice to an answer for AoC19 is only a few minutes of work.

AOC 1.1 Answer

The target language work ends up being:

#lang aoclop
read: 1 nl | v | / 3 | floor | - 2 | ^ | sum

Which is enabled by the grammar:

#lang brag
aoclop-program : /NEWLINE read scope-block /PIPE collect /NEWLINE
read           : /READ INTEGER delimiter
scope-block    : /PIPE /DOWNSCOPE /PIPE all-ops /PIPE /UPSCOPE
delimiter      : DELIMITER
all-ops        : op*
op             : OP | OP INTEGER | OP /PIPE | OP INTEGER /PIPE
collect        : COLLECT

The lexer responds in the same pattern:

(define aoclop-lexer
  (lexer-srcloc
   ["read:" (token 'READ lexeme)]
   ["\n" (token 'NEWLINE lexeme)]
   ["nl" (token 'DELIMITER lexeme)]
   ["floor" (token 'OP lexeme)]
   ["identity" (token 'OP lexeme)]
   ["|" (token 'PIPE lexeme)]
   ["/" (token 'OP lexeme)]
   ["-" (token 'OP lexeme)]
   ["floor" (token 'OP lexeme)]
   ["sum" (token 'COLLECT lexeme)]
   ["^" (token 'UPSCOPE lexeme)]
   ["v" (token 'DOWNSCOPE lexeme)]
   [digits (token 'INTEGER (string->number lexeme))]
   [whitespace (token lexeme #:skip? #t)]))
(provide aoclop-lexer)

And then the expander has only to take into account these small changes:

(define-syntax-rule (aoclop-program read-expr (scope-block (all-ops op ...)) collect) (apply collect (map (λ~> op ...) read-expr)))

(define-syntax (op stx)
  (syntax-case stx ()
    [(op x "identity") #'(identity x)]
    [(op x "floor") #'(floor x)]
    [(op x "/" divisor) #'(/ x divisor)]
    [(op x "-" difference) #'(- x difference)]))
(provide op)

(define-syntax-rule (collect "sum") +)
(provide collect)

And that is it! The biggest operative change is that the programs no longer return lists. They return a single number that has been collected from a list using some sort of folding operation (in this example sum or +).

Most gratifying is that getting from 1.1 to 1.2 is even less change.

AoC 1.2 Answer

For 1.2, the same task is ascribed except everything inside the scope block is run until it bottoms out at zero, and all results are added up. Most readers would assume we would need some concept of recursion, which usually requires the ability to define functions. At least we should require the Y combinator. Because we have a layer of abstraction between our target language and our implementation, all notions of recursion or fixpoint calculation can be in the expander. As a result we can just define a syntax for convergance and go from there.

#lang aoclop
read: 1 nl | v |> / 3 | floor | - 2 <| ^ | sum

Note the > < which means “do this until you stop” in this little language.

To implement this, we need to parse it:

aoclop-program : /NEWLINE read scope-block /PIPE collect /NEWLINE
read           : /READ INTEGER delimiter
scope-block    : /PIPE /DOWNSCOPE /PIPE all-ops /PIPE /UPSCOPE | /PIPE /DOWNSCOPE /PIPE converge-block /PIPE /UPSCOPE
delimiter      : DELIMITER
all-ops        : op*
converge-block : /GT all-ops /LT
op             : OP | OP INTEGER | OP /PIPE | OP INTEGER /PIPE
collect        : COLLECT

This just says we can replace a standard all-ops block with a converge block under scoping. This may be overspecific but it works for this use case. If we ever need to do convergence somewhere else we can move it later. That would not step into the one big pitfall because we are associating the semantics of convergence with the syntax of it. If that syntax moves, that is okay.

The lexer is predictable:

(define aoclop-lexer
  (lexer-srcloc
   ["read:" (token 'READ lexeme)]
   ["\n" (token 'NEWLINE lexeme)]
   ["nl" (token 'DELIMITER lexeme)]
   ["floor" (token 'OP lexeme)]
   ["identity" (token 'OP lexeme)]
   ["|" (token 'PIPE lexeme)]
   ["/" (token 'OP lexeme)]
   ["-" (token 'OP lexeme)]
   ["floor" (token 'OP lexeme)]
   [">" (token 'GT lexeme)]
   ["<" (token 'LT lexeme)]
   ["sum" (token 'COLLECT lexeme)]
   ["^" (token 'UPSCOPE lexeme)]
   ["v" (token 'DOWNSCOPE lexeme)]
   [digits (token 'INTEGER (string->number lexeme))]
   [whitespace (token lexeme #:skip? #t)]))
(provide aoclop-lexer)

And now the expander is the thing that has to be upgraded, but it is only a few lines added:

(define-syntax (aoclop-program stx)
  (syntax-case stx ()
    [(aoclop-program read-expr (scope-block (converge-block (all-ops op ...))) collect) #'(apply collect (map ((curry converge) (λ~> op ...)) read-expr))]
    [(aoclop-program read-expr (scope-block (all-ops op ...)) collect) #'(apply collect (map (λ~> op ...) read-expr))]))
(provide aoclop-program)

(define (converge proc x)
  (define step (proc x))
  (cond [(<= step 0) 0]
        [else (+ step (converge proc step))]))

The tricky parts here are converge and curry. curry is a built-in that allows me to pass one parameter at a time to a function. This is useful because converge takes two parameters, but the second one is going to be involved in a map operation. map takes a function of one parameter so I have to bind the proc to converge first. In this case the proc is (λ~> op ...) which represents the one-arity combination of all operations piped together. The remaining new functionality is converge. converge simply runs a proc to generate a step and if the step is zero it stops. It adds up all results. This is a little overspecialized but it works for this case. It is tempting to replace the hard-coded 0 with the identity of the folding operation, and pass in the folding operation instead of always using +. In practice + is the most common, with * being second most popular. If we used *, we would hope to bottom out at 1 but it may never reach that value. Some acceptable error threshold would then be useful. This is all discussed at length in the great book SICP linked above.

Astute readers may notice that we could break the aoclop-program macro up. I agree and think this would be a good refactor to do before the next slice. It should be possible to have that macro rearrange the big pieces and then delegate to lower down macros for greater long term flexibility. This would look like:

(define (converge proc x)
  (define step (proc x))
  (cond [(<= step 0) 0]
        [else (+ step (converge proc step))]))

(define-syntax-rule (aoclop-program read scope-block collect)
  (apply collect (read-scope scope-block read)))
(provide aoclop-program)

(define-syntax (read-scope stx)
  (syntax-case stx ()
    [(read-scope (scope-block (converge-block all-ops)) read) #'(map ((curry converge) all-ops) read)]
    [(read-scope (scope-block all-ops) read) #'(map all-ops read)]))

(define-syntax-rule (all-ops op ...)
  (λ~> op ...))
(provide all-ops)

(define-syntax (op stx)
  (syntax-case stx ()
    [(op x "identity") #'(identity x)]
    [(op x "floor") #'(floor x)]
    [(op x "/" divisor) #'(/ x divisor)]
    [(op x "-" difference) #'(- x difference)]))
(provide op)

(define-syntax-rule (collect "sum") +)
(provide collect)

(define-syntax-rule (delimiter "nl") "\n")
(provide delimiter)

In my opinion this is slightly harder to understand as it is necessary to introduce an intermediate macro read-scope which does not appear in the grammar in order to give the mapping operation associated with the scope block visibility on the read data. It is more flexible for future changes as it localizes the concept of convergence into a smaller area, and more tightly defines all-ops as essentially equivalent to a threading lambda. As a last upgrade we could use more sophisticated macros in the form of syntax-parse to give better error messages to users of the DSL, or add in code coloring and the like.

Why You Should Care

This is a very promising update as it took a very short amount of time and a tiny amount of code. It shows that lack of planning did not cause us to be in a bad place for part two. Instead, being disciplined and making sure the code was in a good place at each step left us in a flexible enough position to leap in any direction easily.

Next Tactics

To cover the last obvious question, we can look at the ease of composition of these languages. There is an issue where, if we want to go outside of what is allowed in the target language, we have no recourse but to extend the language syntax and semantics. This is dangerous, as bigger DSLs are less useful than small ones. Alternatively, you can start over. This is also suboptimal as it requires a DSL to be written for each slightly different problem. Ideally, subtasks can be done in multiple DSLs and they can be stitched together (much in the way brag is generally useful). Making this possible is less change than expected. Let’s go back to the (aoclop-program) macro. Looking at this literally, whatever this macro produces is what you get at the end when you run any #lang aoclop script. Right now, it produces a number. Instead, it could expose a variable binding. For example:

(define-syntax-rule (aoclop-program read-expr (scope-block (all-ops op ...)))
  (map (λ~> op ...) read-expr))
(provide aoclop-program)

Becomes

(define-syntax-rule (aoclop-program read-expr (scope-block (all-ops op ...)))
(begin
  (define answer (map (λ~> op ...) read-expr))
  answer)
(provide aoclop-program)

And now I can either run the script or import the script into a Racket file and expect answer to be there.