Awesome

Calculating Correct Compilers

This repository contains the supplementary material for the paper "Calculating Correct Compilers" (Journal of Functional Programming, 25, 2015) by Patrick Bahr and Graham Hutton. The material includes Coq formalisations of all calculations in the paper. In addition, we also include Coq formalisations for calculations that were mentioned but not explicitly carried out in the paper.

Paper vs. Coq Proofs

The Coq proofs proceed as the calculations in the paper. There are, however, two minor technical difference due to the nature of the Coq system.

In the paper the derived VMs are tail recursive, first-order functions. The Coq system must be able to prove termination of all recursive function definitions. Since Coq's termination checker is not powerful enough to prove termination for some of the VMs (VMs from sections 3.1, 4.1, 5) or the VMs are not expected to terminate in general (VMs for lambda calculi / for language with loops), we had to define the VMs as relations instead. In particular, all VMs are defined as a small-step semantics. Each tail recursive function of a VM corresponds to a configuration constructor in the small-step semantics. As a consequence, the calculations do not prove equations, but rather instances of the relation =>>, which is the transitive, reflexive closure of the relation ==> that defines the VM.
The Coq files contain the final result of the calculation, and thus do not reflect the process of discovering the definition of the compiler and the VM. That is, the files already contain the full definitions of the compiler and the virtual machine. But we used the same methodology as described in the paper to develop the Coq proofs. This is achieved by initially defining the Code data type as an empty type, defining the ==> relation as an empty relation (i.e. with no rules), and defining the compiler function using the term Admit (which corresponds to Haskell's "undefined"). This setup then allows us to calculate the definition of the Code data type, the VM, and the compiler as described in the paper.

File Structure

Below we list the relevant Coq files for the calculations in the paper:

Arith.v: arithmetic expressions (section 2)
Exceptions.v: exceptions, first approach (section 3.1)
ExceptionsTwoCont.v: exceptions, second approach (section 3.2)
StateGlobal.v: global state (section 4.1)
StateLocal.v: local state (section 4.2)
Lambda.v: call-by-value lambda calculus (section 5)

In addition, we also include calculations for the following languages:

LambdaCBName.v: call-by-name lambda calculus
LambdaCBNeed.v: call-by-need lambda calculus
LambdaExceptions.v: call-by-value lambda calculus with exceptions
StateGlobalSeq.v: global state with explicit sequence operator
Loop.v: a simple imperative language with while loops

The remaining files are used to define the Coq tactics to support reasoning in calculation style (Tactics.v) and to specify auxiliary concepts (Heap.v, ListIndex.v).

Haskell Code

Haskell definitions of the calculated compilers from the paper can be found in the Haskell subdirectory. In addition, the extraction subdirectory contains Haskell definitions of the compilers generated from the Coq proofs using Coq's code extraction facility (see below).

Technical Details

Dependencies

Tested with Coq versions 8.17, 8.18

Proof Checking

The complete Coq development in this repository is proof-checked automatically. The current status is:

To check and compile the complete Coq development yourself, you can use the Makefile:

> make

Code Extraction

The Haskell definitions in the subdirectory extraction have be obtained by code extraction. The code extraction can be repeated as follows:

> make
> cd extraction
> make