.. _getting-started: Getting Started =============== **DaeDaLus** is a domain-specific programming language for specifying *parsers*. It supports *data-dependent* parsing, meaning that the behavior of a parser can be affected by the *semantic values* parsed from other parts of the input. This makes DaeDaLus extremely well-suited for the concise and precise specification of both text and binary formats. DaeDaLus parser specifications may be directly applied to inputs via the ``daedalus`` interpreter to produce serialized representations of the resulting semantic values, *or* the specifications may be compiled to either Haskell or C++ sources for parsers to be used in larger software ecosystems. .. note:: This tutorial assumes that you have a working knowledge of common programming concepts and are comfortable working at the command line. Familiarity with functional programming will be particularly helpful but is not a strict requirement; any concepts needed will be covered in the relevant sections. Installation ------------ DaeDaLus currently has no binary releases available for installation, but it can be easily built and installed from source. Since DaeDaLus is implemented in Haskell, you will need a suitable Haskell environment; the easiest way to set this up is with `ghcup `_. Simply follow the instructions on that page, and you will have the ability to install the necessary components to build DaeDaLus. At the time of this writing, we recommend that you use ``ghcup`` to install GHC 8.10.7 and Cabal 3.6.2.0: .. code-block:: bash > ghcup install ghc 8.10.7 && ghcup set ghc 8.10.7 > ghcup install cabal 3.6.2.0 && ghcup set cabal 3.6.2.0 If you prefer, you can run ``ghcup tui`` to interactively install and set default GHC and Cabal versions. With Haskell installed, clone `the DaeDaLus repository `_. Once cloned, ``cd`` into the repository root and run: .. code-block:: bash > cabal install exe:daedalus --installdir=DIR --overwrite-policy=always This will build and install the executable ``daedalus``, placing links to the executable in the directory ``DIR``. We recommend setting this installation directory to something on your ``PATH``, so that the interpreter can be invoked simply as ``daedalus`` at the command line -- the remainder of this tutorial assumes this setup for brevity. The ``--overwrite-policy=always`` flag will make sure that, if you clone a new version of ``daedalus``, this installation step will overwrite the existing versions of the executable installed on the system. The DaeDaLus Command-Line Interface ----------------------------------- Once DaeDaLus has been compiled and installed, we recommend reading :ref:`The Command-Line Tools` section to become familiar with how to run the DaeDaLus tool. For the purposes of this tutorial, it will be helpful to be familiar with the ``show-types`` DaeDaLus command. This command is useful to get DaeDaLus to show you the types of the elements of your specification, both to check that your specification is valid and also to serve as a learning aid while going through the tutorial. DaeDaLus Syntax Highlighting / Editing Modes -------------------------------------------- The DaeDaLus repository ships with support for editing DaeDaLus specifications in a few popular editors. The files can be found in the ``syntax-highlight`` subdirectory of the DaeDaLus repository. Please see your editor's documentation for details on how to install the files for your editor of choice. Downloading The Sample Specifications ------------------------------------- For convenience, the two full specifications we look at in this tutorial, for the PPM and PNG image formats, are provided as :download:`a compressed TAR <../../build/examples.tar.gz>`. You should follow along with the ``plain-ppm.ddl`` specification for the first part, and fill out the ``png-template.ddl`` as you complete the exercises. ``png.ddl`` is the full solution to all the PNG exercises. Your First DaeDaLus Specification --------------------------------- In order to give a feel for what DaeDaLus specifications look like, we now present a well-known image format, PPM, and a DaeDaLus parser for it. This example will be broken down in detail in the following sections of the tutorial as a means of exploring the available language features. The Portable PixMap Format ^^^^^^^^^^^^^^^^^^^^^^^^^^ PPM is a simple image format designed to make exchange between different platforms easy. For the purposes of this introduction, we'll be looking specifically at the ASCII PPM format, which describes color RGB images in a human-readable format. Informally, this format consists of: 1. A magic number identifying the file type (for ASCII PPM, this is ``P3``) 2. The dimensions of the image (width then height) 3. The maximum color value 4. A 'matrix' of RGB triples for each pixel defined in row-major order The format also allows for single-line comments, but we will ignore these for now. An Example PPM Image ^^^^^^^^^^^^^^^^^^^^ Here is a small example of a PPM image: .. code-block:: P3 4 4 15 0 0 0 0 0 0 0 0 0 15 0 15 0 0 0 0 15 7 0 0 0 0 0 0 0 0 0 0 0 0 0 15 7 0 0 0 15 0 15 0 0 0 0 0 0 0 0 0 We can match this up with the format description given above: 1. The magic number is ``P3``, indicating an ASCII RGB image 2. The width and height are both ``4`` 3. The maximum color value is ``15`` 4. There is a four-by-four grid of triples, one triple per pixel A DaeDaLus PPM Specification ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ Our goal now is to provide a DaeDaLus specification for this format, so that we may parse well-formed PPM values into semantic values for further processing in Haskell or C++ (you might imagine we are writing a program to transform images represented in this PPM format). Here it is: .. literalinclude:: ../../build/downloads/source/examples/plain-ppm.ddl :language: DaeDaLus :linenos: Note that this specification only specifies the format's data layout and does not perform any validation of the image data (such as checking whether the color values don't exceed the declared maximum value). Later, we’ll discuss the pros and cons of including validation in parsers and some strategies for deciding whether or not that is best left to other parts of the application consuming the formatted data. For now, let’s break down this example to understand the building blocks of parser specifications.