| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
| 1. Introduction | ||
| 2. User Guide | ||
| 3. Standard Rules | ||
| 4. Rolling Your Own | ||
| 5. Installation | ||
| 6. Bugs and Shortcomings |
This is a guide to using DrIFT, a type sensitive preprocessor for Haskell 98.
DrIFT is a tool which parses a Haskell module for structures (data & newtype declarations) and commands. These commands cause rules to be fired on the parsed data, generating new code which is then appended to the bottom of the input file, or redirected to another. These rules are expressed as Haskell code, and it is intended that the user can add new rules as required.
DrIFT is written in pure Haskell 98, however code it generates is free to make use of extensions when appropriate. DrIFT is currently tested against hugs and ghc.
| 1.1 So, What Does DrIFT do? | ||
| 1.2 Features | ||
| 1.3 Why Do We Need DrIFT? | ||
| 1.4 An Example |
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
DrIFT allows derivation of instances for classes that aren't supported by the standard compilers. In addition, instances can be produced in separate modules to that containing the type declaration. This allows instances to be derived for a type after the original module has been compiled. As a bonus, simple utility functions can also be produced for types.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Currently supported derivations are the following. This list is obtainable by
running DrIFT -l.
Binary:
Binary efficient binary encoding of terms
GhcBinary byte sized binary encoding of terms
Debugging:
Observable HOOD observable
General:
NFData provides 'rnf' to reduce to normal form (deepSeq)
Typeable derive Typeable for Dynamic
Generics:
FunctorM derive reasonable fmapM implementation
HFoldable Strafunski hfoldr
Monoid derive reasonable Data.Monoid implementation
RMapM derive reasonable rmapM implementation
Term Strafunski representation via Dynamic
Prelude:
Bounded
Enum
Eq
Ord
Read
Show
Representation:
ATermConvertible encode terms in the ATerm format
Haskell2Xml encode terms as XML (HaXml<=1.13)
XmlContent encode terms as XML (HaXml>=1.14)
Utility:
Parse parse values back from standard 'Show'
Query provide a QueryFoo class with 'is', 'has',
'from', and 'get' routines
from provides fromFoo for each constructor
get for label 'foo' provide foo_g to get it
has hasfoo for record types
is provides isFoo for each constructor
test output raw data for testing
un provides unFoo for unary constructors
update for label 'foo' provides 'foo_u' to update it
and foo_s to set it
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
The original motivation for DrIFT came from reading one of the Glasgow Parallel Haskell papers on Strategies. Strategies require producing instances of a class which reduces to normal form (called NFData). It was commented that it was a shame that instances of NFData couldn't be automatically derived; the rules to generate the instances are simple, and adding instances by hand is tiresome. Many classes' instances follow simple patterns. This is what makes coding up instances so tedious: there's no thought involved!
The idea to extend DrIFT to work on imported types came from a discussion of the Haskell mailing list, arising from a point made by Olaf Chitil :
Why is the automatic derivation of instances for some standard classes linked to data and newtype declarations? It happened already several times to me that I needed a standard instance of a data type that I imported from a module that did not provide that instance and which I did not want to change (a library; GHC, which I mainly want to extend by further modules, not spread changes over 250 modules). When declaring a new data type one normally avoids deriving (currently) unneeded instances, because it costs program code (and maybe one even wants to enable the user of the module to define his own instances).
The third feature of DrIFT, providing utility functions to manipulate new types, especially records was caused by finding oneself writing the same sort of code over and over again. These functions couldn't be captured in a class, but have a similar form for each type they are defined on. A thread on the Haskell mailing list made a related point: untagging and manipulating newtypes was more cumbersome than it should be.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Here's an example of what how DrIFT is used. This Haskell module
contains commands to the DrIFT preprocessor. These are annotated with
{-! ... !-}. After processing with DrIFT the generated code
is glued on the bottom of the file, beneath a marker indicating where
the new code starts. The machine generated code is quite long, and
would really have been a drudge to type in by hand.
| 1.4.1 Source Code | ||
| 1.4.2 After processing with DrIFT |
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
-- example script for DrIFT
module Example where
import Foo
{-!for Foo derive : Read,NFData !-} -- apply rules to imported type
{-! global : is !-} -- global to this module
{-!for Data derive : update,Show,Read!-} -- stand alone comand syntax
{-!for Maybe derive : NFData !-} -- apply rules to prelude type
data Data = D {name :: Name,
constraints :: [(Class,Var)],
vars :: [Var],
body :: [(Constructor,[(Name,Type)])],
derive :: [Class],
statement :: Statement}
data Statement = DataStmt | NewTypeStmt
deriving Eq {-!derive : Ord,Show,Read !-} -- abbreviated syntax
|
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
module Example where
import Foo
{-!for Foo derive : Read,NFData !-} -- apply rules to imported type
{-! global : is !-} -- global to this module
{-!for Data derive : update,Show,Read!-} -- stand alone comand syntax
{-!for Maybe derive : NFData !-} -- apply rules to prelude type
data Data = D {name :: Name,
constraints :: [(Class,Var)],
vars :: [Var],
body :: [(Constructor,[(Name,Type)])],
derive :: [Class],
statement :: Statement}
data Statement = DataStmt | NewTypeStmt
deriving Eq {-!derive : Ord,Show,Read !-}
{-* Generated by DrIFT-v1.0 : Look, but Don't Touch. *-}
isD (D aa ab ac ad ae af) = True
isD _ = False
instance Ord Statement where
compare DataStmt (DataStmt) = EQ
compare DataStmt (NewTypeStmt) = LT
compare NewTypeStmt (DataStmt) = GT
compare NewTypeStmt (NewTypeStmt) = EQ
instance Show Statement where
showsPrec d (DataStmt) = showString "DataStmt"
showsPrec d (NewTypeStmt) = showString "NewTypeStmt"
instance Read Statement where
readsPrec d input =
(\ inp -> [((DataStmt) , rest)
| ("DataStmt" , rest) <- lex inp])
input
++
(\ inp ->
[((NewTypeStmt) , rest)
| ("NewTypeStmt" , rest) <- lex inp])
input
isDataStmt (DataStmt) = True
isDataStmt _ = False
isNewTypeStmt (NewTypeStmt) = True
isNewTypeStmt _ = False
instance (NFData a) => NFData (Maybe a) where
rnf (Just aa) = rnf aa
rnf (Nothing) = ()
body_u f r@D{body} = r{body = f body}
constraints_u f r@D{constraints} = r{constraints = f constraints}
derive_u f r@D{derive} = r{derive = f derive}
name_u f r@D{name} = r{name = f name}
statement_u f r@D{statement} = r{statement = f statement}
vars_u f r@D{vars} = r{vars = f vars}
body_s v = body_u (const v)
constraints_s v = constraints_u (const v)
derive_s v = derive_u (const v)
name_s v = name_u (const v)
statement_s v = statement_u (const v)
vars_s v = vars_u (const v)
instance Show Data where
showsPrec d (D aa ab ac ad ae af) = showParen (d >= 10)
(showString "D" . showChar '{' .
showString "name" . showChar '=' . showsPrec 10 aa
. showChar ',' .
showString "constraints" . showChar '=' . showsPrec 10 ab
. showChar ',' .
showString "vars" . showChar '=' . showsPrec 10 ac
. showChar ',' .
showString "body" . showChar '=' . showsPrec 10 ad
. showChar ',' .
showString "derive" . showChar '=' . showsPrec 10 ae
. showChar ',' .
showString "statement" . showChar '=' . showsPrec 10 af
. showChar '}')
instance Read Data where
readsPrec d input =
readParen (d > 9)
(\ inp ->
[((D aa ab ac ad ae af) , rest) | ("D" , inp) <- lex inp ,
("{" , inp) <- lex inp , ("name" , inp) <- lex inp ,
("=" , inp) <- lex inp , (aa , inp) <- readsPrec 10 inp ,
("," , inp) <- lex inp , ("constraints" , inp) <- lex inp ,
("=" , inp) <- lex inp , (ab , inp) <- readsPrec 10 inp ,
("," , inp) <- lex inp , ("vars" , inp) <- lex inp ,
("=" , inp) <- lex inp , (ac , inp) <- readsPrec 10 inp ,
("," , inp) <- lex inp , ("body" , inp) <- lex inp ,
("=" , inp) <- lex inp , (ad , inp) <- readsPrec 10 inp ,
("," , inp) <- lex inp , ("derive" , inp) <- lex inp ,
("=" , inp) <- lex inp , (ae , inp) <- readsPrec 10 inp ,
("," , inp) <- lex inp , ("statement" , inp) <- lex inp ,
("=" , inp) <- lex inp , (af , inp) <- readsPrec 10 inp ,
("}" , rest) <- lex inp])
input
-- Imported from other files :-
instance Read Foo where
readsPrec d input =
(\ inp -> [((Foo) , rest)
| ("Foo" , rest) <- lex inp]) input
++
(\ inp -> [((Bar) , rest)
| ("Bar" , rest) <- lex inp]) input
++
(\ inp -> [((Bub) , rest)
| ("Bub" , rest) <- lex inp]) input
instance NFData Foo where
rnf (Foo) = ()
rnf (Bar) = ()
rnf (Bub) = ()
|
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
This chapter assumes that DrIFT has already been installed and the environment variables set up. The installation is handled in Installation.
Briefly, the way DrIFT works is
Rules can be applied to any types defined using a data or
newtype statement. Rules can't be applied to types defined using
type, as this only produces a synonym for a type. Don't
try to use rules on type synonyms.
| 2.1 The Command Line | ||
| 2.2 Command Syntax | ||
| 2.3 Emacs DrIFT mode |
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
DrIFT processes standard Haskell scripts (suffix ‘.hs’) and
literate scripts (suffix ‘.lhs’). Currently, only literate code
using > is accepted: DrIFT doesn't understand the TeX style
of literate programming using \begin{code}.
If you've compiled up an executable from the source code (or are using Runhugs) to run DrIFT over a file type :-
DrIFT filename
Alternatively, for Hugs, use :-
runhugs DrIFT filename (run DrIFT over filename)
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Commands to DrIFT are entered into Haskell code in the form of
annotations. DrIFT's annotations start with {-! and finish
with !-}. (This is so they don't clash with the compiler annotations
given to GHC or HBC). There are three forms of command.
{-! for type derive :
rule1,rule2,… !-})
This is the basic form of DrIFT command. It asks DrIFT to apply the
listed rules to the specified type. If the type is parameterised,
e.g. Maybe a, just enter the type name into the command, omitting
any type variables. DrIFT assumes that types given are currently in
scope, and will first search the current module. If it fails to find a
matching type definition, the prelude and any imported modules are also
searched. This is the only command which allows code to be generated
for a type defined in another module.
{-! derive :rule1,rule2,… !-})
This command is appended to the end of a data or newtype
definition, after the deriving clause, if present. It applies the listed
rules to the type it is attached to.
{-! global :rule1,rule2,… !-}
This command applies the listed rules
to all types defined within the module. Note that this command doesn't
cause code to be generated for types imported from other modules.
For an example of these commands in use, See section An Example.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
infix, import,newtype). It
doesn't matter what position they occur within the module.
>).
-- and {- .. -} in the usual way.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
For Emacs fans, Hans W Loidl hwloidl@dcs.gla.ac.uk has written a script which allows DrIFT to be run within a buffer.
The commands available are
M-x hwl-derive, C-c d d runs DrIFT over the current
buffer, and then updates the buffer.
M-x hwl-derive-insert-standalone, C-c d s inserts a
template for a standalone command into the current buffer at the
cursor position.
M-x hwl-derive-insert-local, C-c d l inserts a template
for an abbreviated command.
M-x hwl-derive-insert-global, C-c d g inserts a template
for a global command
In `hugs-mode' these functions are also available vie a menu item in the hugs menu.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Heres a listing of the rules that come pre-defined with DrIFT. If you want a more detailed idea of how they work, their definitions are in the file ‘StandardRules.hs’, and are (fairly) well documented. In the following list the highlighted text is the name of the rule, as used in commands. The naming convention for rules is names starting with a capital generate an instance for the class of the same name. Sets of functions are generated by a name beginning with a lower case letter.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
The classes Eq, Ord, Enum, Show, Read & Bounded are described in the Haskell report as being derivable; DrIFT provides rules for all these.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Originally, NFData (for Normal Form evaluation strategies)
was the only other class to have a rule. But now, there are rules for
many more classes from 3rd-party libraries, e.g. XmlContent
from HaXml, Binary from nhc98, Term from Strafunski,
FunctorM for Generics, Observable for HOOD debugging,
Typeable for dynamics, and so on. For a full list, use the
--list command-line option.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
newtype.
For a type
newtype Foo a = F a,un produces the function
unFoo :: Foo a -> a.
For a type
data Foo = Bar | Bub, is generates
isBar :: Foo -> BoolandisBub :: Foo -> Bool.
For a type
data Foo a = F{bar :: a,bub :: Int}has generates
hasbar :: Foo a-> Boolandhasbub :: Foo a -> Bool.
For a type
data Foo a = F{bar :: a, bub ::Int}update generates
bar_u :: (a -> a) -> Foo a -> Foo aand
bub_u :: (Int -> Int) -> Foo a -> Foo awhich apply a function to a field of a record, and then return the updated record. If the value does not have the given field then the value is returned unchanged.
bar_s :: a -> Foo a -> Foo aandbub_s ::Int -> Foo a -> Foo aare also generated, and are used to set the value of a field in a record.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Programmers who only wish to use the pre-defined rules in DrIFT don't need to read or understand the following section. However, as well as using the supplied rules, users are encouraged to add their own. There is a stub module ‘UserRules.hs’ in the source, to which rules can be added.
If a compiled version of DrIFT is being used, the program will then
have to be recompiled before the new rules can be used. However, if the
Runhugs standalone interpreter is used, this is not necessary. Due to
the way Runhugs searches for modules to load, a user may have many
copies of the UserRules module. The UserRules module in the current
directory will be loaded first. If that is not present, then the
HUGSPATH environment variable is searched for the module. So it is
possible to have a default UserRules module, and specialised ones for
particular projects.
| 4.1 The Basic Idea | ||
| 4.2 How is a Type Represented? | ||
| 4.3 Pretty Printing | ||
| 4.4 Utilities | ||
| 4.5 Adding a new rule |
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
A rule is a tuple containing a string and a function. The string is the name of the rule, and is used in commands in an input file. The function maps between the abstract representation of a datatype and text to be output (A sort of un-parser, if you like). The best way to understand this is to have a look at the existing rules in ‘StandardRules.hs’. This module is quite well documented.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
A type is represented within DrIFT using the following data definition.
>data Statement = DataStmt | NewTypeStmt deriving (Eq,Show)
>data Data = D { name :: Name, -- type name
> constraints :: [(Class,Var)],
> vars :: [Var], -- Parameters
> body :: [Body],
> derives :: [Class], -- derived classes
> statement :: Statement}
> | Directive
> | TypeName Name deriving (Eq,Show)
>type Name = String
>type Var = String
>type Class = String
|
A Data type represents one parsed data or newtype
statement. These are held in a D constructor record (the
Directive and TypeName constructors are just used internally by
DrIFT). We'll now examine each of the fields in turn.
name holds the name of the new datatype as a string.
constraints list the type constraints for the type variables of
the new type. e.g. for data (Eq a) => Foo a = F a, the value of
constraints would be [("Eq","a")].
vars contains a list of the type variables in the type. For the
previous example, this would simply be ["a"] .
body is a list of the constructors of the type, and the
information associated with them. We'll come back to this in a moment.
derives lists the classes that the type an instance of though
using the deriving clause.
statement indicates whether the type was declared using a
newtype or data statement
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
>data Body = Body { constructor :: Constructor,
> labels :: [Name],
> types :: [Type]} deriving (Eq,Show)
>type Constructor = String
|
The body type holds information about one of the constructors of a type.
constructor is self-explanatory. labels holds the names
of labels of a record. This will be blank if the constructor isn't a
record. types contains a representation of the type of each
value within the constructor. The definition of Type is as
follows.
>data Type = Arrow Type Type -- fn > | Apply Type Type -- application > | Var String -- variable > | Con String -- constructor > | Tuple [Type] -- tuple > | List Type -- list > deriving (Eq,Show) |
Few of the deriving rules supplied have actually needed to use this type information, which I found quite surprising. If you do find you need to use it, one example is the Haskell2Xml rule.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Instead of producing a string as output, rules produce a value of type
Doc. This type is defined in the Pretty Printing Library implemented
by Simon Peyton-Jones. The pretty printer ensures that the code is
formatted for readability, and also handles problems such as
indentation. Constructing output using pretty printing combinators is
easier and more structured than manipulating strings too. For those
unfamiliar with these combinators, have a look at the module
‘Pretty.lhs’ and the web page http://www.cse.ogi.edu/~simonpj/
or for more detail the paper The Design of a Pretty Printing
Library, J. Hughes
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Upon the pretty printing library, DrIFT defines some more formatting functions which make regularly occurring structures of code easier to write. These structures include simple instances, blocks of code, lists, etc. The utilities are in the module ‘RuleUtils.hs’ and should be self explanatory.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
A rule has type type Rule = (String,Data -> Doc). Once you have
written your mapping function and chosen an appropriate name for the
rule, add this tuple to the list userRules :: [Rule]
in module ‘UserRules.hs’. Recompile if necessary. DrIFT will then call this rule when
its name occurs in a command in an input file.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
DrIFT isn't a large or complicated application, so it shouldn't be too hard for anyone to get it up and running. For the platform you want to install for, read the corresponding section below, then see Environment Variables
| 5.1 GHC | ||
| 5.2 Hugs | ||
| 5.3 Runhugs | ||
| 5.4 Environment Variables | ||
| 5.5 Installing the Emacs DrIFT Mode |
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
the automake script should automatically detect any ghc or nhc installation and
use that to build and install DrIFT. First run ./configure . To
compile, type make all. The executable produced ‘DrIFT’ can then
be installed with make install.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
The DrIFT code comes as a set of Haskell modules. You want to copy all
these to somewhere in your HUGSPATH, then you can load and run
DrIFT in any directory.
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Edit the first line of the the file ‘DrIFT’ to point to your copy
of runhugs. Copy ‘DrIFT’ to somewhere on your PATH, and
the remainder of the source (‘*.hs’,‘*.lhs’) to a directory in your HUGSPATH
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
In you environment set DERIVEPATH to the list of directories you
wish derive to search for modules / interfaces.
DERIVEPATH is quite fussy about the format the list should take :-
For instance
good - /users/nww/share/hugs/lib:/users/nww/share/hugs/lib/hugs
bad - /users/nww/share/hugs/lib/: /users/nww/share/hugs/lib/hugs/
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
Edit ‘derive.el’ so that the variable hwl-derive-cmd contains your
copy of the DrIFT executable.
Place ‘derive.el’ into a directory on your load-path, byte-compile it and put the following command into your ‘.emacs’ file:
(load "derive")
| [ < ] | [ > ] | [ << ] | [ Up ] | [ >> ] | [Top] | [Contents] | [Index] | [ ? ] |
| [Top] | [Contents] | [Index] | [ ? ] |
| [Top] | [Contents] | [Index] | [ ? ] |
This document was generated on February, 10 2008 using texi2html 1.78.
The buttons in the navigation panels have the following meaning:
| Button | Name | Go to | From 1.2.3 go to |
|---|---|---|---|
| [ < ] | Back | Previous section in reading order | 1.2.2 |
| [ > ] | Forward | Next section in reading order | 1.2.4 |
| [ << ] | FastBack | Beginning of this chapter or previous chapter | 1 |
| [ Up ] | Up | Up section | 1.2 |
| [ >> ] | FastForward | Next chapter | 2 |
| [Top] | Top | Cover (top) of document | |
| [Contents] | Contents | Table of contents | |
| [Index] | Index | Index | |
| [ ? ] | About | About (help) |
where the Example assumes that the current position is at Subsubsection One-Two-Three of a document of the following structure:
This document was generated on February, 10 2008 using texi2html 1.78.