A small strongly, statically typed compile-to-JS language with basic extensible records and type-safe blocks
Bitcoin donations are gratefully accepted at 14ZhCym28WDuFhocP44tU1dBpCzjX1DvhF.
Current Features
- Full Type Inference
- Higher Kinded Polymorphism
- Support for basic Javascript types
- Extensible records
- Type-safe blocks with for/while/assignment etc.
- Functions with multiple arguments
- Pattern matching
- Simple FFI
- Modules
- Rank N Types
- Do Notation
- Optional tail-call elimination
- Type Classes
Try It!
Try PureScript in the browser here.
License
See the LICENSE file
Contributing
Contributions are always encouraged! Please see CONTRIBUTING.md for guidelines.
- Join the discussion on FreeNode at #purescript.
- Follow @purescript on Twitter for development updates
Installation
Install the Haskell Plaform, then cabal install
Usage
The psc executable takes a list of PureScript source files as arguments and by default writes out its errors or output to the console.
The following options are supported:
-s --stdin
Read input from standard input instead of from files
-o --output
Write the generated Javascript to the specified file
-e --foreign imports
Write a list of foreign imports declarations to the specified file in addition to generating Javascript output
--runtime-type-checks
Generate simple runtime type checks for function arguments with simple types.
--tco
Perform tail-call elimination on the generated Javascript.
--no-prelude
Do not include the Prelude in the generated Javascript.
--magic-do
Overload the `do` keyword to inline calls to `bind` for the `Eff` monad, to generate more efficient code.
--run-main
Generate a call to `Main.main` after all other generated Javascript.
Motivation
I was looking for a simple functional language which would compile to JavaScript and have the following characteristics:
- Generate simple readable Javascript
- Provide the ability to compile to tight loops if necessary
- Reasonable type system
- Ideally, written in a programming language I would enjoy working in
- Provides a simple interface to existing Javascript code
I didn't find exactly what I was looking for, so I wrote PureScript. It doesn't have everything right now, but it should serve as a simple core on which to develop new ideas.
PureScript is not designed to be a general-purpose programming language. The primary use case is as a generator for purely-functional core libraries, with the main application code written in another language.
PureScript can also be seen as a trade-off between a theoretically ideal language and one which generates reasonably high performance code.
Hello, PureScript!
As an introductory example, here is the usual "Hello World" written in PureScript:
module Main where import Trace main = trace "Hello, World!"
which compiles to the following Javascript, ignoring the Prelude:
var Main; (function (Main) { var main = trace("Hello, World!"); Main.main = main; })(Main = Main || {});
The following command will compile and execute the PureScript code above:
psc input.purs --run-main | nodejs
Another Example
The following code defines a Person data type and a function to generate a string representation for a Person:
data Person = Person { name :: String, age :: Number } foreign import numberToString :: Number -> String showPerson :: Person -> String showPerson (Person o) = o.name ++ ", aged " ++ numberToString(o.age)
Line by line, this reads as follows:
Personis a data type with one constructor, also calledPerson- The
Personconstructor takes an object with two properties,namewhich is aString, andagewhich is aNumber
- The
- The
numberToStringfunction is written in Javascript, and converts aNumberto itsStringrepresentation - The
showPersonfunction takes aPersonand returns aString showPersonworks by case analysis on its argument, first matching the constructorPersonand then using string concatenation and object accessors to return its result.
The generated Javascript looks like this:
var Person = function (value) { return { ctor: 'Person', value: value }; }; function showPerson(_1) { if (_1.ctor === "Person") { return _1.value.name + ", aged " + numberToString(_1.value.age); }; throw "Failed pattern match"; };
Type System
The type system defines the following types:
- Primitive Types
- Number
- String
- Boolean
- Arrays
- E.g.
[String],[[Number]]
- E.g.
- Records
- E.g.
{ foo :: String, bar :: Number }
- E.g.
- Tagged Unions
- E.g.
data Foo a = Foo | Bar String
- E.g.
- Functions
- E.g.
Number -> String - E.g.
(Number, Number) -> Number - Functions can have zero or more arguments
- E.g.
- Polymorphic types
- E.g.
forall a. a -> a
- E.g.
Primitive Types
The three primitive types String, Number and Boolean correspond to their Javascript equivalents at runtime.
PureScript supports the same binary and unary operations on primitive types as Javascript, with the following exceptions:
- String concatenation is denoted
++to differentiate it from numeric addition,+ - PureScript's
==and!=correspond to Javascript's strong equality tests===and!==
Examples:
num = 1 + 2 * 3 str = "Hello, " ++ "World!" bool = 1 > 2 || true
Arrays
PureScript arrays correspond to Javascript arrays at runtime, but all elements must have the same type.
Array literals look like Javascript array literals: [1, 2, 3]
Array elements can be read using array index notation arr !! index
Records
PureScript records correspond to Javascript objects.
Record literals look like Javascript object literals: { foo: "Foo" }
Properties can be read by using dot notation: o.foo
Tagged Unions
Tagged unions consist of one or more constructors, each of which takes zero or one arguments.
Tagged unions can only be created using their constructors, and deconstructed through pattern matching (see later).
For example:
data Foo a = Foo | Bar String runFoo Foo = "It's a Foo" runFoo (Bar s) = "It's a Bar. The string is " ++ s test = runFoo Foo ++ runFoo (Bar "Test")
In the example, Foo is a tagged union type which has two constructors. It's first constructor Foo takes no argument, and it's second Bar takes one, which must be a String.
runFoo is an example of pattern matching on a tagged union type to discover its constructor, and the last line shows how Foos are constructed.
Functions
Functions in PureScript can have zero or more arguments in general, just like in Javascript.
Functions are introduced by using a backslash followed by a list of argument names:
which would correspond to the Javascript
function test1(a) { return function (b) { return a + b; } }
Multiple argument functions can be introduced by wrapping the arguments in parentheses, and separating them with commas:
which generates
function test1(a, b) { return a + b; }
In the case of a function with no arguments, the parentheses may be omitted, as follows:
which would correspond to the Javascript function test2() { return 100; }
Multiple-argument and single-argument syntax can be mixed, as follows:
test3 = \a (b, c) d -> a + b + c + d
which generates
function test3(a) { return function (b, c) { return function (d) { return a + b + c + d; } } }
Functions are applied by providing their arguments inside parentheses:
A special case is made in the case of functions with one argument. These functions can be applied without parentheses, and function application associates to the left:
-- has type Number -> Number -> Number -> Number addThree = \a b c -> a + b + c -- has type Number -> Number -> Number addThree 1 -- has type Number -> Number addThree 1 2 -- has type Number addThree 1 2 3
Polymorphic Types
Expressions defined at the top level may have polymorphic types.
Here is an example:
identity is inferred to have (polymorphic) type forall t0. t0 -> t0. This means that for any type t0, identity can be given a value of type t0 and will give back a value of the same type.
A type annotation can also be provided:
identity :: forall a. a -> a identity x = x
Functions may also be polymorphic in row types or type variables with other kinds (see "Kind System"):
addProps o = o.foo + o.bar
Here, addProps is inferred to have type forall r. { foo :: Number, bar :: Number | r } -> Number. That is, it can take any type which has properties Foo and Bar, and any other record properties.
So, the following compiles:
addProps { foo: 1, bar: 2, baz: 3 }but the following does not:
since the bar property is missing.
Again, a type annotation can be provided if necessary.
Rank N Types
It is also possible for the forall quantifier to appear on the left of a function arrow, inside types record fields and data constructors, and in type synonyms.
In most cases, a type annotation is necessary when using this feature.
As an example, we can pass a polymorphic function as an argument to another function:
poly :: (forall a. a -> a) -> Boolean poly f = (f 0 < 1) == f true
Notice that the polymorphic function's type argument is instantiated to both Number and Boolean.
An argument to poly must indeed be polymorphic. For example, the following fails:
test = poly (\n -> n + 1)
since the skolemized type variable a does not unify with Number.
Type Inference
All types can be inferred, but annotations can optionally be provided.
Kind System
There are two primitive kinds, the kind * of types and the kind ! of effects.
For each kind k there is also a kind # k of rows, with types of kind k. For example # * is the kind of rows of types, as used to define records, and # ! is the kind of rows of effects, used to define the monad Eff of extensible effects.
Higher kinded types are also supported. That is, a type variable can refer to not only a type or a row, but a type constructor, or row constructor etc.
Whitespace
Syntax is whitespace sensitive. The general rule of thumb is that declarations which span multiple lines should be indented past the column on which they were first defined on their subsequent lines.
That is, the following is valid:
But this is not:
Blocks
Blocks are collections of statements wrapped in braces { ... }. Blocks must return a value of the same type on every branch of execution.
The following types of statement are supported:
- Variable introduction
- Variable assignment
- For loops
- For-each loops
- While loops
- If-Then-Else statements
Here is an example of a power function defined using a block:
pow n p = { var m = n; for (i <- 0 until p) { m = m * n; } return m; }
Blocks enable local mutation of their variables, but mutation is not allowed in general. The type system prevents mutable variables from escaping their scope.
That is, while the example above is valid, the following does not compile:
incr n = { n = n + 1; return n; }
The variable n is not mutable, and so the assignment in the first line of the do block is not allowed.
This function can be rewritten as follows:
incr n = { var m = n; m = m + 1; return m; }
For Loops
For loops look like this:
total = { var n = 0; for (i <- 0 until 10) { n = n + i; } return n; }
The bounds 0 and 10 are inclusive and exclusive respectively.
For-Each Loops
For each loops loop over the elements in an array using the Object.forEach method. A polyfill may be required for some browsers:
total arr = { var n = 0; foreach (i in arr) { n = n + i; } return n; }
While Loops
The syntax of a while loop is similar to a foreach loop:
log2 n = { var count = 0; var m = n; while (m > 1) { m = m / 2; count = count + 1; } return count; }
If-Then-Else Statements
Else branches are optional, and may contain further if statements, just as in Javascript:
collatz n = { var count = 0; var m = n; while (m > 1) { if (m % 2 == 0) { m = m / 2; } else { m = m * 3 + 1; } count = count + 1; } return count; }
If-Then-Else Expressions
The if, then and else keywords can also be used to create conditional expressions. In this case, the else block is always required.
For example,
conditional = if 2 > 1 then "ok" else "oops"
Do Notation
The do keyword introduces simple syntactic sugar for monadic expressions.
Here is an example, using the maybe monad:
data Maybe a = Nothing | Just a instance Monad Maybe where ret = Just (>>=) Nothing _ = Nothing (>>=) (Just a) f = f a isEven n | n % 2 == 0 = Just {} isEven _ = Nothing evenSum a b = do n <- a m <- b let sum = n + m isEven sum ret sum
isEven adds two values of type Maybe Number and returns their sum, if the sum is even. If the sum is odd, evenSum returns Nothing.
This example illustrates the following aspects of do notation:
- The corresponding type constructor must be an instance of the
Prelude.Monadtype class, which defines theretand>>=functions. - Statements can have the following form:
a <- xwhich desugars tom.bind x (\a -> ...)let a = xwhich desugars to(\a -> ...)(x)xwhich desugars tom.bind x (\_ -> ...)or justxif this is the last statement.
Not illustrated here, but equally valid is the use of a binder on the left hand side of <- or =. For example:
test arr = do (x:y:_) <- arr ret (x + y)
A pattern match failure will generate a runtime exception, just as in the case of a regular case statement.
Pattern Matching
Pattern matching deconstructs a value to bring zero or more expressions into scope. Pattern matches are introduced with the case keyword.
Pattern matches have the following general form
case value of pattern -> result -- ... pattern -> result
Pattern matching can also be used in the declaration of functions, as we have already seen:
fn pattern_1 ... pattern_n = result
Any of the above types of pattern are also valid when introducing functions. In addition, patterns can also be grouped in parentheses to introduce multiple-arugument functions. For example,
example x (y, z) w = x * y + z * w
The following pattern types are supported:
- Wildcard pattern
- Literal patterns
- Variable pattern
- Array patterns
- Cons patterns
- Record patterns
- Named patterns
- Guards
Patterns need not be exhaustive. A pattern match failed exception will be thrown at runtime if no pattern matches the input.
Wildcard Patterns
The wilcard _ matches any input and brings nothing into scope:
Literal Patterns
Literal patterns are provided to match on primitives:
case booleanValue of true -> 0 false -> 1 case stringValue of "Foo" -> 0 _ -> 1 case numericValue of 0 -> 0 _ -> 1
Variable Pattern
A variable pattern matches any input and binds that input to its name:
case foo(x) of result -> bar(result)
Array Patterns
Array patterns match an input which is an array, and bring its elements into scope.
For example:
f = \arr -> case arr of [x] -> x [x, y] -> x * y + f xs _ -> 0
Here, the first pattern only matches arrays of length one, and brings the first element of the array into scope.
The second pattern matches arrays with two elements, and brings the first and second elements into scope.
Cons Patterns
The head and tail of a non-empty array can be matched by using a cons pattern:
add = \arr -> case arr of [] -> 0 x : xs -> x + add xs
: associates to the right:
addPairs = \arr -> case arr of x : y : xs -> x * y + addPairs xs _ -> 0
Record Patterns
Record patterns match an input which is a record, and bring its properties into scope.
f = \o -> case o of { foo = "Foo" } -> o.bar _ -> 0
Nested Patterns
The patterns above can be combined to create larger patterns.
For example:
f = \o -> case o of { arr = x : _, take = "car" } -> x { arr = _ : x : _, take = "cadr" } -> x _ -> 0
Named Patterns
Named patterns bring additional names into scope when using nested patterns. Any pattern can be named by using the @ symbol:
f = \arr -> case arr of a@(_ : _ : _) -> true a -> false
Here, in the first pattern, any array with two or more elements will be matched and bound to the variable a.
Guards
Guards are used to impose additional constraints inside a pattern using boolean-valued expressions, and are introduced with a pipe after the pattern:
evens = \arr -> case arr of [] -> 0 x : xs | x % 2 == 0 -> 1 + evens xs _ : xs -> evens xs
When defining a function, guards appear after all patterns:
greater x y | x > y = true greater _ _ = false
Type Synonyms
For convenience, it is possible to declare a synonym for a type using the type keyword. Type synonyms can include type arguments.
For example:
type Foo = { foo :: Number, bar Number } addFoo :: Foo -> Number addFoo = \o -> o.foo + o.bar
Record Updates
Properties on records can be updated using the following syntax: o { key = value, ..., key = value }
For example, the following function increments the foo property on its argument:
incr = \o -> o { foo = o.foo + 1 }
The generated Javascript assumes the existence of a method called Object.extend such that Object.extend(o, p) takes an object o and generates a shallow copy of o including the properties of p. A simple JQuery implementation of this specification is
Object.prototype.extend = function(o, p) { return $.extend({}, o, p); };
Operators
In addition to the standard operators, user-defined infix operators can be created by enclosing names in parentheses:
E.g. to create a synonym for string concatenation:
(<>) = \s1 s2 -> s1 ++ s2 greeting = "Hello" <> "World!"
Regular functions can be used as operators by enclosing their names in backticks:
foo = \x y -> x * y + y test = 10 `foo` 20
Fixity declarations can associate a precedence level, which is a natural number, to a user-defined operator, and specify which way it associates:
Modules
Modules are introduced using the module keyword. All code must be placed in a module.
Introduce a module as follows, with a list of declarations:
Names may be qualified by using a dot:
foo = A.foo
bar = A.B.bar
All the names in a module can be aliased using the import declaration:
You can also limit which names are going to be aliased:
Foreign Function Interface
The foreign import keyword declares a value which is defined in Javascript, and its type:
foreign import pow :: (Number, Number) -> Number
To declare a new type with no constructors, use foreign import data and provide the kind:
foreign import data IO :: * -> * foreign import console :: { log :: String -> IO {} }
To alias a name of a field defined on a Javascript type to a PureScript function, use foreign import member. For example, to define a function length which accesses the length member of an array:
foreign import member "length" length :: forall a. [a] -> Number
Type Classes
PureScript has basic support for type classes via the class and instance keywords.
Type class instances must be of the form String, Number, Boolean, [t] or C t1 ... tn, and only a single type class parameter is supported.
Type class instances are resolved based on the order in which they appeared in the source files. In particular, overlapping instances are permitted.
class Show a where show :: a -> String instance Show String where show s = s instance Show Boolean where show true = "true" show false = "false" instance (Show a) => Show [a] where show arr = "[" ++ showArray arr ++ "]" showArray :: forall a. (Show a) => [a] -> String showArray [] = "" showArray [x] = show x showArray (x:xs) = show x ++ ", " ++ showArray xs example = show [true, false]
