Metalanguage and simple types¶

NASSLLI 2022: implementing semantic compositionality¶

Kyle Rawlins, kgr@jhu.edu ¶

Johns Hopkins University, Department of Cognitive Science¶

Goals for today:

Look at some more interesting examples
Look in more detail at the metalanguage implementation
Type inference in simple types

Composition systems, recap¶

What is an implemented composition system?

A set of composition operations
1. Python: implemented via a python function that constructs a new metalanguage object given some input(s)
2. Combinator: implemented via a metalanguage combinator
A set of classes that describe object language $\Leftrightarrow$ meta-language mappings, abstracted as Composables
A set of functions that given some Composables, does a (brute-force) search over possible valid combinations
And, of course, the metalanguage itself

The combinator trick¶

Many composition operations that have been proposed in the literature can be represented as combinators.

The initial core Heim & Kratzer system is essentially the system determined by:

FA: $\lambda f . \lambda x . f(x)$ (also called the A combinator)
NN: $\lambda x . x$ (also called the I combinator)
PM: $\lambda f_{\langle e,t \rangle} . \lambda g_{\langle e,t \rangle} . \lambda x_e . f(x) \wedge g(x)$

(However -- types for the first two!)

A more interesting example to play with¶

Switch to notebook for this:

In [ ]:

%%lamb
||cat|| = L x_e: Cat(x)
||gray|| = L x_e: Gray(x)
||kaline|| = Kaline_e
||julius|| = Julius_e
||inP|| = L x_e : L y_e : In(y, x) # `in` is a reserved word in python
||texas|| = Texas_e
||isV|| = L p_<e,t> : p # `is` is a reserved word in python
||fond|| = L x_e : L y_e : Fond(y, x)

In [60]:

of = lang.Item("of", content=None)
a = lang.Item("a", content=None)
binder = lang.Binder(5)
t5 = lang.Trace(5)

display(of, a, binder, t5)

$[\![\mathbf{\text{of}}]\!]^{}\text{ [vacuous]}$

$[\![\mathbf{\text{a}}]\!]^{}\text{ [vacuous]}$

$[\![\mathbf{\text{5}}]\!]^{}\text{ [vacuous]}$

$[\![\mathbf{\text{t}}_{5}]\!]^{}_{e} \:=\: $${var5}_{e}$

In [61]:

kaline * (isV * (a * (gray * cat)))

Out[61]:

1 composition path. Result:
[0]: $[\![\mathbf{\text{[[isV [a [gray cat]]] kaline]}}]\!]^{}_{t} \:=\: $$({Gray}({Kaline}_{e}) \wedge{} {Cat}({Kaline}_{e}))$

In [62]:

kaline * (isV * ((a * (gray * cat)) * (inP * texas)))

Out[62]:

1 composition path. Result:
[0]: $[\![\mathbf{\text{[[isV [[a [gray cat]] [inP texas]]] kaline]}}]\!]^{}_{t} \:=\: $$(({Gray}({Kaline}_{e}) \wedge{} {Cat}({Kaline}_{e})) \wedge{} {In}({Kaline}_{e}, {Texas}_{e}))$

In [63]:

kaline * (isV * (a * ((gray * cat) * (inP * texas)
                       * (binder * (t5 * (fond * (of * julius)))))))

Out[63]:

1 composition path. Result:
[0]: $[\![\mathbf{\text{[[isV [a [[[gray cat] [inP texas]] [5 [[fond [of julius]] t5]]]]] kaline]}}]\!]^{}_{t} \:=\: $$((({Gray}({Kaline}_{e}) \wedge{} {Cat}({Kaline}_{e})) \wedge{} {In}({Kaline}_{e}, {Texas}_{e})) \wedge{} {Fond}({Kaline}_{e}, {Julius}_{e}))$

Another example¶

Depending on time, let's look at the Neo-Davidsonian fragment

Metalanguage objects¶

All metalanguage objects are subclasses of class TypedExpr. Subclasses add on:

Syntactic information: name of expression, part structure
Behaviors: type constraints, reduction / simplification code
Rendering information (infix/prefix, mathjax)

There are essentially three main kinds of metalanguage objects: terms, operators, and binding operators. Functions are a special case of binding operators.

A quick note on terms¶

Terms come in two kinds: variables and constants.

Only variables can be bound.
Convention (prolog-ish): variables are lowercase, constants are uppercase.

Introspection on metalanguage objects¶

We looked yesterday at an example like this:

In [28]:

formula = %te p_t & (q_t | ~r_t)
formula

Out[28]:

$({p}_{t} \wedge{} ({q}_{t} \vee{} \neg{} {r}_{t}))$

In [29]:

display(formula.op) # typed expressions (may) have an operator
display(formula.type) # typed expressions have a type
display(list(formula)) # typed expressions have parts
display(formula.__class__) # the python type of `formula`

'&'

$t$

[p_t, (q_t | ~r_t)]

lamb.meta.BinaryAndExpr

Let's look at a part:

In [34]:

formula[0]

Out[34]:

${p}_{t}$

In [37]:

display(formula[0].op) # Terms put the variable name in the `op` field
display(formula[0].type)
display(len(formula[0])) # no parts
display(formula[0].__class__)

'p'

$t$

lamb.meta.TypedTerm

Abstract Syntax Trees (ASTs)¶

Concept from programming language design: an AST is a representation of the abstract syntactic structure of a formal language.

In [26]:

def collect_ast(x):
    result = [x.op] + [collect_ast(sub) for sub in list(x)]
    if (len(result) > 1):
        result = [""] + result
    return result

In [44]:

display(formula)
svgling.draw_tree(collect_ast(formula))

$({p}_{t} \wedge{} ({q}_{t} \vee{} \neg{} {r}_{t}))$

Out[44]:

In [42]:

f2 = %te L x_e : L y_e : P(x) & Q(x)
display(f2)
svgling.draw_tree(collect_ast(f2))

INFO (meta): Coerced guessed type for 'P_t' into <e,t>, to match argument 'x_e'
INFO (meta): Coerced guessed type for 'Q_t' into <e,t>, to match argument 'x_e'

$\lambda{} x_{e} \: . \: \lambda{} y_{e} \: . \: ({P}({x}) \wedge{} {Q}({x}))$

Out[42]:

In [43]:

f3 = %te L x_e : Forall y_e : y <=> x
display(f3)
svgling.draw_tree(collect_ast(f3))

$\lambda{} x_{e} \: . \: \forall{} y_{e} \: . \: ({y} = {x})$

Out[43]:

Some basic logical inference, nothing sophisticated (contributions welcome!):

Behaviors¶

Complex formula-building
Reduction
Type inference / consistency
Simplification (rudimentary)

More simplification/reduction examples:

In [62]:

f4 = %te True & False
f4.simplify()

Out[62]:

${False}_{t}$

In [66]:

f5 = %te x << (Set y: Cat(y))
f5

INFO (meta): Coerced guessed type for 'Cat_t' into <e,t>, to match argument 'y_e'

Out[66]:

$({x}_{e} \in{} \{y_{e}\:|\: {Cat}({y}_{e})\})$

In [68]:

f5.reduce()

Out[68]:

${Cat}({x}_{e})$

Type inference part 1¶

The most involved part of the metalanguage implementation is type inference. Plan for discussing:

Some lambda calculus theory
Implementing simple type inference
Unification

On Thursday, we will revisit this topic, and talk about polymorphic type inference.

A bit more on the lambda calculus¶

Useful references

Carpenter 1998, Type-logical semantics
SEP entry on the lambda calculus

Desiderata for implementation:

Reduction is guaranteed to converge on a single result (strong normalization)
Efficient reduction strategies
Working with formulas involves decidable problems.

(I won't discuss the latter, but just to tantalize you: in the untyped lambda calculus, it is undecidable whether two lambda expressions are equivalent -- arguably the first undecidable problem, discovered by Church.)

An example (untyped): $\lambda x . x(x)$

Reduction 1 (converges):

$(\lambda x . x(x))(\lambda x . x)$
$(\lambda x . x)(\lambda x . x)$
$\lambda x . x$

ok...

Reduction 2 (does not converge):

$(\lambda x . x(x))(\lambda x . x(x))$
$(\lambda x . x(x))(\lambda x . x(x))$
...

The untyped lambda calculus is not strongly normalizing. No guarantee that reduction will converge at all!

Simple types, again¶

Let $\mathbf{BasType}$ be a non-empty set of basic types. (E.g. $\{e,t\}$)
If $\sigma \in \mathbf{BasType}$, then $\sigma \in \mathbf{Typ}$.
If $\sigma, \tau \in \mathbf{Typ}$, then $\langle \sigma, \tau \rangle \in \mathbf{Typ}$.
Nothing else is a type.

Lambda calculus: syntax (Carpenter version)¶

For any symbol $v$ where $v$ is $x,y,z$ with an arbitrary number of primes, and type $\tau$, $v_\tau \in \mathbf{Var}_\tau$.
Let $Var = \bigcup_{\tau \in \mathbf{Typ}} Var_\tau$
For any $\tau$, $\mathbf{Var}_\tau \subseteq \mathbf{Term}_\tau$. (Countably infinite.)
For any $\tau$, $\mathbf{Con}_\tau \subseteq \mathbf{Term}_\tau$. (Assume $\mathbf{Var}$ and $\mathbf{Con}$ are disjoint.)
If $\alpha$ is a term of type $\sigma$, and $v_\tau$ a variable, then $\lambda v_\tau . (\alpha)$ is a term of type $\langle \tau,\sigma \rangle$.
If $\alpha$ is a term of type $\langle \sigma,\tau \rangle$, and $\beta$ is a term of type $\sigma$, then $(\alpha(\beta))$ is a term of type $\tau$.
Nothing else is a term.

Lambda calculus inference as rewrite rules¶

$\alpha$-reduction: $\lambda v_\tau . \alpha \Rightarrow_\alpha \lambda v'_\tau \alpha{}[v_\tau := v'_\tau]$

if $v'_\tau$ is not free in $\alpha$, and $v'_\tau$ is free for substitution for $v_\tau$ in $\alpha$.

$\beta$-reduction: $(\lambda v_\tau . \alpha)(\beta) \Rightarrow_\beta \alpha{}[v_\tau := \beta]$

if $\beta$ is free for substitution for $v_\tau$ in $\alpha$ and $\beta \in \mathbf{Term}_\tau$.

Strong normalization¶

A $\beta$-redex is a subterm of the form: $(\lambda v . \alpha)(\beta)$ for $\alpha,\beta \in \mathbf{Term}$
A term is in $\beta$-normal form if it contains no $\beta$-redexes.
A term has a $\beta$-normal form if there is a (finite) rewrite path to a $\beta$-normal form.

Strong normalization: a lambda calculus (viewed as a rewrite system) is strongly normalizing if every term has a $\beta$-normal form.

Church-Rosser theorem¶

If $\alpha \Rightarrow \beta$ and $\alpha \Rightarrow \gamma$, then there is some $\delta$ such that $\beta \Rightarrow \delta$ and $\gamma \Rightarrow \delta$.

Intuition: order of application doesn't matter!
If normalizing is possible, there is at most one normal form.

Strong normalization and simple types¶

All this to say:

We want to be working with a type system that allows for strong normalization.
Simply typed lambda calculus is a very straightforward system that is provably strongly normalizing.

Type checking for function-argument combinations¶

Back to the Carpenter definition: If $\alpha$ is a term of type $\langle \sigma,\tau \rangle$, and $\beta$ is a term of type $\sigma$, then $(\alpha(\beta))$ is a term of type $\tau$

Implementation looks pretty straightforward. Given some LFun f and TypedExpr a:

If f.type[0] == a.type, then return ApplicationExpr(f, a) (of type f.type[1])
Otherwise, raise a TypeMismatchError

Constructing functions¶

Important detail: we also need to ensure that variables are used consistently!

In [53]:

%te L x_t : P_<e,t>(x_e)

ERROR (parsing): Parsing of typed expression failed with exception:
ERROR (parsing): Binding operator expression has unparsable body, in string 'L x_t : P_<e,t>(x_e)' (Type mismatch: 'x_e'/e and type t conflict (Failed to unify types across distinct instances of term))

In [55]:

%te p_t & Q_<e,t>(p_e)

ERROR (parsing): Parsing of typed expression failed with exception:
ERROR (parsing): Type mismatch: 'p_t'/t and type e conflict (Failed to unify types across distinct instances of term)

Type checking and type inference¶

Type inference: given two types $a,b$, what (if any) single type $c$ is equivalent to $a$ and $b$?

Combining $a$ and $b$ in this way is sometimes referred to as type unification.

In the simply-typed lambda calculus, type inference is the same thing as type checking. If $a$ and $b$ are equal, they can be unified as $a$ (or $b$), otherwise, they cannot be unified.

Term unification¶

Given two terms $t1_\alpha$ and $t2_\beta$, a unification is a valid substitution that produces a single $t3_\gamma$ that is equivalent to $t1$ and $t_2$.

Reduction as unification: reduction involves unifying a variable of some type $\alpha$ (indicated by the $\lambda$ term) with an argument of some type $\beta$, and substituting the result for the variable in the scope of the lambda term