Ray Jackendoff
Program in Linguistics, MS 013
Brandeis University
Waltham, MA 02454 USA
tel: 617-484-5394
fax :617-484-0164
Short abstract:
The goal of this study to reintegrate the theory of generative grammar into the
cognitive sciences. Generative grammar was correct to focus on the child's
acquisition of language as its central problem, leading to the hypothesis of an
innate Universal Grammar. However,
generative grammar was mistaken to assume that the syntactic component is the
sole course of combinatoriality, and that everything else is
"interpretive." The proper approach is a parallel architecture, in
which phonology, syntax, and semantics are autonomous generative systems,
linked by interface components. The
parallel architecture leads to an integration within linguistics, and to a far
better integration with the rest of cognitive neuroscience. It fits naturally into the larger
architecture of the mind/brain and permits a properly mentalistic theory of
semantics. It leads to a view of
linguistic performance in which the rules of grammar are directly involved in
processing. Finally, it leads to a
natural account of the incremental evolution of the language capacity.
1.
Introduction
In the 1960s, when I became a graduate student in linguistics,
generative grammar was the hot new topic. Everyone from philosophers to psychologists to anthropologists to
educators to literary theorists was reading about transformational
grammar. But by the late 1970s, the
bloom was off the rose, although most linguists didn’t realize it; and by the
1990s, linguistics was arguably far on the periphery of the action in cognitive
science. To some extent, of course,
such a decline in fortune was simply a matter of fashion and the arrival of new
methodologies such as connectionism and brain imaging. However, there are deeper reasons for
linguistics’ loss of prestige, some historical and some scientific.
The basic questions I want to take up here, then, are:
What was right about generative grammar in the
1960s, such that it held out such promise?
What was wrong about it, such that it didn’t
fulfill its promise?
How can we fix it, so as to restore its value to the
other cognitive sciences?
The goal is to integrate linguistics with the other cognitive sciences,
not to eliminate the insights achieved by any of them. To understand language and the brain, we
need all the tools we can get. But
everyone will have to give a little in order for the pieces to fit together
properly.
The position developed in Foundations of Language is that the
overall program of generative grammar was correct, as was the way this program
was intended to fit in with psychology and biology. However, a basic technical mistake at the heart of the formal
implementation, concerning the overall role of syntax in the grammar, led to
the theory being unable to make the proper connections both within linguistic
theory and with neighboring fields. Foundations
of Language develops an alternative, the parallel architecture,
which offers far richer opportunities for integration of the field. In order to understand the motivation for
the parallel architecture, it is necessary to go through some history.
2. Three founding themes of generative grammar
The remarkable first chapter of Noam Chomsky’s Aspects of the Theory
of Syntax (1965) set the agenda for everything that has happened in
generative linguistics since. Three
theoretical pillars support the enterprise:
mentalism, combinatoriality, and acquisition.
Mentalism. Before Aspects, the predominant view
among linguists – if it was even discussed – was that language is something
that exists either as an abstraction, or in texts, or in some sense “in the
community” (the latter being the influential view of Saussure (1915), for
example). Chomsky urged the view that
the appropriate object of study is the linguistic system in the mind/brain of
the individual speaker. According to
this stance, a community has a common language by virtue of all speakers in the
community having essentially the same linguistic system in their minds/brains.1
The term most often used for this linguistic system is “knowledge”,
perhaps an unfortunate choice. However,
within the theoretical discourse of the time, the alternative was thinking of
language as an ability, a “knowing how” in the sense of Ryle (1949), which
carried overtones of behaviorism and stimulus-response learning, a sense from
which Chomsky with good reason wished to distance himself. It must be stressed, though, that whatever
term is used, the linguistic system in a speaker’s mind/brain is deeply
unconscious and largely unavailable to introspection, in the same way that our
processing of visual signals is deeply unconscious. Thus language is a kind of mind/brain property hard to associate
with the term “knowledge”, which commonly implies accessibility to
introspection. Foundations of
Language compromises with tradition by systematically using the term f-knowledge
(‘functional knowledge’) to describe whatever is in speakers’ heads that
enables them to speak and understand their native language(s).
There still are linguists, especially those edging off toward semiotics
and hermeneutics, who reject the mentalist stance and assert that the only
sensible way to study language is in terms of the communication between
individuals (a random example is Dufva and Lähteenmäki 1996). But on the whole, the mentalistic outlook of
generative grammar has continued to be hugely influential throughout
linguistics and cognitive neuroscience.
More controversial has been an important distinction made in Aspects
between the study of competence – a speaker’s f-knowledge of language –
and performance, the actual
processes (viewed computationally or neurally) taking place in the mind/brain
that put this f-knowledge to use in speaking and understanding sentences. I think the original impulse behind the
distinction was methodological convenience.
A competence theory permits linguists to do what they have always done,
namely study phenomena like Bulgarian case marking and Turkish vowel harmony,
without worrying too much about how the brain actually processes them. Unfortunately, in response to criticism
from many different quarters (especially in response to the collapse of the
derivational theory of complexity as detailed in e.g. Fodor, Bever, and Garrett
19xx), linguists have tended to harden the distinction into a firewall: competence theories were taken to be immune
to evidence from performance. And so
began a gulf between linguistics and
the rest of cognitive science that has persisted until the present.
Foundations
does not abandon the competence-performance distinction, but does return it to
its original status as a methodological rather than ideological
distinction. Although the innovations
in Foundations are largely in the realm of competence theory, one of
their important consequences is that there is a far closer connection to
theories of processing, as well as the possibility of a two-way dialogue
between competence and performance theories.
We return to this issue in section 9.3.
Combinatoriality: The earliest published work
in generative grammar, Chomsky’s Syntactic Structures (1957), began with
the observation that a language contains an arbitrarily large number of
sentences. Therefore, in addition to
the finite list of words, a characterization of a language must contain a set
of rules (or a grammar) that collectively describes or
“generates” the sentences of the language.
Syntactic Structures showed that the rules of natural language
cannot be characterized in terms of a finite-state Markov process, nor in terms
of a context-free phrase structure grammar.
Chomsky proposed that the appropriate form for the rules of a natural
language is a context-free phrase structure grammar supplemented by
transformational rules. Not all
subsequent traditions of generative grammar (e.g. Head-Driven Phrase Structure
Grammar (Pollard and Sag 1994) and Lexical-Functional Grammar (Bresnan 1982,
2001)) have maintained the device of transformational rules; but they all
contain machinery designed to overcome the shortcomings of context-free
grammars pointed out in 1957.2
Transferred into the mentalistic framework of 1965, the consequence of
combinatoriality is that speakers of the language must have rules of language
(or mental grammars) in their heads as part of their f-knowledge. Again there is a certain amount of
controversy arising from the term “rules”.
Rules of grammar in the sense of generative grammar are not like any of
the sorts of rules or laws in ordinary life:
rules of etiquette, rules of chess, traffic laws, or laws of
physics. They are unconscious
principles that play a role in the production and understanding of
sentences. Again, to ward off improper
analogies, Foundations uses the term f-rules for whatever the
combinatorial principles in the head may be.
Generative linguistics leaves open how directly the f-rules are involved in processing, but, as suggested
above, the unfortunate tendency among linguists has been not to care. The theory in Foundations, though,
makes it possible to regard the rules as playing a direct role in processing;
see again section 9.3.
An important reason for the spectacular reception of early generative
grammar was that it went beyond merely claiming that language needs rules: it offered rigorous formal techniques for
characterizing the rules, based on approaches to the foundations of mathematics
and computability developed earlier in the century. The technology suddenly made it possible to say lots of
interesting things about language and ask lots of interesting questions.
For the first time ever it was possible to provide detailed descriptions
of the syntax of natural languages (not only English but German, French,
Turkish, Mohawk, Hidatsa, and Japanese were studied early on). In addition, generative phonology took off
rapidly, adapting elements of Prague School phonology of the 1930s to the new
techniques. With Chomsky and Halle’s
1968 Sound Pattern of English as its flagship, generative phonology
quickly supplanted the phonological theory of the American structuralist
tradition.
Acquisition: Mentalism and combinatoriality together lead
to the crucial question: How do
children get the f-rules into their heads?
Given that the f-rules are unconscious, parents and peers cannot verbalize
them; and even if they could, children would not understand, since they don’t
know language yet. The best the
environment can do for a language learner is provide examples of the language
in a context. From there on it is up to
the language learner to construct the principles on his or her own –
unconsciously of course.
Chomsky asked the prescient question:
what does the child have to “(f-)know in advance” in order to accomplish
this feat? He phrased the problem in
terms of the “poverty of the stimulus”:
many different generalizations are consistent with the data presented to
the child, but the child somehow comes up with the “right” one, i.e. the one
that puts him or her in tune with the generalizations of the language
community. I like to put the problem a
bit more starkly: The whole community
of linguists, working together for decades with all sorts of crosslinguistic
and psycholinguistic data unavailable to children, has still been unable to
come up with a complete characterization of the grammar of a single natural
language. Yet every child does it by
the age of ten or so. Children don’t
have to make the choices we do: for
instance they don’t have to decide whether the “right” choice of grammar is in
the style of transformational grammar, the Minimalist Program, Optimality
Theory, Role and Reference Grammar, Tree-Adjoining Grammar, Cognitive Grammar,
connectionist networks, or some as yet unarticulated alternative. They already f-know it in advance.
One of the goals of linguistic theory, then, is to solve this “Paradox
of Language Acquisition” by discovering what aspects of linguistic f-knowledge
are not learned, but rather form the basis for the child’s
learning. The standard term for the
unlearned component is Universal Grammar or UG, a term that again
perhaps carries too much unwanted baggage.
In particular, UG should not be confused with universals of
language: it is rather what shapes the
acquisition of language. I prefer to
think of it as a toolkit for constructing language, out of which the child (or
better, the child’s brain) f-selects tools appropriate to the job at hand. If the language in the environment happens
to have a case system (like German), UG will help shape the child’s acquisition
of case; if it has a tone system (like Mandarin), UG will help shape the
child’s acquisition of tone. But if the
language in the environment happens to be English, which lacks case and tone,
these parts of UG will simply be silent.
What then is the source of language universals? Some of them will indeed be determined by
UG, for instance the overall “architecture” of the grammatical system: the parts of the mental grammar and their
relations (of which much more below).
Other universals, especially what are often called “statistical” or
“implicational” universals, may be the result of biases imposed by UG. For instance, UG may say that if a language
has a case system, the simplest such systems are thus-and-so; these will be
widespread systems crosslinguistically; they will be acquired earlier by
children; and systems may tend to change toward them over historical time. Other universals may be a consequence of the
functional properties of any relatively efficient communication system: for instance, the most frequently used
signals tend to be short. UG doesn’t
have to say anything about these universals at all; they will come about
through the dynamics of language use in the community (a process which of
course is not very well understood).
If UG is not learned, how does the child acquire it? The only alternative is through the
structure of the brain, which is determined through a combination of genetic
inheritance and the biological processes resulting from expression of the
genes, the latter in turn determined by some combination of inherent structure
and environmental input. Here
contemporary science is pretty much at an impasse. We know little about how genes determine brain structure and
nothing about how the details of brain structure determine anything about
language structure, even aspects of language as simple as speech sounds. Filling out this part of the picture is a
long-term challenge for cognitive neuroscience. It is premature to reject the hypothesis of Universal Grammar, as
some have (e.g. Elman et al. 1996 and Deacon 1997), arguing that we don’t know
how genes could code for language acquisition.
After all, we don’t know how genes code for birdsong or sexual behavior
or sneezing either, but we don’t deny that there is a genetic basis behind
these.
There next arises the question of how much of UG is a human cognitive
specialization for language and how much is a consequence of more general
capacities. The question has often been
oversimplified to a binary decision between language being entirely special or
entirely general, with a strong bias inside generative linguistics towards the
former and outside generative linguistics towards the latter. The truth of the matter undoubtedly lies
somewhere in between. To be sure, many
people (including myself) would find it satisfying if a substantial part of
language acquisition were a consequence of general human cognitive factors; but
the possibility of some specialization overlaying the general factors must not
be discounted. My view is that we
cannot determine what is general and what is special until we have comparable
theories of other cognitive capacities, including other learned
cognitive capacities. To claim that
language is parasitic on, say, motor control, perhaps because both have
hierarchical and temporal structure (this seems to be the essence of
Corballis’s (1991) position) – but without stating a theory of the f-knowledge
involved in motor control – is to
coarsen the fabric of linguistic theory to the point of unrecognizability. The closest approach to a comparable theory
is the music theory of Lerdahl and Jackendoff 1983, which displays some
striking parallels and some striking differences with language.
Of course, if UG – the ability to learn language – is in part a human
cognitive specialization, it must be determined by some specifically human
genes, which in turn had to have come into existence sometime since the hominid
line separated from the other great apes.
One would therefore like to be able to tell some reasonable story about
how it could be shaped by natural selection or other evolutionary
processes. We return to this issue in
section 9.4.
This approach to the acquisition of language has given rise to a
flourishing tradition of developmental research (references far too numerous to
mention) and a small but persistent tradition in learnability theory (e.g.
Wexler and Culicover 1980, Baker and McCarthy 1981). And certainly, even if the jury is not yet in on the degree to
which language acquisition is a cognitive specialization, there have been all
manner of phenomena investigated that bear on the issue, for instance:
My impression is that, while there are questions about
all of these cases, en masse they offer an overwhelming case for some degree of
genetic specialization for language learning in humans.
These three foundational issues of generative grammar
– mentalism, combinatoriality, and acquisition – have stood the test of time;
if anything they have become even more important over the years in the rest of
cognitive science. It is these three
issues that connect linguistics intimately with psychology, brain science, and
genetics. Much of the promise of
generative linguistics arose from this new and exciting potential for
scientific unification.
3. The broken
promise: Deep Structure would be the key to the mind
A fourth major point of Aspects, and the one
that seeped most deeply into the awareness of the wider public, concerned the
notion of Deep Structure. A basic claim
of the 1965 version of generative grammar was that in addition to the surface
form of sentences, i.e. the form we hear, there is another level of syntactic
structure, called Deep Structure, which expresses underlying syntactic
regularities of sentences. For
instance, a passive sentence like (1a)
has a Deep Structure in which the noun phrases are in the order of the
corresponding active (1b).
(1) a. The bear was chased by the lion.
b. The lion chased the bear.
Similarly, a question such as (2a) has a Deep
Structure closely resembling that of the corresponding declarative (2b).
(2) a. Which martini did Harry drink?
b. Harry drank that martini.
In the years preceding Aspects, the question
arose of how syntactic structure is connected to meaning. Following a hypothesis first proposed by
Katz and Postal (1964), Aspects made the striking claim that the
relevant level of syntax for determining meaning is Deep Structure.
In its weakest version, this claim was only that
regularities of meaning are most directly encoded in Deep Structure, and this
can be seen in (1) and (2). However,
the claim was sometimes taken to imply much more: that Deep Structure IS meaning, an interpretation that
Chomsky did not at first discourage.3 And this was the part of generative
linguistics that got everyone really excited.
For if the techniques of transformational grammar lead us to meaning, we
can uncover the nature of human thought.
Moreover, if Deep Structure is innate – being dictated by Universal
Grammar – then linguistic theory gives us unparalleled access to the essence of
human nature. No wonder everyone
wanted to learn linguistics.
What happened next was that a group of generative
linguists, notably George Lakoff, John Robert Ross, James McCawley, and Paul
Postal, pushed very hard on the idea that Deep Structure should directly encode
meaning. The outcome, the theory of
Generative Semantics (e.g. McCawley 1968, Postal 1970, Lakoff 1971), increased
the “abstractness” and complexity of Deep Structure, to the point that the
example Floyd broke the glass was famously posited to have eight
underlying clauses, each corresponding to some feature of the semantics. All the people who admired Aspects
for what it said about meaning loved Generative Semantics, and it swept the
country. But Chomsky himself reacted
negatively, and with the aid of his then-current students (full
disclosure: present author included),
argued vigorously against Generative Semantics. When the dust of the ensuing“Linguistics Wars” cleared around
1973 (Newmeyer 1980, Harris 1993, Huck and Goldsmith 1995), Chomsky had won –
but with a twist: he no longer claimed
that Deep Structure was the sole level that determines meaning (Chomsky
1972). Then, having won the battle, he
turned his attention, not to meaning, but to relatively technical constraints
on movement transformations (e.g. Chomsky 1973, 1977).
The reaction in the larger community was shock: for one thing, at the fact that the
linguists had behaved so badly; but more substantively, at the sense that there
had been a “bait and switch.” Chomsky
had promised Meaning with a capital M and then had withdrawn the
offer. Many researchers, both inside
and outside linguistics, turned away from generative grammar with disgust,
rejecting not only Deep Structure but mentalism, innateness, and sometimes even
combinatoriality. And when, later in
the 1970s, Chomsky started talking about meaning again, in terms of a syntactic
level of Logical Form (e.g. Chomsky 1981), it was too late: the damage had been done. From this point on, the increasingly
abstract technical apparatus of generative grammar was of no interest to more than
a tiny minority of cognitive scientists, much less the general public.
Meanwhile, various nonChomskyan traditions of
generative grammar developed, most notably Relational Grammar (Perlmutter
1983), Head-Driven Phrase Structure Grammar (Pollard and Sag 1987, 1994),
Lexical-Functional Grammar (Bresnan 1982, 2001), Formal Semantics (Partee 1976,
Heim and Kratzer 19xx), Optimality Theory (Prince and Smolensky 1993),
Construction Grammar (Fillmore et al. 1988, Goldberg 1995), and Cognitive
Grammar (Lakoff 1987, Langacker 1987, Talmy 2000). On the whole, these approaches to linguistics (with the possible
exception of Cognitive Grammar) have made even less contact with philosophy,
psychology, and neuroscience than the recent Chomskyan tradition. My impression is that many linguists have
simply returned to the traditional concerns of the field: describing languages,
with as little theoretical and cognitive baggage as possible. While this is perfectly fine – particularly
since issues of innateness don’t play too big a role when you’re trying to record
an endangered language before its speakers all die – the sense of excitement
and danger that comes from participating in the integration of fields has
become attenuated.
4. The
scientific mistake: syntactocentrism
So much for pure intellectual history. We now turn to what I think was an important
mistake at the core of generative grammar, one that in retrospect lies behind
much of the alienation of linguistic theory from the cognitive sciences. Chomsky did demonstrate that language
requires a generative system that makes possible an infinite variety of
sentences. However, he explicitly
assumed, without argument (1965: 16, 17, 75, 198), that generativity is
localized in the syntactic component of the grammar – the construction of
phrases from words – and that phonology
(the organization of speech sounds) and semantics (the organization of meaning)
are purely “interpretive”, that is, that their combinatorial properties are
derived strictly from the combinatoriality of syntax.
In 1965 this was a perfectly reasonable view. The important issue at that time was to show
that something in language is generative. Generative syntax had provided powerful new tools, which were
yielding copious and striking results.
At the time, it looked as though phonology could be treated as a sort of
low-level derivative of syntax: the syntax gets the words in the right order,
then phonology massages their pronunciation to adjust them to their local
environment. As for semantics,
virtually nothing was known: the only
things on the table were the rudimentary proposals of Katz and Fodor (1963) and
some promising work by people such as Bierwisch (1967, 1969) and Weinreich
(1966). So the state of the theory
offered no reason to question the assumption that all combinatorial complexity
arises from syntax.
Subsequent shifts in mainstream generative linguistics
stressed major differences in outlook.
But one thing that remained unchanged was the assumption that syntax is
the sole source of combinatoriality.
Figure 1 diagrams the architecture of components in three major stages
of Chomskyan syntactic theory: the Aspects theory, Principles and
Parameters (or Government-Binding) Theory (Chomsky 1981), and the Minimalist
Program (Chomsky 1995). The arrows
denote direction of derivation.

Figure 1. Architecture of Chomsky's theories over the years.
These shifts alter the components of syntax and their
relation to sound and meaning. What
remains constant throughout, though, is that (a) there is an initial stage of
derivation in which words or morphemes are combined into syntactic structures;
(b) these structures are then massaged by various syntactic operations; and (c)
certain syntactic structures are shipped off to phonology/phonetics to be
pronounced and other syntactic structures are shipped off to “semantic
interpretation” to be understood. In
short, syntax is the source of all linguistic organization.
I believe that this assumption of “syntactocentrism” –
which, I repeat, was never explicitly grounded – was an important mistake at the heart of the field.4 The correct approach is to regard linguistic
structure to be the product of a number of parallel but interacting generative
capacities – at the very least, one each for phonology, syntax, and
semantics. As we will see, elements of
such a “parallel architecture” have been implicit in practice in the field for
years. What is novel in the present
work is bringing these practices out into the open, stating them as a
foundational principle of linguistic organization, and exploring the
large-scale consequences.
5. Phonology
as an exemplar of the parallel architecture
An unnoticed crack in the assumption of
syntactocentrism appeared in the middle to late 1970s, when the theory of
phonology underwent a major seachange.
Before then, the sound system of language had been regarded essentially
as a sequence of speech sounds. Any
further structure, such as the division into words, was thought of as simply
inherited from syntax. However, beginning with work such as Goldsmith (1979)
and Liberman and Prince (1977), phonology rapidly came to be thought of as
having its own autonomous structure, in fact multiple structures or tiers. Figure 2 provides a sample, the structure of
the phrase the big apple. The
phonological segments appear at the bottom, as terminal elements of the
syllabic tree.

Figure 2. Phonological structure of the big apple.
There are several innovations here. First, syllabic structure is seen as
hierarchically organized. At the center
of the syllable (notated as σ) is a syllabic nucleus (notated N),
which is usually a vowel but sometimes a syllabic consonant such as the l
in apple. The material following
the nucleus is the syllabic coda (notated C); this groups with the
nucleus to form the rhyme (notated R), the part involved in
rhyming. In turn, the rhyme groups with
the syllabic onset (notated O) to form the entire syllable. Syllables are grouped together into larger
units such as feet and phonological words (here, the bracketing subscripted Wd). Notice that in Figure 1, the word the
does not constitute a phonological word on its own; it is attached (or
cliticized) onto the word big.
Finally, phonological words group into larger units such as phonological
phrases. Languages differ in their
repertoire of admissible nuclei, onsets, and codas, but the basic hierarchical
organization and the principles by which strings of segments are divided into syllables
are universal. (It should also be
mentioned that signed languages have parallel syllabic organization, except
that the syllables are built out of manual rather than vocal constituents
(Klima and Bellugi 1979, Fischer and Siple 1990).)
These hierarchical structures are not built out of
syntactic primitives such as nouns, verbs, and determiners; their units are
intrinsically phonological. In
addition, the structures, though hierarchical, are not recursive.5 Thus the principles governing these structures
are not derivable from syntactic structures; they are an autonomous system of
generative rules.
Next consider the metrical grid in Figure 2. Its units are beats, notated as
columns of xs. A column with
only one x is a weak beat, and more xs in a column indicate a
relatively stronger beat. Each beat is
associated with a syllable; the strength of a beat indicates the relative
stress on that syllable, so that for example in Figure 2 the first syllable of apple
receives maximum stress. The basic principles
of metrical grids are in part autonomous of language: they also appear, for instance, in music (Lerdahl and Jackendoff
1983), where they are associated with notes instead of syllables. Metrical grids place a high priority on
rhythmicity: an optimum grid presents an alternation of strong and weak beats,
as is found in music and in much poetry.
On the other hand, the structure of syllables exerts an influence on the
associated metrical grid: syllables
with heavy rhymes (i.e. containing a coda or a long vowel) “want” to be
associated with relatively heavy stress.
The stress rules of a language concern the way syllabic structure comes
to be associated with a metrical grid; languages differ in ways that are now
quite well understood (e.g. Kager 1995, Halle and Idsardi 1995).
Again, metrical grids are built of nonsyntactic
units. As they are to some degree
independent of syllabic structure, they turn out to be a further autonomous
“tier” of phonological structure.
At a larger scale of phonological organization we find
prosodic units over which intonation contours are defined. These are comparable in size to syntactic
phrases but do not coincide with them.
Here are two examples.
(3) Syntactic
bracketing:
[Sesame
Street] [is [a production [of [the Children’s Television Workshop]]]]
Prosodic
bracketing (two pronunciations):
a. [Sesame Street is a production of] [the
Children’s Television Workshop]
b. [Sesame Street] [is a production] [of the
Children’s Television Workshop]
(4) Syntactic
bracketing
[This]
[is [the cat [that chased [the rat [that ate [the cheese]]]]]]
Prosodic
bracketing:
[This is
the cat] [that chased the rat] [that ate the cheese]
The two pronunciations of (3) are both acceptable, and
other prosodic bracketings are also possible.
However, the choice of prosodic bracketing is not entirely free, since
for instance [Sesame] [Street is a production of the] [Children’s Television
Workshop] is an impossible phrasing.
Now notice that the first constituent of (3a) and the second constituent
of (3b) do not correspond to any syntactic constituent. We would be hard pressed to know what
syntactic label to give to Sesame Street is a production of. But as an intonational constituent it
is perfectly fine. Similarly in (4),
the syntax is relentlessly right-embedded, but the prosody is flat and
perfectly balanced into three parts.
Again, the first two constituents of the prosody do not correspond to
syntactic constituents of the sentence.
The proper way to deal with this lack of correspondence
is to posit a phonological category of Intonational Phrase, which plays a role
in the assignment of intonation contours and the distribution of stress
(Beckman and Pierrehumbert 1986, Ladd 1996).
Intonation Phrases are to some degree correlated with syntax; their
boundaries tend to be at the beginning of major syntactic constituents; but
their ends do not necessarily correlate with the ends of the
corresponding syntactic constituents.
At the same time, intonational phrases have their own autonomous
constraints, in particular a strong preference for rhythmicity and parallelism
(as evinced in (2) for example), and a preference for saving the longest
prosodic constituent for the end of the sentence.6
Another example of mismatch between syntax and
phonology comes from contractions such as I’m and Lisa’s (as in Lisa’s
a doctor). These are clearly
phonological words, but what is their syntactic category? It is implausible to see them either as noun
phrases that incidentally contain a verb or to see them as verbs that
incidentally contain a noun. Keeping
phonological and syntactic structure separate allows us to say the natural
thing: they are phonological words that
correspond to two separate syntactic constituents.
(5) Syntactic
structure: [NP I] [V
(a)m] [NP Lisa] [V
(i)s]
Phonological
structure: [Wd
I’m] [Wd Lisa’s]
Since every different sentence of the language has a
different phonological structure, and since phonological structures cannot be
derived from syntax, the usual arguments for combinatoriality lead us to the
conclusion that phonological structure is generative. However, in addition to the generative principles that describe
these structures, it is necessary to introduce a new kind of principle into the
grammar, what might be called “correspondence rules” or “interface rules.” These rules (I revert to the standard term
“rules” rather than being obsessive about “f-rules”) regulate the way the
independent structures correspond with each other. For instance, the relation between syllable weight and metrical
weight is regulated by an interface rule between syllabic and metrical
structure; the relation between syntactic and intonational constituents is
regulated by an interface rule between syntactic and prosodic structure.
An important property of interface rules is they don’t
“see” every aspect of the structures they are connecting. For instance, the rules that connect
syllabic content to metrical grids are totally insensitive to syllable onset:
universally, stress rules care only about what happens in the rhyme. Similarly, although the connection between
syntax and phonology “sees” certain syntactic boundaries, it is insensitive to
the depth of syntactic embedding, Moreover, syntactic structure is totally
insensitive to the segmental content of the words it is arranging (e.g. there
is no syntactic rule that applies only to words that begin with b). Thus interface rules implement not
isomorphisms between the structures they relate, but rather only partial
homomorphisms.
This is not to say that we should think of speakers as
thinking up phonological and syntactic structures independently in the hope
they can be matched up by the interfaces.
That would be the same sort of mistake as thinking that speakers start
with the symbol S and generate a syntactic tree, finally putting in
words so they know what the sentence is about.
At the moment we are not thinking in terms of production; rather we are
stating the principles (of “competence”) in terms of which sentences are
well-formed. We will get back to how
this is related to processing in section 9.3.
Now the main point of this section. This view of phonological structure,
developed in the late 1970s and almost immediately adopted as standard, is
deeply subversive of the syntactocentric assumption that all linguistic
combinatoriality originates in syntax.
According to this view, phonological structure is not just a passive
hand-me-down derived from low-level syntax: it has its own role in shaping the
totality of linguistic structure. But
at the time of these changes, no great commotion was made about this most
radical aspect of the new phonology.
Phonologists for the most part were happy to get on with exploring this
exciting way of doing things, and for them, the consequences for syntax didn’t
matter. Syntacticians, for their part,
simply found phonology irrelevant to
their concerns of constraining movement rules and the like, especially since
phonology had now developed its own arcane technical machinery. So neither subdiscipline really took notice;
and as the technologies diverged, the relation between syntax and phonology
became a no-man’s-land (or perhaps only a very-few-man’s-land). Tellingly, as far as I can determine, in all
of Chomsky’s frequent writings on the character of the human language capacity,
there is no reference at all to post-1975 phonology – much less to the
challenge that it presents to his overall syntactocentric view of language.
6. The
syntax-semantics interface
I have treated the developments in phonology first
because it is less controversial. But
in fact the same thing happened in semantics.
Over the course of the 1970s and 1980s, several radically different
approaches to semantics developed: within
linguistics, at least Formal Semantics (growing out of formal logic)(Partee
1976, Heim and Kratzer 19xx), Cognitive Grammar (Lakoff 1987, Langacker 1987,
Talmy 2000), and Conceptual Semantics (Jackendoff 1983, 1990, Pinker 1989,
Pustejovsky 1995), plus approaches within computational linguistics and
cognitive psychology. Whatever their
differences, all these approaches take meaning to be deeply combinatorial. None of them take the units of semantic
structure to be syntactic units such as NPs and VPs; rather, the units are
intrinsically semantic entities like objects, events, actions, properties, and
quantifiers.7 Therefore,
whichever semantic theory we choose, it is necessary to grant semantics an
independent generative organization, and it is necessary to include in the
theory of grammar an interface component that correlates semantic structures
with syntactic and phonological structures.
In other words, the relation of syntax to semantics is qualitatively parallel
to the relation of syntax to phonology.
However, apparently no one pointed out the challenge to syntactocentrism
– except the Cognitive Grammarians, who mostly went to the other extreme and
denied syntax any independent role, and who have been steadfastly
ignored by mainstream generative linguistics.
The organization of phonological structure into
semi-independent tiers finds a parallel in semantics as well. Linguistic meaning can be readily partialled
into two independent aspects. On one
hand there is what might be called “propositional structure”: who did what to whom and so on. For instance, in The bear chased the lion,
there is an event of chasing in which the bear is the chaser and the lion is
“chasee”. On the other hand, there is
also what is now called “information structure”: the partitioning of the message into old vs. new information,
topic vs. comment, presupposition vs. focus, and so forth. We can leave the propositional structure of
a sentence intact but change its information structure, by using stress (6a-c)
or various focusing constructions (6d-f):
(6) a. The BEAR chased the lion.
b. The bear chased the LION.
c. The bear CHASED the lion.
d. It was the bear that chased the lion.
e. What the bear did was chase the lion.
f. What happened to the lion was the bear chased
it.
Thus the propositional structure and the information
structure are orthogonal dimensions of meaning, and can profitably be regarded
as autonomous tiers. (Foundations
proposes a further split of propositional structure into descriptive and
referential tiers, an issue too complex for the present context.)
Like the interface between syntax and phonology, that
between syntax and semantics is not an isomorphism. Some aspects of syntax make no difference in semantics. For instance, the semantic structure of a
language is the same whether or not the syntax marks subject-verb agreement,
verb-object agreement, or nominative and accusative case. The semantic structure of a language does
not care whether the syntax calls for the verb to be after the subject (as in
English), at
the end of the clause (as in Japanese), or second in a main clause and final in
a subordinate clause (as in German). As
these aspects of syntax are not correlated with or derivable from semantics,
the interface component disregards them.
Similarly, some aspects of semantics have little if
any systematic effect in syntax. Here
are a few well-known examples.
(7) a. Where is my hat?
b. (Now,
Billy:) What’s the capital of New York?
c. Would
you please open the window?
d. Is the
Pope Catholic?
(8) a. Jill
jumped until the alarm went off.
b. Jill slept until the alarm went off.
c. Jill jumped when the alarm went off.
The standard account of this contrast (Talmy 2000,
Verkuyl 1993, Pustejovsky 1995, Jackendoff 1997) is that the meaning of until
is to set a temporal bound on an ongoing process. When the verb phrase already denotes an ongoing process, such as
sleeping, all is well. But when the
verb phrase denotes an action that has a natural temporal ending, such as
jumping, then its interpretation is “coerced” into repeated action – a
sort of ongoing process – which in turn can have a temporal bound set on it by until. For present purposes, the point is that the
sense of repetition arises from semantic combination, without any direct
syntactic reflex. (On the other hand,
there are languages such as American Sign Language that have a grammatical
marker of iteration; this will have to be used in the translation of (8a).)
(9) a. [One
waitress says to another]:
The ham sandwich wants another cup of
coffee.
[Interpretation: ‘the person who
ordered/is eating the ham sandwich...’]
b. Chomsky is on the top shelf next to
Plato.
[Interpretation: ‘the book by
Chomsky ...’]
Such cases of “reference transfer” contain no syntactic
reflex of the italicized parts of the interpretation. One might be tempted to dismiss these phenomena as “mere
pragmatics”, hence outside the grammatical system. But this proves impossible, because reference transfer can have
indirect grammatical effects. A clear
example involves imagining that Richard Nixon went to see the opera Nixon in
China (yes, a real opera!), and what happened was that:
(10) Nixon was astonished to see himself sing
a foolish duet with Pat.
The singer of the duet, of course, is the actor
playing Nixon; thus the interpretation of himself involves a
reference transfer. However, we cannot
felicitously say that what happened next was that:
(11) *(Up on stage,) Nixon was astonished to
see himself get up and walk out.
That is, a reflexive pronoun referring to the acted
character can have the real person as antecedent, but not vice versa
(Fauconnier 1985, Jackendoff 1992).
Since the use of reflexive pronouns is central to grammar, reference
transfer cannot be seen as “extragrammatical.”
(12) Everyone in this room knows at least two
languages.
a. ‘John knows English and French; Sue
knows Hebrew and Hausa; ....”
b. ‘... namely, Mandarin and Navajo.’
Should there be two different syntactic structures
associated with these two interpretations?
Chomsky 1957 said no; Chomsky 1981 said yes; Generative Semantics said
yes; I am inclined to say no (Jackendoff 1996, Foundations chapter
12). The problem with finding two
different syntactic structures is that it requires systematic and drastic
distortions of the syntactic tree that never show up in the surface syntax of
any language. The problem with having
only one syntactic structure is that it makes the syntax-semantics interface
more complex. The point to be made here
is that the scope of quantification may well be a further example of the
“dirtiness” of the interface between syntax and semantics; this continues to be
an important issue in linguistic theory.
In each of these cases, a syntactocentric theory is
forced to derive the semantic distinctions from syntactic distinctions. Hence it is forced into artificial solutions
such as empty syntactic structure and elaborate movement, which have no
independent motivation beyond providing grist for the semantics. On the other hand, if the semantics is
treated as independent from syntax but correlated with it, it is possible to
permit a less than perfect correlation; it is then an empirical issue to
determine how close the match is.
If we abandon syntactocentrism, it is logically
possible that there are aspects of semantics that have no impact on syntax but do
have an effect on phonology through a direct phonology-semantics
interface. Such a treatment is
attractive for the correlation between prosody and information structure. For instance, the differences among (6a-c)
do not show up in syntax at all – only in the stress and intonation in
phonology, and in the focus-presupposition relations in semantics. In a syntactocentric theory, one is forced
to generate these sentences with a dummy syntactic element [+Focus], which
serves only to correlate phonology and meaning and does not affect word order
or inflection. (Such was the approach
in Jackendoff (1972), for instance.)
But this element does no work in syntax per se; it is only present in
order to account for the correlation between phonology and semantics. By introducing a direct
phonology-to-semantics interface sensitive to this correlation, we can account
for it with minimal extra machinery; but of course this requires us to
abandon syntactocentrism.
7. The
outcome: Parallel architecture
The argument so far has been that theoretical thinking
in both phonology and semantics has proceeded in practice as though their
structures are due to independent generative capacities. What has attracted far less notice among
syntacticians, phonologists, and semanticists alike is that such an
organization logically requires the grammar to contain interface components
that correlate the independent structures.
Carrying this observation through the entire architecture of grammar, we
arrive at an overall picture like Figure 3: the grammar contains multiple sets
of formation rules (the “generative” components), each determining its own
characteristic type of structure, and the structures are linked or correlated
by interface components.

Figure 3. The parallel archetecture.
In the syntactocentric architecture, a sentence is
well-formed when its initial syntactic tree is well-formed and all the steps of
derivation from this to phonology and semantics are well-formed. In the parallel architecture, a sentence is
well-formed when all three of its structures – phonological, syntactic, and
semantic – are independently well-formed and a well-formed correspondence among
them has been established by the interfaces.
One of the primary interface rules between phonology
and syntax is that the linear order of units in phonology corresponds to the
linear order of the corresponding units in syntax. One of the primary interface rules between syntax and semantics
is that a syntactic head (such as a verb, noun, adjective, or preposition)
corresponds to a semantic function, that the syntactic arguments of the head
(subject, object, etc.) correspond to the arguments of the semantic
function. The consequence of these two
primary interface principles is that for the most part, syntax has the linear
order of phonology but the embedding structure of semantics.
An illustration of some of these properties of the
parallel architecture appears in Figure 4, the structure of the phrase the
cats. The three independent
structures are displayed side by side;8 the subscripting
indicates the connections established by the interfaces between the parts of
the three structures. For example, the
clitic pronounced xc is
coindexed with the determiner in the syntax and the with definiteness feature
in semantics. Notice that the lowest
nodes in the syntactic tree are syntactic features, not the customary notation the
cat-s. The reasons for this are
explained in the next section.

Figure 4. The structure of the cats in the parallel architecture.
The overall architecture laid out in Figure 3 provides
a model within which many different theories of grammar can be embedded and
compared. For instance, Figure 3 does
not dictate whether the syntactic formation rules are along the lines of
transformational grammar, the Minimalist Program, Head-Driven Phrase Structure
Grammar, or many other alternatives.
Moreover, the syntactocentric framework is a version
of Figure 3 in which the phonological and semantic formation rules are null, so
that everything in phonological and semantic structures is determined only by
their interfaces with syntax. The
framework favored by many in Cognitive Linguistics minimizes or even eliminates
the syntactic formation rules, so that syntax is determined entirely by
meaning.
The organization into parallel generative components
is not new here. In addition to the
innovations in phonology discussed in section 3, Lexical-Functional Grammar
divides syntax into two tiers, c-structure and f-structure; Autolexical Syntax
(Sadock 1991) has a different division into morphosyntax and phrasal syntax;
Role and Reference Grammar (Van Valin and LaPolla 1997) has, in addition to a
morphosyntax/phrasal syntax division, the propositional/ information tier
division in semantics, with interfaces going every which way among the
tiers. In other words, various elements
of this architecture are widely present in the literature. What is novel here is recognizing that this
organization runs through the entire grammar, from phonology through semantics
(and further, as we will see in section 9).9
It might well be argued that the standard
syntactocentric framework has served the field well for forty years. Why should anyone want to give it up? A reply might come in five parts. First, no one has ever argued for the
syntactocentric model. In Aspects,
it was explicitly only an assumption, which quickly hardened into dogma and
then became part of the unstated background.
By contrast, the parallel architecture now has been argued for,
in part based on well-established results in phonology and semantics, which
have never played a role in syntactocentric argumentation.
Second, an advocate might argue that the
syntactocentric model is a priori simpler:
why should we admit so many different components into the grammar? The reply would be that the choice among
theories must be determined by empirical adequacy as well as a priori
simplicity. If the parallel
architecture allows a more perspicuous account of, say, intonation contours or
the relation of focus to stress, these are arguments in its favor.
A third point concerns the relation of syntax and
semantics. Since syntax is now not
responsible for determining every semantic distinction, it is to some degree
liberated from semantics and can therefore be considerably simplified. However, some compensating complexity must
be introduced into the syntax-semantics interface, so it doesn’t disappear from
the grammar entirely. It now becomes an
empirical question how to parcel the complexity out, and this question can be
addressed; it is not just an issue of opinion or preference (see e.g.
Jackendoff 1992, Culicover 1999, Culicover and Jackendoff 1995, 1997,
1999). At the same time, syntax does
not go away entirely (as opponents of Chomsky would often like). The syntax of a language still has to say
where the verb goes, whether the verb agrees with the subject, how to form
relative clauses and questions, and so on.
The differences among languages in these respects are not predictable
from semantics, and children have to learn them.
A fourth point concerns the nature of Universal
Grammar. In the parallel architecture,
the issues of acquisition and innateness don’t go away, they are exactly the
same, namely, How does the child acquire the grammar of its native language on
the basis of environmental evidence?
However, as just suggested, the sorts of questions that most often arise
concern the balance of power among components.
We don’t find ourselves invariably asking: what do we have to add to syntax to account for such-and-such a
phenomenon? Rather, we find ourselves
asking: in which component does this
phenomenon belong? Is it a fact of
syntax, of semantics, or of the interfaces?
And to what extent is it realistic to attribute such a bias to the child
learning the language?
A final point concerns not linguistic structure itself
but its connection to the rest of the theory of the brain/mind. On the face of it (at least in my opinion),
one should favor approaches that permit theoretical integration. Section 9 will show four ways that the
parallel architecture invites such integration but the syntactocentric theory
does not.
8. The lexicon
and the words vs. rules controversy
Every theory of language has to take a word to be a
stored complex of phonological, syntactic, and semantic features or structure;
commonly the store of words is called the lexicon. .
However, theories differ in the role of the lexicon in the construction
of sentences. In all of the
syntactocentric architectures shown in Figure 1, words are inserted into
syntactic trees at the beginning of a syntactic derivation, at the point when
syntactic trees are being built and before they begin to be manipulated and fed
to phonology and semantics. Thus the
traditional notation for trees in Figure 5a is actually intended as an
abbreviation of Figure 5b, in which the lexical items are spelled out in
full. The consequence is that the
syntax is carrying around with it all the phonological and semantic features of
words, which are totally invisible to syntactic rules and are of use to the
grammar only when handed over and “interpreted” by the proper component.

Figure 5a. Traditional notion for the cat. b. What the traditional notation abbreviates.
The parallel architecture, by contrast, insists that each
kind of feature belongs only in its own structure. Thus the traditional syntactic notation in Figure 5 is formally
incoherent, because it has phonological and semantic features in a syntactic
structure. Thus it is formally
impossible to insert full lexical items into syntactic structure. How then do words get into linguistic
structures? The answer is that each of
the three structures making up a word is inserted into its proper structure,
and each of them carries with it an index that connects it to the others. So, for example, the word cat is
notated as Figure 6; its contribution to the larger structure in Figure 4
should be evident.

Figure 6. The structure of the word cat.
Thus a word is best regarded as a type of interface
rule that establishes partial correspondences among pieces of phonological,
syntactic, and semantic structure (each piece in turn conforming to the
formation rules of its own component).
In other words, the language does not consist of a lexicon plus
rules of grammar; rather, lexical items are among the rules of grammar –
very particular rules to be sure, but rules nonetheless.
This treatment of the lexicon offers an attractive
account for a number of previously troublesome phenomena. For instance, consider an idiom such as kick
the bucket. This can be treated as
a lexically listed VP that is coindexed with phonology in the normal way, but
which lacks indexes connecting the individual words to semantics: instead, the
VP as a whole is coindexed with the semantic structure DIE. As a consequence, the individual words kick,
the, and bucket do not contribute individually to meaning. This is precisely what an idiom is supposed
to be: a stored unit in which the words
do not have their normal meaning.
A sort of converse is found in irregular
morphology. Consider something like the
irregular plural feet. It has to
be listed syntactically as a plural noun, and the two syntactic parts are
coindexed in the normal way to semantics:
the word denotes multiple entities of the type FOOT. However, the syntactic parts are not
connected in normal fashion to phonology; rather the whole syntactic complex is
coindexed with the undifferentiated lump feet in phonology.
Notice by contrast how the regular plural is
coded in Figure 4. The regular plural consists
of a piece of meaning, namely plurality, plus a piece of syntax, namely an
affix attached to nouns, plus a piece of phonology, namely a suffix s or
z or cz, the
choice determined contextually. That
is, the regular plural has all the same parts as a word, and it determines a
connection between them. We can notate
this as a lexical item along the lines of Figure 7. (The italicized bits denote contextual features that determine
how this item is combined with its environment.) The contribution of this item to the overall structure in Figure
4 is entirely parallel to the contribution of the word cat.

Figure 7. The English regular plural as a lexical item.
This view of regular
morphology puts a new and unexpected spin on the by now hoary “words vs. rules”
controversy (e.g. Rumelhart and McClelland 1986, Elman et al. 1996, Pinker
1999). Traditionally, the issue is
taken to be:
On the present view,
words are rules – interface rules that help connect phonological,
syntactic, and semantic structures. And
Figure 7, the “rule” for the regular English plural affix, is qualitatively no
different. Its contextual features are
qualitatively not unlike those of, say, transitive verbs. It combines with nouns the same way a
transitive verb combines with its object.
Thus the formation of regular plurals is an instance of ordinary
combinatoriality. In this approach, the
issue comes to be restated like this:
I submit that even to the most committed of
connectionists, this latter way of framing the question can hardly be
objectionable.
In fact, the advocates of rules, such as Pinker, have
not made the case nearly as strong as it can be. The connectionist argument has been to the effect: We can make a device that learns all English
past tenses without making use of a rule, and we can find evidence from
acquisition and processing that supports this account. The best version of the anti-connectionist
argument has been: Connectionist
modeling offers important innovations over standard models of language in
dealing with case-by-case learning and analogy for the irregular past
tenses. But -- you still need rule
learning to account for children’s acquisition of regular past tense, and we
can find evidence from acquisition and processing that supports this
account. The problem is that the debate
has often been framed as though only the past tense were at issue, while the
subtext behind the connectionist position is that if this can be learned
without rules, then it is a good bet that the rest of language can be too.
But not only the past tense is at stake. To deal with the whole of language, it is
necessary to account for the creative formation of things like verb phrases and
relative clauses – linguistic entities that cannot be listed in the
lexicon. On the present view, the way
that the regular past tense affix combines with verbs is exactly like the way
nouns combine with relative clauses and the way noun phrases combine with verbs
and the way subordinate clauses combine with main clauses – it is just another
case of free combinatoriality. In the
decade and a half since the original connectionist past tense model, there has
been no demonstration that the model scales up to acquisition of the full free
combinatoriality of language – the issue that grounds generative
linguistics.
At the same time, notice that within the parallel
architecture, the terms of the dispute become far less contentious. The regular past tense is no longer a
qualitatively different phenomenon from words:
words are a type of rule, and the posited regular past tense morpheme,
Figure 7, is in the relevant respects just like a word. (It differs only in that it is grammatically
smaller and it requires a word as its grammatical host.) So the issue is only whether there is such a
separate lexical item, not whether there are two wholly different kinds of
linguistic animal, namely words and rules. Thus in the end the fate of the past
tense doesn’t seem like such a big deal.
9. Four ways the
parallel architecture helps integrate linguistics with cognitive neuroscience
The parallel architecture may be an intriguing
technical alternative to the Chomskyan orthodoxy in linguistics, but is there
any reason why it should be of interest to anyone other than linguists?
The end of the previous section may have begun to offer some hints. This section will sketch out a little more fully
some ways in which the parallel architecture offers opportunities to unify
linguistics with the other cognitive sciences.
9.1. The place
of the parallel framework in the larger architecture of the mind/brain.
To sum up the larger picture: The parallel
architecture claims that language is organized into a number of
semi-independent combinatorial systems, each of which has its own organizing
principles. These systems are linked by
systems of interface principles.
Interface principles establish a correlation between pieces of structure
in two (or more) of the combinatorial systems.
Some interface principles deal with large-scale and general
correspondences such as the parallel between linear order in syntax and in
phonology. On the other hand, some of
them are extremely specialized, for instance individual words, idioms, and
regular affixes. The interface principles
as a whole do not implement an isomorphism between the structures they
connect. Rather they implement a
partial homomorphism, a “dirty” correspondence in which not all parts of the
structures in question are correlated, and in which many-to-many mappings are
altogether common.
This conception of the interfaces within language is
perfectly in tune with the way linguistic structures connect to the rest of the
mind. Consider how phonology interacts
with the auditory system in speech perception and with the motor system in
speech production. As is well known (to
the dismay of fifty years of computer scientists working on automated speech
recognition), the mapping between a frequency analysis of the speech signal and
the phonological structure of an utterance is frighteningly complex. In particular, some aspects of the speech
signal play no role in phonological structure and must be factored out, for
instance the individual timbre of the speaker’s voice, the speaker’s tone of
voice, and the speed of production, not to mention ambient noise. These aspects of the speech signal are put
to use for other cognitive purposes, but not for speech. Moreover, having factored all these things
out from the acoustic signal, still not every part of the phonological
structure is predictable from what is left:
most prominently, word boundaries are not present as pauses in the
signal. Thus the
auditory-to-phonological mapping has the same general characteristics as the
interfaces inside language: it
establishes a “dirty” correspondence between certain aspects of two disparate
mental structures.
Speech production has similar properties. Not every aspect of phonological structure
corresponds to an aspect of the motor control involved in operating the vocal
tract. In particular, word boundaries
do not correspond at all consistently to pauses in production. And not every aspect of motor control is
controlled by phonological structure.
For instance, one can talk intelligibly with a pipe in one’s mouth,
which hugely distorts the motor commands involved in speech without changing
the phonological structure a bit. And
of course the same muscles in the vocal tract are used for chewing, swallowing,
and so on. Without going into more
detail, it should be clear that again the same sort of interface is in play
here.
Next consider the visual system. Beyond the very early levels of vision,
there is little detailed theory of the f-knowledge involved in vision – the
necessary levels of representation and so on (I take Marr (1983) to have been
attempting to lay out such a theory, but the enterprise has been largely
abandoned since his death). On the
other hand, the neuroscience of vision reveals a qualitatively similar picture: numerous independent brain areas, each
specializing in a particular aspect of vision such as shape, motion, color, and
spatial relations, each interacting with certain others by dedicated pathways,
and no area where “it all comes together” to form a full representation of the
visual field. This has precisely the
flavor of the parallel architecture in linguistics, where the notion of a
“sentence” or “phrase” is distributed among several structures, communicating
with each other via dedicated interfaces, as shown in Figure 4.10
A syntactocentric architecture, by comparison, shows
no resemblance to the rest of the mind/brain.
A master “computational system” that generates syntactic structures,
which in turn determine phonological structures and meanings, simply has no
known parallel in the brain. Even the
connection of language to speech is markedly different from the connections
among the components inside of language.
9.2. The role
of semantics
Another important advantage of the parallel
architecture is the connection of semantics to the rest of the mind/brain. Foundations of Language (chapters 9
and 10) advocates that if generative grammar is to truly adopt the mentalist
stance, this stance must be applied to meaning as well. According to this stance, the basic function
of language is to convert thoughts into communicable form; the virtue of human
language over other natural communication systems is that it is so broad in the
range of messages it can convey. Each
of the indefinitely many sentences of a language conveys a different
thought. Since not all these thoughts
can be stored in a single head, it is necessary that thoughts be constructed
combinatorially. Therefore a goal for
semantic theory is to uncover the combinatorial system underlying human
concepts. Such a goal converges with important
trends in psychology and philosophy.
However, another influential strain in semantics (and
the predominant one in Anglo-American philosophy, dating back to Frege (1892)
and shared by people as different as David Lewis (1972), Hilary Putnam (1976),
John Searle (1980), and Jerry Fodor (1987)) takes it that semantics is the
study of the connection of language to the world. On this view, a proper semantics has to be
concerned above all with how the noise kFt is
connected with cats. How language users
make that connection is quite a different issue (and to many semanticists, not
of interest). There is no room here to
begin the critique of this view; chapters 9 and 10 of Foundations take
up the argument in detail. My overall
conclusion is that even if it is worthwhile undertaking such a “realist”
semantics, the enterprise of discovering how language users do it is also
worthwhile. I don’t care whether you
call the latter enterprise semantics or shmenatics or whatever: it’s this enterprise whose central issues
intercalate naturally with those of generative linguistics, cognitive
psychology, and neuroscience. Just to
be clear, I will call this enterprise conceptualist semantics.
Conceptualist semantics requires us to rethink the
traditional issue of reference, which takes as its starting point the
unshakeable intuition that the phrase my cat does indeed pick out an
individual in the world. In a mentalist
linguistic theory, the language user’s linguistic system connects the phonological string /maykFt/ to the concept of a feline animal, and to the
concept of this feline animal being possessed by the speaker of the
phrase. How then does the language user
get from there to the actual individual out there in the world? The answer in brief is that it isn’t just
language users who have to connect something in their head to a sense of
individuals in the world: any organism
with a visual system about like ours (e.g. babies and apes) has precisely the
same problem. The environment acting on
the visual system produces some set of activations in the brain, resulting in
the organism experiencing real objects out there. In other words, conceptualist semantics allows us to recognize
that the problem of reference is not a problem about language, it’s at
bottom a problem about perception and cognition which has to be
solved by psychology and neuroscience.
By contrast, conventional realist theories of reference divorce
reference from the mind and make no contact whatsoever with research on
perception.
In order for the system of meaning to be influenced by
perception, of course, there has to be an interface between conceptual/semantic
structure and the “upper end” of the perceptual systems, where “the world”
(i.e. the perceiver’s conceptualization of the physical world) is organized in
terms of stable three-dimensional objects that are located in space with
respect to the perceiver and each other.
This interface too can be shown to have the standard characteristics: it
is a partial homomorphism between the quasi-algebraic format in which
linguistic meanings are encoded and the quasi-geometric/topological format(s)
in which spatial understanding is encoded.
Thus at the semantic end of the language faculty, just as at the phonological
end, the relation between language and the rest of the mind is of the same
general character as the interfaces within the language faculty itself.
Studying the conceptual system as a combinatorial
system leads to the same questions about acquisition as studying syntax. How does the child learning language acquire
the meanings of all those thousands of words on the basis of experience, both
perceptual and linguistic? What
perceptual biases and innate structures does the child bring to the task of
interpreting the world? Here conceptualist
semantics makes contact with a rich literature on word and concept learning and
its innate bases (Bloom 2000, Carey 1985, Spelke et al. 1994, Baillargeon 1986,
Gleitman and Landau 1994, Keil 1989, to mention only a few parochial examples). Moreover, since humans doubtless share with
monkeys and apes at least the parts of the conceptual system dealing with
physical space and perhaps some of the parts dealing with social relations and
other minds, conceptualist semantics further makes contact with research on
primate cognition (Premack 1976, Tomasello 2000, Hauser 2000, Povinelli 2001,
Cheney and Seyfarth 1990, Köhler 1927).
Again, these are issues that conventional realist
semantics cannot address. Nor are they
particularly accessible to semantics studied in a syntactocentric linguistic
theory. For if the combinatorial
properties of semantics were completely attributable to the combinatorial
properties of syntax, then it would be impossible for nonlinguistic organisms
to have combinatorial thoughts. There
are of course important strains of philosophy that have embraced this view,
identifying the capability for thought with the capability for overt language
(Descartes comes to mind, for instance).
But I think contemporary cognitive neuroscience has outgrown such a
view, and linguistics ought to be able to follow suit gracefully.
9.3. The
relation of grammar to processing
A theory of linguistic competence is supposed to
simply define the permissible structures in the language, without saying how
those structures are produced in real time.
However, as pointed out in
section 1, a competence theory ought to lend itself to being embedded in a
theory of processing: we ought to be
able to say how the f-knowledge that constitutes the competence theory is
actually put to use.
There turns out to be an inherent structural reason
why competence has to be isolated from performance in the syntactocentric view
of language. If we flatten out and
simplify all the syntactocentric architectures in Figure 1, they all have a
logical directionality proceeding outward from syntax in the middle:
(13) Logical
directionality of syntactocentric architecture
sound Z phonology
Z syntax Y meaning
What I mean by logical directionality is that the
possible phonological structures and meanings cannot be determined without
first determining syntactic structures.
Syntacticians may insist that they are being “metaphorical” when they
talk about things happening “before” and “after” other things in a derivation;
but the logical dependence is there nevertheless. Now contrast this to the logical
directionality of language processing:
language perception goes consistently from left to right, and language
production from right to left.
(14) a. Logical directionality of language
perception
sound Y phonology Y syntax Y meaning
b. Logical directionality of language
production
sound Z phonology
Z syntax Z meaning
Hence there is no way that the logical directionality
in (13) can serve the purposes of both perception and production. Going from syntax to phonology in (13) seems
inherently like production – but only part of production; going from
syntax to semantics in (13) seems inherently like perception – but only part of
it.
The parallel architecture, by contrast, is inherently nondirectional. The “information flow” between sound and
meaning is through the sequence of interfaces, each of which is a system of
correlations between two structures, not a derivation of one from the other. The correlations can be used in either
direction (which is why they are drawn with double arrows in Figure 3). This makes it possible to think of speech
perception as a process where structures are activated first at the auditory
end of the chain of structures, “clamped” by the environmental input. The interfaces propagate activation
rightward through the chain, each interface principle creating a partial
resonance between the structures it connects.
Eventually the structured activation reaches semantic structure, at
which point it can interact with the hearer’s understanding of the context to
produce the understanding of the heard utterance. Similarly, in speech production, the speaker begins with a
thought to convey, i.e. meaning is “clamped” by the speaker’s communicative
intent. Then the interface principles
propagate activation leftward through the chain, eventually activating motor
control of the vocal tract and producing speech. Crucially, except for the auditory and vocal parts of the chain,
the very same structures and the very same interface principles are invoked in
perception and production, just in opposite directions.
There is no need in this system for all of one level
to be totally processed before activation of the next level sets in. Any activation of a level, no matter how
incomplete, if it can be detected by the next interface, will start to
propagate to the next level in the chain.
Thus processing can be thought of as “incremental” or “opportunistic”
rather than rigidly regulated. In
addition, since the interfaces are trying to achieve “resonance”, i.e. optimal
mapping between levels, there is ample room in the processing theory for
feedback in processing – semantics affecting syntactic processing in
perception, and vice versa in production.
A crucial tenet of this theory, though, is that the
rules of grammar are the only source of information flow in language processing. For example, knowledge of context cannot
directly affect phonological processing, because there are no interface rules
that directly relate contextual understanding to phonological structure. On the other hand, context can indirectly
affect phonological processing – via the interfaces linking them through
semantics and syntax. The prediction is
that such feedback will take effect some time after constraints directly from
phonology, because it has to go up the chain of interfaces and down again. On the whole such a prediction seems
consistent with the experimental literature (Cutler and Clifton 1999, Levelt
1989); Foundations works out
many details, in particular the relation of long-term memory to working memory
during language processing.
The role of the lexicon in the processing theory is
entirely parallel to that in the competence theory. Recall that words are little interface rules, providing partial
routes for mapping between sound and meaning.
Now consider the logic of language perception. The auditory system and the interface from audition to phonology
produce some string of speech sounds in the hearer’s head, and this activates a
call to the lexicon: “Do any of you
guys in there sound like this?”
And various items raise their hand, i.e. get activated. At this point the processor has no way of
knowing which of these items is semantically appropriate, because no contact
has yet been made with semantics.
However, each item over time activates a connection to potential
syntactic and semantic structures, which can be integrated with previous words
and with context to determine which candidate word makes most sense in context. This scenario corresponds precisely to the
results in lexical access experiments (Swinney 1979, Tanenhaus et al. 1979), in
which every possible sense of a given phonological string is activated at
first, later to be pruned down by semantic context.
A parallel story can be told for speech
production. The speaker has activated
some conceptual structure that s/he wishes to communicate. The first step is to call the lexicon: “Do any of you guys in there mean this?” And various items raise their hand. All the lexical retrieval and speech error
literature now comes to bear in showing us the flow of information from this
point to actual vocal production; for the most part it proves to correspond
nicely to the options made possible by the components of the parallel framework
(Levelt 1989, 1999).
It is significant that the parallel architecture
accords words a very active role in determining the structure of sentences, in
concurrence with evidence from the psycholinguistic literature. By contrast, the syntactocentric
architecture views words as much more passive:
they simply ride around at the bottom of syntactic trees, while the
derivational rules of syntax do all the interesting work. Thus again, in the area of the lexicon, the
syntactocentric framework again makes it hard to connect competence and
performance.
The conclusion here is that the parallel architecture
permits a far closer relation between competence and performance theories. The rules of the language, including the
words, are posited to be precisely what the processing system uses in
constructing mappings between sound and meaning. This opens the door for a two-way dialogue between linguistics
and psycholinguistics. Linguistics has
always dictated the structures that psycholinguistics should be
investigating. But now there is the
possibility that psycholinguistic experiments may help determine what component
of the grammar is responsible for a particular phenomenon. For instance, PiZango et al. (1999) have shown that aspectual coercion
(e.g. the sense of repetition in Jill jumped until the alarm went off,
example (8a) above) causes a processing load at a point in time that is
appropriate to semantic, not syntactic processing. This result conforms to the theoretical claim that aspectual
coercion is a matter of adjusting semantic well-formedness, not a matter of
syntactic deletion of an iterative morpheme.
In short, the idealization of a competence theory is not a rigid
abstraction, it is rather a convenient methodological move, to be bridged
freely when the occasion arises. (More
detail on the relation of rules to processing is offered in Foundations,
chapter 7.)
9.4. Evolution
of language
Let us return to a point from section 2. If Universal Grammar is a human cognitive
specialization, it has to be transmitted by genes that have emerged in the
course of our evolutionary divergence from the chimpanzees. Of course the actual evidence for the
evolution of the language faculty is practically nonexistent. There is some evidence about the evolution
of the human vocal tract (Fitch 2000), but the ability to make speech sounds is
only one element of language – and of course there are signed languages, which
don’t involve speech at all. In
addition, it has begun to look like many of the mechanisms for auditory
perception are already in place in other mammals (Hauser, Chomsky, and Fitch
2002). But the real issue is: how did the ability to systematically map
combinations of concepts into sequences of speech sounds and back again develop
in our species, and how did the ability to learn such systematic
combinatorial mappings develop?
In the absence of evidence, we would like at least to
be able to tell a plausible story about the emergence of Universal Grammar, an
important aspect of which is the overall architecture of language. In particular, we would not like to have to
explain language through miraculous emergence, given that (as argued by Pinker
and Bloom 1990) it has the hallmarks of being shaped by natural selection. Pinker and Bloom, however, do not offer any
concrete proposals as to how language evolved.
As is well known, Chomsky himself has been notably evasive on the issue
of the evolution of the language faculty, often seeming to cast aspersions on
the theory of natural selection (Newmeyer 1998 collects representative
quotes). Chomsky is correct that other
factors besides natural selection play a role in evolution, for instance
environmental pressure and the biochemistry of proteins. Nevertheless there is nothing in these other
factors that provides any helpful hints on what brought about the emergence of
language.
The logic of the syntactocentric architecture suggests
a reason why such evasion has been necessary.
The problem is in providing a route for incremental evolution, such that
some primitive version of the faculty could still be useful to the
organism. In the syntactocentric
architecture, everything depends on syntax.
Meaning cannot have evolved before syntax, because its structure is
totally dependent on the syntactic structure from which it is derived. For the same reason, phonological structure
cannot have evolved before syntax. Thus
the complexity of syntax had to evolve before the complexity of the other components. But what would confer an adaptive advantage
on a syntactic faculty that just generated meaningless and imperceptible
syntactic structures? And what would
enable children to acquire such syntactic structures if there were no
perceptible output to which they could attach it? We quickly see that, at this very crude level at least, the
syntactocentric theory is stuck: there
is no logical way to build it incrementally, such that the earlier stages are
useful.
The parallel architecture offers a better
alternative. The system of concepts
that language expresses is an independent generative component in the
mind/brain. Since it is taken to exist,
in some degree at least, in other primates as well, it also could have existed
in our ancestors, prior to language.
That is, our ancestors had interesting thoughts, but lacked any way to
say them: meaning therefore would be
the first generative component of language to emerge (a similar view is urged
by Hauser 2000).
Most speculation on language evolution goes on to say
that the earliest stage would have been the symbolic use of simple
vocalization, without grammatical organization. Such a stage is theoretically impossible in the syntactocentric
theory, since even single-word utterances have to arise from syntactic
structure. But such a stage is quite
natural in the parallel architecture:
it consists of stored associations of vocalizations and concepts, i.e. a
“paleo-lexicon.” Lexical items that can
serve on their own as utterances still exist in modern language, for instance hello,
oops, ouch, and gadzooks.
The provision for them in language might be viewed as an evolutionary
relic of this earliest stage.
Assuming that there would be an adaptive advantage to
a larger number of signals, a regimentation of vocalization along the lines of
phonological structure would be the next generative component of language to
emerge. Phonological organization in
effect digitizes vocalizations, making a large vocabulary reliably
discriminable and learnable. (Proto-)words
at this point would be simply duples of phonological and semantic structure,
without syntax.
A next innovation might be the provision of
concatenating words into larger utterances.
However, when words are concatenated, the issue arises of how the
meanings of words in a string are related to each other semantically. In a string like eat apple Fred, it
is pretty clear on pragmatic grounds that Fred is eating the apple and not the
reverse. But pragmatics can only go so
far: in chase lion bear, who is
the chaser? Something as elaborate as
English syntax is not entirely necessary to fix this. One can actually get considerable mileage from simple functional
principles of linear word order. For
example, the principle “Agent First” would tell us that the lion is chasing the
bear and not the reverse. Such a
principle is a straight phonology-to-semantics mapping, relating linear order
to semantic function. And principles
like this appear to be widespread in pidgin languages (Bickerton 1981) and the
grammars of speakers who have acquired their languages late in life, after the
sensitive period (Klein and Perdue 1997).
Finally, principles like Agent First have their
limitations too. One can imagine the
capacity for modern syntactic structure evolving last, as a way of
making more complex semantic relations among the words of an utterances more
precisely mappable to linear word order in phonology. That is, syntax comes along in evolution as a refinement, a
“supercharger” of a preexisting interface between phonology and semantics. This seems exactly appropriate to its
function within the parallel architecture.
In short, the parallel architecture cannot tell us
exactly how language evolved – I don’t think anything can ever tell us
that. But it does afford a far more
plausible hypothesis than the syntactocentric architecture (Foundations,
chapter 8 develops this story in considerably more detail). Thus the parallel architecture opens the
door for linguistics to participate far more fully in the mainstream of
evolutionary psychology, yet another desirable connection.
10.
Conclusions
Putting this all together, the parallel architecture
makes it possible both to internally integrate linguistic theory, establishing
the proper relation between phonology, syntax, semantics, and the lexicon, and
also to integrate linguistic theory more comprehensively with the brain and
with biology. In addition, by
liberating semantics from its syntactic shackles, the parallel architecture
makes it possible to develop a fully psychological theory of meaning and its
relation to perception. As observed
above, these points of connection were precisely what early generative grammar
promised but ultimately couldn’t bring off; I have tried to show here why
syntactocentrism was a major reason behind this disappointment.
Of course, to propose a new architecture only begins
the work. It opens major questions
about exactly what components the grammar requires and what interfaces connect
them. Vast numbers of phenomena have
been studied in the context of the traditional architecture; to what extent can
the analyses proposed there be duplicated or even improved upon? In particular, a thorough overhaul of
syntactic theory is necessary, in order to overcome decades of accretions
motivated solely by syntactocentric
assumptions (Culicover and Jackendoff (forthcoming) begin to undertake this
task). Perhaps the hardest part of all
this will be maintaining a sense of global integration, keeping the subdomains
of the field in closer touch than they have recently been.
But linguistics alone cannot sustain the weight of the
inquiry. We need all the help we can
get from every possible quarter. And in
return, one would hope that linguistic theory might be a more fruitful source of
evidence and puzzles for other fields.
Above all, my aspiration for Foundations is that it can help
encourage the necessary culture of collaboration.
References
Baillargeon, Renée (1986) Representing the existence and the location of hidden objects: Object permanence in 6- and 8-month-old
infants. Cognition 23, 21-41.
Baker, C. L., and John J. McCarthy (eds.) (1981) The logical problem of language
acquisition. Cambridge, MA: MIT Press.
Beckman, Mary, and Janet Pierrehumbert (1986)
Intonational structure in English and Japanese. Phonology 3, 255-309. Bellugi et al. 1994
Bellugi, Ursula, Howard Poizner, and Edward S. Klima
(1989) Language, modality, and the
brain. Trends in Neurosciences
12, 380-388.
Bickerton, Derek (1981) Roots of language.
Ann Arbor, MI: Karoma.
Bierwisch, Manfred (1967) Some semantic universals of German adjectivals. Foundations of Language 3, 1-36.
Bierwisch, Manfred (1969) On certain problems of semantic representation. Foundations of Language 5, 153-184.
Bloom, Paul (2000) How children learn the meanings
of words. Cambridge, Mass.: MIT
Press.
Bresnan, Joan W. (ed.) (1982) The mental
representation of grammatical relations.
Cambridge, MA: MIT Press.
Bresnan, Joan W. (2001) Lexical-functional syntax. Oxford: Blackwell.
Carey, Susan (1985) Conceptual Change in Childhood. MIT Press.
Cheney, Dorothy, and Robert Seyfarth (1990) How monkeys see the world. Chicago:
University of Chicago Press.
Chomsky, Noam (1957)
Syntactic structures. The
Hague: Mouton.
(1965) Aspects of the theory of syntax. Cambridge, MA: MIT Press.
(1966) Cartesian Linguistics. New York:
Harper & Row.
(1972) Studies on Semantics in Generative
Grammar. The Hague: Mouton.
(1973) Constraints on transformations. In S. Anderson and P. Kiparsky (eds.), A
Festschrift for Morris Halle, 232-286.
New York: Holt, Rinehart &
Winston.
(1977) On wh-movement. In P. Culicover, T. Wasow, and A. Akmajian (eds.), Formal
syntax, 71-132. New York: Academic Press.
(1981) Lectures on government and binding. Dordrecht: Foris.
(1995) The Minimalist Program. Cambridge, MA: MIT Press.
Chomsky, Noam, and Morris Halle (1968) The sound pattern of English. New York, Harper & Row.
Clahsen, Harald, and Mayella Almazan (1998) Syntax and morphology in Williams
syndrome. Cognition 68, 167-198.
Corballis, Michael C. (1991) The lopsided ape.
Oxford: Oxford University Press,
Culicover, Peter W. (1999) Syntactic nuts: Hard
cases in syntax. Oxford: Oxford
University Press.
Culicover, Peter, and Ray Jackendoff (1995) Something else for the binding
theory. Linguistic Inquiry 26,
249-275.
(1997) Semantic subordination despite syntactic
coordination. Linguistic Inquiry
28, 195-217.
(1999) The view from the periphery: The English correlative conditional. Linguistic Inquiry 30, 543-571.
(forthcoming)
Minimal syntax.
Curtiss, Susan (1977)
Genie: A linguistic study of
a modern-day "wild child".
New York, Academic Press.
Cutler, Anne, and Charles Clifton, Jr. (1999) Comprehending spoken language: A blueprint of the listener. In Colin M. Brown and Peter Hagoort (eds.), The
neurocognition of language, 123-166.
Oxford: Oxford University Press.
Deacon, Terrence W. (1997) The Symbolic Species.
New York: Norton.
Degraff, Michel (ed.) (1999) Language creation and language change: Creolization, diachrony, and development. Cambridge, MA: MIT Press.
Dufva, H., and M. Lähteenmäki (1996) But who killed Harry? A dialogical approach
to language and consciousness. Pragmatics
and Cognition 4, 35-53.
Elman, Jeffrey (1990)
Finding structure in time. Cognitive
Science 14, 179-211.
Elman, Jeffrey, Elizabeth Bates, Mark Johnson, Annette
Karmiloff-Smith, Domenico Parisi, and Kim Plunkett (1996) Rethinking innateness: A connectionist perspective on development. Cambridge, MA: MIT Press.
Fauconnier, Gilles (1985) Mental spaces: Aspects
of meaning constrution in natural language. Cambridge, MA: MIT Press.
Fillmore, Charles, Paul Kay, and Mary Catherine
O'Connor (1988) Regularity and idiomaticity in grammatical constructions: The case of let alone. Language 64, 501-538.
Fischer, Susan D.,
and Patricia Siple (eds.) (1990) Theoretical issues in sign language
research 1. Chicago: University of
Chicago Press.
Fitch, W. Tecumseh (2000) The evolution of speech:
A comparative review. Trends
in Cognitive Sciences 4, 258-267.
Flynn, Suzanne, and Wayne O'Neill (eds.) (1988) Linguistic theory in second language
acquisition. Dordrecht: Reidel.
Fodor, Jerry (1987)
Psychosemantics: The problem
of meaning in the philosophy of mind.
Cambridge, MA: MIT Press.
Fodor, Jerry, Thomas Bever, and Merrill Garrett
(1974) The Psychology of Language. New York:
McGraw-Hill.
Frege, Gottlob (1892)
Über Sinn und Bedeutung. Zeitschrift
für Philosophie und philosophische Kritik 100, 25-50. English translation in Peter Geach and Max
Black (eds.), Translations from the philosophical writings of Gottlob Frege,
56-78. Oxford: Blackwell, 1952.
Gleitman, Lila R., and Barbara Landau (eds.)
(1994) The acquisition of the
lexicon. Cambridge, MA: MIT Press.
Goldberg, Adele (1995) Constructions: A Construction Grammar approach to argument
structure. Chicago: University of Chicago Press.
Goldsmith, John (1976) Autosegmental phonology.
Doctoral dissertion, MIT. New
York: Garland Press, 1979.
Gopnik, Myrna (1999)
Some evidence for impaired grammars.
In R. Jackendoff, P. Bloom, and K. Wynn (eds.), Language, Logic, and
Concepts, 263-283. Cambridge,
Mass.: MIT Press.
Halle, Morris, and William Idsardi (1995) Stress and metrical structure. In J. Goldsmith (ed.), Handbook of
theoretical phonology, 403-443.
Oxford: Blackwell.
Harris, Randy Allen (1993) The Linguistics Wars.
New York: Oxford University
Press.
Hauser, Marc D. (2000) Wild Minds: What
Animals Really Think. New
York: Henry Holt.
Hauser, Marc D., Noam Chomsky, and Tecumseh Fitch
(2002) Science
Heim, Irena, and Angelica Kratzer (199x)
Huck, Geoffrey, and John Goldsmith (1995) Ideology and linguistic theory. Chicago:
University of Chicago Press.
Jackendoff, Ray (1972) Semantic interpretation in generative grammar. Cambridge, MA: MIT Press.
(1983) Semantics and cognition. Cambridge, MA: MIT Press.
(1992) Mme. Tussaud meets the binding theory. Natural Language and Linguistic Theory
10.1, 1-31.
(1990) Semantic structures. Cambridge, MA: MIT Press.
(1996) The proper treatment of measuring out,
telicity, and possibly even quantification in English. Natural Language and Linguistic Theory
14, 305-354.
(1997) The architecture of the language faculty. Cambridge, MA: MIT Press.
Kager, René (1995)
The metrical theory of word stress.
In J. Goldsmith (ed.), Handbook of theoretical phonology,
367-402. Oxford: Blackwell.
Katz, Jerrold, and Jerry Fodor (1963) The structure of a semantic theory. Language 39, 170-210.
Katz, Jerrold, and Paul M. Postal (1964) An integrated theory of linguistic
descriptions. Cambridge, MA: MIT Press.
Kegl, Judy, Ann Senghas, and Marie Coppola (1999) Creations through contact: Sign language emergence and sign language
change in Nicaragua. In DeGraff 1999,
179-237.
Keil, Frank C. (1989)
Concepts, kinds, and cognitive development. Cambridge, MA: MIT Press.
Klein, Wolfgang, and Clive Perdue (1997)
The Basic Variety, or: Couldn't language be much simpler? Second Language Research 13, 301-347.
Klima, Edward S., and Ursula Bellugi (1979) The signs of language. Cambridge, MA: Harvard University Press.
Köhler, Wolfgang (1927) The mentality of apes, Routledge & Kegan Paul.
Ladd, D. Robert (1996) Intonational phonology.
Cambridge: Cambridge University Press.
Lakoff, George (1971)
On generative semantics. In D.
Steinberg and L. Jakobovits (eds.), Semantics: An interdisciplinary reader in philosophy, linguistics, and
psychology, 232-296. New York: Cambridge University Press.
Lakoff, George (1987) Women, fire, and dangerous
things. Chicago: University of Chicago Press.
Lamb, Sydney (1966) Outline of Stratificational
Grammar. Washington: Georgetown
University Press.
Langacker, Ronald (1987) Foundations of cognitive grammar, Vol. 1. Stanford:
Stanford University Press.
Lenneberg, Eric (1967) Biological foundations of language. New York:
John Wiley & Sons.
Lerdahl, Fred, and Ray Jackendoff (1983) A Generative Theory of Tonal Music. Cambridge, MA: MIT Press.
Levelt, W. J. M. (1989) Speaking.
Cambridge: MIT Press.
(1999) Producing spoken language: A blueprint of the speaker. In Colin M. Brown and Peter Hagoort (eds.), The
neurocognition of language, 83-122.
Oxford: Oxford University Press.
Liberman, Mark, and Alan Prince (1977) On stress and linguistic rhythm. Linguistic Inquiry 8, 249-336.
Marcus, Gary (2001)
The algebraic mind. Cambridge, MA: MIT Press.
Marr, David (1982)
Vision. San
Francisco: Freeman.
McCawley, James D. (1968) Lexical insertion in a transformational grammar without deep
structure. In B. Darden, C.-J. N.
Bailey, and A. Davison (eds.) Papers from the fourth meeting of the Chicago Linguistic
Society. Chicago: University of Chicago Department of
Linguistics.
Newmeyer, Frederick J. (1980) Linguistic theory in America: The first quarter-century of
transformational-generative grammar.
New York: Academic Press.
(1998) On the supposed 'counterfunctionality' of
Universal Grammar: some evolutionary
implications, in James Hurford, Michael Studdert-Kennedy, and Chris Knight
(eds.) Approaches to the evolution of language, 305-319. Cambridge:
Cambridge University Press.
Newport, Elissa (1990) Maturational constraints on language learning. Cognitive Science 14, 11-28.
Nunberg, Geoffrey (1979) The non-uniqueness of semantic solutions: Polysemy.
Linguistics and Philosophy 3, 143-184.
Partee, Barbara, ed. (1976) Montague Grammar.
New York, Academic Press.
Perlmutter, David M. (ed.) (1983) Studies in Relational Grammar 1. Chicago:
University of Chicago Press.
PiZango, Maria
Mercedes, Edgar Zurif, and Ray Jackendoff (1999) Real-time processing implications of enriched composition at the
syntax-semantics interface. Journal
of Psycholinguistic Research, 28, 395-414.
Pinker, Steven (1989)
Learnability and cognition:
The acquisition of argument structure. Cambridge, MA: MIT Press
(1999) Words and rules. New York: Basic Books.
Pinker, Steven, and Paul Bloom (1990) Natural language and natural selection, Behavioral
and Brain Sciences 13, 707-726
Pollard, Carl, and Ivan Sag (1987) Information-based syntax and semantics. Stanford: Center for the Study of Language
and Information.
(1994) Head-Driven Phrase Structure Grammar. Chicago: University of Chicago Press.
Postal, Paul M.
(1970) On the surface verb
"remind." Linguistic
Inquiry 1, 37-120.
Povinelli, Daniel (2002) Folk Physics for Apes. Oxford: Oxford University Press.
Premack, David (1976)
Intelligence in ape and man.
Hillsdale, NJ: Erlbaum.
Prince, Alan, and Paul Smolensky (1993). Optimality Theory: Constraint Interaction
in Generative Grammar. Rutgers
University Center for Cognitive Science, Piscataway, NJ.
Pustejovsky, James (1995) The generative lexicon.
Cambridge, MA: MIT Press.
Putnam, Hilary (1975)
The meaning of 'meaning.' In
Keith Gunderson (ed.), Language, mind, and knowledge, 131-193. Minneapolis: University of Minnesota Press.
Rumelhart, David, and James McClelland (1986) On learning the past tense of English
verbs. In J. McClelland, D. Rumelhart,
and the PDP Research Group, Parallel distributed processing, Vol. 2,
216-271. Cambridge, MA: MIT Press.
Ryle, Gilbert (1949)
The concept of mind.
Chicago: University of Chicago
Press.
Sadock, Jerrold (1991) Autolexical syntax.
Chicago: University of Chicago
Press.
Saussure, Ferdinand de (1915) Cours de linguistique générale,
edited by Charles Bally and Albert Sechehaye.
English translation: Course
in General Linguistics. New
York: Philosophical Library, 1959.
Savage-Rumbaugh, Sue, Stuart Shanker, and Talbot
Taylor (1998) Apes, language, and
the human mind. Oxford University
Press.
Searle, John (1980) Minds, brains, and programs. Behavioral and Brain Sciences 3,
417-424.
Seidenberg, Mark S., and Laura Petitto (1978) Signing behavior in apes: A critical review. Cognition l7, 177-215.
Spelke, Elizabeth, Gary Katz, Susan Purcell, Sheryl
Ehrlich, and Karen Breinlinger (1994)
Early knowledge of object motion:
Continuity and inertia. Cognition
51, 131-176.
Swinney, David (1979) Lexical access during sentence
comprehension: (Re)consideration of context effects. Journal of Verbal Learning and Verbal Behavior 18, 645-659.
Talmy, Leonard (2000) Toward a Cognitive Semantics. Cambridge, Mass.: MIT Press.
Tanenhaus, M. K., M. J. Spivey-Knowlton, K. M.
Eberhard, and J. C. Sedivy (1995).
Integration of Visual and Linguistic Information in Spoken Language
Comprehension. Science 268,
1632-4.
Terrace, H. (1979) Nim. New York, Knopf.
Tomasello, Michael (ed.) (2000) Primate cognition (special
issue). Cognitive Science 24.3.
Van Valin, Robert, and Randy LaPolla (1997) Syntax:
Structure, meaning and function.
Cambridge: Cambridge Unviersity
Press.
Verkuyl, Henk (1993)
A theory of aspectuality: The
interaction between temporal and atemporal structure. Cambridge:
Cambridge University Press.
Weinreich, Uriel (1966) Explorations in semantic theory.
In T. Sebeok (ed.), Current trends in linguistics, vol. 3. The Hague:
Mouton. Reprinted in U.
Weinreich (1980), On semantics, 99-201.
Philadelphia: University of
Pennsylvania Press.
Wexler, Kenneth, and Peter Culicover (1980) Formal principles of language acquisition. Cambridge, MA: MIT Press.
Zurif, Edgar (1990)
Language and the brain. In Dan
Osherson and Howard Lasnik (eds.), An introduction to cognitive science,
Vol. 1: Language, 177-198. Cambridge, MA: MIT Press.
Acknowledgment
Foundations of Language was written during 1999-2000, while I was a Fellow at
the Wissenschaftskolleg zu Berlin, a year for which I am profoundly
grateful. Both the book and this
version of it were supported in part by NIH Grant 03660 to Brandeis
University. I also must thank Marc
Hauser and Dan Dennett for their help and encouragement on this version.
Notes
1. “Essentially the same” is a matter of perspective. When we are talking about “English speakers” as a whole we can treat them all as essentially the same. But if we’re talking about dialect differences, dialect contact, or language change, we can just as easily switch to treating different speakers as having (slightly) different linguistic systems in their heads. And of course when we’re talking about language acquisition, we take it for granted that the young child has a different system than the adults.
2. To some extent Chomsky’s point has been lost on the larger cognitive neuroscience community. For instance, the widely cited connectionist parser of Elman (1990) is a variation of a finite-state Markov device, and is subject to some of the same objections raised by Chomsky in 1957. See Marcus 2001 and Pinker 1999 for extensive discussion.
3. For example: “The deep structure that expresses the meaning is common to all languages, so it is claimed [by the Port-Royal grammarians – who of course did not use the term “deep structure”], being a simple reflection of the forms of thought” (Chomsky 1966, 35).
4. Some opponents of generative grammar (for instance some Cognitive Grammarians) have rightly objected to syntactocentrism, but proposed instead that all properties of language are derivable from meaning. I take this to be equally misguided, for reasons that should be evident as we proceed.
5. A standard mark of recursivity is a constituent occurring within another constituent of the same type. For instance, a clause can appear within another clause: The man who comes from New York is tall; and a noun phrase can appear within a noun phrase: the king of the Cannibal Islands. In phonology this sort of situation does not occur nearly so freely: in particular, a syllable cannot occur within another syllable.
6. Interestingly, Chomsky (1965) brings up an example like (4) and analyzes the prosody is a fact of performance: speakers don’t pronounce the sentence in accordance with its syntactic structure. This is about the only way he can analyze it, given that he does not have independent principles of intonational constituency at his disposal. Contemporary theory allows us to say (correctly, I believe) that (4) is well-formed both syntactically and prosodically, with a well-formed correspondence between the two structures.
7. It is important to distinguish two interpretations of “syntactic” here. In the broader sense, every combinatorial system has a syntax: mathematics, computer languages, music, and even phonology and semantics. In the narrower sense of technical linguistics, “syntactic” denotes the organization of units such as NPs, VPs, and prepositions. I am reserving “syntactic” for this narrower sense and using “combinatorial” for the broader sense.
8. For the semantics I have used the Conceptual Structure notation of Jackendoff (1983, 1990); readers invested in other frameworks should feel free to substitute their own notations.
9. Stratificational Grammar (Lamb 1966) also proposed a thoroughgoing organization into independent generative components linked by interfaces.
10. The
lexicon, a large collection of
learned arbitrary associations between very particular bits of structure, also
has parallels in other domains of memory.
For instance, it is an arbitrary fact that the sound kFt means
‘feline animal’, and the child must learn it from the environment. Now consider the association between the
appearance of foods and their tastes.
It is similarly arbitrary (from the point of view of the organism) that
something that looks like a cucumber tastes the way it does, and organisms
learn probably hundreds or thousands of such associations. (There are even “ambiguous” looking
foods: think of mashed potatoes and
vanilla ice cream.)