Taalportaal - the digital language portal

[+] I. General introduction

Dutch is an official language in the Netherlands, Belgium-Flanders, Surinam, Aruba and the Netherlands Antilles. With at least 22 million native speakers (Wikipedia.org 2024), it is one of the world's major languages. It is taught and studied in more than 135 universities in 40 countries (taalunie.org/dossiers/68: January 9, 2021). Dutch is also one of the best-researched living languages; research on it has had a major and continuing impact on the development of formal linguistic theory, and it plays an important role in various other types of linguistic research. However, much information remains hidden in scientific publications: some is embedded in theoretical discussions of interest or accessible only to certain groups of formal linguists, some is more or less outdated in the light of more recent findings and theoretical developments, some is buried in publications with limited distribution, and some is simply inaccessible to large groups of readers because it is written in Dutch. This calls for a comprehensive, scientifically based description of the grammar of Dutch that is accessible to a wider international audience: the Syntax of Dutch (SoD) aims to fill this gap in the field of syntax. The Language Portal (https:\\taalportaal.org), of which the SoD is a part, provides an even more comprehensive description of the grammar of Dutch by including similar descriptions of its phonology and morphology; cf. Van der Wouden et al. (2016).

[+] II. Main objective

The main objective of SoD is to present a synthesis of the currently available syntactic knowledge of Dutch. It provides a comprehensive overview of relevant research on Dutch, presenting not only the results of earlier approaches to the language, but also the results of formal linguistic research over the last seven decades, since the advent of generative grammar, which are often not found in other reference works. It should be emphasized, however, that SoD is primarily concerned with language description and not with linguistic theory; the reader will generally look in vain for critical evaluations of theoretical proposals made to explain particular problems. Although SoD addresses many of the central issues in current syntactic theory, it does not provide an introduction to it. Readers interested in such an introduction are referred to one of the many existing introductory textbooks, or to handbooks like The Wiley Blackwell Companion to Syntax, edited by Martin Everaert & Henk van Riemsdijk, or The Cambridge Handbook of Generative Syntax, edited by Marcel den Dikken. A commendable publication that aims to provide a description of Dutch in a more theoretical setting is The Syntax of Dutch by Jan-Wouter Zwart in the Cambridge Syntax Guides series.

[+] III. Intended readership

SoD is not aimed at a specific group of linguists, but at a more general readership. The intention is to produce a reference work that will be accessible to a wide audience with some training in linguistics and/or related disciplines, and that will assist all researchers interested in questions of Dutch syntax. Although we did not originally target this audience, the descriptions we provide have been proven to be accessible to advanced students of language and linguistics. Specifying our audience in this way means that we have tried to avoid the jargon of specific theoretical frameworks by using, as much as possible, the lingua franca commonly used by linguists. When we use a term that we believe is not part of the lingua franca, we provide a brief clarification of that term in the glossary. When we occasionally discuss theoretical matters, we try to do so in places where it does not interrupt the discussion of the data, although such digressions may include new data if necessary to make our point.

[+] IV. Object of description

The object of description is aptly described by the title of this work, Syntax of Dutch. This title suggests a number of ways in which the empirical domain is limited; this will be elaborated here by briefly discussing the two terms syntax and Dutch.

[+] A. Syntax

Syntax is the branch of linguistics that studies how words are combined into larger phrases and, ultimately, sentences. This means that we do not systematically discuss the internal structure of words (which is the domain of morphology) or the way sentences are used in discourse (which is the domain of discourse studies and pragmatics): we digress into such matters only if it is useful for describing the syntactic properties of the language. For example, Chapter N14 contains an extensive discussion of deverbal nominalization, but only because this morphological process is relevant to the discussion of the complementation of nouns in Chapter N15. And Section N21.1.3 shows that the difference in word order between the two examples in (1) is related to the preceding discourse: when the sentence is pronounced in a neutral way (i.e. without a contrastive accent), the object Marie can only precede clausal adverbs such as waarschijnlijkprobably if it refers to a person already mentioned in (or implied by) the preceding discourse.

1

a.	Jan	heeft	waarschijnlijk	Marie	gezien.	Marie = discourse new
	Jan	has	probably	Marie	seen
	'Jan has probably seen Marie.'

b.	Jan heeft	Marie	waarschijnlijk	gezien.	Marie = discourse old
	Jan has	Marie	probably	seen
	'Jan has probably seen Marie.'

Our goal of describing the internal structure of phrases and sentences implies that we focus on competence (the internalized grammar of native speakers) rather than performance (the actual use of language). SoD will pay considerable attention to certain aspects of meaning and will also discuss phonological aspects such as stress and intonation where these are relevant (e.g. in the context of word order phenomena such as those in (1)). The reason for this is that current formal grammar assumes that the output of the syntactic module of the grammar consists of objects (sentences) that relate form and meaning. Furthermore, formal syntax has been quite successful in establishing and describing a large number of constraints on this relationship. A prime example of this is the formulation of the so-called binding theory, which (among other things) accounts for the fact that referential pronouns such as hemhim and anaphoric pronouns such as zichzelfhimself differ in the domain within which they can (or must) find an antecedent. For instance, the examples in (2), where the intended antecedents of the pronouns are italicized, show that referential object pronouns such as hemhim cannot have an antecedent within their minimal clause, whereas anaphoric pronouns such as zichzelfhimself must have an antecedent within their minimal clause; cf. Section N22 for a detailed discussion.

2

a.	Jan denkt [Clause	dat	Peter hem/*zichzelf	bewondert].
	Jan thinks	that	Peter him/himself	admires
	'Jan thinks that Peter admires him [= Jan].'

b.	Jan denkt [Clause	dat	Peter	zichzelf/*hem	bewondert].
	Jan thinks	that	Peter	himself/him	admires
	'Jan thinks that Peter admires himself [= Peter].'

[+] B. Dutch

SoD provides a syntactic description of what we will loosely refer to as Standard Dutch, although we are aware that there are many problems with this term; cf. Milroy (2001). First, the term Standard Dutch is often used to refer to the written language and the more formal registers, which are widely perceived as more prestigious than the colloquial uses of the language. Second, the term suggests that there is an invariant language system shared by a large group of speakers. Third, the term bears the connotation that some, often unnamed, authority is able to determine what should or should not be considered part of the language, or what should or should not be considered proper language use.

SoD does not provide a description of this prestigious, invariant, externally determined language system. The reason for this is that knowledge of this system is not related to the competence (i.e. internalized grammar) of individual language users, but “is the product of a series of educational and social factors which have overtly impinged on the linguistic experiences of individuals, prescribing the correctness/incorrectness of certain constructions” (Adger & Trousdale 2007). Instead, the notion of standard language in SoD should be understood more neutrally as an idealization referring to certain properties of linguistic competence that can be assumed to be shared by individual speakers of the language. Our notion of standard Dutch, which will henceforth be written with a single capital letter, differs from the traditional notion of Standard Dutch, written with two capitals, in that it may include properties that would be rejected by language teachers, and it may exclude certain properties that are explicitly taught as part of the standard language. To put the latter in more technical terms, our notion of standard Dutch refers to the core grammar of Dutch (i.e. those aspects of the language system that arise spontaneously in the language-learning child through exposure to utterances in the standard language) and excludes the periphery (those properties of the standard language that are explicitly taught at a later age). This does not mean that we ignore the more peripheral aspects completely, but it should be kept in mind that they have a special status and may have properties alien to the core system.

A distinguishing feature of standard Dutch is that it can be used by speakers of different dialects; speakers of such dialects sometimes have to acquire it as a second language at a later age, i.e. in a way similar to a foreign language (although this may be rare in the context of Dutch). This implies that there is no contradiction in distinguishing different varieties of standard Dutch. A largely similar view is taken in Haeseryn et al. (1997: Section 0.6.2), where the fourfold distinction in (3) is made when it comes to geographically determined variation.

3

a.

Standard Dutch

b.

regional variety of standard Dutch

c.

regional variety of Dutch

d.

dialect of Dutch

The types in (3b&c) are characterized by certain properties that are found only in certain larger but geographically limited regions. The distinction between the two types of regional varieties is defined by Haeseryn at al. (1997) by appealing to the perception of the properties in question by other speakers of the standard language: if the majority of these speakers do not consider the property in question to be characteristic of a particular geographical region, the property belongs to a regional variety of standard Dutch; if the property in question is unknown to some speakers of the standard language or is considered to be characteristic of a particular geographical region, it belongs to a regional variety of Dutch. We will not adopt the distinction between the types in (3b) and (3c), since we are not aware of any large-scale perception studies that could help us distinguish between the two varieties in question. We will therefore simply merge the two categories into one, as in (4).

4

a.

standard Dutch

b.

regional variety of standard Dutch

c.

dialect of Dutch

We think it is helpful to think of the types of language variety in (4) in terms of grammatical properties that are common to the competence of groups of speakers. This means that we can see standard Dutch as a set of properties that are part of the competence of all speakers of the language. Examples of such properties in the nominal domain are that non-pronominal noun phrases are not morphologically case-marked and that the word order within noun phrases is such that nouns normally follow attributively used adjectives but precede PP-modifiers and that articles precede attributive adjectives (if present); cf. (5a). Relevant properties within the clausal domain are that finite verbs occupy the co-called second position in main clauses, while non-finite verbs cluster on the right side of the clause, as in (5b), and that finite verbs join the clause-final non-finite verbs in embedded clauses, as in (5c).

5

a.		de	oude	man	in de stoel	word order within noun phrases
		the	old	man	in the chair

b.	Jan	heeft	de man	een lied	horen	zingen.	verb second/clustering
	Jan	has	the man	a song	hear	sing
	'Jan has heard the man sing a song.'

c.	dat	Jan	de man	een lied	heeft	horen	zingen.	verb clustering
	that	Jan	the man	a song	has	hear	sing
	'that Jan has heard the man sing a song.'

Regional varieties of Dutch are then defined as having a set of additional properties that are part of the competence of larger subgroups of speakers. Such properties define certain special characteristics of the variety in question, but do not normally give rise to linguistic outputs that are inaccessible to speakers of other varieties; see the discussion of (6) below for a typical example. Dialects can be thought of as having a set of additional properties that are part of the competence of a smaller group of speakers in a restricted geographical area. Such properties may be alien to speakers of standard Dutch and may give rise to linguistic outputs that are not immediately accessible to other speakers of Dutch; see example (8) below for a possible case. This way of thinking about the typology in (4) allows us to use language types in a more gradual way, which may do more justice to the situation we actually find. It also makes it possible to define varieties of Dutch along different (e.g. geographical and social) dimensions.

The examples in (6) provide an example of a property that belongs to regional varieties of Dutch: speakers of northern varieties of Dutch require that the indefinite direct object boekenbooks precede all verbs in clause-final position, while many speakers of southern varieties of Dutch (especially those spoken in the Flemish part of Belgium) will also allow the object to permeate the verb sequence, provided that it precedes the main verb.

6

a.	dat	Jan <boeken>	wil <*boeken>	kopen.	Northern Dutch
	that	Jan books	wants	buy
	'that Jan wants to buy books.'

b.	dat	Jan <boeken>	wil <boeken>	kopen.	Southern Dutch
	that	Jan books	wants	buy
	'that Jan wants to buy books.'

Dialects of Dutch can differ from standard Dutch in many ways. For example, several dialects have morphological agreement between the subject and the complementizer, as illustrated in (7) by examples taken from Van Haeringen (1939); see Haegeman (1992), Hoekstra & Smit (1997), Zwart (1997), Barbiers et al. (2005) and the references cited there for more examples and detailed discussion. Complementizer agreement is a typical dialect feature: it does not occur in (the regional varieties of) standard Dutch.

7

a.	Assg	Wim	kompsg,	mot	jə	zorgə	dat	je	tuis	ben.
	when	Wim	comes	must	you	make.sure	that	you	at.home	are
	'When Wim comes, you must make sure to be home.'

b.	Azzəpl	Kees en Wim	komməpl, mot	jə	zorgə	dat	je	tuis	ben.
	when	Kees and Wim	come	must	you make.sure	that	you	home	are
	'When Kees and Wim come, you must make sure to be home.'

The examples in (8) illustrate another feature that belongs to a certain set of dialects. Speakers of most varieties of Dutch would agree that the use of possessive datives is possible only in a limited set of constructions: whereas possessive datives are possible in constructions such as (8a), in which the possessee is embedded in a °complementive PP, they are excluded in constructions such as (8b), in which the possessee is a direct object. Constructions such as (8b), if understood at all, are perceived as belonging to certain eastern and southern dialects, which is indicated here by a percentage sign.

8

a.	Marie zet	Peter/hempossessor	het kind	op de kniepossessee.
	Marie puts	Peter/him	the child	onto the knee
	'Marie puts the child on Peterʼs/his knee.'

b.	%	Marie wast	Peter/hempossessor	de handenpossessee.
		Marie washes	Peter/him	the hands
		'Marie is washing Peterʼs/his hands.'

Note that the typology in (4) should allow for certain dialectal features to become part of certain regional varieties of Dutch, as indeed seems to be the case for possessive datives of the type in (8b); cf. Cornips (1994). This again shows that it is not possible to draw sharp dividing lines between regional varieties and dialects, which emphasizes that we are dealing with dynamic systems; cf. the discussion of (4) above. For our limited purpose, however, the proposed distinctions seem sufficient.

The reader should bear in mind that the description of the types of Dutch in (4) in terms of properties of the competence of groups of speakers implies that standard Dutch does not refer to a language in the traditional sense; it refers only to a subset of properties common to all non-dialectal varieties of Dutch. The selection of one of these varieties as Standard Dutch in the more traditional sense is a political rather than a linguistic enterprise and therefore need not concern us here. For practical reasons, however, we will focus on the (prestigious) regional variety of Dutch spoken in the northwestern part of the Netherlands. One reason is that the authors who have contributed to SoD are all native speakers of this variety; this focus allows them to simply appeal to their own intuitions to determine whether or not this variety exhibits a particular property. A second reason is that this variety seems to be close to the varieties that have been discussed in the linguistic literature on Standard Dutch (in the traditional sense). This does not mean that we will not discuss other varieties of Dutch, but we will only do so if we have reason to believe that they behave differently. Unfortunately, however, not much is known about the syntactic differences between the various varieties of Dutch. Since this not part of our present goal to solve this problem, we would like to ask the reader to limit the judgments made in SoD to speakers of the Northwestern variety, unless noted otherwise. In the vast majority of cases, the other varieties of Dutch will show identical or similar behavior, since the behavior in question reflects properties that are part of standard Dutch (in the technical sense). However, this cannot be taken for granted, as it cannot be ruled out with certainty at this stage that they may occasionally reflect properties of the regional variety spoken by the author(s).

[+] V. The data collection

Section IV has stated that the main goal of SoD is to provide an up-to-date syntactic description of standard Dutch, i.e. a description of the way in which words are combined into larger phrases and sentences in core syntax. This means that we aim to describe (in a more or less informal way) the competence of native speakers rather than their performance. In order to do this, we will make extensive use of acceptability judgments on constructed examples ready made for the syntactic problems at hand. For practical reasons, these judgments will be based on the existing literature, the intuition of the researcher, or linguistically trained informant(s).

Not so long ago, it seemed unnecessary to motivate the use of acceptability judgments, now sometimes referred to as the intuitionist approach to the study of syntax. The first edition of SoD was criticized for using this method of data collection, however. The various reviews which appeared in Nederlandse Taalkunde/Dutch Linguistics (issues 19/1-2 and 21/2) left no doubt that some reviewers consider the intuitionist approach inadequate: a reference grammar such as Syntax of Dutch should preferably be based on corpus data. The assumption underlying this claim seems to be that introspection data are not (reliable) empirical data and are therefore inherently inferior to corpus data based on actual performance, or “real language” in the words of De Hoop (2016). The rejoinder to these reviews in Broekhuis (2016) aimed at showing that these reviewers were too optimistic about the potential of corpus linguistics: at least some of their suggested improvements and corrections of the proposed descriptions in SoD turned out to be problematic themselves on two counts: they were either based on “raw” corpus data or there were shortcomings in the linguistic annotation of the corpora used.

The rejection of the intuitionist approach is not universal and is probably related to the fact that the research interests of some reviewers are better served by corpus research. This can be seen from the fact that other reviewers explicitly praised the use of the intuitionist approach in SoD (cf. Smessaert 2014), and some corpus linguists consider acceptability judgments to be of similar value as corpus data; cf. the discussion between Schoenmakers & Van Hout (2024) and Weerman (2024). For example, Odijk (2020: 12) emphasizes that data from corpora and data collected in artificial experimental settings (including introspection) should all be considered empirical data: “all relevant evidence should be considered, and no form or source of evidence has a privileged status” (our translation). Although we agree with Odijk’s statement in principle, there are reasons to think that the intuitionist approach is in fact better suited for collecting synchronic syntactic data than corpus research (evidently not for linguistic research in e.g. diachronic syntax, language variation, and language acquisition; here, corpus research is indispensable). This seems to justify a more general comparison of the two methods of data collection. The following discussion is largely based on earlier discussions by Newmeyer (1983) and Kübler and Zinsmeister (2015), and has previously been published in a slightly different form as Broekhuis (2020).

[+] A. Competence and performance

The suitability of the two methods of data collection under discussion (i.e. the use of acceptability judgments on constructed examples or the mining of corpora) for SoD is closely related to the type of data needed to study synchronic syntax. Studying the internal organization of phrases/sentences is not an easy task, and progress has been slow but steady since the introduction of generative grammar in the mid-1950s. The complexity of syntactic structures is not immediately apparent from actual utterances but must be brought to light in an indirect way by applying a series of sophisticated and sometimes controversial syntactic tests. This is used in generative grammar as evidence for postulating a certain amount of tacit knowledge in the mind/brain of the language user, needed for the production and processing of such structures. Although relatively little is known about the form of syntactic knowledge in the mind/brain, a great deal is known about the presumably universal principles that determine what are (un)grammatical, i.e. (im)possible, syntactic structures. The set of universal principles determining syntactic structure is thought to be the innate part of the adult language user’s internal language or competence that determines the properties of his language and is also thought to be instrumental in language acquisition. If this is true, then the study of the syntax of a particular language L is not only of interest in itself, but also crucial for the study of the universal mechanisms underlying the human ability to produce, process, and acquire language.

Some linguists deny the existence of competence in the generative sense: a relatively stable knowledge state that emerges in the language-learning child at a relatively early age (say, before adolescence). For example, the usage-based theory:

“[…] incorporates the basic insight that usage has an effect on linguistic structure. It thus contrasts with the generative paradigm’s focus on competence to the exclusion of performance and rather looks to evidence from usage for the understanding of the cognitive organization of the grammar” (Bybee & Beckner, 2010).

However, even if the “basic insight that usage has an effect on linguistic structure” were beyond doubt, it would not mean that the universal principles determining syntactic structure discovered by generative research should be abandoned. The reason is that these principles are claimed to hold for any adult syntactic knowledge system about a language, regardless of whether such a system should be considered a stable or a more flexible knowledge state: if such systems were flexible, the universal principles would simply hold for all successive stages.

Given the empirical and explanatory success attributed to universal principles by generative linguists, it is clearly insufficient to dismiss these principles simply by pointing to the supposedly flexible nature of the grammatical system, as is done in Van de Velde (2014:89); one would at least expect an attempt at refuting these principles on empirical grounds. While there are many generative studies attempting this as part of the normal scientific practice of theory evaluation, we are not aware of any usage-based study that specifically aims to do so.

We want to argue that the postulation of such universal principles as an innate part of the knowledge state of language users is empirically grounded. This implies that the distinction between competence and performance is also meaningful for individual speakers: competence refers to the mechanisms within the individual that are necessary for the production and processing of the abstract linguistic objects that generative grammar refers to as phrases/sentences; performance refers to everything related to the use of utterances built on these abstract objects. Since it is not important for the present discussion what the substance of competence is, the reader can simply think of it in a fairly theory-neutral way as a set of hypotheses about the internal structure of phrases (including sentences), as such relevant to all linguists interested in syntax. For this reason, we present SoD as a tool for all linguists, regardless of their theoretical background.

[+] B. What kind of data is needed for competence research?

Competence enables the individual speaker to produce and process structured linguistic objects. Syntactic research aims at modelling the knowledge state of the adult speaker and by comparing the knowledge states of larger groups of native speakers of the same or different languages at uncovering the universal principles that enable the child to acquire natural language. Achieving these goals is a step-by-step process: based on the analysis of a necessarily limited number of syntactic phenomena in a limited number of languages, hypotheses are proposed about the universal principles to which linguistic objects are subject. These universal principles can exert an influence on the analysis of the original data set by making a selection among the competing analyses which can, in principle, explain it. They can also make predictions about other, unrelated syntactic phenomena as well as the syntactic behavior of languages not previously considered. This leads to a gradual expansion of empirical coverage, which may lead to revised or new hypotheses about the universal principles. By going through this process again and again, we hope to increase our insight into the universal properties of the competence of speakers of different languages.

The data considered in competence research can be selected rather arbitrarily: any set that the researcher considers suitable for fruitful investigation will do. However, there are two concepts that may help narrow down the data set for further research, namely acceptability and grammaticality; cf. Newmeyer (1983:§2.2). Grammaticality is the easier concept to define, since it is a theoretical term: a syntactic object is grammatical if the language user is predicted to be able to produce it, and ungrammatical if the language user is predicted not to be able to produce it. However, the term is still problematic because competence theory is constantly being updated, which implies that certain objects may be characterized as grammatical at one stage of the theory and as ungrammatical at another (and vice versa). The term acceptability is not a theoretical term, but refers to the feelings/intuitions that language users have about utterances. Of course, these feelings/intuitions depend on the speaker’s competence, but they can also be affected by many other factors, like the interpretability of the utterance, the usability of the utterance in the intended context, and language norms. Moreover, acceptability has little to do with actual use, since speakers may produce utterances that they find unacceptable upon closer inspection; this is a first reason why “raw” data collected by corpus research are less suitable for syntactic research.

The reason why the notions of acceptability and grammaticality are helpful in restricting the data set is that there appears to be a consensus that at least those utterances frequently produced in colloquial speech and judged acceptable by native speakers should be included in the set of grammatical sentences; likewise, that utterances that occur infrequently (or never) in colloquial speech and are judged unacceptable by native speakers should be included in the set of ungrammatical sentences. We will illustrate this by considering two factors that affect the acceptability of linguistic forms, grammaticality and interpretability. Figure 1 shows that these factors divide any data set into four, possibly empty, subsets (conveniently ignoring the many other factors that may affect acceptability judgments):

Figure 1: Two important factors affecting acceptability

The nature of subset 1 (grammatical and interpretable) will be unproblematic for most readers, as these are the forms that all native speakers will characterize as acceptable, i.e. as belonging to their language; we can therefore expect these forms to occur in the speech produced by native speakers. The same will hold for subset 4 (ungrammatical and uninterpretable), as these are the forms that all native speakers will characterize as unacceptable; we can expect these forms not to occur in the speech of native speakers (although they may occur in the speech of non-native speakers). Subset 2 (interpretable but ungrammatical) is a mixed bag that may include incomplete utterances, anacoluthon, speech errors of many kinds, and so on. Subset 3 (grammatical but uninterpretable) is more problematic, since the decision to place a particular form in this subset depends on the grammar adopted. For example, whether Chomsky’s famous example Colorless green ideas sleep furiously belongs to this subset depends on whether the grammar contains selection restrictions such as “the verb sleep takes an animate subject”: if it does, the sentence belongs to subset 4; if it does not, the sentence belongs to subset 3. There are many forms for which it can be difficult to decide where they belong. For example, native speakers of Dutch occasionally place pseudo-participles (adjectives with the appearance of a participle) such as geliefdpopular after the verbs in clause-final position, as in (9b).

9

a.	Marie zegt	dat	zijn boeken	geliefd	zijn.
	Marie says	that	his books	popular	are
	'Marie says that his books are popular.'

b.

?

Marie zegt dat zijn boeken zijn geliefd.

Some researchers may place (9b) in subset 2, while others may think that it belongs to subset 1, or that it illustrates an ongoing language change. The fact that the choice depends on the syntactic theory one supports has led to the methodological guideline that such examples should not be used to evaluate theories: the prevailing theory at the time should decide whether the example should be considered grammatical or not. Thus, the theory should preferably be based only on clear (i.e. undisputedly grammatical/ungrammatical) cases. This would avoid ad hoc additions to the theory or the elimination of otherwise well-motivated mechanisms from the theory on shaky grounds; however, we should note that there are incidental examples where the study of unclear cases has yielded important theoretical insights, such as the study of parasitic gaps). This means that competence research should use the clear cases from subset 1 and 4 for theory evaluation and exclude the cases from subset 2 and 3 for this purpose. So far, the set of clear cases (much of which remains to be explored) has provided sufficient material for new and challenging research; we think that there is reason to believe that it will continue to do so for a long time to come. We may hope that by the time the set of clear cases is exhausted, we will have enough information to be able to say something meaningful about the less clear cases. Unfortunately, corpus research is not going to help solve the problem of unclear cases either, since showing that a particular construction is attested in a corpus says little or nothing about its place in Figure 1. Therefore, the best we can do in our syntactic description is to distinguish between unquestionably grammatical and ungrammatical examples (i.e. the cases that should be accounted for by any syntactic theory) and the unclear cases (whose grammaticality should be determined by the prevailing theory of the time).

[+] C. The validity of introspection data

Critics of Syntax of Dutch disapprove of the use of introspection data because they are thought to be subjective and unreliable in the sense that acceptability judgments can be theoretically biased. There seems to be little ground for distrusting introspection data, as the few available studies suggest that the reliability of data sets based on introspection is not significantly lower than that of those based on corpora: where the two can be compared, convergence seems to be the rule rather than the exception; cf. Sprouse & Almeida 2010. One reason for this is that there are various checks and balances on the introspection data found in the literature, to the extent that they are not truly subjective but intersubjective: a researcher who is not entirely sure of his judgments will usually consult others for their judgments in order to sharpen his own. Moreover, data are usually peer-reviewed before publication and may be subject to post-publication scrutiny in subsequent work on the same topic. When in doubt, there is always the option of investigating the forms in question through more controlled research (such as large-scale intuition research). Faulty introspection data will usually be corrected before the research is published, or have little chance of surviving because they are simply forgotten over time as no one cites them.

It is important to keep in mind that the judgments in SoD are not about the grammaticality, but about the acceptability of the examples in question. Examples without diacritics are considered perfectly acceptable by native speakers, and one would probably (but not necessarily) want to mark them as grammatical in any theoretical account of the language, i.e. to place them in subset 1 in Figure 1. Examples preceded by an asterisk are considered unacceptable by native speakers, and one would probably (but again not necessarily) want to mark them as ungrammatical in any theoretical account of the language, i.e. to place them in subset 4 in Figure 1. The unclear cases in subsets 2 and 3 are usually preceded by a question mark, but since not all examples in these subsets are the same in that some are felt to be more deviant than others, we will occasionally make finer distinctions.

*Fully unacceptable form

*?Probably an unacceptable form

??May be an unacceptable or a disfavored form

?Disfavored form

(?)Slightly marked but probably acceptable form

no markingPerfectly acceptable form

The diacritics above thus express graded judgments within a given set of examples: examples without marking are clearly acceptable and those with an asterisk are clearly unacceptable, with various cases in between. Recall that the deviation of a given example does not necessarily mean that we should consider it ungrammatical; if there is good reason for assuming that the example is deviant for e.g. semantic or pragmatic reasons, or that it belongs not to the core but to the periphery of the grammar, the example is marked with a dollar sign $. If we are dealing with an example that is unacceptable in one reading, but acceptable in another, unintended reading, the number sign # is used. The percentage sign % is used when we know that there is speaker variation in judgments among speakers of standard Dutch. What does not seem very useful is to argue about the relative judgments of examples in different sets of examples. For example, the diacritical mark ?? does not refer to a degree of unacceptability that can be objectively assigned to examples of a certain kind: it is just a warning flag, signaling “this example is pretty bad and one should seriously consider that it might be ungrammatical”. The same goes for the diacritical mark (?), which means something like “this example is pretty good and should probably be considered grammatical”. Ultimately, it is the syntactic analysis that decides on grammaticality.

[+] D. How intuitionist researchers compile data sets for their research

This subsection will show that compiling a data set using the intuitionist approach is usually a fairly organized process. This will be illustrated using the data set compiled in Broekhuis and Den Dikken (2018). Their investigation starts with the simple observation that the PP tot aan1 het einde aan2 toe up to the end can be reduced by omitting aan1, aan2, and/or toe. By systematically going through all possible combinations, we are able to construct the data set in (10).

10

a.		tot	het einde
		to	the end

a'.		tot	aan1	het einde
		to	on	the end

b.		tot	het einde	toe
		to	the end	to

b'.		tot	aan1	het einde	toe
		to	on	the end	to

c.		tot	het einde	aan2	toe
		to	the end	on	to

		cʹ.	tot	aan1	het einde	aan2	toe
		to	on	the end	on	to

d.	*	tot	het einde	aan2
		to	the end	on

d'.	*	tot	aan1	het einde	aan2
		to	on	the end	on

The primeless examples show that aan2 and toe are both optional, but that toe must be present if aan2 is overtly realized. The contrast between the primeless and primed examples shows that aan1 is optional in all acceptable primeless cases. Applying the standard use of parentheses to indicate optionality, we can refer collectively to all and only the acceptable forms in (10) using the string tot (aan) het einde ((aan) toe).

The fact that there are subtle meaning differences between the acceptable forms in (10) is taken as evidence for the claim that the syntactic structures of these forms are different. However, they all have the same distribution; they can all be used only as adverbial phrases of time and place, as in the primeless examples in (11). This is arguably related to the fact that the preposition tot cannot be omitted in these examples; with toe present, the omission of tot leads to unacceptability, as indicated by the asterisk in the singly primed examples; with toe absent, the result is unacceptable or leads to the loss of the intended “up to” interpretation, as indicated by the hash sign in the doubly primed examples.

11

a.	Jan	heeft	tot	(aan)	het einde	((aan)	toe)	geslapen.	time
	Jan	has	to	on	the end	on	to	slept
	'Jan has slept up to the end (of e.g. the meeting).'

a'.

*

Jan heeft (aan) het einde (aan) toe geslapen.

a''.

Jan heeft *(#aan) het einde geslapen.

b.	Jan	heeft	het gras	tot	(aan)	het einde	((aan)	toe)	verwijderd.	spatial
	Jan	has	the grass	to	on	the end	on	to	removed
	'Jan has removed the grass up to the end (e.g. from the garden path)'

b'.

*

Jan heeft het gras (aan) het einde (aan) toe verwijderd.

b''.

Jan heeft het gras *(#aan) het einde verwijderd.

Given the independently established fact that the lexical heads of phrases must be overtly realized in non-ellipsis contexts (due to the principle of recoverability), the examples in (11) can be taken as evidence for the claim that tot is the head in all PP-constructions, which takes the remainder of the PPs as its complement, as in (12).

12

[PP tot [(aan) het einde ((aan) toe)]]

Another crucial observation for the syntactic analysis is given in (13): the string aan het einde in the acceptable primed examples in (10) can be replaced by the adverbial proforms daarthere and danthen.

13

a.

tot aan het einde

a'.

tot daar/dan

b.

tot aan het einde toe

b'.

tot daar/dan toe

c.

tot aan het einde aan toe

c'.

tot daar/dan aan toe

Given the independently established fact that such a substitution is only possible with phrases, the examples in (13) show that aan het einde is a PP embedded in the complement of tot. On the assumption that toe is a postposition, this leads to the syntactic structures in the primeless examples in (14), where the PPs are numbered according to their depth of embedding. The structures in the primed examples follow from the plausible assumption that the NP het einde is in a similar position as the PP aan het einde.

14

a.

[PP1 tot [PP2 aan het einde]]

a'.

[PP tot [NP het einde]]

b.

[PP1 tot [PP2 [PP3 aan het einde] toe]]

b'.

[PP1 tot [PP2 [NP het einde] toe]]

c.

[PP3 tot [PP2 [PP3 aan het einde] aan toe]]

c'.

[PP3 tot [PP2 [NP3 het einde] aan toe]]

To arrive at a full analysis of the paradigm, we still need to explain the occurrence of aan2 in the (c)-examples and the unacceptability of the (d)-examples in (10), but we refer the reader to the article for the full analysis, as the above is sufficient for our limited purpose of showing how the intuitionist researcher assembles the data set. The main finding is that the research question in a sense dictates how the data set should be expanded to find a proper answer; it is not easy to see how corpus research could be helpful in assembling the paradigms needed to answer (let alone formulate) the relevant question.

[+] E. What kind of data are found in corpora?

Since the rejection of introspection data is not based on actual research, the suggestion that linguistic research should be based on corpus research simply reflects a prejudice of some researchers in favor of corpus data. An important question is whether this prejudice is justified in terms of the suitability of the data found in corpora for synchronic syntactic research. The answer is clearly negative, as can be seen from the following three quotes from the careful discussion in Kübler and Zinsmeister’s (2015:17) handbook on corpus linguistics of “three shortcomings of corpus data that users need to be aware of”.

1. The competence-performance dichotomy: “[...] corpus data are always performance data in the sense that they are potentially distorted by external factors such as incomplete sentences, repetitions, and corrections in conceptionally spoken language, or text-genre related properties in conceptionally written language such as nominalizations in law-related texts.”

The competence researcher who relies on corpus data is thus forced to weed out the unclear data in order to meet the methodological requirement that competence theory be based on clear cases (cf. Subsection VB), which can only be done by appealing to ... intuition.

2. The completeness of the data set: “[...] corpora are always limited; even web-based mega corpora comprise only a limited number of sentences. This means that there is a certain chance that the web pages with the linguistically interesting variant might just have been omitted by chance when the corpus was created.”

The situation is even more serious, as Kübler and Zinsmeister (2015:166) add that linguistically annotated corpora cannot be used to search for rare examples. The competence researcher relying on corpus data is thus forced to supplement the data set with the missing cases, which can only be done by appealing to ... intuition.

3. The third quote concerns negative evidence: “[...] corpus data will never provide negative evidence in contrast to acceptability judgements – even if there are ungrammatical examples in corpora. This means that corpus data do not allow us to test the limits of what belongs to the language and what does not.”

Given that establishing “the limits of what belongs to the language and what does not” is the core business of the competence researcher, this quote suggests that the competence researcher who relies on corpus data can only do his job if he supplements the data set with the negative evidence, which can only be done by appealing to ... intuition.

[+] F. The relation between competence and corpus research

The conclusion to be drawn from the quotes from Kübler and Zinsmeister (2015:17) can only be that without an appeal to introspection data, there can be no competence research. Corpus data is inherently unsatisfactory for this kind of research: it can at best be used for adjusting intuitionist claims about clear data by showing that an allegedly unacceptable example occurs with a high frequency, or that an allegedly acceptable example does not occur at all, but even then, introspection is needed. On the other hand, it is clear that intuitionist research has had a major impact on corpus research in the guise of the tag sets used: “the most common type of linguistic annotation is part-of-speech annotation which can be applied automatically with high accuracy, and which is already very helpful for linguistic analysis despite its shallow character” (Kübler and Zinsmeister (2015:17). After all, part-of-speech annotation is based on the results of centuries of intuitionist linguistic research and thus transfers all the presumed subjectivity and unreliability of the intuitive approach to corpus research. An additional problem is that the shallow nature of the (partly outdated) annotation means that it requires a lot of creative thinking on the part of the corpus researcher to select the data relevant to specific research questions, and even then, the results are questionable when it comes to evaluating competence research; cf. Broekhuis’ (2016) remarks on the corpus study in Van Bergen and De Swart (2010). These problems can only be solved by developing more sophisticated annotation systems, which can only be provided by ... intuitionist research.

[+] VI. Organization of the material

SoD is divided into four main parts that focus on the four lexical categories: verbs, nouns, adjectives, and adpositions. Lexical categories have denotations and usually take arguments: nouns denote sets of entities, verbs denote states of affairs (activities, processes, etc.) in which these entities may be involved, adjectives denote properties of entities, and adpositions denote (temporal or spatial) relations between entities.

Of course, the four lexical categories do not exhaust the set of word classes; there are also functional categories like complementizers, articles, numerals, and quantifiers. Such elements normally play a role in phrases headed by the lexical categories: articles, numerals, and quantifiers are normally part of noun phrases, and complementizers are part of clauses (i.e. verbal phrases). For this reason, these functional elements will be discussed in relation to the lexical categories.

The first four main parts of SoD are entitled Xs and X phrases, where X stands for one of the lexical categories; each part discusses one lexical category and the ways in which it combines with other elements (e.g. arguments and functional categories) to form constituents. Furthermore, these four parts of SoD all have more or less the same overall organization, in the sense that they contain (one or more) chapters on the topics discussed in the subsections below.

[+] A. Characterization and classification

Each main part begins with an introductory chapter that provides a general characterization of the lexical category under discussion by describing some of its most salient properties. The reader will find here not only a brief overview of the syntactic properties of these lexical categories, but also relevant discussions of morphology (e.g. the inflection of verbs and adjectives) and semantics (e.g. the aspectual and tense properties of verbs). The introductory chapter will also discuss ways in which the lexical categories can be divided into smaller natural subclasses.

[+] B. Internal syntax

Most of the work deals with the internal structure of the projections of lexical categories/heads. These projections can be divided into two subdomains, the lexical domain and the functional domain. Taken together, the two domains are sometimes referred to as the extended projection of the lexical head in question; cf. Grimshaw (1991). We will see that there is good reason to assume that the lexical domain is embedded in the functional domain, as in (15), where LEX stands for the lexical heads V, N, A or P, and F stands for one or more functional heads like the article dethe or the complementizer datthat.

15

[functional ... f ... [lexical .... lex .....]]

The lexical domain of a lexical head is the part of its projection that affects its denotation. The denotation of a lexical head can be affected by its complements and its modifiers, as can be easily seen in the examples in (16).

16

a.		Jan leest.
		Jan reads

b.		Jan leest	een krant.
		Jan reads	a newspaper

c.		Jan leest	nauwkeurig.
		Jan reads	carefully

The phrase een krant lezento read a newspaper in (16b) denotes a smaller set of states of affairs than the phrase lezento read in (16a), and so does the phrase nauwkeurig lezento read carefully in (16c). The elements in the functional domain do not affect the denotation of the lexical head, but provide different kinds of additional information.

[+] 1. The lexical domain I: Argument structure

Lexical heads function as predicates, which means that they normally take arguments, i.e. they enter into so-called thematic relations with entities that they semantically imply. For example, intransitive verbs normally take an agent as their subject; transitive verbs normally take an agent and a theme, which are syntactically realized as their subject and their object, respectively; and verbs like wachtento wait normally take an agent, which is realized as their subject, and a theme, which is realized as a prepositional complement.

17

a.		JanAgent	lacht.	intransitive verb
		Jan	laughs

b.		JanAgent	weet	een oplossingTheme.	transitive verb
		Jan	knows	a solution

c.		JanAgent	wacht	op de postbodeTheme.	verb with PP-complement
		Jan	waits	for the postman

Although this is often less obvious with nouns, adjectives, and prepositions, it is possible to describe examples like those in (18) in the same terms. The bracketed phrases can be seen as predicates which are predicated of the noun phrase Jan, which we can therefore call their logical subject (we use small caps to distinguish this term from the term nominative subject of the clause). Furthermore, the examples in (18) show (i) that the noun vriend can be combined with a PP-complement that provides further information needed to identify the person with whom the subject Jan is in a relationship of friendship, (ii) that the adjective trotsproud can optionally take a PP-complement that identifies the subject matter of which the subject Jan is proud, and (iii) that the preposition onderunder can take a nominal complement that refers to the location of its subject Jan.

18

a.		Jan is [een vriend	van Peter].
		Jan is a friend	of Peter

b.		Jan is [trots	op zijn dochter].
		Jan is proud	of his daughter

c.		Marie stopt	Jan [onder	de dekens].
		Marie puts	Jan under	the blankets

The fact that the italicized phrases are complements is somewhat obscured by the fact that there are certain contexts in which they can easily be omitted (e.g. when they express information that the addressee can infer from the linguistic or non-linguistic context). However, the fact that they are always semantically implied shows that they are semantically selected by the lexical head.

[+] 2. The lexical domain II: Modification

The projection consisting of a lexical head and its arguments can be modified in various ways. For instance, the examples in (19) show that the projection of the verb wachtento wait can be modified by different adverbial phrases: they indicate when and where the state of affairs of Jan waiting for his father took place.

19

a.	Jan wachtte	gisteren	op zijn vader.	time
	Jan waited	yesterday	for his father
	'Jan waited for his father yesterday.'

b.	Jan wacht	op zijn vader	bij het station.	place
	Jan waits	for his father	at the station
	'Jan is waiting for his father at the station.'

The examples in (20) show that the lexical projections of nouns, adjectives and prepositions can also be modified; the modifiers are italicized.

20

a.		Jan is een vroegere vriend	van Peter.
		Jan is a former friend	of Peter

b.		Jan is erg trots	op zijn dochter.
		Jan is very proud	of his daughter

c.		Marie stopt	Jan diep	onder de dekens.
		Marie puts	Jan deep	under the blankets

[+] 3. The functional domain

Projections of the lexical heads may contain various elements that are not arguments or modifiers and thus do not affect the denotation of the head noun. Such elements simply provide additional information about the denotation. Examples of such functional categories are articles, numerals, and quantifiers, which we find in the nominal phrases in (21).

21

a.		Jan is de/een	vroegere	vriend	van Peter.	article
		Jan is the/a	former	friend	of Peter

b.		Peter heeft	twee/veel	goede vrienden.	numeral/quantifier
		Jan has	two/many	good friends

That functional categories provide additional information about the denotation of the lexical domain can be easily demonstrated by these examples. For example, the use of the definite article de in (21a) expresses that the (contextually determined) set denoted by the phrase vroegere vriend van Peter has a single member; the use of the indefinite article een, on the other hand, is compatible with there being more members in this set. Similarly, the use of the numeral tweetwo in (21b) expresses that there are exactly two members in the set, and the quantifier veelmany expresses that the set is large.

The functional elements found in verbal projections are tense (generally expressed as inflection on the finite verb) and complementizers. The semantic contribution of tense is that it provides information about the temporal location of the eventuality expressed by the lexical projection. The contribution of the complementizers in (22) concerns the illocutionary type of the expression: while dat in (22a) introduces embedded declarative clauses, ofwhether introduces embedded interrogative clauses.

22

a.	Jan zegt [dat Marie ziek	is].	declarative
	Jan says that Marie ill	is
	'Jan says that Marie is ill.'

b.	Jan vroeg [of	Marie ziek	is].	interrogative
	Jan asked whether	Marie ill	is
	'Jan asked whether Marie is ill.'

Since functional categories provide information about the lexical domain, they are often assumed to be part of a functional domain dominating (i.e. built on top of) the lexical domain; cf. (15) above. This functional domain is generally assumed to have a complex structure and to be highly relevant to word order: functional heads are assumed to project, just like lexical heads, thereby creating so-called specifier positions that can be used as landing sites for movement. A familiar case is wh-movement, which is assumed to target the specifier position in the projection of the complementizer; this explains why, in colloquial Dutch, wh-movement can result in the interrogative phrase being placed to the immediate left of the complementizer ofwhether. This is shown in (23b), where the trace t indicates the original position of the moved wh-element and the index i is just a convenient way to indicate that the two positions are related (i.e. to indicate that the wh-phrase is the direct object of the embedded clause). The discussion of word-order phenomena will therefore play a prominent role in the chapters devoted to the functional domain.

23

a.	Jan zegt	[dat	Marie een boek van Louis Couperus	gelezen	heeft].
	Jan says	that	Marie a book by Louis Couperus	read	has
	'Jan said that Marie has read a book by Louis Couperus.'

b.	Jan vroeg	[wati	(of)	Marie ti	gelezen	heeft].
	Jan asked	what	whether	Marie	read	has
	'Jan asked what Marie has read.'

While (relatively) much is known about the functional domain of verbal and nominal projections, research on the functional domain of adjectival and prepositional phrases is still in its infancy. For this reason, the reader will find separate chapters on this topic only in the parts on verbs and nouns.

[+] C. External syntax

The discussions of the four lexical categories will be concluded with a look at the external syntax of their projections, i.e. an examination of how such projections can be used in larger structures. For example, adjectives can be used as complementives (predicative complements of verbs), as attributive modifiers of noun phrases, and also as adverbial modifiers of verb phrases.

24

a.		Die auto	is snel.	complementive use
		that car	is fast

b.		een	snelle	auto	attributive use
		a	fast	car

c.	De auto reed	snel	weg.	adverbial use
	the car drove	quickly	away
	'The car drove away quickly.'

Since the external syntax of the adjectival phrases in (24) can in principle also be described as the internal syntax of the verbal/nominal projections containing these phrases, this may lead to some redundancy. For example, complementives are discussed in Section V2.2 as part of the internal syntax of the verbal projection, but also in Sections N21.2, A28, and P35.2 as part of the external syntax of nominal, adjectival, and adpositional phrases. We have allowed this redundancy because it makes it possible to simplify the discussion of the internal syntax of verb phrases in V2.2: nominal, adjectival, and adpositional complementives behave differently in a number of ways, and discussing them all in Section V2.2 would have obscured the discussion of the properties of complementives in general. Of course, cross-references will inform the reader when a particular topic is discussed from the perspective of both internal and external syntax.

[+] VII. History of the project

The idea for the SoD project was initiated by Henk van Riemsdijk in 1992. A pilot study was carried out at Tilburg University in 1993, and a steering committee was set up after a meeting with interested parties from Dutch and Flemish institutions. However, it took another five years before a substantial grant was received from the Netherlands Organization of Scientific Research (NWO) in 1998. In 2010, the SoD project became part of a larger project initiated by Hans Bennis and Geert Booij, called the Language Portal, which provides information on all aspects of the grammar (i.e. phonology, morphology and syntax) of three languages (Dutch, Frisian and Afrikaans). The first edition of SoD was published in 8 volumes by Amsterdam University Press in 2012-2019 as part of the linguistic series Comprehensive Grammar Resources, both in hardcover and as epubs in open access; SoD is also accessible via the Language Portal (https://taalportaal.org) on the internet.

1. H. Broekhuis, N. Corver and R. Vos: Verbs and verb phrases, Vol I-III

2. H. Broekhuis, E. Keizer and M. den Dikken: Noun and noun phrases, volume I-II

3. H. Broekhuis: Adjectives and adjective phrases

4. H. Broekhuis: Adpositions and adpositional phrases

5. H. Broekhuis and N. Corver: Coordination and ellipsis

The present, thoroughly revised, expanded and updated second edition of SoD was prepared by Hans Broekhuis during 2022-2025, under the supervision of Norbert Corver. To emphasize the unity of the work, it is now published under their general editorship to emphasize the unity of the work. It is again available in hardcover, as a free epub, and as part of the Language Portal (https://taalportaal.org).

[+] VIII. Acknowledgments

Over the years, many Dutch linguists have commented on parts of the work presented here, and because we do not want to tire the reader with long lists of names, we simply thank the entire Dutch linguistic community, which also saves us the embarrassment of forgetting certain names. Nevertheless, we would like to make special mention of a number of people and institutions. The pilot study for the project, conducted from November 1993 to September 1994, was made possible by a grant from the Center for Language Studies and the University of Tilburg. This pilot study resulted in a project proposal, which was approved by The Netherlands Organisation for Scientific Research (NWO) in 1998 and enabled us to produce the main body of parts 2-4 mentioned in the previous subsection between May 1998 and May 2001. We would like to thank the members of the steering committee (chaired by Henk van Riemsdijk), which consisted of Hans Bennis, Martin Everaert, Liliane Haegeman, Anneke Neijt and Georges de Schutter, who all provided us with comments on substantial parts of the resulting manuscripts. The works in 2-4 could be prepared for publication between April 2008 and October 2010 thanks to a grant from the Truus und Gerrit van Riemsdijk-Stiftung: we are grateful to Carole Boster for copy-editing these works. Since November 2010, work on the SoD has continued at the Meertens Institute (KNAW) in Amsterdam as part of the Language Portal Dutch/Frisian project, again funded by The Netherlands Organisation for Scientific Research (NWO), which has enabled us to write the missing part on verbs and verb phrases. The Language Portal project was successfully completed with the presentation of the Language Portal in 2016. Since then, the Meertens Institute and Utrecht University have enabled Hans Broekhuis and Norbert Corver to continue their work on the SoD; this resulted in the additional volume on coordination and ellipsis, published in 2019, and now also in the publication of the second, completely revised edition of SoD. We are grateful for the financial and moral support of these institutions and thank them for the opportunity they have given us to develop the SoD. Special mention should be made of Frits Beukema, who meticulously proofread and edited the entire second edition and also provided us with valuable comments on the content of SoD; the remaining stylistic, grammatical, typographical, and formatting errors are due to changes made in the final stages of the revision process.

December 2025

Hans Broekhuis

Norbert Corver

Co-authors and editors of the Syntax of Dutch

References

Adger, David & Graeme Trousdale. 2007. Variation in English syntax: theoretical implications. English Language and Linguistics 11: 261-278.

Barbiers, Sjef et al. 2005. Syntactic atlas of the Dutch dialects, volume I. Amsterdam: Amsterdam University Press.

Broekhuis, Hans. 2020. Why I will not become a corpus linguist. The use of introspection data and corpus data in synchronic syntactic research. Nederlandse Taalkunde/Dutch Linguistics 25: 181-192.

Cornips, Leonie. 1994. Syntactische variatie in het Algemeen Nederlands van Heerlen. University of Amsterdam: PhD thesis.

Grimshaw, Jane. 1991. Extended projections. Ms. Brandeis University, Waltham, MA.

Haegeman, Liliane. 1992. Theory and description in generative syntax: a case study in West Flemish. Cambridge: Cambridge University Press.

Haeseryn, Walter et al. 1997. Algemene Nederlandse spraakkunst, 2nd, revised edition. Groningen: Nijhoff.

Hoekstra, Eric & Caroline Smit (eds). 1997. Vervoegde voegwoorden.Cahiers van het P.J. Meertens-Instituut 9. Amsterdam: P.J. Meertens-Instituut.

Milroy, James. 2001. Language ideologies and the consequences of standardization. Journal of Sociolinguistics 5: 530-555.

Schoenmakers, Gert-Jan & Roeland Van Hout. 2024. Consistency and variability in acceptability judgments from naive native speakers & Two roads diverged in a linguistic wood, and we have to travel both Nederlandse Taalkunde/Dutch Linguistics 29: 49-75; 82-88.

Smessaert, Hans. 2014. Telwoorden en kwantoren: syntaxis versus semantiek. Nederlandse Taalkunde/Dutch Linguistics 19: 77-86.

Van der Wouden, Ton et al. 2016. Het Taalportaal. Nederlandse Taalkunde//Dutch Linguistics 21: 157-167.

Van Haeringen, C.B. 1939. Congruerende voegwoorden. Tijdschrift voor Nederlandse Taal en Letterkunde 58: 161-176.

Weerman, Fred. 2024. Back to the future. Nederlandse Taalkunde/Dutch Linguistics 29: 76-81.

Wikipedia.org. 2024. List of countries and territories where Afrikaans or Dutch are official languages. en.wikipedia.org/w/index.php?title=List_of_countries_and_ territories_where_Afrikaans_or_Dutch_are_official_languages&oldid=1234155649 (accessed July 23, 2024).

Zwart, Jan-Wouter. 1997. Morphosyntax of verb movement. A minimalist approach to the syntax of Dutch. Dordrecht: Kluwer Academic Publishers.