Anna Szabolcsi

Anna Szabolcsi
Strong vs. Weak Islands

This overview ("case") was written for The Syntax Companion (http://www-uilots.let.uu.nl/syncom/ ) in 1998 and updated in 2002. I wish to thank Marcel den Dikken and Jenny Doetjes for comments. A shorter version has appeared as Anna Szabolcsi and Marcel den Dikken, "Islands". GLOT International 4/6 (1999), reprinted in Lisa Cheng and Rint Sybesma, eds., The Second GLOT State-of-the-Article Book, Mouton de Gruyter (2002).

Table of Contents

1. Introduction

2. Absolute versus selective islands

3. Strong islands

3.1 Classical strong islands

3.2 Types of explanation

3.3 Ways of salvaging strong island violations

3.4 More on Subjacency

3.4.1 Cinque 1990

3.4.2 Some intriguing similarities between SIs and WIs

3.4.3 Postal 1998

4. Weak Islands: A preview

5. (A) What extractions are sensitive to WIs?

5.1 (A1) Arguments versus adjuncts

5.2 (A2) Referential versus non-referential / Existential presupposition

5.3 (A3) Re-evaluating the role of D-linking

5.4 (A4) Individual versus non-individual, and how many-phrases

5.5 (A5) Functional readings and event-related readings

5.6 (A6) Individuals: is it being of type e that matters?

5.7 (A7) Split constructions

5.8 (A8) Negative polarity (NPI) licensing

5.9 (A9) Cross-sentential anaphora

5.10 (A10) Elements of anti-pronominal contexts

6. (B) What contexts constitute weak islands?

6.1 (B1) Wh-islands

6.2 (B2) Negatives and other affective operators

6.3 (B3) Response stance (e.g., negative) and non-stance (e.g., factive), in contrast to volunteered stance predicates

6.4 (B4) Extraposition islands

6.5 (B5) VP-adverbs

6.6 (B6) Scope islands

7. Theories

7.1 ECP and Subjacency for (A1)--(B1)

7.2 Relativized Minimality for (A2, A3)--(B1, B2, B3%, B4, B5%)

7.3 Monotonicity for (A3, A4)--(B1, B2, B3%, B5%)

7.4 From Relativized Minimality to a Scope Theory

7.4.1 A major problem with Relativized Minimality

7.4.2 Scopal intervention

7.4.3 Prospects for a revision of Relativized Minimality

7.5 The Scope Theory

7.5.1 An algebraic approach to scopal intervention, for (A3,A4,A5,A6,A7%)--(B1,B2,B3,B5,B6)

7.5.2 A Dynamic Semantic approach to scopal intervention (A4,A5,A7,A8,A9)--(B1,B2,B3,B5,B6)

7.5.3 How do the two scope theories compare?

8. Conclusions

Bibliography

1. Introduction

The fundamental work on islands is Ross 1967/1986. Ross distinguished between transformations that move a constituent and leave behind a pro-form (i.e., resumptive pronoun) and ones that "chop" a constituent and move it without leaving anything behind. An example of the former is left dislocation (that is, on Ross's analysis, left dislocation is not base generated). An example of chopping is topicalization. Of the two, only chopping operations are constrained by islands. Islands will be enclosed in angled brackets:

(1)

This kid, I must call <the teacher who punished him>.

(2)

*This kid, I must call <the teacher who punished _>.

Diagnosing islands has become more complex since Ross, because it is assumed that there exist invisible pro-forms (empty resumptive pronouns), in addition to overt ones. I will use "gap" as a pretheoretical term that covers both traces and empty pronouns.

Islands come in two main varieties: "weak" and "strong". Note that the weak/strong qualification does not correspond to degrees of ungrammaticality.

(3)

Strong islands:
No extraction is allowed (constructions with an appropriate resumptive pronoun may be allowed).

(4)

Weak islands (WI):
Some phrases can extract, others cannot extract.

The focus of this chapter is weak islands (WI). Strong islands are considered only to set the stage for the discussion of WIs.

Up until the late Eighties, nothing much beyond wh-islands had been thought to be a weak (selective) island. Beginning with Relativized Minimality, however, an ever growing range of weak islands have been recognized. Thus, theories of weak islands have mushroomed, each coming with a significant set of new data and important new connections to other domains. The review of weak islands will emphasize the correspondence between data sets and theories. Less attention will be given to proposals that primarily recast the theoretical account of some narrow range of data.

 

2. Absolute versus selective islands

Although the notion of strong versus weak islands as defined in (3) and (4) seems like a simple descriptive one, it is not. Consider the following examples involving wh-islands and adjunct islands:

(5)

a.

*About which topic did John ask <who was talking _>?

 

b.

*?Which topic did John ask <who was talking about _>?

 

c.

*How did John ask <who behaved _>?

(6)

a.

*About which topic did you leave <because Mary talked _>?

 

b.

*Which topic did you leave <because Mary talked about _>?

 

c.

*How did you leave <because Mary behaved _>?

These two sets might suggest that both wh-clauses and adjunct clauses are strong islands: the extraction of three different types of phrases was attempted and failed. On the other hand, it is easy to construct other examples in which some, but not other, phrases can extract. The percentage sign % indicates that there is variation among the speakers of English:

(7)

a.

%About which topic did John ask <whether to talk _>?

 

b.

Which topic did John ask <whether to talk about _>?

 

c.

*How did John ask <whether to behave _>?

(8)

a.

*About which topic did you leave <without talking _>?

 

b.

Which topic did you leave <without talking about _>?

 

c.

*How did you leave <without behaving _>?

The difference between the two sets involves Tense in the first place. While tensed clauses are themselves not islands, cf.

(9)

Which topic do you think that I talked about _?

we see that the presence of Tense may considerably strengthen other islands. This fact must be systematically controlled for in the evaluation of various islands. (Chomsky 1986 briefly touches upon the role of Tense. Cinque 1990 uses it as an important test, and Manzini 1992 proposes a theory of its behavior.)

Thus, the specific constructions in (5)-(6) may preclude the presence of any gaps, but this in itself does not mean that wh-complements and adjuncts per se are totally impenetrable.

Shall we conclude now that wh-clauses and adjuncts per se are merely weak islands? Note a fine distinction: while adverb extraction is blocked in both, the infinitival whether-clause allows the extraction of both DPs and PPs (at least for some speakers), while the gerundival adjunct clause allows only DP-extraction.

The interpretation of this situation is a theoretical, rather than descriptive, matter. Making the implicit assumption that the PP versus DP facts are highly consistent in each island type, Cinque 1990 argues that this difference warrants radically different analyses, because a DP gap may be an empty resumptive pronoun but a PP gap cannot be. Wh-islands are indeed weak (selective); adjunct-islands on the other hand are strong (absolute), and the DP-gap that they may contain is an empty resumptive pronoun.

Cinque's distinction will serve as a basic organizing principle in this chapter, because it is typically taken for granted in the weak island literature: theories of weak islands do not try to account for the islands that Cinque classifies as strong.

(10)

Cinque's diagnostic of strong versus weak islands:
Among the domains that do not allow all standard extractions, those that allow a PP-gap are weak islands, and those that can at best contain a DP-gap are strong islands (and their DP-gap is an empty pronoun).

This is not the only possible way to cut the cake. For instance, Postal 1997,1998, who uses the terms locked (absolute) versus unlocked (selective) island, does not attribute this much significance to PP versus DP. Following Ross, he assumes that extraction from unlocked islands always involves empty resumptive pronouns and subsumes Cinque's weak islands under this rubric.

A note on the data. Question formation, relativization, and topicalization are all instances of wh-movement. Because the bulk of the literature to be surveyed focuses on examples involving question formation, this chapter follows the same practice, unless the discussion of a particular work requires us to do otherwise.

 

3. Strong islands

3.1 Classical strong islands

While recent literature has identified a host of weak islands, the classical inventory, with the exception of certain wh-islands, consists of strong ones. What follows is a list, sometimes interrupted by comments.

Complex DP (with relative clause):

(11)

*Which kid must you call <the teacher who punished _>?

 

*Where must you call <the teacher who put the book _>?

(12)

*This kid, you must call <the teacher who punished _>.

 

*On the table, you must call <the teacher who put the book _>.

(13)

*What size shoes did you call <the man who wears _>?

(14)

*How did you call <the man who behaved _>?

Complex DP (with complement clause):

(15)

*Which man did you hear <the rumor that my dog bit _>?

 

*Where did you hear <the rumor that I put the book _>?

(16)

*This man, I heard <the rumor that my dog bit _>.

 

*On the table, I heard <the rumor that you put the book _>.

(17)

*What size shoes did you hear <the rumor that I wear _>?

(18)

*How did you hear <the rumor that I behaved _>?

But just as it was possible to arrange for a DP-gap in adjuncts by removing Tense, the same is possible with Complex DPs if the definiteness of the "head" is removed. (Pollard & Sag (1994, vol.2) assume, for this reason, that there is no need for a Complex DP constraint.) PP-gaps remain practically excluded.

(19)

a.

??What the police arrested <everyone who saw _> was this video. (Postal 1998)

 

b.

*I know in which building the police arrested <everyone who lives _>.

(20)

a.

??Which man did they consider <rumors that Bob would betray _>? (Rothstein 1988)

 

b.

*About which man did they consider <rumors that Bob would talk _>?

In distinction to Tense, Definiteness may create an island by itself (Fiengo and Higginbotham 1981, Manzini 1992):

Definites:

(21)

*Which man did you discover <Mary's poem about _>?

(22)

??Which man did you discover <the poem about _>?

(23)

Which man did you discover a poem about _?

See also the Case of Extraction from NP and complex NPs.

Subjects:

(24)

 

*Which man did <his visiting _> shock you?

(25)

 

*Which book do you believe <the first chapter of _> to be full of lies?

(26)

 

*Which man does <everyone who knows _> admire his sincerity?

Adjuncts (see the discussion of Tense above):

(27)

*Which topic did you leave <because Mary talked about _>?

(28)

a.

Which topic did you leave <without talking about _>?

 

b.

*About which topic did you leave <without talking _>?

Authier 1991 argued that adjuncts can contain CP gaps besides DP gaps; for an alternative analysis, see Postal 1994:

(29)

We suggest _ to our employees, <without actually requiring _ of them>, that they wear a tie.

Coordinate structures, unless extraction is across-the-board:

(30)

*Which man did you invite <Mary and _>?

(31)

*Which man did you invite <Mary and a friend of _>?

(32)

Which man did you invite <a friend of _ or a brother of _>?

However, DP-gaps are acceptable under various circumstances (see Postal 1998 for detailed discussion):

(33)

This is the beer that I <bought _, loaded _ into the car, went home, and then fell asleep>. (Jacobson 1996)

See also the Case of ATB phenomena. <<#10>>

The status of tensed constituent wh-complements seems to vary cross-linguistically. Some speakers of English reject extractions from them entirely, whereas others find them tolerable. Likewise, they are rejected by speakers of Dutch, whereas they are acceptable in Scandinavian languages (see Engdahl and Ejerhed 1982) and in Hungarian. (Hungarian has object pro-drop, but only for singulars, thus we know that the gap in (c) is not a dropped pronoun.) This variation is not well-understood (though see Bayer 1984, 1996:6.6.4).

Tensed constituent wh-complements (in some languages/dialects)

(34)

a.

%Which man did John ask <who invited>?

 

 

b.

*Welke

man

heb

jij

je

afgevraagd

<wie

- gezien

heeft>?

(Dutch)

 

 

which

man

have

you

self

asked

who

seen

has

 

 

 

c.

Mely

fiúkat

találgattad,

hogy

<ki látta _>?

(Hungarian)

 

 

which

boys

were-guessing-you

that

who saw

 

See also the Case of Extraction from wh-islands. <<#??>>

Left branches:
As (35) shows, it is often difficult to extract from the left branch of a structure (Ross 1967/1986):

(35)

*Which (man's) did you see <_ picture>?

But comparable examples are perfect in other languages:

(36)

Combien

as-tu

lu

<-

de

livres>?

(French)

 

how many

have-you

read

of

books

 

(37)

Cuius

legis

<_

librum>?

(Latin)

 

whose

read-you

 

book

 

 

(38)

Kinek

olvastad

<_

a könyvét>?

(Hungarian)

 

who-dative

read-you

 

the book+agr

Whether all the counterexamples are well-understood, the Left Branch condition seems like a less plausible generalization than the others. See Corver (1990).

Finally, some islands constrain left-extraction but not Right Node Raising, indicating that the latter may not be an extraction (Bouma 1987, Postal 1998):

(39)

I patted <the dog that bit _> and hit <the dog that adored _> the man who stole my beloved cat.

We will not discuss the relation between wh-in-situ and islands, see the Case of Wh-in-situ.

 

3.2 Types of explanation

The standard explanation of the islandhood of Complex DPs, Subjects, Wh-complements, and Left Branches is in terms of Subjacency: they are out because movement crosses more than one bounding node (barrier), see 3.4 and 7.1.

Adjunct islands are standardly explained by Huang's 1982 Condition on Extraction Domains, hence by the Empty Category Principle: an extraction domain needs to be properly governed. Likewise, Pesetsky 1982 subsumes the Coordinate Structure Constraint under the path containment version of the ECP, see 7.1..

Manzini 1992 is the only attempt to unify the effect of Definiteness and Tense with other locality phenomena: D and T block dependencies based on Case-addresses, which, in her theory, DPs otherwise rely on to escape from islands.

 

3.3 Ways of salvaging strong island violations

As has been mentioned, a strong island violation can sometimes be salvaged by resorting to the resumptive pronoun strategy (whether the pronoun is overt or empty).

Second, the gap in the island may be parasitic on a well-behaved trace of movement, e.g.:

(40)

Which papers did you file _ <without reading _>?

(41)

He is a man who <everyone who knows _> admires _.

See Engdahl 1983, Kayne 1983, Chomsky 1982,1986, Cinque 1990, among others.

Third, the island itself may be pied piped, overtly or at Logical Form:

(42)

a.

*Whose did you visit <_ brother's sister>?

 

b.

<Whose brother's sister> did you visit?

Pied piping at LF has been proposed for apparent violations of Complex DPs in wh-in-situ languages in Pesetsky 1987, and for apparent violations of a single adjunct island in Cinque 1990, for instance.

These matters are not discussed here further; see the Cases of Resumptive pronouns, Parasitic gaps, and Pied piping.

 

3.4 More on Subjacency

Subjacency is classically understood as a condition on movement and requires that movement not cross more than one bounding node. Bounding nodes were originally defined as a list: NP and S (=DP and IP) in English (Chomsky 1973, 1977), NP and S' (=DP and CP) in Italian (Rizzi 1978).

Chomsky 1986 redefines bounding nodes as barriers. An XP is a blocking category for α iff it is not theta-marked by a sister lexical head and dominates α. β is a barrier for α iff (i) β is a blocking category for α but not IP, or (ii) β is the first XP that dominates a blocking category for α. Chain-formation (which takes the place of movement) requires 1-subjacency: no more than one barrier may be crossed. (For an excellent introduction to "Barriers" as well as other theories of locality, see Roberts 1997.)

Adopting Rizzi's 1990 Relativized Minimality (the pertinent aspects of which will be discussed when we turn to Weak Islands), Cinque 1990 proposes two important changes in the understanding of strong islands. (i) Strong islands constrain binding chains, not necessarily movement, and (ii) Not only government but also binding chains require 0-subjacency (no barrier may be crossed).

 

3.4.1 Cinque 1990

We take a closer look at some aspects of Cinque's proposal, because they are fundamental in defining the division of labor between theories of strong and weak islands, as assumed in most of the literature.

Following Obenauer (1984/85), Cinque observes that both parasitic gaps and gaps inside strong islands are restricted to the category DP and takes this to indicate that these gaps are not variables, but A-bar bound empty pronominals: pro. This pro is unmoved in syntax but must move at LF (like some kind of abstract wh-phrase). But instead of moving on its own, it pied pipes the minimal island it is contained in. This accounts for the fact that, if all goes well, one strong island can be evaded.

Now recall the contrast between tensed and gerundival adjunct islands:

(43)

*Which topic did you leave <because Mary talked about _>?

(44)

Which topic did you leave <without talking about _>?

Bona fide cases of pied piping are also constrained by Tense (Nanni and Stillings 1978):

(45)

*They bought a car that their son might drive which was a surprise to them.

 

'they bought a car and the fact that their son might drive it was a surprise to them'

(46)

The elegant parties, to be admitted to one of which was a privilege, had usually been held at Delmonico's.

Cinque stipulates that Tensed Inflection weakly blocks the upward percolation of features. Thus, even one island cannot be pied piped by pro if it is tensed.

Cinque predicts that at most one island can be evaded: strong islands cannot be compounded:

(47)

*the book that we left Russia <without being arrested <after distributing _>>

The reason is that the movement of the pied piped domain continues to be sensitive to islands (can at best be parasitic on overt movement). Postal (1998) notes, though, that an adjunct island can be compounded with a complex DP (see further in 3.4.3):

(48)

It was Lucille that Mike went home <without criticizing <anyone who defended _>>.

(49)

It was Lucille that Mike criticized <everyone who went home <without defending _>>.

To summarize, the observation that certain islands only contain DP gaps leads Cinque to a theory according to which these islands are strong, and the DP gap is not a trace of movement but A-bar bound pro. This contrasts with Weak Islands, which may also contain PP gaps. The latter must be traces since, according to Cinque, human languages generally lack resumptive pronouns of category PP. The behavior of weak islands is explained by considerations that do not apply to strong islands, in Cinque's work and in the many other proposals this chapter will review. Thus, the DP versus PP distinction carries a great burden in deciding which data are to be accounted for by each theory.

 

3.4.2 Some intriguing similarities between SIs and WIs

While the division of labor so determined has proven very useful in the literature, it may be worth pointing out some intriguing similarities between the phrases that may or may not escape from the two types of islands.

First, there is dialectal variation among speakers of English regarding the acceptability of PP-extraction out of wh-islands (one of the weak islands). Unfortunately, no systematic empirical study of this variation exists, to my knowledge, wherefore it is difficult to assess its significance for Cinque's theory.

Second, consider the following contrast:

Wh-phrase associated with pro in a strong island:

(50)

 

Which politician did you go to England <after meeting _>?

(51)

 

*How much water did you make the pasta <after boiling _>?

Wh-phrase associated with a variable in a weak island:

(52)

a.

Which politician did John ask <whether to worry _>?

 

b.

*About which politician did John ask <whether to worry _>?

(53)

a.

*How much gravy did John ask <whether to cook _>?

 

b.

*With how much gravy did John ask <whether to cook _>?

In both cases, the wh-phrase needs to be referential in some sense. But the reasons are different. In the case of strong islands, the wh-phrase needs to be of the kind that resumptive pronouns tend to associate with; in the case of weak islands, it has to carry a referential index and corefer with its variable (see in 7.2). Cinque 1990 characterizes the suitable wh-phrases rather similarly in the two cases, but neither this book nor any subsequent work known to me addresses the question whether the two requirements are exactly identical. Cinque (pers. comm., 1998) has kindly suggested the following contrast between a weak and a strong island,

(54)

?Quanti pazienti volevi sapere <se ogni dottore avesse potuto visitare _>?

 

'How many patients did you want to know whether each doctor had been able to examine?'

(55)

*Quanti pazienti vuoi incontrare <ogni dottore che abbia visitato _>?

 

'How many patients do you want to meet each doctor who examined?'

but the exact generalization remains an open question. One proposal that attempts to unify weak and strong islands is Starke 2001.

 

3.4.3 Postal 1998

Perlmutter 1972 proposed that all extractions leave invisible resumptive pronouns, whether they are from islands or not. Obenauer (1984/85) proposed that all extractions from islands involve empty resumptive pronouns. Cinque (1990) developed a specific resumptive pronoun theory of apparent strong island violations. Postal 1998 proposes to generalize Cinque's idea to all islands and introduces many novel pieces of data.

Postal 1997 asks the fundamental question whether constituents are by default islands or by default not islands. The standard answer is that they are by default not, whence theories attempt to characterize islands as a natural class. In contrast, he argues that non-islands form a small natural class, characterizable in Arc-Pair Grammar.

Postal 1998 distinguishes locked (absolute) and unlocked (selective) islands. Locked islands permit only left dislocation (or operations that are like LD in the relevant respect). Unlocked islands permit some gaps and comprise both strong and weak islands in Cinque's terms: interrogative clauses, complements of factive verbs like regret, rationale clauses (go home (in order) to ...), clausal complements of certain nouns (formulate a plan to prove ...), adjuncts, relative clauses with quantifier heads, subjects, and certain coordinate structures (Postal's list is not intended to be exhaustive).

On Postal's analysis, every extraction from an unlocked island contains an invisible resumptive pronoun (RP). Such islands are unlocked because they permit the extraction of an RP. The mechanism is in many respects like Cinque's. The invisible RP in question requires control, hence it extracts to a position sister to the fronted phrase. This derives island-sensitivity. The reason why limited island violations are possible is that an RP controls a secondary invisible resumptive pronoun, notated as RPx, which extracts only to the left boundary of the lowest island from which the primary RP extracts. Tertiary RPs are excluded.

Postal assumes that RPs in unlocked islands are DPs. The fact that some selective islands permit PP gaps is acknowledged (fn. 15 to Ch. 3) but not addressed in any depth. But, unlike Cinque, Postal does not derive his reason for adopting an RP-analysis from category restrictions. Instead, the differential ability of phrases to occur in what he calls "anti-pronominal contexts" plays a major role.

Antipronominal contexts are ones that do not allow weak definite pronouns. One antipronominal context is the object of tell (as opposed to determine):

(56)

Mike was a spy. We could easily determine/*tell it.

A pseudo-clefted phrase can, whereas a topicalized one cannot, be linked to a gap in an anti-pronominal context:

(57)

What we could easily tell was that Mike was a spy.

(58)

*That Mike was a spy, we could easily tell.

This is explained if topicalization requires an RP, independently of whether it is from an island, whereas pseudo-clefting does not require one. Assume further that pseudo-clefting allows, but comparative extraction forbids an RP. Then, if extraction from an island invariably involves an RP, one arrives at the following typology:

(59)

 

 

A1 extractions

ok in antipron.context
ok from unlocked isl.
E.g.
negative fronting
restrictive relative
pseudo-clefting
what size N

 

 

A2 extractions

ok in antipron.context
* from unlocked isl.
E.g.
comparative
the more...the more...
such N as...
how big a N

B-extractions

* in antipron.context
ok from unlocked isl.
E.g.
topicalization
non-restrictive rel.
clefting
whose N

Note that A1-extractions are linked to an RP when escaping from an unlocked island, and they are not linked to an RP when occurring in an antipronominal context.

This typology predicts that when an antipronominal context is embedded in an otherwise unlocked island, no extraction is possible, since the two ingredients impose contradictory requirements. Consider, for instance, the existential there context. The examples indicate that it is antipronominal; it disallows topicalization (B), it permits negative fronting (A1), but when embedded in a factive island, it comes to disallow negative fronting, even though the latter is otherwise possible from a factive island:

(60)

*There are them in the bottle.

(61)

*Such chemicals, he thought there were _ in the bottle.

(62)

No such chemicals did he think that there were _ in the bottle.

(63)

*No such chemicals did he regret that there were _ in the bottle.

(64)

No such chemicals did he regret that we used _.

Postal identifies the following further antipronominal contexts (illustrated with the impossibility of topicalization):

(65)

Existential there:
No such chemicals did he believe that there were _ in the bottle.
* Such chemicals, he believed that there were _ in the bottle.

(66)

Change of color
* Green, he never painted the car _.

(67)

Name position
* Raphael, I would not name anybody _.

(68)

Inalienable possession
* His ear, I never touched him on _.

(69)

Predicate nominals
* That kind of surgeon, Frank never became _.

(70)

Adverbial NPs
* That reason, he resigned for _.

(71)

Extraposed PPs
* Such a scurrilous review, they published _ last year of his book.

(72)

Infinitival extraposition
* A definite wish, I did not perceive _ in Sylvia to retire.

(73)

Exceptive shifting
* Something dangerous, he might have handed _ to Rita other than the gun.

(74)

Temporal NPs
* That much time, Frank could never stay _ in Italy.

(75)

Idiomatic V+NP
* That much headway, which they made _ on the job,...

(76)

Place name as locative
* Argentina, our president is said to have been born in _.

Postal remains non-committal as to why each extraction type allows, forbids, or requires an RP. It would clearly be desirable to identify syntactic or semantic properties that entail either that an RP is forbidden or that an RP is required, and to analyze the permissive type as syntactically or semantically ambiguous. This would be especially tempting in connection with the wh-phrases how big a N (RP forbidden), whose N (RP required), and what size N (RP permitted), whose divergent behavior Postal notes.

Postal’s data, whether well-understood or not, provide some novel counterexamples to certain claims in the literature. Although many of the anti-pronominal contexts involve adjuncts or non-referential expressions, many others are argumental and/or referential, i.e. their island sensitivity is surprising. On the other hand, while Cinque takes the island-sensitivity of Italian negative phrases to support the referentiality generalization, (64) shows that negative fronting in English is not blocked by weak islands.

As was mentioned at the outset, Postal lumps all unlocked islands together and does not distinguish between Cinque's two subtypes, viz. weak islands and single strong islands. The restricted viability of such a position is in fact implicit in Cinque's work. The DPs that can escape from weak islands are described quite similarly to those that can escape from strong ones. If this is correct, then, as long as only DP-extraction is considered, we do not expect to be able, or need, to distinguish between the two subtypes. Recall that Cinque distinguishes the two subtypes with reference to the possibility of PP-extraction.

In support of his unitary analysis, P. Postal (p.c.) notes that the (b,c) examples below pose a problem for Cinque's dual analysis. Only one of the gaps being inside an island, the two members of the conjunction in (b) and the two individuals that make up the plural in (c) need to participate in two different filler--gap relations:

(77)

a.

Which cop/1 did Lucy believe Sam called _/1 and May faint after meeting _/1?

 

b.

Which cop/2 and which nurse/3 did respectively Lucy believe Sam called _/2 and May faint after meeting _/3?

 

c.

Which two cops/3i,j did respectively Lucy believe Sam called _/3i and May faint after meeting _/3j?

The island sensitivity of an extraction may not be fully equated with its inability to link to an RP, however, whether the latter property be a primitive or a derived one. As will be detailed in 5.6, collectively interpreted arguments of "one time only" predicates are sensitive to weak islands (in the sense that extraction out of a WI forces a multiple events reading on the naturally "one time only" predicate):

(78)

Which relatives do you <regret that you heard this rumor from _>?

(79)

??Which relatives do you <regret that you got this present from _>?

The same collective arguments can nevertheless antecede resumptive pronouns, in contrast to some amount phrases, for example:

(80)

a.

These relatives of mine, they gave me this present.

 

b.

These relatives of mine, I got this present from them.

(81)

*This much pain, Mary endured it.

Whether or not the unification of all unlocked islands that Postal proposes is eventually viable, in the rest of this chapter, I follow standard practice in excluding the islands that Cinque calls strong from the discussion of weak (selective) islands.

 

4. Weak Islands: A preview

The historical starting point is the assumption, made in Huang 1982, Lasnik and Saito 1984, 1992, Chomsky 1986, that the paradigmatic (if not the only) case of weak (selective) islands is wh-islands, and the expressions whose extraction is sensitive to WIs are adjuncts, as opposed to arguments.

Since "Barriers," a number of new theories of weak islands have appeared in quick succession. What makes this process especially interesting is the fact that practically each theory comes with a significant new set of data. The survey below will reflect this spirit. The critical data come in two dimensions:

(A) What extractions are sensitive to WIs?

(B) What induces a WI?

Much of the literature can be conveniently surveyed along the dimensions in (A) and (B). I will thus begin by drawing ever-widening circles of data, first for (A), then for (B). The third part of the discussion will review what theories account for what data sets.

Sections 5 and 6 are not simple "data surveys", though. Detailed and often theoretical arguments will be put forth as to why exactly the proposed generalizations hold. Also, when certain pieces of literature offer important analyses of WI-phenomena without proposing their own overall theory of WI, they will be summarized in these sections.

In both these sections and in section 7, where we turn to theories, we will see that the proposals are less and less syntactic. Some of them include pragmatic or semantic factors in the description of what expressions are WI-sensitive or WI-inducers, although they formulate the explanation in syntactic terms. Others even derive the explanation from semantics.

The data to be discussed are as follows:

Ad (A) What extractions are sensitive to WIs:

(A1) Arguments versus adjuncts
(A2) Referential versus non-referential
(A3) Re-evaluating the role of D-linking
(A4) Individual versus non-indvidual, and how many-phrases
(A5) Functional readings and event-related readings
(A6) Individuals: is it being type e that matters?
(A7) Split constructions
(A8) Negative polarity item (NPI) licensing
(A9) Cross-sentential anaphora
(A10) Elements of anti-pronominal contexts

Ad (B) What induces a WI:

(B1) Wh-islands
(B2) Negatives and other affective operators
(B3) Response stance and non-stance, versus volunteered stance predicates
(B4) Extraposition islands
(B5) VP-adverbs
(B6) Scope islands

Section 7 presents the theories that account for these data and generalizations. To anticipate, the division of labor is as follows. When a theory accounts for only some of the data falling under some generalization, the generalization is marked with a percentage sign (%):

ECP and Subjacency for (A1)--(B1)
Relativized Minimality for (A2, A7)--(B1, B2, B3%, B4, B5%)
Monotonicity for (A3, A4, A7%)--(B1, B2, B3%, B5%)
Scope Theory, algebraic version for (A3, A4, A5, A6, A7%)--(B1, B2, B3, B5, B6)
Scope Theory, dynamic semantic version (A4, A5, A7, A8, A9)--(B1, B2, B3, B5, B6)

 

5. (A) What extractions are sensitive to WIs?

This section will survey the main distinctions that have been found useful in describing what extractions are sensitive to WIs. To keep new information manageable, wh-islands will be used to illustrate the phenomena. Two other familiar WIs, negative and factive islands will be added where necessary (e.g. because subject extraction out of a wh-island may be independently ungrammatical). The full set of WIs is much greater, though, and will be presented when dimension (B) is addressed.

 

5.1 (A1) Arguments versus adjuncts

Huang 1982, Lasnik and Saito 1984, 1992, and Chomsky 1986 draw the distinction between arguments (claimed not to be sensitive to WIs) and adjuncts (claimed to be sensitive to them), e.g.

(82)

 

Which problem did John ask <how to phrase _>?

(83)

 

*How did John ask <which problem to phrase _>?

 

 

'what is the manner such that John asked which problem to phrase in that manner'

Besides how, why and, to a somewhat lesser extent, when are WI-sensitive adjuncts.

(84)

*Why did John ask <whether to fire him _>?

'What is the reason such that John asked whether it is a good reason for firing him'

(85)

??When did John ask <whether to fire him _>?

'What is the time such that John asked whether it is a good time for firing him'

Where does not fit the picture very well, because its WI-sensitivity does not depend much on whether it is subcategorized for (as with put) or not (as with read):

(86)

?Where did John ask <whether to put/read this book _>?

'What is the location such that John asked whether to put this book there / whether to read this book there'

The morphological constitution of these wh-words may also play a role. It has been noted that when counterparts of why and when have an articulated PP structure in a language, they extract better. Korean, Japanese, and Hungarian are cases in point.

 

5.2 (A2) Referential versus non-referential / Existential presupposition

A major revision of the argument/adjunct distinction is prompted by observations by Ross 1984, Kroch 1989, Comorovski 1989, Rizzi 1990, Cinque 1990, Obenauer 1992, and Kiss 1993. In a nutshell, the claim is that originating in an argument position is not enough: a successful extractee must also be referential in some sense.

Some subcategorized for, and thus presumably argumental, XPs are unexpectedly WI-sensitive:

(87)

*What did John ask whether these pearls cost _? (Ross)

cf. * These pearls cost.

(88)

*How did John ask whether to behave _? (Rizzi)

cf. *John behaved.

According to Rizzi 1990, amount and manner phrases may be arguments but they do not have the theta-roles of event-participants (referential theta-roles). On the other hand, as events take place in time and space, Rizzi surmises that the event specification may license a temporal and locative index that accounts for the fact, noted above, that such phrases are less sensitive to WIs than manners and reasons.

Based on the fact that expletive wh-phrases do not have referential theta-roles, Rizzi 1992 extends this reasoning to partial wh-movement, which is blocked by a negative island (for cross-linguistic variation, see Dayal 1994):

(89)

Was glaubst du (* nicht), mit wem Jakob jetzt spricht?

what think you (* not) with whom Jacob now talks

            'With whom do(n't) you think Jacob is talking now?'

Drawing from work by Kroch and Comorovski, Cinque 1990 makes a finer distinction that involves pragmatics: a referential wh-phrase, in addition to having a referential theta-role, needs to be Discourse-linked, i.e. drawn from a pre-established set.

Some of the most persuasive most persuasive examples involve how many-phrases:

(90)

*How many books are you wondering <whether to write _ next year>?

(91)

How many books on the list are they wondering <whether to publish _ next year>?

(92)

*How many points are you wondering <whether to earn _>?

(93)

How many points are the jurors debating <whether to take off _>?

The good examples involve a contextually established set of books or a specific range of points that figure skating jurors conventionally assign to mistakes in the program.

Similar contrasts are easy to construct with other wh-phrases, including adjuncts. For instance, although how-extraction out of a wh-island or a negative island is by default bad, it becomes rather acceptable given a contextually specified checklist of ways to solve the problem:

(94)

How are you wondering <whether to solve the problem _>?

(ok when choosing from salient checklist)

(95)

How did <no student solve the problem _>?

(ok when choosing from salient checklist)

In a similar spirit, Kiss 1993 assumes that specificity in the sense of Enç 1991 is a prerequisite for extraction out of a wh-island.

We might say that these proposals define WI-sensitivity in terms that combine syntax with pragmatics.

In the name of philological correctness, it should be mentioned that Kroch's influential paper apparently had more than one version, and the "official unpublished" version gives a different explanation from what Cinque relies on. According to Kroch, what saves (91) is not that the how many-phrase is D-linked but, rather, the fact that the context licenses the existential presupposition that there is a particular amount such that you are wondering whether to publish that amount.

A similar interpretive contrast is pointed out in Obenauer 1992. He characterizes it in terms of specificity and correlates it with a difference in agreement with the past participle. The split construction (combien... de fautes 'how many ... errors') never allows agreement and only the non-specific reading is available. In the non-split construction, on the specific reading agreement is optional: Dis-moi combien de fautes tu as fait / faites 'Tell me how many (of the typical, expected) errors you made', and on the non-specific reading, which asks about the number of errors, participle agreement is excluded.)

 

5.3 (A3) Re-evaluating the role of D-linking

The pragmatic argument is very powerful: indeed, almost any wh-phrase (save for why, perhaps) can be made immune to WIs by D-linking. But it has been argued that D-linking is not the discriminating factor.

Szabolcsi and Zwarts 1990, 1993 argue that the moral of the "salient checklist" examples is different from what Kroch and Cinque draw from them. The checklist effectively turns elements of a non-individuated domain into discrete individuals. D-linking may thus play an important role in transforming a domain but, they claim, it is not D-linking itself but the emergent set of individuals that is decisive.

Likewise, the pragmatic approach predicts that wh-the-hell can never be extracted out of a weak island, because it is "aggressively non-D-linked," in the words of Pesetsky 1987. Szabolcsi and Zwarts 1993 submit that it is an independent property of wh-the-hell that makes it difficult to extract from WIs. E.g.,

(96)

Who the hell saw John?

Unless a rhetorical question, the above is felicitous only if we have unquestionable evidence that someone saw John and merely wish to identify the person. The requirement of unquestionable evidence is often difficult to fulfil in the complex situations described by WI-violations:

(97)

??Who the hell are you wondering <whether to invite _>?

On the other hand, when such evidence is available, a WI-violation by wh-the-hell is acceptable. E.g., seeing someone rifling through a dictionary, we may felicitously ask,

(98)

What the hell do you still <not know <how to spell _>>?

Perhaps the clearest example demonstrating the significance of individuals, in distinction to D-linking, comes from Dobrovie-Sorin 1994. Clitic doubling in Romanian signals D-linking, and indeed, it enables a how many-phrase to extract from a factive island:

(99)

Pe cîte femei regreţi <că le ai iubit _>?

'how many (of the) women are such that you regret having loved them'

On the other hand, cîte femei 'how many women' can be extracted even if it is not doubled by le and, consequently, is not D-linked. It turns out that the critical factor is whether it is interpreted as quantifying over numbers of women (case (i), which is bad) or over individual women (case (ii), which is okay):

(100)

Cîte femei regreţi <că ai iubit _>?

(i)

* 'for what number, you regret having loved that number of women'

(ii)

'how many women are such that you regret having loved them'

These data lead to the conclusion that semantics, rather than pragmatics, plays the real role in the characterization of WI-sensitivity. (On the other hand, the WI-sensitivity of certain expressions in anti-pronominal contexts, e.g. place names in (76) remains a mystery.)

 

5.4 (A4) Individual versus non-individual, and how many-phrases

Szabolcsi and Zwarts argue that the individual / non-individual distinction is what sets apart WI-escaping which/what-phrases from manners, reasons, amounts, and other WI-sensitive expressions (when the latter are not individuated by contextual brute force). A very similar conclusion is reached by many authors specifically in connection with the ambiguity of how many-phrases.

Dobrovie-Sorin proposes to split QR into two Logical Form operations: NPR (the raising of noun phrases with a quantifier feature) and DR (the raising of determiners). The first leaves an individual variable and is immune to WIs. The second leaves a higher order, determiner-type variable and is sensitive to WIs. (NPR may be followed by DR but that does not matter in relation to island escaping.)

The distinction is of course highly reminiscent of the contrast between overt combien extraction and combien de N extraction (Obenauer 1984/85):

(101)

Combien as-tu (*beaucoup) consulté _ de livres?

how many have-you (*a lot) consulted _ of books

(102)

Combien de livres as-tu (beaucoup) consulté _?

how many of books have-you (a lot) consulted

The overt extraction of combien 'how many' is blocked by WIs, whereas the overt extraction of the full noun phrase is (generally) not; see also Rizzi 2000 for a similar case of "splitting" in Italian, involving wh - d'altro 'wh else'. The finer point that Dobrovie-Sorin makes is that combien de livres is in itself ambiguous between an amount and an individual reading, and the amount reading is absent when combien de livres is (grammatically) extracted out of a WI, exactly as was observed in connection with cîte femei:

(103)

Combien de livres as-tu consulté _?

ambiguous (DR / NPR)

(104)

Combien de livres as-tu beaucoup consulté _?

not ambiguous (*DR / NPR)

In fact, even overt combien de N extraction out of a WI may be ungrammatical when, as the object of a verb of creation, the phrase can only have an amount reading (Szabolcsi and Zwarts 1993). The contrast between this and grammaticality of (102) is taken to show that combien de livres is ambiguous:

(105)

Combien de cercles as-tu (*beaucoup) dessiné _?

how many circles have-you (*a lot) drawn

The same interpretive contrast holds for non-wh numeral phrases, whose amount reading is produced by DR, and whose individual reading is produced by NPR, at Logical Form (Dobrovie-Sorin 1994):

(106)

John read fifty books.

(i)

'John read books. Their number was fifty'

(ii)

'There are fifty books such that John read them'

(107)

John didn't read fifty books.

(i)

* 'John didn't read books. Their number was