Known issues

This page lists known issues in the annotation of the following diachronic English corpora:

Where no specific corpus is indicated, the issue affects both corpora.


ALSO (PPCEME)
ALSO is invariably tagged as ALSO. There may be instances that should instead be tagged as ADV.

Backwards gapping
Instances of backwards gapping are likely to be mistagged as ordinary gapping.

CODE (PPCEME)
CODE material is not always attached as high as possible.

Conjunction of unlike categories (PPCEME)
When unlike categories are conjoined at the word level, the CONJP required by the guidelines is likely to be missing.

Direct speech fragments (PPCEME)
Fragments of direct speech are likely to labelled as FRAG rather than QTP.

ELSE, ENOUGH
The PPCME2 and the PPCEME/PCEEC do not always agree on which instances of post-head ELSE and ENOUGH are tagged as adjectival (ADJR) or adverbial (ADVR).

Exceptional Case-Marking (ECM) versus object control

Fragments, direct speech (PPCEME)
See Direct speech fragments.

Infinitival adjuncts (PPCEME)
Infinitival adjuncts are likely to lack the -ADT dash tag.

Left-dislocation (-LFD)
In the PPCME2, PPs are annotated as left-dislocated only if the resumptive element is a matching PP (that is, if the preposition in the left-dislocated and in the resumptive element is identical, and if the objects of the prepositions corefer); there are only a handful of examples.
( (IP-IMP (VBI Thynk)
	  (ALSO eek)
	  (CP-THT (C that)
		  (IP-SUB (PP-LFD (P of)
				  (NP (SUCH swich)
				      (N seed)
				      (PP (P as)
					  (CP-CMP (WNP-1 0)
						  (C 0)
						  (IP-SUB (PP (P (CODE {of}))
							      (NP *T*-1))
							  (NP-SBJ (NS cherles))
							  (VBP spryngen))))))
			  (, ,)
			  (PP-RSP (P of)
				  (NP (SUCH swich) (N seed)))
			  (NP-SBJ=2 *exp*)
			  (VBP spryngen)
			  (NP-2 (NS lordes))))
	  (. .))
  (ID CMCTPARS,314.C1.1108))

In the PPCEME, the -LFD dash tag is used for such examples, but also much more liberally to indicate a relationship between a pre-subject PP and any coreferential PP or ADVP.

( (IP-MAT-SPE (PP-LFD (P Though)
		      (CP-ADV-SPE (C 0)
				  (IP-SUB-SPE (NP-SBJ (PRO I))
					      (VBP beare)
					      (NP-OB1 (N record))
					      (PP (P of)
						  (NP (PRO$ my) (N selfe))))))
	      (, ,)
	      (ADVP-RSP (ADV yet))
	      (NP-SBJ (PRO$ my) (N record))
	      (BEP is)
	      (ADJP (ADJ true)))
  (. :)) (ID AUTHNEW-E2-H,VIII,1J.1030))

Measure phrases (NP-MSR, QP) (PPCEME)
Measure phrase modifiers of nouns are likely to be mistagged as NP-POS rather than NP-MSR.
The distinction between NP-MSR and QP is not straightforward, and it is likely that some instances of one category are mistagged as the other. Searches for one category should therefore generally include the other.

Object control versus Exceptional Case-Marking (ECM)
See Exceptional Case-Marking (ECM) versus object control

Participial clauses (IP-PPL) (PPCME2)
The distinction between participial clauses functioning as adjuncts (IP-PPL) and complements (IP-PPL-OB1) is not implemented in the PPCME2. However, participial clauses functioning as complements are likely to be rare in that corpus.

Participial clauses versus reduced relative clauses
Reduced relatives (RRC) headed by participles are not always easy to distinguish from participial clauses (IP-PPL). It is wise in searches for one category to include the other.

Participles, adjectival (PPCEME)
Some adjectival uses of participles, notably passive participles, are likely to be mistagged as ADJ, contrary to the rule in Verbs and other categories.

Pied piping (PPCME2)
In cases of pied piping, the PPCME2 sometimes incorrectly marks only the highest (rather than every) maximal projection in Spec(CP) as a wh- constituent.

Proper nouns
Many inconsistencies and outright errors likely remain with respect to the tagging of proper nouns (NPR).
The guidelines for proper nouns of the form THE N OF NP (THE WAR OF THE ROSES) have the counterintuitive result that none of the nouns is tagged NPR.

Purpose infinitives (PPCEME)
Purpose infinitives are not always easy to distinguish from bare infinitives, and some infinitives that should be tagged as purpose infinitives are likely to lack the -PRP tag, particularly in connection with go and send.

Reduced relative clauses versus participial clauses
See Participial clauses versus reduced relative clauses.

Resumptive elements (-RSP)
See Left dislocation.

Right-node raising (PPCEME)
Not all instances of right-node raising are annotated with an index, particularly in the statutes.

Secondary predication versus small clauses
It is not always easy to distinguish instances of secondary predication from small clauses. See Secondary predicate NPs for a list of predicates that license NP-SPR, ADJP-SPR. See Small clauses for a list of predicates that license IP-SMC.

Single NP object with LIKE and similar verbs (LACK, NEED, WANT) (PPCME2)
In the PPCME2, the experiencer argument of LIKE (and similar verbs) in the ME LIKE(N) PEARS construction is often mistagged as NP-OB1 rather than NP-OB2.

Small clauses versus secondary predication
See Secondary predication versus small clauses.