( (METADATA (AUTHOR NICHOLAS_BACON_II:MALE:BROTHER:1543:26) (RECIPIENT NATHANIEL_BACON_I:MALE:BROTHER:1546?:23?) (LETTER BACON_001:E1:1569:AUTO:FAMILY_NUCLEAR)) (IP-MAT (CONJ nor) (NP-1 (D the) (N commyssion) (PP (P for) (NP (D the) (N pease)))) (NP-SBJ (PRO I)) (ADVP-TMP (ADV never)) (VBD harde) (PP (P of) (NP *ICH*-1)) (. .)) (ID BACON,I,7.001.5))
Field 1 name Field 2 gender Field 3 relationship (if any, if known) Field 4 date of birth Field 5 age (at time of writing)
For most correspondents both a first and last name are known, but
occasionally only one name is known. When only the last name is known,
a title is included when supplied (e.g., MRS_NECTON, COLONEL_STRODE),
but if no title is supplied, they are referred to by the single name
only (e.g., WADEHILL). Individuals known only by first name are
generally (early or foreign) royalty, or certain religious figures
(abbots, in particular). These are given either a number (EDWARD_IV),
a title (PRINCE MAURICE), or an epithet: JOHN_OF_NORTON,
JOHN_OF_TEWKESBURY, JOHN_OF_LANCASTER. The latter distinguishes three
John's: the Abbot of Norton, the Abbot of Tewkesbury, and the Duke of
Bedford.
Women
Women are generally referred to by the name they were using at the time of writing. If married, and their maiden name is known it is included in square brackets prefaced by N. for Nee (e.g. MARY_PEYTON[N.ASTON]). In general, the inclusion of a maiden name indicates a married state at the time of writing, but its absence indicates nothing, since not all maiden names are known.Women who wrote letters under more than one name are given both names separated by a slash (DOROTHY_OSBORNE/TEMPLE). In some cases, the two names are the woman's maiden and married name (DOROTHY_OSBORNE/TEMPLE), but they may also be two married names. In this case, if the maiden name is known, it is also included, if known (e.g., ELIZABETH_POYNINGS/BROWNE[N.PASTON]). The difference between DOROTHY_OSBORNE/TEMPLE and MARY_PEYTON[N.ASTON] is that Dorothy wrote both as Dorothy Osborne and as Dorothy Temple, whereas Mary wrote only as Mary Peyton (i.e. when married). Her maiden name is included only for information purposes. From the information given in the name (DOROTHY_OSBORNE/TEMPLE) alone, it is not possible to tell whether one of these names is a maiden name and one a married name or both married names (although from other information we know that Osborne is her maiden name and Temple her married name). For ELIZABETH_POYNINGS/BROWNE[N.PASTON], on the other hand, who wrote both as Elizabeth Poynings and Elizabeth Browne, we know from the name alone (because of the inclusion of the maiden name) that both of these are married names.
Royalty
Pre-Tudor kings are referred to by first name and number only (e.g. RICHARD_III, EDWARD_IV). Other members of the royal family are referred to by common epithets (RICHARD_OF_YORK, HUMPHREY_OF_LANCASTER). Tudor and Stuart kings and family are named Tudor or Stuart along with their number (e.g. HENRY_TUDOR_VIII, JAMES_STUART_I/VI). Queens are referred to in the same way as other women, with the exception of Mary Tudor, Queen of England (referred to as MARY_TUDOR_I), who has the same name as Mary Tudor, Queen of France (referred to as simply MARY_TUDOR).
Unknown recipient/Name unknown
In a small number of cases, nothing is known about the recipient of a letter. In these cases the recipient is listed as UNKNOWN_RECIPIENT. In line with our policy of uniquely identifying every individual, each unknown recipient is given a separate arabic number (e.g., UNKNOWN_RECIPIENT_1, UNKNOWN_RECIPIENT_2, etc.). When something about the identity of the recipient is known (usually by some distinguishing epithet given by the editor, or the content of the letter) e.g., 'sheriff of Norfok', but the name is not, the recipient is listed as NAME_UNKNOWN (or NAMES_UNKNOWN for groups such as the 'commissioners for recusancy'). As with unknown recipients, each unnamed individual/group is given a different number. The same unnamed individual or group as author or recipient of multiple letters is given the same number. The reason for distinguishing the two types is that for the latter, but not the former, additional personal information (API) is included in the associated information file (AIF). Note that the PRIVY COUNCIL is treated as an individual, i.e., a name, and not classified as an unnamed group.
Author birthdates are largely supplied by Helsinki. Recipient
birthdates not supplied by the edition or Helsinki come from the
on-line DNB. When no DOB is known, the field is empty (indicated, as
usual, by an underscore).
Age at time of writing (AUTHOR/RECIPIENT field 5)
This number is simply calculated by subtracting the date of birth (DOB) of the author/recipient
from the date of the letter. If either the letter
date or the DOB of the correspondent is uncertain (indicated by a
following ?), the age is likewise (e.g., 49?). When the date of the
letter is given as a decade (1500S), age is not calculated. If the DOB
or the date of the letter is unknown, the field is empty (indicated,
as usual, by an underscore).
Letter information
The following information is given for the letter:
Field 1
unique reference
Field 2
time period
Field 3
date of letter
Field 4
authenticity
Field 5
recipient classification
Unique reference (LETTER field 1)
Each letter in the corpus is uniquely identified by a combination of
the filename and letter number (e.g., ARUNDEL_001).
Time period (LETTER field 2)
The time period of the letter is identified according to the Helsinki
Corpus time periods, as follows:
This information is intended to make it possible to compare PCEEC data
to that from the PPCME2 and PPCEME.
Time period
Dates
M4
1420-1500
E1
1500-1569
E2
1570-1639
E3
1640-1710
Date of letter (LETTER field 3)
This information is taken from the Helsinki 'text identifier' parameter
(Q). When the year of the letter is known it is given precisely. If
the date is conjectural it is followed by a question mark. When a
precise year cannot be suggested the decade is given, indicated by S,
as follows: 1500S and 1600S indicate the first decade of each century,
1510S indicates 1510-1519, etc. If even the decade is uncertain, a
question mark follows the decade (e.g., 1530S?).
Authenticity (LETTER field 4)
Three values are supplied for the authenticity of the letter. This is
a simplified version of the Helsinki 'authenticity code' found in
the 'text identifier'
parameter (Q).
FAMILY_NUCLEARThe information in this field comes from the Helsinki 'recipient type' code found in the 'text identifier'
FAMILY_OTHER
FAMILY_SERVANT
FRIEND
OTHER
The three nodes included in the METADATA node (AUTHOR, RECIPIENT, and LETTER) are searched in the same way as CODING strings are searched. The query below searches column 1 of the CODING string for the value x.
query: (CODING column 1 x) <-- note space between 'column' and '1'
CS2 difference alert! There must be a space between the column and column number in CS2 CODING queries (see the Conversion guide). |
query: (AUTHOR column 2 MALE)This query will find all the tokens in which the author is male. To search for other values as well, simply join search functions with AND.
CS2 difference alert! It is not necessary to join multiple search calls one at a time with right-branching parentheses in CS2 (see the Conversion guide) |
query: (AUTHOR column 2 MALE) AND (RECIPIENT column 2 FEMALE) AND (LETTER column 2 E2)This query finds all men writing to women in period E2. The following query finds all women writing to women.
query: (AUTHOR column 2 [1]FEMALE) AND (RECIPIENT column 2 [2]FEMALE)Note the use of prefix indices. Indices must be used to distinguish different referents of the same search term when searching the METADATA, just as with any other type of search.
To find all women writing to men or men to women, use the new CS2 OR function.
query: ((AUTHOR column 2 MALE) AND (RECIPIENT column 2 FEMALE)) OR ((AUTHOR column 2 FEMALE) AND (RECIPIENT column 2 MALE))metadata can be searched in conjunction with structure.
query: (AUTHOR column 2 FEMALE) AND (RECIPIENT column 2 MALE) AND (IP* iDoms NP-VOC)This query finds all tokens written by women to men, in which vocatives are used.
( (CODING BACON_001:E1:1569:AUTOGRAPH:FAMILY_NUCLEAR:NICHOLAS_BACON_II:MALE:BROTHER:1543:26:NATHANIEL_BACON_I:MALE:BROTHER:1546?:23?:16%) (IP-MAT (NP-SBJ (PRO I)) (HVP have) (VBN reseyved) (ALSO also) (NP-OB1 (PRO$ my) (N hose)) (. .)) (ID BACON,I,7.001.3))This coding string contains all the information in the metadata nodes, with the columns from the LETTER node first, followed by the AUTHOR and then the RECIPIENT node, as follows:
Column 1 author name Column 2 author gender Column 3 author relationship (if any) Column 4 author date of birth Column 5 author age (at time of writing) Column 6 recipient name Column 7 recipient gender Column 8 recipient relationship (if any) Column 9 recipient date of birth Column 10 recipient age (at time of writing) Column 11 letter unique reference Column 12 letter time period Column 13 date of letter Column 14 letter authenticity Column 15 recipient classification
Searching the metadata CODING string is done in the same way as for any other CODING string (see the section on searching the CODING node in the CorpusSearch Lite Manual). For information on how to add coding to the string, see the section on the coding function.
Note that if you want to use any values from the coding string as input to varbrul, you will have to recode them as single character values, since varbrul will not accept multi-character columns.