| Principles
of transcription: general | General principles of transcription, including details of what is and is not captured, and the order in which it is represented |
| Regularization:
silent | Features which the WWP silently regularizes, including details of spacing, delimiters, type size, and typography |
| Regularization:
<orig> | Explicit regularization using <orig> |
| Punctuation:
general
| Transcription of punctuation, including treatment of hard and soft hyphens |
| Punctuation
and elements | Position of punctuation relative to element boundaries |
| Transcription
of primary sources | Use of elements from the TEI tagset on transcription of primary sources |
| Features
omitted from
transcription | Use of <gap> to encode explicit omissions from the transcription, and cases where silent omission is allowed |
| Typography:
I, J, U and V, general | Transcription and encoding of early typography using <orig> |
| Tagging
the letter, tagging the
word | Application of <sic>, <orig>, and <abbr> at the word and letter level |
| Typography:
recognizing difficult letter
forms | Discussion of specific letterforms in the WWP collection, including long s, disambiguation of I and J, U and V |
| Special
characters: entity
references | Use of entity references for special characters, boilerplate, and decorative features of the text |
| Special
characters: ordinary characters requiring special
treatment | Further detail on ordinary characters which must be encoded with entity references in particular contexts or because they serve special functions |
| Special
characters: brevigraphs and diacritical marks | Using entity references to transcribe brevigraphs and characters with diacritical marks |
| Special
characters: miscellaneous | Details of various kinds of special characters not covered elsewhere |
| Ellipsis | Encoding of ellipsis using the entity reference … |
| Roman
numerals | Transcription of roman numerals, and regularization of roman numeral dates |
| Errors
in the original | Encoding of errors in the document source using <sic>; situations where corr= is and is not used; distinguishing between error and old spelling |
| Sequencing
errors | Encoding of errors in sequencing, such as scene or page numbering |
| Reading
order | Discussion of the principle of “reading order” to guide the order of transcription in cases where the text flow contains parallel or non-sequential segments |
| Handwriting:
the hand= attribute and the <hand>
element | Identification of handwriting using <hand> and the hand= attribute |
| Handwriting:
additions and deletions | Encoding handwritten additions and deletions using <add>, <addSpan>, <del>, and <gap> |
| Unclear
text | Handling damaged, unclear, or illegible text, including missing or deleted letters, damage to the original, or unclarity in the reproduction, using <sic>, <del>, <unclear>, <supplied>, and <gap> |
| Gap:
general | General notes on the use of <gap> to encode material omitted from transcription |
| Gap:
use of the extent
attribute | Detailed notes on the use of the extent= attribute on <gap> to indicate the extent of text being omitted from transcription |
| Gap:
use of the extent attribute,
advanced | Excruciatingly detailed information on the use of the extent= attribute on <gap> to encode the signature sequences of pages omitted from transcription |