Entries relating to Transcription issues:
Principles of transcription: generalGeneral principles of transcription, including details of what is and is not captured, and the order in which it is represented
Regularization: silentFeatures which the WWP silently regularizes, including details of spacing, delimiters, type size, and typography
Regularization: <orig>Explicit regularization using <orig>
Punctuation: general Transcription of punctuation, including treatment of hard and soft hyphens
Punctuation and elementsPosition of punctuation relative to element boundaries
Transcription of primary sourcesUse of elements from the TEI tagset on transcription of primary sources
Features omitted from transcriptionUse of <gap> to encode explicit omissions from the transcription, and cases where silent omission is allowed
Typography: I, J, U and V, generalTranscription and encoding of early typography using <orig>
Tagging the letter, tagging the wordApplication of <sic>, <orig>, and <abbr> at the word and letter level
Typography: recognizing difficult letter formsDiscussion of specific letterforms in the WWP collection, including long s, disambiguation of I and J, U and V
Special characters: entity referencesUse of entity references for special characters, boilerplate, and decorative features of the text
Special characters: ordinary characters requiring special treatmentFurther detail on ordinary characters which must be encoded with entity references in particular contexts or because they serve special functions
Special characters: brevigraphs and diacritical marksUsing entity references to transcribe brevigraphs and characters with diacritical marks
Special characters: miscellaneousDetails of various kinds of special characters not covered elsewhere
EllipsisEncoding of ellipsis using the entity reference &hellip;
Roman numeralsTranscription of roman numerals, and regularization of roman numeral dates
Errors in the originalEncoding of errors in the document source using <sic>; situations where corr= is and is not used; distinguishing between error and old spelling
Sequencing errorsEncoding of errors in sequencing, such as scene or page numbering
Reading orderDiscussion of the principle of “reading order” to guide the order of transcription in cases where the text flow contains parallel or non-sequential segments
Handwriting: the hand= attribute and the <hand> elementIdentification of handwriting using <hand> and the hand= attribute
Handwriting: additions and deletionsEncoding handwritten additions and deletions using <add>, <addSpan>, <del>, and <gap>
Unclear textHandling damaged, unclear, or illegible text, including missing or deleted letters, damage to the original, or unclarity in the reproduction, using <sic>, <del>, <unclear>, <supplied>, and <gap>
Gap: generalGeneral notes on the use of <gap> to encode material omitted from transcription
Gap: use of the extent attributeDetailed notes on the use of the extent= attribute on <gap> to indicate the extent of text being omitted from transcription
Gap: use of the extent attribute, advancedExcruciatingly detailed information on the use of the extent= attribute on <gap> to encode the signature sequences of pages omitted from transcription

list all entries

search

about

wwp