WWP Supporting Materials
This page gathers together the handouts and supporting materials developed to accompany the WWP’s program of workshops and seminars on a variety of topics including text encoding with TEI, advanced topics such as TEI customization and XSLT, and text analysis with word embedding models. Many of the handouts are short crib sheets intended for classroom use, where a quick printable reference is helpful; they are not intended to substitute for the much fuller references that are readily available on the web. All material are published here under a Creative Commons Attribution-ShareAlike 4.0 International license and we encourage people to adapt and improve them.
Text Encoding with TEI
-
Element List for Basic Encoding:
source,
HTML,
PDF
A glossed list of the most common elements that we use and talk about during the class -
Introductory oXygen Exercise:
HTML
An introductory exercise covering the basic concepts of XML and the essentials of the oXygen XML editor -
Oxygen Crib Sheet:
source,
HTML
A beginner’s crib sheet for getting started with the oXygen XML editor -
Document Analysis Worksheet:
source,
HTML
A set of questions to guide the document analysis process - Real-world encoding samples: a variety of samples from the WWP and other projects, including TEI/XML samples, project descriptions, and other materials.
- TEI Exercise Package tarball or zip file
Customizing and Publishing TEI Data
-
CSS Crib Sheet:
source,
HTML
A beginner’s crib sheet for getting started with CSS -
Another CSS Crib Sheet:
source,
HTML
A basic crib sheet for getting started with CSS -
Element List for ODD-writing:
source,
HTML
A glossed list of the basic elements used in writing ODD customizations -
Element List for RelaxNG:
source,
HTML
A glossed list of the basic RelaxNG elements used in writing ODD customizations -
Roma Exercise:
source,
HTML
A step-by-step exercise for using Roma to customize the TEI
Text Analysis
- Word Vectors Primer: A set of tutorials for understanding and working with word vectors.
- Women Writers Vector Toolkit: A web interface for exploring word embedding models, accompanied by glossaries, sources, case studies, and sample assignments.
- Word Vector Walkthroughs: A GitHub repository with the code used to train the models in the WWVT, as well as downloadable walkthroughs for training models.
- Data Preparation Guide and Checklist: A detailed guide to preparing a corpus for work with word embedding models.
- Model Validation and Assessment Checklist: A guide to assessing and validating research with word embedding models.
- Sample Model and Query Log: A guide to keeping track of settings, queries, and results when experimenting with word embedding models.
- Corpus Search Guide: A guide with strategies for examining a corpus and some suggested tools to make this easier.