Digital Humanities Workbench |
Home page > Tools > XML > Text Encoding Initiative Text Encoding InitiativeBasically anyone can develop their own XML codeset for every type of text. In the world of XML, it is preferable to work with standard specifications wherever possible, in order to promote optimal document interchange and the development of processing software. The Text Encoding Initiative (TEI) is a consortium working on the development and maintenance of a standard for the digital representation of texts used as research material in the humanities. In that context, XML specifications have been developed for many textual structures (such as critical editions, prose, poetry, text corpora, lexicons, older manuscripts). TEI is now seen as the de facto standard for annotating electronic texts in the humanities and is also widely used by libraries, museums and publishers. Examples of projects within the humanities that use TEI's XML definitions can be found in an overview on the TEI website. These XML specifications have been codified in a so-called TEI DTD, which also comes in a simplified version: TEI Lite. TEI DTDs can be easily expanded for specific purposes. This is useful when working in a project with standard text structures (such as poetry or text corpora) in which certain project-specific elements need to be annotated. An important part of any document that is annotated based on the TEI DTD is the so-called TEI header. This can include the following metadata:
More information
Text Encoding Initiative (TEI) Homepage |
Other topics in this section: Introduction Basic principles Examples Using XML Further information |