Word lists from MECS-WIT encoded transcriptions are produced from a transcription's alpha text, where an alpha text is essentially a text containing all the orthographically correct text words in a transcription. Alpha texts are language dependent, i.e. a transcription containing both German and English text passages will produce two alpha texts, one German and one English.
The alpha texts generated by a transcription may then be used to produce word lists, which can be ordered alphabetically or frequentially, in ascending or descending order. They are thus also a source of statistical data with regard to the transcriptions.
The Wittgenstein Archives, Allégt. 27, N-5007 Bergen, Norge | |
+47 55 58 94 74 | |
+47 55 58 94 70 | |
wab@hit.uib.no |