3.5. The two versions of ICE-EA For the purposes of the different requirements of analyses, we decided on two versions of the corpus. 1) The complete version in rich text format (rtf) format, which, in addition to the texts of 2000 words, contains the full versions of the texts and all tagging. The additional text, quotes, passages in a different (usually East African) language (code-switching) and editorial comment are accessible but in hidden font. This allows the user to retrieve the complete text, to have immediate access to the source and to take account of deviations. 2) The reduced version as text only (ASCII), which consists of just the 2000-word texts and the element-attached tagging. Markers indicating text units, subtexts and headings as well as all editorial comment are omitted. This allows easier access for frequency counts and concordance. Quotes and sentences in a different language are omitted. Hesitations, repetitions and minimal responses were not included in the final count.