The British Academic Spoken English (BASE) corpus

Tagged texts

The tagged transcripts of the BASE corpus are available as XML files, in zipped folders. To download the data, click on one of the following links which will enable you to either open or save a zipped folder containing the XML files of all lectures and seminars for one of the academic divisions in the corpus. In addition to the files, the BASE DTD is included in the folder and it must always be present in the same folder as any of the XML files that is viewed.