“software tools for the extraction, processing and analysis of linguistic data”

CASE XML Conversion Tool

We are currently working on an exciting project to record English spoken by students in academic institutions around the world. The Corpus of Academic Spoken English (CASE) is being compiled by a team of researchers at Trier University of Applied Sciences. Birmingham City University is one partner providing students for the project, and we are also developing software to support the analysis of the transcribed spoken data.

Our CASE XML Conversion tool converts the project's default mark-up (based on discourse analysis notation) to a bespoke XML schema, encapsulating all of the original information in a machine readable form. The XML versions of the transcripts will enable additional levels of computational analysis. For example, XPath searches enable features of the texts to be found with relative ease, and frequency information can be extracted about these features. We hope that the machine readable XML will assist with the analysis as the project progresses.

Find out more about the CASE Project on the CASE Project website.

The CASE XML Conversion Tool has been developed into the XTranscript system. Find out more about XTranscript and convert your transcripts into XML.