The Open University
Browse

CODA corpus Release 1.0

Download (588.64 kB)
dataset
posted on 2023-02-07, 13:12 authored by Paul PiwekPaul Piwek

The CODA corpus contains approximately 700 turns of human-authored expository dialogue (by Mark Twain and George Berkeley) which has been aligned with monologue that expresses the same information as the dialogue. 


The monologue side is annotated with Coherence Relations (RST). 


The dialogue side is annotated with Dialogue Act tags.

Funding

CODA: COherent Dialogue Automatically generated from text

Engineering and Physical Sciences Research Council

Find out more...

History

Usage metrics

    Faculty of Science, Technology, Engineering and Mathematics (STEM)

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC