The CODA corpus contains approximately 700 turns of human-authored expository dialogue (by Mark Twain and George Berkeley) which has been aligned with monologue that expresses the same information as the dialogue.
The monologue side is annotated with Coherence Relations (RST).
The dialogue side is annotated with Dialogue Act tags.
Funding
CODA: COherent Dialogue Automatically generated from text
Engineering and Physical Sciences Research Council