The annotation scheme of the Turkish Discourse Bank and an evaluation of inconsistent annotations

Download
2010-7-15-16
Zeyrek Bozşahin, Deniz
Sevdik- Çallı , Ayışığı
Ögel- Balaban, Hale
Yalçınkaya, İhsan
Turan, Ümit Deniz
In this paper, we report on the annotation procedures we developed for annotating the Turkish Discourse Bank (TDB), an effort that extends the Penn Discourse Tree Bank (PDTB) annotation style by using it for annotating Turkish discourse. After a brief introduction to the TDB, we describe the annotation cycle and the annotation scheme we developed, defining which parts of the scheme are an extension of the PDTB and which parts are different. We provide inter-coder reliability calculations on the first and second arguments of some connectives and discuss the most important sources of disagreement among annotators.

Suggestions

An Assessment of Explicit Inter- and Intra-sentential Discourse Connectives in Turkish Discourse Bank
Zeyrek Bozşahin, Deniz (null; 2018-05-07)
The paper offers a quantitative and qualitative analysis of explicit inter- and intra-sentential discourse connectives in Turkish Discourse Bank, or TDB version 1.1, a multi-genre resource of written Turkish manually annotated at the discourse level following the goals and principles of Penn Discourse TreeBank. TDB 1.1 is a 40K-word corpus involving all major discourse relation types (explicit discourse relations at intra- and inter-sentential positions, implicit discourse relations, alternative lexicalizat...
Annotating Subordinators in the Turkish Discourse Bank
Zeyrek Bozşahin, Deniz; Bozşahin, Hüseyin Cem; Sevdik-Çallı, Ayışığı; Demirşahin, Işın; Aktaş, Berfin; Yalçınkaya, İhsan; Ögel, Hale (ACL-IJCNLP, Linguistic Annotation Workshop III; 2009)
In this paper we explain how we annotated subordinators in the Turkish Discourse Bank (TDB), an effort that started in 2007 and is still continuing. We introduce the project and describe some of the issues that were important in annotating three subordinators, namely kars¸ın, ragmen ˘ and halde, all of which encode the coherence relation Contrast-Concession. We also describe the annotation tool.
Pair Annotation as a Novel Annotation Procedure: The Case of Turkish Discourse Bank
Demirşahin, Işın; Zeyrek Bozşahin, Deniz (Springer, 2017-01-01)
In this chapter, we provide an overview of Turkish Discourse Bank, a resource of ∼ 400,000 words built on a sub-corpus of the 2-million-word METU Turkish Corpus annotated following the principles of Penn Discourse Tree Bank. We first present the annotation framework we adopted, explaining how it differs from the annotation of the original language, English. Then we focus on a novel annotation procedure that we have devised and named pair annotation after pair programming. We discuss the advantages it has ...
Pair Annotation as a Novel Annotation Procedure: The Case of Turkish Discourse Bank
Demirşahin, Işın; Zeyrek Bozşahin, Deniz (2017-6-17)
In this chapter, we provide an overview of Turkish Discourse Bank, a resource of ∼∼400,000 words built on a sub-corpus of the 2-million-word METU Turkish Corpus annotated following the principles of Penn Discourse Tree Bank. We first present the annotation framework we adopted, explaining how it differs from the annotation of the original language, English. Then we focus on a novel annotation procedure that we have devised and named pair annotation after pair programming. We discuss the advantages it has of...
Assessment of the Turkish discourse bank and a cascaded model to automatically identify discursive phrasal expressions in Turkish
Sevdik Çallı, Ayışığı Başak; Zeyrek Bozşahin, Deniz; Department of Cognitive Sciences (2015)
This thesis presents a methodology for an overall assessment of the Turkish Discourse Bank (TDB), a linguistic resource where discourse relations overtly expressed by discourse connectives have been identified and annotated with the two arguments they relate. We provide a quantitative and qualitative assessment of the TDB in order to establish the reliability of this discourse resource for Turkish and suggest that our methodology can be utilized for reliability evaluations of other annotated corpora. Our qu...
Citation Formats
D. Zeyrek Bozşahin, A. Sevdik- Çallı, H. Ögel- Balaban, İ. Yalçınkaya, and Ü. D. Turan, “The annotation scheme of the Turkish Discourse Bank and an evaluation of inconsistent annotations,” 2010, p. 282, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/36049.