Automated coherence detection with term-distance path extraction of the co-occurrence matrix of a document

Download
2015
Ağın, Halil
This thesis takes the distributional semantics (frequency-based semantics) approach as the theoretical framework to quantify textual coherence. Distributional semantics describes discourse sections as vectors, having dimensions are the frequency count of co-occurring words in the text within its semantic space. It quantifies the textual coherence by measuring the cosine values of vectors of successive sentences (cf. Latent Semantic Analysis, LSA). The common assumption underlying LSA based studies is that the frequency of word co-occurrence can be used as a cohesive cue to quantify textual coherence, thus leading to analyses based on a term-document matrix. In this thesis, the spatial distance of co-occurring words is considered as a new frequency event of cohesive cues and introduces a document-distance matrix, which is derived from the term-document matrix. This thesis proposes that the matrix representation of document-distance (a derivation of term-document matrix) of co-occurring words in adjacent sentences in a text can be used to quantify textual coherence. Two mathematical functions are suggested for deriving the document-distance matrix and two algorithms for the operations. The mathematical functions operate on the documentdocument matrix (a derivation of term-document matrix) to derive the documentdistance matrix. The algorithms measure the coherence of text by operating on the newly introduced document-distance matrices.

Suggestions

The Need of a Semantic Layer between UMLS and Biomedical Information Systems
Özdemir, Birsen G.; Baykal, Nazife (2011)
Since biomedical information is scattered among a number of semantically or syntactically incompatible independent systems, a contemporary pragmatic approach is proposed in this study to make use of a semantic middle layer and common standards for information exchange between these systems. Biological and medical terminologies and ontologies take vital part in the background of life sciences information systems and the Unified Medical Language System (UMLS) is an inclusive source for biomedical vocabulary. ...
Limitations to SV determination from APT images
Eyüboğlu, Behçet Murat; Barber, David (null; 1989-11-01)
The limitations related to the position-dependent point response function (PRF) of an applied potential tomography (APT) system are determined. The 3-D distribution of the PRF is measured. The thickness of the APT's field of view (slice) and the spatial resolution are determined from these measurements. The effects of these parameters on the reconstructed image are discussed. The results show that the sensitive slice thickness is not uniform across the image plane and that the PRF is strongly dependent on p...
Using constrained intuitionistic linear logic for hybrid robotic planning problems
Saranlı, Uluç (2007-04-14)
Synthesis of robot behaviors towards nontrivial goals often requires reasoning about both discrete and continuous aspects of the underlying domain. Existing approaches in building automated tools for such synthesis problems attempt to augment methods from either discrete planning or continuous control with hybrid elements, but largely fail to ensure a uniform treatment of both aspects of the domain. In this paper, we present a new formalism, Constrained Intuitionistic Linear Logic (CILL), merging continuous...
Shape-invariance approach and Hamiltonian hierarchy method on the Woods-Saxon potential for l not equal 0 states
Berkdemir, Cueneyt; BERKDEMİR, Ayşe; Sever, Ramazan (Springer Science and Business Media LLC, 2008-03-01)
An analytically solvable Woods-Saxon potential for l not equal 0 states is presented within the framework of Supersymmetric Quantum Mechanics formalism. The shape-invariance approach and Hamiltonian hierarchy method are included in calculations by means of a translation of parameters. The approximate energy spectrum of this potential is obtained for l not equal 0 states, applying the Woods-Saxon square approximation to the centrifugal barrier term of the Schrodinger equation.
On output independence and complementariness in rank-based multiple classifier decision systems
Saranlı, Afşar (Elsevier BV, 2001-12-01)
This study presents a theoretical analysis of output independence and complementariness between classifiers in a rank-based multiple classifier decision system in the context of the partitioned observation space theory. To enable such an analysis, an information theoretic interpretation of a rank-based multiple classifier system is developed and basic concepts from information theory are applied to develop measures for output independence and complementariness. It is shown that output independence of classi...
Citation Formats
H. Ağın, “Automated coherence detection with term-distance path extraction of the co-occurrence matrix of a document,” M.S. - Master of Science, Middle East Technical University, 2015.