A hierarchical representation of form documents for identification and retrieval

2000-01-27
Duygulu, Pınar
Atalay, Mehmet Volkan
In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the logical structure of a form by using lines. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Logically same forms are associated to the same hierarchical structure. This representation can handle geometrical modifications and slight variations.
7th Annual Document Recognition and Retrieval Conference

Suggestions

A hierarchical representation of form documents for identification and retrieval
Duygulu, Pinar; Atalay, Mehmet Volkan (Springer Science and Business Media LLC, 2003-12-01)
In this paper, we present a logical representation for form documents to be used for identification and retrieval. A hierarchical structure is proposed to represent the structure of a form by using lines and the XY-tree approach. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Geometrical modifications and slight variations are handled by this representation. Logically identical forms are associated to the same or similar hierarchical structure. Identi...
A heuristic algorithm for hierarchical representation of form documents
Duygulu, P; Atalay, Mehmet Volkan; Dincel, E (1998-08-20)
In this paper our aim is to develop a logical representation for form documents. We propose a hierarchical structure to represent the logical layout of a form by using lines. The approach is top-down and no domain knowledge such as the preprinted data or filled-in data is used. Logically same but physically different forms are associated to the same hierarchical tree. This representation can handle geometrical modifications and slight variations.(1).
A Computerized Method for Delay Risk Assessment Based on Fuzzy Set Theory using MS Project (TM)
BUDAYAN, Cenk; Dikmen Toker, İrem; Birgönül, Mustafa Talat; Ghaziani, Aydin (2018-08-01)
In this paper, we present a computerized method as an add-in to MS Project (TM) for risk identification and estimation of delay. A delay analysis ontology is incorporated so that the users can identify causes of delay at the activity level and assess frequency and impact of risks using their previous experience, expertise and judgement. Testing and validation findings demonstrate that the software is easy to use and gives reliable results. Decision-makers can use this software during risk identification and...
A framework for ranking and categorizing medical documents
Al Zamıl, Mohammed GH. I.; Betin Can, Aysu; Department of Information Systems (2010)
In this dissertation, we present a framework to enhance the retrieval, ranking, and categorization of text documents in medical domain. The contributions of this study are the introduction of a similarity model to retrieve and rank medical textdocuments and the introduction of rule-based categorization method based on lexical syntactic patterns features. We formulate the similarity model by combining three features to model the relationship among document and construct a document network. We aim to rank ret...
A novel rank-based classifier combination scheme for speaker identification
Altincay, H; Demirekler, Mübeccel (2000-06-09)
In this paper, we propose a novel rank-based classifier combination scheme under uncertainty for speaker identification (SI). The combination is based on a heuristic method that uses Dempster-Shafer theory of evidence under some conditions. The method is based on the extraction of first and R-th level ranking statistics. Using these statistics, the pat tern classes are clustered into model sets where the classes in these sets share set specific properties. Some of these model sets are used to reflect the st...
Citation Formats
P. Duygulu and M. V. Atalay, “A hierarchical representation of form documents for identification and retrieval,” SAN JOSE, CA, 2000, vol. 3967, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/54956.