Prediction of Protein-Protein Interaction Relevance of Articles Using References

2009-09-16
Calli, Cagatay
Classifying documents as protein-protein interaction (PPI) relevant or not is the first step towards extracting meaningful PPI data from article content. Currently, this classification step is handled manually by expert curators. A number of text-mining methods have been proposed to tackle this problem, using abstracts without references. We propose that article references contain important information that can be used to enhance these previous techniques. We trained an SVM classifier solely based on reference links extracted from Biocreative II data to test the effect of references. Our approach includes a feature selection method based on reference count imbalance between positive and negative examples. Classification results on Biocreative II test and Biocreative II.5 training datasets show that even simple referential information extracted from papers can be effective for predicting protein interaction.

Suggestions

Fuzzy data representation and querying in XML database
Ustunkaya, Ekin; Yazıcı, Adnan; George, Roy (2007-02-01)
Real-world information including subjective opinions and judgments need imprecise data to be modeled for representation and querying in databases. The Extensible Markup Language (XML) has become a de-facto standard for data modeling and exchange in recent years. Efforts on modeling imprecision and representing such data in XML have not been fully developed. In this paper, an XML based fuzzy data representation and querying system is presented. Complex and imprecise data are represented using a fuzzy extensi...
Analysis of electronic signature in Turkey from the legal and economic perspectives and the awareness level in the country
İskender, Gökhan; Koçyiğit, Altan; Department of Information Systems (2006)
As in the case of other information technologies, the best way of obtaining efficient results from electronic signature application is integrating it to the legal and economic systems and increasing the awareness level of technology in the society. This thesis performs the legal and economic analyses of electronic signature in Turkey and measures the awareness level in the society. The analyses performed in the thesis show that electronic signature is not legally established in Turkey even the legal base is...
Improving Oil-Rate Estimate in Capacitance/Resistance Modeling Using the Y-Function Method for Reservoirs Under Waterflood
Temizel, Cenk; Artun, Emre; Yang, Zhengming (Society of Petroleum Engineers (SPE), 2019-08-01)
Capacitance/resistance modeling (CRM) is an empirical waterflood modeling technique based on the signal correlations between injection rates and gross production rates. CRM can satisfactorily estimate the gross (liquid) production rate. The oil-production-rate forecast is based on fitting the empirical oil fractional-flow model, the Leverett (1941) oil fractional-flow model, or the Koval (1963) model to the historical production data. We observed that the oil-production-rate forecast in this approach is les...
Towards domain oriented semi automated model matching for supporting data exchange
Hongjun, Wang; Akinci, Burcu; Garrett, Jim; Akin, Ömer; Turkaslan Bulbul, Tanyel; Gürsel Dino, İpek (null; 2004-06-04)
The process of m atching data represented in two different data models is a long - standing issue in the exchange of data between different software systems. While the traditional manual matching approach cannot meet today 's demands on data exchange, research shows that a fully automated generic approach for model matching is not likely, and generic semi-automated approaches are not easy to implement. In this paper, we present an approach that focuses on matching data models in a specific domain. The appro...
Determination of three-phase relative permeabilty values by using an artificial neural network model
Karaman, T; Demiral, BMR (Informa UK Limited, 2004-08-01)
In this study, an artificial neural network (ANN) tool, which uses the data obtained from a pore network (PN) model, was developed in order to obtain three-phase relative permeability values. During the development of this ANN tool, four different stages were implemented in which ANN structures were changed in order to find the best architecture that would predict the oil isoperms correctly. By using the data obtained from the PN model, training was implemented and the prediction power of that tool was test...
Citation Formats
C. Calli, “Prediction of Protein-Protein Interaction Relevance of Articles Using References,” 2009, p. 189, Accessed: 00, 2020. [Online]. Available: https://hdl.handle.net/11511/64043.