Dbchem A database query based solution for the chemical compound and drug name recognition task

2013-10-03
Ata, Çağlar
Can, Tolga
We propose a method, named DBCHEM, based on database queries for the chemical compound and drug name recognition task of the BioCreative IV challenge. We prepared a database with 145 million entries containing compound and drug names, their synonyms, and molecular formulas. PubChem Power User Gateway (PUG) system is used to construct the database. Candidate chemical and drug names are identified by using an English dictionary as a list of stop words. All candidates are queried in the compound database. We integrated a small number of heuristic rules into this query based approach. DBCHEM attained 58% precision and 71% recall on the development set with a total running time of 14 minutes for 3500 articles.

Suggestions

Facile synthesis of novel 7-aminofuro- and 7-aminothieno[2,3-d]pyridazin-4(5H)-one and 4-aminophthalazin-1(2H)-ones
Koza, Gani; Keskin, Selbi; Ozer, Merve Sinem; Cengiz, Betul; ŞAHİN, Ertan; Balcı, Metin (2013-01-07)
We hereby report the synthesis of a novel class of compounds, 7-aminofuro- and 7-aminothieno[2,3-d]pyridazin-4(5H)-one and 4-aminophthalazin-1(2H)-ones starting from methyl 2-(2-methoxy-2-oxoethyl)furan- and thiophene-3-carboxylate and methyl 2-(2-methoxy-2-oxoethyl)benzoate. The ester functionalities connected directly to the aromatic ring were regiospecifically converted to the acid, whereas methylene groups were oxidized to the corresponding ketoesters. Reaction of the ketoesters with hydrazine provided ...
Specializing for predicting obesity and its co-morbidities
Goldstein, Ira; Uzuner, Oezlem (Elsevier BV, 2009-10-01)
We present specializing, a method for combining classifiers for multi-class classification. Specializing trains one specialist classifier per class and utilizes each specialist to distinguish that class from all others in a one-versus-all manner. It then supplements the specialist classifiers with a catch-all classifier that performs multi-class classification across all classes. We refer to the resulting combined classifier as a specializing classifier.
Implicit motif distribution based hybrid computational kernel for sequence classification
Atalay, Mehmet Volkan (Oxford University Press (OUP), 2005-04-15)
Motivation: We designed a general computational kernel for classification problems that require specific motif extraction and search from sequences. Instead of searching for explicit motifs, our approach finds the distribution of implicit motifs and uses as a feature for classification. Implicit motif distribution approach may be used as modus operandi for bioinformatics problems that require specific motif extraction and search, which is otherwise computationally prohibitive.
DEEPScreen: high performance drug-target interaction prediction with convolutional neural networks using 2-D structural compound representations
Rifaioğlu, Ahmet Süreyya; Nalbat, Esra; Atalay, Mehmet Volkan (2020-03-07)
The identification of physical interactions between drug candidate compounds and target biomolecules is an important process in drug discovery. Since conventional screening procedures are expensive and time consuming, computational approaches are employed to provide aid by automatically predicting novel drug-target interactions (DTIs). In this study, we propose a large-scale DTI prediction system, DEEPScreen, for early stage drug discovery, using deep convolutional neural networks. One of the main advantage...
MDeePred: Novel multi-channel protein featurization for deep learning-based binding affinity prediction in drug discovery
Rifaioglu, A.S.; Atalay, R. Cetin; Kahraman, Deniz Cansen; DOĞAN, TUNCA; Martin, M.; Atalay, Mehmet Volkan (2021-03-01)
Motivation: Identification of interactions between bioactive small molecules and target proteins is crucial for novel drug discovery, drug repurposing and uncovering off-target effects. Due to the tremendous size of the chemical space, experimental bioactivity screening efforts require the aid of computational approaches. Although deep learning models have been successful in predicting bioactive compounds, effective and comprehensive featurization of proteins, to be given as input to deep neural networks, r...
Citation Formats
Ç. Ata and T. Can, “Dbchem A database query based solution for the chemical compound and drug name recognition task,” 2013, vol. 2, Accessed: 00, 2021. [Online]. Available: http://www.biocreative.org/resources/publications/chemdner-proceed-publications/.