Improving Biochemical Named Entity Recognition Using PSO Classifier Selection and Bayesian Combination Methods

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

IEEE Computer Soc

Access Rights

info:eu-repo/semantics/closedAccess

Abstract

Named Entity Recognition (NER) is a basic step for large number of consequent text mining tasks in the biochemical domain. Increasing the performance of such recognition systems is of high importance and always poses a challenge. In this study, a new community based decision making system is proposed which aims at increasing the efficiency of NER systems in the chemical/drug name context. Particle Swarm Optimization (PSO) algorithm is chosen as the expert selection strategy along with the Bayesian combination method to merge the outputs of the selected classifiers as well as evaluate the fitness of the selected candidates. The proposed system performs in two steps. The first step focuses on creating various numbers of baseline classifiers for NER with different features sets using the Conditional Random Fields (CRFs). The second step involves the selection and efficient combination of the classifiers using PSO and Bayesisan combination. Two comprehensive corpora from BioCreative events, namely ChemDNER and CEMP, are used for the experiments conducted. Results show that the ensemble of classifiers selected by means of the proposed approach perform better than the single best classifier as well as ensembles formed using other popular selection/combination strategies for both corpora. Furthermore, the proposed method outperforms the best performing system at the Biocreative IV ChemDNER track by achieving an F-score of 87.95 percent.

Description

Keywords

Named entity recognition, conditional random fields, particle swarm optimization, classifier combination, Bayesian combiner

Journal or Series

Ieee-Acm Transactions on Computational Biology and Bioinformatics

WoS Q Value

Scopus Q Value

Volume

14

Issue

6

Citation

Endorsement

Review

Supplemented By

Referenced By