Chemical disease relation extraction through the combination of multiple mention levels: RelSCAN

dc.contributor.authorOnye, Stanley Chika
dc.contributor.authorDimililer, Nazife
dc.contributor.authorAkkeles, Arif
dc.date.accessioned2026-02-06T18:21:56Z
dc.date.issued2022
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractChemical-induced disease (CID) relation extraction has been pivotal in the understanding of biological processes. A CID relation between a chemical and disease entity may be extracted either from a single sentence or from two or more adjacent sentences. We use 'intrasentence level' to refer to the mention of the desired entities in the same sentence and 'intersentence level' to refer to the mention of these entities in two or more adjacent sentences. This study proposes a three-phase architecture for extracting CID relations from biomedical literature by considering both sentence levels and additionally the combination of these two sentence levels which we describe as the 'joint level'. In phase 1, we construct relation instances at the intra-and intersentence levels which are subsequently combined to form the joint level. In phase 2, we extracted features specifically for an individual relation instance at the three levels. At each of these levels, we trained three classifier models that consist of the combination of two classifiers. We used the training dataset for training and later classified the CID relation instances using the test dataset. Phase 3 consists of two steps; in step 1, the classifier outputs from both the intra-and intersentence levels are combined and in step 2, the results from step 1 are combined with the results from the classifier trained at joint level using a prediction probability-based voting algorithm to determine the final result. Using the BioCreative V corpus for validation, we obtain results that outperform all the state-of-the-art systems for CID relation extraction on the standard chemical-disease relation corpus.
dc.identifier.doi10.55730/1300-0632.3936
dc.identifier.endpage2253
dc.identifier.issn1300-0632
dc.identifier.issn1303-6203
dc.identifier.issue6
dc.identifier.orcid0000-0003-2175-8577
dc.identifier.scopus2-s2.0-85142255062
dc.identifier.scopusqualityQ2
dc.identifier.startpage2237
dc.identifier.trdizinid1142566
dc.identifier.urihttps://doi.org/10.55730/1300-0632.3936
dc.identifier.urihttps://search.trdizin.gov.tr/tr/yayin/detay/1142566
dc.identifier.urihttps://hdl.handle.net/11129/9546
dc.identifier.volume30
dc.identifier.wosWOS:000884407400016
dc.identifier.wosqualityQ3
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.indekslendigikaynakTR-Dizin
dc.language.isoen
dc.publisherTubitak Scientific & Technological Research Council Turkey
dc.relation.ispartofTurkish Journal of Electrical Engineering and Computer Sciences
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20260204
dc.subjectRelation extraction
dc.subjecttext mining
dc.subjectchemical disease relation
dc.subjectdecision -making
dc.titleChemical disease relation extraction through the combination of multiple mention levels: RelSCAN
dc.typeArticle

Files