A novel dissimilarity metric based on feature-to-feature scatter frequencies for clustering-based feature selection in biomedical data

dc.contributor.authorSheikhi, Ghazaal
dc.contributor.authorAltincay, Hakan
dc.date.accessioned2026-02-06T18:50:56Z
dc.date.issued2021
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractFilter feature selection methods have been extensively used for dimensionality reduction in biomedical data analysis. In this article, a novel dissimilarity metric based on feature-to-feature (F2F) scatter frequencies is proposed for clustering-based filter feature selection. The proposed metric is computed by obtaining the feature-level ranks of samples and identifying the features which assign close ranks to each sample. The order of ranking is determined for each feature by class labels. Samples are represented as a set of affinity sets containing features having rank differences less than a predefined proximity window size. The F2F dissimilarity of a pair of features is computed using the frequency of their appearance in different affinity sets. Features are then clustered into distinct groups using F2F dissimilarity metric. From each cluster, the feature having the highest relevance score is selected. The experiments conducted on ten biomedical datasets confirmed the effectiveness of the proposed method in improving classification performance. Notably, the proposed method outperforms the widely used schemes in terms of both classification under ROC curve and stability.
dc.identifier.doi10.1111/coin.12470
dc.identifier.endpage1889
dc.identifier.issn0824-7935
dc.identifier.issn1467-8640
dc.identifier.issue4
dc.identifier.scopus2-s2.0-85107874169
dc.identifier.scopusqualityQ1
dc.identifier.startpage1865
dc.identifier.urihttps://doi.org/10.1111/coin.12470
dc.identifier.urihttps://hdl.handle.net/11129/15129
dc.identifier.volume37
dc.identifier.wosWOS:000661289800001
dc.identifier.wosqualityQ3
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherWiley
dc.relation.ispartofComputational Intelligence
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260204
dc.subjectdissimilarity
dc.subjectfeature clustering
dc.subjectfeature selection
dc.subjectrepresentative feature
dc.subjectscatter frequency
dc.titleA novel dissimilarity metric based on feature-to-feature scatter frequencies for clustering-based feature selection in biomedical data
dc.typeArticle

Files