Comparison of normalization techniques for metasearch

dc.contributor.authorSever, H
dc.contributor.authorTolun, MR
dc.date.accessioned2026-02-06T18:17:05Z
dc.date.issued2002
dc.departmentDoğu Akdeniz Üniversitesi
dc.description2nd International Conference on Advances in Information Systems -- OCT 23-25, 2002 -- IZMIR, TURKEY
dc.description.abstractIt is well-known fact that the combination of the retrieval outputs of different search systems in response to a query, known as metasearch, improves performance on average, provided that these combined systems (1) have compatible outputs, (2) produce accurate probability of relevance estimates of documents, and (3) be independent of each other. The objective of a normalization technique is to target the first requirement, i.e., document scores of different retrieval outputs are brought into a common scale so that document scores can be comparable across combined retrieval outputs. This has been a recent subject of researches in metasearch and information filtering fields. In this paper, we present a different perspective on multiple evidence combination and investigate various normalization techniques, mostly ad-hoc in nature, with a special focus on the SUM, which shifts minimum scores to zero and then scales their summation to one. This formal approach is equivalent to normalize the distribution of scores of all documents in a retrieval output by dividing them by their sample mean. We have made extensive experiments using ad hoc tracks of third and fifth TREC collections and CLEF'00 database. We argue that (1) the normalization method SUM is consistently better than the other traditionally proposed ones when combining outputs of search systems operating on a single database; (2) the SUM for combination of outputs of search systems operating on mutually exclusive databases is still valuable alternative to the one weighting score distributions of documents by their databases' size.
dc.description.sponsorshipFdn Dokuz Eylul Univ,Sci & Tech Res Council Turkey
dc.identifier.endpage143
dc.identifier.isbn3-540-00009-7
dc.identifier.issn0302-9743
dc.identifier.issn1611-3349
dc.identifier.orcid0000-0002-8478-7220
dc.identifier.orcid0000-0002-8261-0675
dc.identifier.scopus2-s2.0-84951832990
dc.identifier.scopusqualityQ3
dc.identifier.startpage133
dc.identifier.urihttps://hdl.handle.net/11129/8801
dc.identifier.volume2457
dc.identifier.wosWOS:000181470200013
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherSpringer-Verlag Berlin
dc.relation.ispartofAdvances in Information Systems
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260204
dc.subjectRepresentations
dc.titleComparison of normalization techniques for metasearch
dc.typeConference Object

Files