Comparison of normalization techniques for metasearch

Sever, H; Tolun, MR

Comparison of normalization techniques for metasearch

dc.contributor.author	Sever, H
dc.contributor.author	Tolun, MR
dc.date.accessioned	2026-02-06T18:17:05Z
dc.date.issued	2002
dc.department	Doğu Akdeniz Üniversitesi
dc.description	2nd International Conference on Advances in Information Systems -- OCT 23-25, 2002 -- IZMIR, TURKEY
dc.description.abstract	It is well-known fact that the combination of the retrieval outputs of different search systems in response to a query, known as metasearch, improves performance on average, provided that these combined systems (1) have compatible outputs, (2) produce accurate probability of relevance estimates of documents, and (3) be independent of each other. The objective of a normalization technique is to target the first requirement, i.e., document scores of different retrieval outputs are brought into a common scale so that document scores can be comparable across combined retrieval outputs. This has been a recent subject of researches in metasearch and information filtering fields. In this paper, we present a different perspective on multiple evidence combination and investigate various normalization techniques, mostly ad-hoc in nature, with a special focus on the SUM, which shifts minimum scores to zero and then scales their summation to one. This formal approach is equivalent to normalize the distribution of scores of all documents in a retrieval output by dividing them by their sample mean. We have made extensive experiments using ad hoc tracks of third and fifth TREC collections and CLEF'00 database. We argue that (1) the normalization method SUM is consistently better than the other traditionally proposed ones when combining outputs of search systems operating on a single database; (2) the SUM for combination of outputs of search systems operating on mutually exclusive databases is still valuable alternative to the one weighting score distributions of documents by their databases' size.
dc.description.sponsorship	Fdn Dokuz Eylul Univ,Sci & Tech Res Council Turkey
dc.identifier.endpage	143
dc.identifier.isbn	3-540-00009-7
dc.identifier.issn	0302-9743
dc.identifier.issn	1611-3349
dc.identifier.orcid	0000-0002-8478-7220
dc.identifier.orcid	0000-0002-8261-0675
dc.identifier.scopus	2-s2.0-84951832990
dc.identifier.scopusquality	Q3
dc.identifier.startpage	133
dc.identifier.uri	https://hdl.handle.net/11129/8801
dc.identifier.volume	2457
dc.identifier.wos	WOS:000181470200013
dc.identifier.wosquality	N/A
dc.indekslendigikaynak	Web of Science
dc.indekslendigikaynak	Scopus
dc.language.iso	en
dc.publisher	Springer-Verlag Berlin
dc.relation.ispartof	Advances in Information Systems
dc.relation.publicationcategory	Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı
dc.rights	info:eu-repo/semantics/closedAccess
dc.snmz	KA_WoS_20260204
dc.subject	Representations
dc.title	Comparison of normalization techniques for metasearch
dc.type	Conference Object

Collections

WoS Indexed Publications Collection
Scopus İndeksli Yayınlar Koleksiyonu

Comparison of normalization techniques for metasearch

Files

Collections