Information Retrieval Using the Reduced Row Echelon Form of a Term-Document Matrix

dc.contributor.authorParali, Ufuk
dc.contributor.authorZontul, Metin
dc.contributor.authorErtugrul, Duygu Celik
dc.date.accessioned2026-02-06T18:21:36Z
dc.date.issued2019
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractIt is getting more difficult to retrieve relevant information regarding the user input query due to the large amount of information in the web. Unlike the conventional information retrieval (IR) algorithms, this study presents a new algorithm - reduced row echelon form IR method (rrefIR) - with higher average similarity precision to get more relevant and noise-free documents. For dimension reduction in the proposed algorithm, singular value decomposition (SVD) is applied on the reduced row echelon form - obtained by utilizing Gauss-Jordan method - of the covariance of term-document matrix (TDM). The rrefIR algorithm outperforms the LSI and COV algorithms with respect to Jaro-Winkler, Overlap, Tanimoto and Jaccard similarity measures in the means of average similarity precision. The physical reason for the better IR performance is the linear independent basis vectors set obtained by Gauss-Jordan operation. This basis set can be considered as the generating roots of the vector space spanned by TDM. Utilizing these vectors increases the latent semantic charateristics of the SVD phase of the proposed IR algorithm.
dc.identifier.doi10.3966/160792642019072004004
dc.identifier.endpage1046
dc.identifier.issn1607-9264
dc.identifier.issn2079-4029
dc.identifier.issue4
dc.identifier.orcid0000-0003-1380-705X
dc.identifier.scopus2-s2.0-85071716301
dc.identifier.scopusqualityQ2
dc.identifier.startpage1037
dc.identifier.urihttps://doi.org/10.3966/160792642019072004004
dc.identifier.urihttps://hdl.handle.net/11129/9386
dc.identifier.volume20
dc.identifier.wosWOS:000483464100004
dc.identifier.wosqualityQ4
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherLibrary & Information Center, Nat Dong Hwa Univ
dc.relation.ispartofJournal of Internet Technology
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.snmzKA_WoS_20260204
dc.subjectInformation retrieval
dc.subjectGauss-Jordan
dc.subjectSVD
dc.subjectSimilarity measures
dc.titleInformation Retrieval Using the Reduced Row Echelon Form of a Term-Document Matrix
dc.typeArticle

Files