Farsi document image recognition system using word layout signature
| dc.contributor.author | Ergun, Cem | |
| dc.contributor.author | Norozpour, Sajedeh | |
| dc.date.accessioned | 2026-02-06T18:24:45Z | |
| dc.date.issued | 2019 | |
| dc.department | Doğu Akdeniz Üniversitesi | |
| dc.description.abstract | In this paper, a new representation of Farsi words is proposed to present the keyword spotting problems in Farsi document image retrieval. In this regard, we define a signature for each Farsi word based on the word connected component layout. The mentioned signature is shown as boxes, and then, by sketching vertical and horizontal lines, we construct a grid of each word to provide a new descriptor. One of the advantages of this method is that it can be used for both handwritten and machine-printed texts. Finally, to evaluate the performance of our system in comparison to other methods, a database that contains 19,582 printed Farsi words is examined, and after applying this approach, a recall rate of 98.1% and a precision rate of 94.3% are obtained. | |
| dc.identifier.doi | 10.3906/elk-1804-92 | |
| dc.identifier.endpage | 1488 | |
| dc.identifier.issn | 1300-0632 | |
| dc.identifier.issn | 1303-6203 | |
| dc.identifier.issue | 2 | |
| dc.identifier.orcid | 0000-0002-5766-9966 | |
| dc.identifier.scopus | 2-s2.0-85065815882 | |
| dc.identifier.scopusquality | Q2 | |
| dc.identifier.startpage | 1477 | |
| dc.identifier.trdizinid | 336808 | |
| dc.identifier.uri | https://doi.org/10.3906/elk-1804-92 | |
| dc.identifier.uri | https://search.trdizin.gov.tr/tr/yayin/detay/336808 | |
| dc.identifier.uri | https://hdl.handle.net/11129/10353 | |
| dc.identifier.volume | 27 | |
| dc.identifier.wos | WOS:000463355800056 | |
| dc.identifier.wosquality | Q3 | |
| dc.indekslendigikaynak | Web of Science | |
| dc.indekslendigikaynak | Scopus | |
| dc.indekslendigikaynak | TR-Dizin | |
| dc.language.iso | en | |
| dc.publisher | Tubitak Scientific & Technological Research Council Turkey | |
| dc.relation.ispartof | Turkish Journal of Electrical Engineering and Computer Sciences | |
| dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | |
| dc.rights | info:eu-repo/semantics/openAccess | |
| dc.snmz | KA_WoS_20260204 | |
| dc.subject | Farsi document image retrieval | |
| dc.subject | word spotting | |
| dc.subject | word layout signature | |
| dc.subject | optical character recognition | |
| dc.title | Farsi document image recognition system using word layout signature | |
| dc.type | Article |










