HMZDupFinder: a robust computational approach for detecting intragenic homozygous duplications from exome sequencing data

dc.contributor.authorDu, Haowei
dc.contributor.authorDardas, Zain
dc.contributor.authorJolly, Angad
dc.contributor.authorGrochowski, Christopher M.
dc.contributor.authorJhangiani, Shalini N.
dc.contributor.authorLi, He
dc.contributor.authorLupski, James R.
dc.date.accessioned2026-02-06T18:48:58Z
dc.date.issued2024
dc.departmentDoğu Akdeniz Üniversitesi
dc.description.abstractHomozygous duplications contribute to genetic disease by altering gene dosage or disrupting gene regulation and can be more deleterious to organismal biology than heterozygous duplications. Intragenic exonic duplications can result in loss-of-function (LoF) or gain-of-function (GoF) alleles that when homozygosed, i.e. brought to homozygous state at a locus by identity by descent or state, could potentially result in autosomal recessive (AR) rare disease traits. However, the detection and functional interpretation of homozygous duplications from exome sequencing data remains a challenge. We developed a framework algorithm, HMZDupFinder, that is designed to detect exonic homozygous duplications from exome sequencing (ES) data. The HMZDupFinder algorithm can efficiently process large datasets and accurately identifies small intragenic duplications, including those associated with rare disease traits. HMZDupFinder called 965 homozygous duplications with three or less exons from 8,707 ES with a recall rate of 70.9% and a precision of 16.1%. We experimentally confirmed 8/10 rare homozygous duplications. Pathogenicity assessment of these copy number variant alleles allowed clinical genomics contextualization for three homozygous duplications alleles, including two affecting known OMIM disease genes EDAR (MIM# 224900), TNNT1(MIM# 605355), and one variant in a novel candidate disease gene: PAAF1. Graphical Abstract
dc.description.sponsorshipUS National Institutes of Health, National Human Genome Research Institute (NHGRI)/National Heart, Lung and Blood Institute (NHLBI) [UM1 HG006542]; NHGRI [U54 HG003273, UM1 HG008898]; NHGRI Genomic Research Elucidates Genetics of Rare disease (GREGoR) consortium [U01 HG011758]; National Institute of General Medical Sciences (NIGMS) [R01 GM132589, R01 GM106373]; National Institute for Neurological Disorders and Stroke (NINDS) [R35 NS105078]; Rett Syndrome Research Trust (RSRT); International Rett Syndrome Foundation (IRSF) [3701-1]; Doris Duke Charitable Foundation [2023-0235]; NINDS [K23 NS125126-01A1]; Medical Genetics Research Fellowship Program through the United States National Institute of Health [T32 GM007526-42]; Child Neurologist Career Development Program K12 and MDA Development Grant [873841]; [NHGRI K08 HG008986]; National Human Genome Research Institute [U01HG011758] Funding Source: NIH RePORTER; National Institute of General Medical Sciences [R01GM132589] Funding Source: NIH RePORTER; National Institute of Neurological Disorders and Stroke [R35NS105078] Funding Source: NIH RePORTER
dc.description.sponsorshipUS National Institutes of Health, National Human Genome Research Institute (NHGRI)/National Heart, Lung and Blood Institute (NHLBI) [UM1 HG006542 to the Baylor Hopkins Center for Mendelian Genomics] (in part); NHGRI [U54 HG003273]; NHGRI [UM1 HG008898]; NHGRI Genomic Research Elucidates Genetics of Rare disease (GREGoR) consortium [U01 HG011758, NHGRI K08 HG008986 to J.E.P.]; National Institute of General Medical Sciences (NIGMS) [R01 GM132589 to C.M.B.C. and R01 GM106373]; National Institute for Neurological Disorders and Stroke (NINDS) [R35 NS105078]; D.P. is supported by Rett Syndrome Research Trust (RSRT); International Rett Syndrome Foundation (IRSF) [3701-1]; Doris Duke Charitable Foundation [2023-0235]; NINDS [K23 NS125126-01A1]; D.M. was supported by a Medical Genetics Research Fellowship Program through the United States National Institute of Health [T32 GM007526-42]; D.G.C. was supported by the Child Neurologist Career Development Program K12 and MDA Development Grant [873841]. Funding for open access charge: NHGRI Genomic Research Elucidates Genetics of Rare disease (GREGoR) consortium [U01 HG011758, NHGRI K08 HG008986 to J.E.P.].
dc.identifier.doi10.1093/nar/gkad1223
dc.identifier.issn0305-1048
dc.identifier.issn1362-4962
dc.identifier.issue4
dc.identifier.orcid0000-0002-5663-1157
dc.identifier.orcid0000-0002-1766-5311
dc.identifier.orcid0000-0002-3884-7720
dc.identifier.orcid0000-0002-3927-2711
dc.identifier.orcid0000-0003-2387-3122
dc.identifier.pmid38153174
dc.identifier.scopus2-s2.0-85186412068
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1093/nar/gkad1223
dc.identifier.urihttps://hdl.handle.net/11129/14647
dc.identifier.volume52
dc.identifier.wosWOS:001132579800001
dc.identifier.wosqualityQ1
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakPubMed
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherOxford Univ Press
dc.relation.ispartofNucleic Acids Research
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı
dc.rightsinfo:eu-repo/semantics/openAccess
dc.snmzKA_WoS_20260204
dc.subjectCopy-Number Variation
dc.subjectCnv
dc.titleHMZDupFinder: a robust computational approach for detecting intragenic homozygous duplications from exome sequencing data
dc.typeArticle

Files