-
Identifying associations of de novo noncoding variants with autism through integration of gene expression, sequence, and sex information Genome Biol. (IF 10.1) Pub Date : 2025-06-04
Runjia Li, Jason ErnstWhole-genome sequencing (WGS) data has facilitated genome-wide identification of rare noncoding variants. However, elucidating these variants’ associations with complex diseases remains challenging. A previous study utilized a deep-learning-based framework and reported a significant brain-related association signal of autism spectrum disorder (ASD) detected from de novo noncoding variants in the Simons
-
DNA G-quadruplex structures act as functional elements in α- and β-globin enhancers Genome Biol. (IF 10.1) Pub Date : 2025-06-04
Colm Doyle, Krzysztof Herka, Sean M. Flynn, Larry Melidis, Somdutta Dhir, Stefan Schoenfelder, David Tannahill, Shankar BalasubramanianEnhancer elements interact with target genes at a distance to modulate their expression, but the molecular details of enhancer–promoter interaction are incompletely understood. G-quadruplex DNA secondary structures (G4s) have recently been shown to co-occur with 3D chromatin interactions; however, the functional importance of G4s within enhancers remains unclear. In this study, we identify novel G4
-
Transcriptional analysis of metastatic hormone-naïve prostate cancer primary tumor biopsies reveals a relevant role for SOX11 in prostate cancer cell dissemination Genome Biol. (IF 10.1) Pub Date : 2025-06-03
Natalia Martin-Martin, Saioa Garcia-Longarte, Jon Corres-Mendizabal, Uxue Lazcano, Ianire Astobiza, Laura Bozal-Basterra, Nicolas Herranz, Hielke van Splunder, Onintza Carlevaris, Mikel Pujana-Vaquerizo, María Teresa Blasco, Ana M. Aransay, Antonio Rosino, Julian Tudela, Daniel Jimenez, Alberto Martinez, Andrei Salca, Aida Santos-Martín, Sofía Rey, Aitziber Ugalde-Olano, David Gonzalo, Mariona GrauperaMetastatic hormone-naïve prostate cancer (mHNPC) is an infrequent form of this tumor type that is characterized by metastasis at the time of diagnosis and accounts for up to 50% of prostate cancer-related deaths. Despite the extensive characterization of localized and metastatic castration-resistant prostate cancer, the molecular characteristics of mHNPC remain largely unexplored. Here, we provide
-
DeepGFT: identifying spatial domains in spatial transcriptomics of complex and 3D tissue using deep learning and graph Fourier transform Genome Biol. (IF 10.1) Pub Date : 2025-06-03
Shuli Sun, Jixin Liu, Guojun Li, Bingqiang LiuThe rapid advancements in spatially resolved transcriptomics (SRT) enable the characterization of gene expressions while preserving spatial information. However, high dropout rates and noise hinder accurate spatial domain identification for understanding tissue architecture. We present DeepGFT, a method that simultaneously models spot-wise and gene-wise relationships by integrating deep learning with
-
A resource for whole-body gene expression map of human tissues based on integration of single cell and bulk transcriptomics Genome Biol. (IF 10.1) Pub Date : 2025-06-03
Mengnan Shi, Loren Méar, Max Karlsson, María Bueno Álvez, Andreas Digre, Rutger Schutten, Borbala Katona, Jimmy Vuu, Emil Lindström, Feria Hikmet, Han Jin, Meng Yuan, Xiangyu Li, Hong Yang, Xiya Song, Evelina Sjöstedt, Fredrik Edfors, Per Oksvold, Kalle von Feilitzen, Martin Zwahlen, Mattias Forsberg, Fredric Johansson, Jan Mulder, Tomas Hökfelt, Yonglun Luo, Lynn Butler, Wen Zhong, Adil MardinogluNew technologies enable single-cell transcriptome analysis, mapping genome-wide expression across the human body. Here, we present an extended analysis of protein-coding genes in all major human tissues and organs, combining single-cell and bulk transcriptomics. To enhance transcriptome depth, 31 tissues were analyzed using a pooling method, identifying 557 unique cell clusters, manually annotated
-
Publisher Correction: Analyzing the relationship of RNA and DNA methylation with gene expression Genome Biol. (IF 10.1) Pub Date : 2025-06-03
Shangqian Xie, Darren Hagen, Gabrielle M. Becker, Kimberly M. Davenport, Katie A. Shira, Morgan R. Stegemiller, Jacob W. Thorne, Sarem Khilji, Denise Konetchy, Patricia Villamediana, Brenda M. Murdoch, Stephanie D. McKayPublisher Correction: Genome Biol 26, 140 (2025) https://doi.org/10.1186/s13059-025-03617-3 Following publication of the original article [1], the authors identified typesetting error, whereby the Jaccard index has a subset 1 in the numerator, however the absolute value bars were missing in the denominator. Incorrect index: Jaccard index = \(\frac{\left|{G}_{1}\bigcap {G}_{j}\right|}{{G}_{i}\bigcup
-
Differential epigenetic regulation by blue and UV-A light reveals the key role of CsSDG36-mediated H3K4 methylation in leaf development and secondary metabolism in Camellia sinensis Genome Biol. (IF 10.1) Pub Date : 2025-06-02
Pu Wang, Hong Zhang, Yongli Yin, Yue Ge, Binrui Chen, Jing Hu, Yu Wang, Dejiang Ni, Fei GuoThere is a Chinese proverb that good tea comes from high mountains with clouds and mists, suggesting the important impact of environmental factors on the development and secondary metabolism in tea plants. However, the epigenetic mechanism involved is still unclear. High altitude results in light enhancement with a higher retention of short-wavelength light in cloudy conditions, suggesting the key
-
mNSF: multi-sample non-negative spatial factorization Genome Biol. (IF 10.1) Pub Date : 2025-06-02
Yi Wang, Kyla Woyshner, Chaichontat Sriworarat, Genevieve Stein-O’Brien, Loyal A. Goff, Kasper D. HansenAnalyzing multi-sample spatial transcriptomics data requires accounting for biological variation. We present multi-sample non-negative spatial factorization (mNSF), an alignment-free framework extending single-sample spatial factorization to multi-sample datasets. mNSF incorporates sample-specific spatial correlation modeling and extracts low-dimensional data representations. Through simulations and
-
Microbial inoculants modulate the rhizosphere microbiome, alleviate plant stress responses, and enhance maize growth at field scale Genome Biol. (IF 10.1) Pub Date : 2025-06-01
Davide Francioli, Ioannis D. Kampouris, Theresa Kuhl-Nagel, Doreen Babin, Loreen Sommermann, Jan H. Behr, Soumitra Paul Chowdhury, Rita Zrenner, Narges Moradtalab, Michael Schloter, Joerg Geistlinger, Uwe Ludewig, Günter Neumann, Kornelia Smalla, Rita GroschField inoculation of crops with beneficial microbes is a promising sustainable strategy to enhance plant fitness and nutrient acquisition. However, effectiveness can vary due to environmental factors, microbial competition, and methodological challenges, while their precise modes of action remain uncertain. This underscores the need for further research to optimize inoculation strategies for consistent
-
Rapid reprogramming and stabilization of homoeolog expression bias in hexaploid wheat biparental populations Genome Biol. (IF 10.1) Pub Date : 2025-05-28
Marek Glombik, Ramesh Arunkumar, Samuel Burrows, Sophie Louise Mogg, Xiaoming Wang, Philippa BorrillDifferences in the relative level of expression of homoeologs, known as homoeolog expression bias, are widely observed in allopolyploids. While the evolution of homoeolog expression bias through hybridization has been characterized, on shorter timescales such as those found in crop breeding programs, the extent to which homoeolog expression bias is preserved or altered between generations remains elusive
-
Non-cyanobacterial diazotrophs support the survival of marine microalgae in nitrogen-depleted environment Genome Biol. (IF 10.1) Pub Date : 2025-05-28
Udita Chandola, Marinna Gaudin, Camille Trottier, Louis-Josselin Lavier-Aydat, Eric Manirakiza, Samuel Menicot, Erik Jörg Fischer, Isabelle Louvet, Thomas Lacour, Timothée Chaumier, Atsuko Tanaka, Georg Pohnert, Samuel Chaffron, Leïla TirichineNon-cyanobacteria diazotrophs (NCDs) are shown to dominate in surface waters shifting the long-held paradigm of cyanobacteria dominance. This raises fundamental questions on how these putative heterotrophic bacteria thrive in sunlit oceans. The absence of laboratory cultures of these bacteria significantly limits our ability to understand their behavior in natural environments and, consequently, their
-
The impact of PCR duplication on RNAseq data generated using NovaSeq 6000, NovaSeq X, AVITI, and G4 sequencers Genome Biol. (IF 10.1) Pub Date : 2025-05-28
Natalia Zajac, Ioannis S. Vlachos, Sija Sajibu, Lennart Opitz, Shuoshuo Wang, Sridar V. Chittur, Christopher E. Mason, Kevin L. Knudtson, John M. Ashton, Hubert Rehrauer, Catharine AquinoTranscriptome sequencing (RNA-seq) is a powerful technology for gene expression profiling. Selection of optimal parameters for cDNA library generation is crucial for acquisition of high-quality data. In this study, we investigate the impact of the amount of RNA and the number of PCR cycles used for sample amplification on the rate of PCR duplication and, in consequence, on the RNA-seq data quality
-
AMHY and sex determination in egg-laying mammals Genome Biol. (IF 10.1) Pub Date : 2025-05-27
Linda Shearwin-Whyatt, Jane Fenelon, Hongshi Yu, Andrew Major, Zhipeng Qu, Yang Zhou, Keith Shearwin, James Galbraith, Alexander Stuart, David Adelson, Guojie Zhang, Michael Pyne, Stephen Johnston, Craig Smith, Marilyn Renfree, Frank GrütznerEgg-laying mammals (monotremes) evolved multiple sex chromosomes independently of therian mammals and lack the sex-determining gene SRY. The Y-localized anti-Müllerian hormone gene (AMHY) is the candidate sex-determination gene in monotremes. Here, we describe the evolution of monotreme AMHX and AMHY gametologues and for the first time, investigate their expression during gonad sexual differentiation
-
Author Correction: Efficient precise knockin with a double cut HDR donor after CRISPR/Cas9-mediated double-stranded DNA cleavage Genome Biol. (IF 10.1) Pub Date : 2025-05-27
Jian-Ping Zhang, Xiao-Lan Li, Guo-Hua Li, Wanqiu Chen, Cameron Arakaki, Gary D. Botimer, David Baylink, Lu Zhang, Wei Wen, Ya-Wen Fu, Jing Xu, Noah Chun, Weiping Yuan, Tao Cheng, Xiao-Bing ZhangAuthor Correction: Genome Biol 18, 35 (2017) https://doi.org/10.1186/s13059-017-1164-8 Following publication of the original article [1], the authors identified an error in Figure 2: the last two flowcharts in Figure 2b were inadvertently duplicated. This error does not affect the main results and conclusions of the paper. The incorrect Fig. 2 is given below The correct Fig. 2 is given below Zhang
-
Precise measurement of molecular phenotypes with barcode-based CRISPRi systems Genome Biol. (IF 10.1) Pub Date : 2025-05-25
Joseph H. Lobel, Nicholas T. IngoliaGenome-wide CRISPR-Cas9 screens have untangled regulatory networks driving diverse biological processes. Their success relies on interrogating specific molecular phenotypes and distinguishing key regulators from background effects. Here, we realize these goals by optimizing CRISPR interference with barcoded expression reporter sequencing (CiBER-seq) to dramatically improve the sensitivity and scope
-
A standardized framework for robust fragmentomic feature extraction from cell-free DNA sequencing data Genome Biol. (IF 10.1) Pub Date : 2025-05-23
Haichao Wang, Paulius D. Mennea, Yu Kiu Elkie Chan, Zhao Cheng, Maria C. Neofytou, Arif Anwer Surani, Aadhitthya Vijayaraghavan, Emma-Jane Ditter, Richard Bowers, Matthew D. Eldridge, Dmitry S. Shcherbo, Christopher G. Smith, Florian Markowetz, Wendy N. Cooper, Tommy Kaplan, Nitzan Rosenfeld, Hui ZhaoFragmentomics features of cell-free DNA represent promising non-invasive biomarkers for cancer diagnosis. A lack of systematic evaluation of biases in feature quantification hinders the adoption of such applications. We compare features derived from whole-genome sequencing of ten healthy donors using nine library kits and ten data-processing routes and validated in 1182 plasma samples from published
-
Analyzing the relationship of RNA and DNA methylation with gene expression Genome Biol. (IF 10.1) Pub Date : 2025-05-22
Shangqian Xie, Darren Hagen, Gabrielle M. Becker, Kimberly M. Davenport, Katie A. Shira, Morgan R. Stegemiller, Jacob W. Thorne, Sarem Khilji, Denise Konetchy, Patricia Villamediana, Brenda M. Murdoch, Stephanie D. McKayDNA 5-methylcytosine (5mC) and RNA N6-methyladenosine (m6A) methylation are prevalent modifications in eukaryotes, both playing crucial roles in gene regulation. Recent studies have explored their crosstalk and impact on transcription. However, the intricate relationships among 5mC, m6A, and gene expression remain incompletely elucidated. We collect data on 5mC, m6A, and gene expression from samples
-
Incorporating scale uncertainty in microbiome and gene expression analysis as an extension of normalization Genome Biol. (IF 10.1) Pub Date : 2025-05-22
Michelle Pistner Nixon, Gregory B. Gloor, Justin D. SilvermanStatistical normalizations are used in differential analyses to address sample-to-sample variation in sequencing depth. Yet normalizations make strong, implicit assumptions about the scale of biological systems, such as microbial load, leading to false positives and negatives. We introduce scale models as a generalization of normalizations, which allows researchers to model potential errors in these
-
A survey of sequence-to-graph mapping algorithms in the pangenome era Genome Biol. (IF 10.1) Pub Date : 2025-05-22
Yingbo Cui, Chenchen Peng, Zeyu Xia, Canqun Yang, Yifei GuoA pangenome can reveal the genetic diversity across different individuals simultaneously. It offers a more comprehensive reference for genome analysis compared to a single linear genome that may introduce allele bias. Pangenomes are often represented as genome graphs, making sequence-to-graph mapping a fundamental task for pangenome construction and analysis. Numerous sequence-to-graph mapping algorithms
-
H3K4me3 binding ALFIN-LIKE proteins recruit SWR1 for gene-body deposition of H2A.Z Genome Biol. (IF 10.1) Pub Date : 2025-05-21
Linhao Xu, Yafei Wang, Xueying Li, Qin Hu, Vanda Adamkova, Junjie Xu, C. Jake Harris, Israel AusinThe H2A.Z histone variant is highly enriched over gene bodies, playing an essential role in several genome-templated processes, including transcriptional regulation and epigenetic patterning across eukaryotes. Deposition of H2A.Z is mediated by the SWR1 remodeling complex. How SWR1 is directed to gene bodies is largely unknown. Here, we show that ALFIN-LIKE (AL) proteins are responsible for H2A.Z gene
-
Revealing long-range heterogeneous organization of nucleoproteins with 6mA footprinting by ipdTrimming Genome Biol. (IF 10.1) Pub Date : 2025-05-21
Wentao Yang, Xue Qing Wang, Fan Wei, Jingqi Yu, Yifan Liu, Yali DouEnabled by long-read sequencing technologies, particularly Single Molecule, Real-Time sequencing, N6-methyladenine (6mA) footprinting is a transformative methodology for revealing the heterogenous and dynamic distribution of nucleosomes and other DNA-binding proteins. Here, we present ipdTrimming, a novel 6mA-calling pipeline that outperforms existing tools in both computational efficiency and accuracy
-
SMOPCA: spatially aware dimension reduction integrating multi-omics improves the efficiency of spatial domain detection Genome Biol. (IF 10.1) Pub Date : 2025-05-21
Mo Chen, Ruihua Cheng, Jianuo He, Jun Chen, Jie ZhangTechnological advances have enabled us to profile multiple omics layers with spatial information, significantly enhancing spatial domain detection and advancing a variety of biomedical research fields. Despite these advancements, there is a notable lack of effective methods for modeling spatial multi-omics data. We introduce SMOPCA, a Spatial Multi-Omics Principal Component Analysis method designed
-
Evaluating topography of mutational signatures with SigProfilerTopography Genome Biol. (IF 10.1) Pub Date : 2025-05-20
Burçak Otlu, Ludmil B. AlexandrovThe mutations found in a cancer genome are shaped by diverse processes, each displaying a characteristic mutational signature that may be influenced by the genome’s architecture. While prior analyses have evaluated the effect of topographical genomic features on mutational signatures, there has been no computational tool that can comprehensively examine this interplay. Here, we present SigProfilerTopography
-
CMImpute: cross-species and tissue imputation of species-level DNA methylation samples across mammalian species Genome Biol. (IF 10.1) Pub Date : 2025-05-20
Emily Maciejewski, Steve Horvath, Jason ErnstThe large-scale application of the mammalian methylation array has substantially expanded the availability of DNA methylation data in mammalian species. However, this data captures only a small portion of species-tissue combinations. To address this, we develop CMImpute (Cross-species Methylation Imputation), a method based on a conditional variational autoencoder, to impute DNA methylation representing
-
Tumor ploidy determination in low-pass whole genome sequencing and allelic copy number visualization using the Constellation Plot Genome Biol. (IF 10.1) Pub Date : 2025-05-20
Sarah H. Johnson, James B. Smadbeck, Roman M. Zenka, Michael T. Barrett, Athanasios Gaitatzes, Arnav Solanki, Angela B. Florio, Mitesh J. Borad, John C. Cheville, George VasmatzisPloidy determination across the genome has been challenging for low-pass-WGS tumor-only samples. We present BACDAC, a method that calculates tumor ploidy down to 1.2X effective tumor coverage. Allele fraction patterns displayed in the Constellation Plot verify tumor ploidy and reveal subclonal populations. BACDAC outputs a metric, 2N+LOH, that when combined with ploidy better distinguishes near-diploid
-
CloseRead: a tool for assessing assembly errors in immunoglobulin loci applied to vertebrate long-read genome assemblies Genome Biol. (IF 10.1) Pub Date : 2025-05-20
Yixin Zhu, Corey Watson, Yana Safonova, Matt Pennell, Anton BankevichDespite tremendous advances in long-read sequencing, some structurally complex and repeat-rich genomic regions remain challenging to assemble. Furthermore, we lack tools to assess local assembly quality, making it hard to identify problems and assess progress. Here we develop a new approach “CloseRead” for visualizing local assembly quality and diagnosing errors using multiple metrics. We apply CloseRead
-
Urbanization and genetic homogenization in the medieval Low Countries revealed through a ten-century paleogenomic study of the city of Sint-Truiden Genome Biol. (IF 10.1) Pub Date : 2025-05-20
Owyn Beneker, Ludovica Molinaro, Meriam Guellil, Stefania Sasso, Helja Kabral, Biancamaria Bonucci, Noah Gaens, Eugenia D’Atanasio, Massimo Mezzavilla, Hélios Delbrassine, Linde Braet, Bart Lambert, Pieterjan Deckers, Simone Andrea Biagini, Ruoyun Hui, Sara Becelaere, Jan Geypen, Maxim Hoebreckx, Birgit Berk, Petra Driesen, April Pijpelink, Philip van Damme, Sofie Vanhoutte, Natasja De Winter, LehtiProcesses shaping the formation of the present-day population structure in highly urbanized Northern Europe are still poorly understood. Gaps remain in our understanding of when and how currently observable regional differences emerged and what impact city growth, migration, and disease pandemics during and after the Middle Ages had on these processes. We perform low-coverage sequencing of the genomes
-
The gene regulatory mechanisms shaping the heterogeneity of venom production in the Cape coral snake Genome Biol. (IF 10.1) Pub Date : 2025-05-19
Pedro G. Nachtigall, Brett R. Hamilton, Taline D. Kazandjian, Paolo Stincone, Daniel Petras, Nicholas R. Casewell, Eivind A. B. UndheimVenoms and their associated glands and delivery structures have evolved numerous times among animals. Within these venom systems, the molecular, cellular, and morphological components interact and co-evolve to generate distinct, venom phenotypes that are increasingly recognized as models for studying adaptive evolution. However, toxins are often unevenly distributed across venom-producing tissues in
-
RobusTAD: reference panel based annotation of nested topologically associating domains Genome Biol. (IF 10.1) Pub Date : 2025-05-19
Yanlin Zhang, Rola Dali, Mathieu BlanchetteTopologically associating domains (TADs) are fundamental units of 3D genomes and play essential roles in gene regulation. Hi-C data suggests a hierarchical organization of TADs. Accurately annotating nested TADs from Hi-C data remains challenging, both in terms of the precise identification of boundaries and the correct inference of hierarchies. While domain boundary is relatively well conserved across
-
New insights into the cold tolerance of upland switchgrass by integrating a haplotype-resolved genome and multi-omics analysis Genome Biol. (IF 10.1) Pub Date : 2025-05-14
Bingchao Wu, Dan Luo, Yuesen Yue, Haidong Yan, Min He, Xixi Ma, Bingyu Zhao, Bin Xu, Jie Zhu, Jing Wang, Jiyuan Jia, Min Sun, Zheni Xie, Xiaoshan Wang, Linkai HuangSwitchgrass (Panicum virgatum L.) is a bioenergy and forage crop. Upland switchgrass exhibits superior cold tolerance compared to the lowland ecotype, but the underlying molecular mechanisms remain unclear. Here, we present a high-quality haplotype-resolved genome of the upland ecotype “Jingji31.” We then conduct multi-omics analysis to explore the mechanism underlying its cold tolerance. By comparative
-
Deciphering the role of RNA in regulating CTCF’s DNA binding affinity in leukemia cells Genome Biol. (IF 10.1) Pub Date : 2025-05-12
Judith Hyle, Wenjie Qi, Mohamed Nadhir Djekidel, Wojciech Rosikiewicz, Beisi Xu, Chunliang LiCTCF, a highly studied transcription factor, is essential for chromatin interaction maintenance. Several independent studies report that CTCF interacts with RNAs in vitro and in cells. Yet continuous debates about the authenticity of the RNA-binding affinity of CTCF and its biological role remain in large part due to limited research techniques available, such as CLIP-seq. Here, we investigate RNA’s
-
Parent-of-origin regulation by maternal auts2 shapes neurodevelopment and behavior in fish Genome Biol. (IF 10.1) Pub Date : 2025-05-09
Antoine Emile Clément, Constance Merdrignac, Sergi Roig Puiggros, Dorine Sévère, Aurélien Brionne, Thomas Lafond, Thaovi Nguyen, Jérôme Montfort, Cervin Guyomar, Alexandra Dauvé, Amaury Herpin, Denis Jabaudon, Violaine Colson, Florent Murat, Julien BobeParental experience can influence progeny behavior through gamete-mediated non-genetic inheritance, that is, mechanisms that do not involve changes in inherited DNA sequence. However, underlying mechanisms remain poorly understood in vertebrates, especially for maternal effects. Here, we use the medaka, a model fish species, to investigate the role of auts2a, the ortholog of human AUTS2, a gene repressed
-
Endometrial tumorigenesis involves epigenetic plasticity demarcating non-coding somatic mutations and 3D-genome alterations Genome Biol. (IF 10.1) Pub Date : 2025-05-09
Sebastian Gregoricchio, Aleksandar Kojic, Marlous Hoogstraat, Karianne Schuurman, Suzan Stelloo, Tesa M. Severson, Tracy A. O’Mara, Marjolein Droog, Abhishek A. Singh, Dylan M. Glubb, Lodewyk F. A. Wessels, Michiel Vermeulen, Flora E. van Leeuwen, Wilbert ZwartThe incidence and mortality of endometrial cancer (EC) is on the rise. Eighty-five percent of ECs depend on estrogen receptor alpha (ERα) for proliferation, but little is known about its transcriptional regulation in these tumors. We generate epigenomics, transcriptomics, and Hi-C datastreams in healthy and tumor endometrial tissues, identifying robust ERα reprogramming and profound alterations in
-
ChromActivity: integrative epigenomic and functional characterization assay based annotation of regulatory activity across diverse human cell types Genome Biol. (IF 10.1) Pub Date : 2025-05-09
Tevfik Umut Dincer, Jason ErnstWe introduce ChromActivity, a computational framework for predicting and annotating regulatory activity across the genome through integration of multiple epigenomic maps and various functional characterization datasets. ChromActivity generates genomewide predictions of regulatory activity associated with each functional characterization dataset across many cell types based on available epigenomic data
-
Most human DNA replication initiation is dispersed throughout the genome with only a minority within previously identified initiation zones Genome Biol. (IF 10.1) Pub Date : 2025-05-09
Jamie T. Carrington, Rosemary H. C. Wilson, Eduardo de La Vega, Sathish Thiyagarajan, Tom Barker, Leah Catchpole, Alex Durrant, Vanda Knitlhoffer, Chris Watkins, Karim Gharbi, Conrad A. NieduszynskiThe identification of sites of DNA replication initiation in mammalian cells has been challenging. Here, we present unbiased detection of replication initiation events in human cells using BrdU incorporation and single-molecule nanopore sequencing. Increases in BrdU incorporation allow us to measure DNA replication dynamics, including identification of replication initiation, fork direction, and termination
-
Epigenome profiling reveals distinctive regulatory features and cis-regulatory elements in pepper Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Hongji Yang, Guorong Yu, Zhidong Lv, Tonghui Li, Xi Wang, Ying Fu, Zhangsheng Zhu, Guangjun Guo, Hang He, Ming Wang, Guochen Qin, Feng Liu, Zhenhui Zhong, Yan XuePepper (Capsicum annuum) is one of the earliest and most widely cultivated vegetable crops worldwide. While the large and complex genome of pepper severely hampered the understanding of its functional genome, it also indicates a rich yet unexplored reservoir of regulatory elements (REs). In fact, variations in the REs represent a major driving force in evolution and domestication in both plants and
-
ZAR1 and ZAR2 orchestrate the dynamics of maternal mRNA polyadenylation during mouse oocyte development Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Yu-Ke Wu, Ruibao Su, Zhi-Yan Jiang, Yun-Wen Wu, Yan Rong, Shu-Yan Ji, Jingwen Liu, Zhuoyue Niu, Zhiyi Li, Yuanchao Xue, Falong Lu, Heng-Yu FanDuring meiosis, the oocyte genome keeps dormant for a long time until zygotic genome activation. The dynamics and homeostasis of the maternal transcriptome are essential for maternal-to-zygotic transition. Zygotic arrest 1 (ZAR1) and its homolog, ZAR2, are RNA-binding proteins that are important for the regulation of maternal mRNA stability. Smart-seq2 analysis reveals drastically downregulated maternal
-
Peak analysis of cell-free RNA finds recurrently protected narrow regions with clinical potential Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Pengfei Bao, Taiwei Wang, Xiaofan Liu, Shaozhen Xing, Hanjin Ruan, Hongli Ma, Yuhuan Tao, Qing Zhan, Efres Belmonte-Reche, Lizheng Qin, Zhengxue Han, Minghui Mao, Mengtao Li, Zhi John LuCell-free RNAs (cfRNAs) can be detected in biofluids and have emerged as valuable disease biomarkers. Accurate identification of the fragmented cfRNA signals, especially those originating from pathological cells, is crucial for understanding their biological functions and clinical value. However, many challenges still need to be addressed for their application, including developing specific analysis
-
Development of a whole-exome sequencing kit to facilitate porcine biomedical research Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Vishwaarth Vijayakumar, Tanvi Joshi, Lobna Elkhadragy, Lawrence B. Schook, Ron C. Gaba, Mohammed El-Kebir, Kyle M. SchachtschneiderIt is important for porcine models to replicate gene mutations present in human diseases to improve the translatability of animal studies. In this study, the high efficacy of a whole exome sequencing kit was demonstrated for the improved pig reference genome (Sus scrofa 11.1) to profile biomedically relevant swine breeds and enable high-depth sequencing required for intratumor heterogeneity profiling
-
ONTraC characterizes spatially continuous variations of tissue microenvironment through niche trajectory analysis Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Wen Wang, Shiwei Zheng, Sujung Crystal Shin, Joselyn Cristina Chávez-Fuentes, Guo-Cheng YuanRecent technological advances enable mapping of tissue spatial organization at single-cell resolution, but methods for analyzing spatially continuous microenvironments are still lacking. We introduce ONTraC, a graph neural network-based framework for constructing spatial trajectories at niche-level. Through benchmarking analyses using multiple simulated and real datasets, we show that ONTraC outperforms
-
Determining the functional relationship between epigenetic and physical chromatin domains in Drosophila Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Sandrine Denaud, Gonzalo Sabarís, Marco Di Stefano, Giorgio-Lucio Papadopoulos, Bernd Schuettengruber, Giacomo CavalliThe tight correlation between topologically associating domains (TADs) and epigenetic domains in Drosophila suggests that the epigenome contributes to define TADs. However, it is still unknown whether histone modifications are essential for TAD formation and structure. By either deleting or shifting key regulatory elements needed to establish the epigenetic signature of Polycomb TADs, we show that
-
Predicting adenine base editing efficiencies in different cellular contexts by deep learning Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Lucas Kissling, Amina Mollaysa, Sharan Janjuha, Nicolas Mathis, Kim F. Marquart, Yanik Weber, Woohyun J. Moon, Paulo J. C. Lin, Steven H. Y. Fan, Hiromi Muramatsu, Máté Vadovics, Ahmed Allam, Norbert Pardi, Ying K. Tam, Michael Krauthammer, Gerald SchwankAdenine base editors (ABEs) enable the conversion of A•T to G•C base pairs. Since the sequence of the target locus influences base editing efficiency, efforts have been made to develop computational models that can predict base editing outcomes based on the targeted sequence. However, these models were trained on base editing datasets generated in cell lines and their predictive power for base editing
-
Long-read sequencing reveals novel isoform-specific eQTLs and regulatory mechanisms of isoform expression in human B cells Genome Biol. (IF 10.1) Pub Date : 2025-05-08
Yuya Nagura, Mihoko Shimada, Ryoji Kuribayashi, Ko Ikemoto, Hiroki Kiyose, Arisa Igarashi, Tadashi Kaname, Motoko Unoki, Akihiro FujimotoGenetic variations linked to changes in gene expression are known as expression quantitative loci (eQTLs). The identification of eQTLs helps to understand the mechanisms governing gene expression. However, prior studies have primarily utilized short-read sequencing techniques, and the analysis of eQTLs on isoforms has been relatively limited. In this study, we employ long-read sequencing technology
-
Polygraph: a software framework for the systematic assessment of synthetic regulatory DNA elements Genome Biol. (IF 10.1) Pub Date : 2025-05-06
Avantika Lal, Laura Gunsalus, Anay Gupta, Tommaso Biancalani, Gokcen EraslanThe design of regulatory elements is pivotal in gene and cell therapy, where DNA sequences are engineered to drive elevated and cell-type specific expression. However, the systematic assessment of synthetic DNA sequences without robust metrics and easy-to-use software remains challenging. Here, we introduce Polygraph, a Python framework that evaluates synthetic DNA elements, based on features like
-
PRESCOTT: a population aware, epistatic, and structural model accurately predicts missense effects Genome Biol. (IF 10.1) Pub Date : 2025-05-06
Mustafa Tekpinar, Laurent David, Thomas Henry, Alessandra CarbonePredicting the functional impact of point mutations is a critical challenge in genomics. PRESCOTT reconstructs complete mutational landscapes, identifies mutation-sensitive regions, and categorizes missense variants as benign, pathogenic, or variants of uncertain significance. Leveraging protein sequences, structural models, and population-specific allele frequencies, PRESCOTT surpasses existing methods
-
SlideCNA: spatial copy number alteration detection from Slide-seq-like spatial transcriptomics data Genome Biol. (IF 10.1) Pub Date : 2025-05-02
Diane Zhang, Åsa Segerstolpe, Michal Slyper, Julia Waldman, Evan Murray, Robert Strasser, Jan Watter, Ofir Cohen, Orr Ashenberg, Daniel Abravanel, Judit Jané-Valbuena, Simon Mages, Ana Lako, Karla Helvie, Orit Rozenblatt-Rosen, Scott Rodig, Fei Chen, Nikhil Wagle, Aviv Regev, Johanna KlughammerSolid tumors are spatially heterogeneous in their genetic, molecular, and cellular composition, but recent spatial profiling studies have mostly charted genetic and RNA variation in tumors separately. To leverage the potential of RNA to identify copy number alterations (CNAs), we develop SlideCNA, a computational tool to extract CNA signals from sparse spatial transcriptomics data with near single
-
Four near-complete genome assemblies reveal the landscape and evolution of centromeres in Salicaceae Genome Biol. (IF 10.1) Pub Date : 2025-05-02
Yubo Wang, Lulu Zhao, Deyan Wang, Kai Chen, Tiannan Luo, Jianglin Luo, Chengzhi Jiang, Zhoujian He, Heng Huang, Jiaxiao Xie, Yuanzhong Jiang, Jianquan Liu, Tao MaCentromeres play a crucial role in maintaining genomic stability during cell division. They are typically composed of large arrays of tandem satellite repeats, which hinder high-quality assembly and complicate our efforts to understand their evolution across species. Here, we use long-read sequencing to generate near-complete genome assemblies for two Populus and two Salix species belonging to the
-
SpaNorm: spatially-aware normalization for spatial transcriptomics data Genome Biol. (IF 10.1) Pub Date : 2025-04-29
Agus Salim, Dharmesh D. Bhuva, Carissa Chen, Chin Wee Tan, Pengyi Yang, Melissa J. Davis, Jean Y. H. YangNormalization of spatial transcriptomics data is challenging due to spatial association between region-specific library size and biology. We develop SpaNorm, the first spatially-aware normalization method that concurrently models library size effects and the underlying biology, segregates these effects, and thereby removes library size effects without removing biological information. Using 27 tissue
-
Medieval genomes from eastern Iberia illuminate the role of Morisco mass deportations in dismantling a long-standing genetic bridge with North Africa Genome Biol. (IF 10.1) Pub Date : 2025-04-28
Gonzalo Oteo-Garcia, Marina Silva, M. George B. Foody, Bobby Yau, Alessandro Fichera, Llorenç Alapont, Pierre Justeau, Simão Rodrigues, Rita Monteiro, Francesca Gandini, María Luisa Rovira Gomar, Albert Ribera i Lacomba, Josep Pascual Beneyto, Valeria Mattiangeli, Daniel G. Bradley, Ceiridwen J. Edwards, Maria Pala, Martin B. RichardsThe Islamic influence on the Iberian Peninsula left an enduring cultural and linguistic legacy. However, the demographic impact is less well understood. This study aims to explore the dynamics of gene flow and population structure in eastern Iberia from the early to late medieval period through ancient DNA. Our comprehensive genomic analysis uncovers gene flow from various Mediterranean regions into
-
Rewiring of SINE-MIR enhancer topology and Esrrb modulation in expanded and naive pluripotency Genome Biol. (IF 10.1) Pub Date : 2025-04-28
Nadia Omega Cipta, Yingying Zeng, Ka Wai Wong, Zi Hao Zheng, Yao Yi, Tushar Warrier, Jian Zhou Teo, Jia Hao Jackie Teo, Yee Jiun Kok, Xuezhi Bi, Reshma Taneja, Derrick Sek Tong Ong, Jian Xu, Florent Ginhoux, Hu Li, Yih-Cherng Liou, Yuin-Han LohThe interplay between 3D genomic structure and transposable elements (TE) in regulating cell state-specific gene expression program is largely unknown. Here, we explore the utilization of TE-derived enhancers in naïve and expanded pluripotent states by integrative analysis of genome-wide Hi-C-defined enhancer interactions, H3K27ac HiChIP profiling and CRISPR-guided TE proteomics landscape. We find
-
Multi-ancestry whole genome sequencing analysis of lean body mass Genome Biol. (IF 10.1) Pub Date : 2025-04-28
Xiaoyu Zhang, Kuan-Jui Su, Bodhisattwa Banerjee, Ittai Eres, Yi-Hsiang Hsu, Carolyn J. Crandall, Rajashekar Donaka, Zhe Han, Rebecca D. Jackson, Hanhan Liu, Zhe Luo, Braxton D. Mitchell, Chuan Qiu, Qing Tian, Hui Shen, Ming-Ju Tsai, Kerri L. Wiggins, Hanfei Xu, Michelle Yau, Lan-Juan Zhao, Xiao Zhang, May E. Montasser, Douglas P. Kiel, Hong-Wen Deng, Ching-Ti Liu, David KarasikLean body mass is a crucial physiological component of body composition. Although lean body mass has a high heritability, studies evaluating the genetic determinants of lean mass (LM) have to date been limited largely to genome-wide association studies (GWAS) and common variants. Using whole genome sequencing (WGS)-based studies, we aimed to discover novel genetic variants associated with LM in population-based
-
InSituCor: exploring spatially correlated genes conditional on the cell type landscape Genome Biol. (IF 10.1) Pub Date : 2025-04-24
Patrick Danaher, Dan McGuire, Lidan Wu, Michael Patrick, David Kroeppler, Haiyan Zhai, Deniz G. Olgun, Dennis Gong, Jingyi Cao, William L. Hwang, Joachim Schmid, Joseph M. BeechemIn spatial transcriptomics data, spatially correlated genes promise to reveal high-interest phenomena like cell–cell interactions and latent variables. But in practice, most spatial correlations arise from the spatial arrangement of cell types, obscuring the more interesting relationships we hope to discover. We introduce InSituCor, a toolkit for discovering modules of spatially correlated genes. InSituCor
-
Variant effect predictor correlation with functional assays is reflective of clinical classification performance Genome Biol. (IF 10.1) Pub Date : 2025-04-22
Benjamin J. Livesey, Joseph A. MarshUnderstanding the relationship between protein sequence and function is crucial for accurate classification of missense variants. Variant effect predictors (VEPs) play a vital role in deciphering this complex relationship, yet evaluating their performance remains challenging for several reasons, including data circularity, where the same or related data is used for training and assessment. High-throughput
-
Weighted 2D-kernel density estimations provide a new probabilistic measure for epigenetic age Genome Biol. (IF 10.1) Pub Date : 2025-04-22
Juan-Felipe Perez-Correa, Thomas Stiehl, Riccardo E. Marioni, Janie Corley, Simon R. Cox, Ivan G. Costa, Wolfgang WagnerEpigenetic aging signatures provide insights into human aging, but traditional clocks rely on linear regression of DNA methylation levels, assuming linear trajectories. This study explores a non-parametric approach using 2D-kernel density estimation to determine epigenetic age. Our weighted model achieves similar predictive accuracy as conventional clocks and provides a variation score reflecting the
-
miRglmm: a generalized linear mixed model of isomiR-level counts improves estimation of miRNA-level differential expression and uncovers variable differential expression between isomiRs Genome Biol. (IF 10.1) Pub Date : 2025-04-22
Andrea M. Baran, Arun H. Patil, Ernesto Aparicio-Puerta, Seong-Hwan Jun, Marc K. Halushka, Matthew N. McCallMicroRNA-seq data is produced by aligning small RNA sequencing reads of different microRNA transcript isoforms, called isomiRs, to known microRNAs. Aggregation to microRNA-level counts discards information and violates core assumptions of differential expression methods developed for mRNA-seq data. We establish miRglmm, a differential expression method for microRNA-seq data, that uses a generalized
-
Zero-shot evaluation reveals limitations of single-cell foundation models Genome Biol. (IF 10.1) Pub Date : 2025-04-18
Kasia Z. Kedzierska, Lorin Crawford, Ava P. Amini, Alex X. LuFoundation models such as scGPT and Geneformer have not been rigorously evaluated in a setting where they are used without any further training (i.e., zero-shot). Understanding the performance of models in zero-shot settings is critical to applications that exclude the ability to fine-tune, such as discovery settings where labels are unknown. Our evaluation of the zero-shot performance of Geneformer
-
Hmgb2 improves astrocyte to neuron conversion by increasing the chromatin accessibility of genes associated with neuronal maturation in a proneuronal factor-dependent manner Genome Biol. (IF 10.1) Pub Date : 2025-04-17
Priya Maddhesiya, Tjasa Lepko, Andrea Steiner-Mezzardi, Julia Schneider, Veronika Schwarz, Juliane Merl-Pham, Finja Berger, Stefanie M. Hauck, Lorenza Ronfani, Marco Bianchi, Tatiana Simon, Anthodesmi Krontira, Giacomo Masserdotti, Magdalena Götz, Jovica NinkovicDirect conversion of reactive glial cells to neurons is a promising avenue for neuronal replacement therapies after brain injury or neurodegeneration. The overexpression of neurogenic fate determinants in glial cells results in conversion to neurons. For repair purposes, the conversion should ideally be induced in the pathology-induced neuroinflammatory environment. However, very little is known regarding
-
DNA methylation dynamics play crucial roles in shaping the distinct transcriptomic profiles for different root-type initiation in rice Genome Biol. (IF 10.1) Pub Date : 2025-04-17
Wei Jiang, Zhou Zhou, Xiaoying Li, Yu Zhao, Shaoli ZhouMonocots possess a fibrous root system comprising an embryonic root, crown roots, and lateral roots. The distinct cellular origins highlight the diversity of the initiation mechanism. To date, the distinct initiation mechanisms have been poorly studied. In this study, we conduct a comprehensive transcriptome and DNA methylome assay of these root types during their initiation. Our findings indicate
-
Bayesian multi-study non-negative matrix factorization for mutational signatures Genome Biol. (IF 10.1) Pub Date : 2025-04-16
Isabella N. Grabski, Lorenzo Trippa, Giovanni ParmigianiMutational signatures are typically identified from tumor genome sequencing data using non-negative matrix factorization (NMF). However, existing NMF techniques only decompose a single dataset, limiting rigorous comparisons of signatures across conditions. We propose a Bayesian NMF method that jointly decomposes multiple datasets to identify signatures and their sharing pattern across conditions. We
-
Guidelines for releasing a variant effect predictor Genome Biol. (IF 10.1) Pub Date : 2025-04-15
Benjamin J. Livesey, Mihaly Badonyi, Mafalda Dias, Jonathan Frazer, Sushant Kumar, Kresten Lindorff-Larsen, David M. McCandlish, Rose Orenbuch, Courtney A. Shearer, Lara Muffley, Julia Foreman, Andrew M. Glazer, Ben Lehner, Debora S. Marks, Frederick P. Roth, Alan F. Rubin, Lea M. Starita, Joseph A. MarshComputational methods for assessing the likely impacts of mutations, known as variant effect predictors (VEPs), are widely used in the assessment and interpretation of human genetic variation, as well as in other applications like protein engineering. Many different VEPs have been released, and there is tremendous variability in their underlying algorithms, outputs, and the ways in which the methodologies