Selected publications

Submitted
* represents equal contribution
  • Tangqi Fang, Xiao Wang, Zixuan Xiao, Shengqi Hang, Ghulam Murtaza, Jinrui Yang, Hanwen Xu, Anupama Jha, William Stafford Noble, Sheng Wang. Evo2HiC: a multimodal foundation model for integrative analysis of genome sequence and architecture. bioRxiv, 2025.
    [BibTeX]

  • Shengqi Hang, Xiao Wang, Ghulam Murtaza, Anupama Jha, Bingfei Wen, Tangqi Fang, Jacob Sanders, Sheng Wang, William Stafford Noble. Puget predicts gene expression across cell types using sequence and 3D chromatin organization data. bioRxiv, 2025.
    [BibTeX]

Published
  • Anupama Jha, Borislav Hristov, Xiao Wang, Sheng Wang, William J. Greenleaf, Anshul Kundaje, Erez L. Aiden, Alessandro Bertero, William Stafford Noble. Prediction and functional interpretation of inter-chromosomal genome architecture from DNA sequence with TwinC. Nature Communications, in press, 2026.
    [BibTeX]

  • Xiao Wang, Yaqi Zhang, Shubham Ray, Anupama Jha, Tangqi Fang, Shengqi Hang, Slava Doulatov, William Stafford Noble, Sheng Wang. A generalizable Hi-C foundation model for chromatin architecture, single-cell and multi-omics analysis across species. Nature Methods, in press, 2026.
    [BibTeX]

  • Di Wu, Natalie Maus, Anupama Jha, Kai Yang, Bailey D. Wales-McGrath, San Jewell, Aditya Tangiyan, Peter Choi, Jacob R Gardner, Yoseph Barash. Generative modeling for RNA splicing code predictions and design. eLife, 2025.
    [BibTeX]

  • Kai Yang, Nathanial Islas, San Jewell, Anupama Jha, Caleb M. Radens, John A. Pleiss, Kristen W. Lynch, Yoseph Barash, Peter S. Choi. Machine learning-optimized targeted detection of alternative splicing. Nucleic Acids Research, 2025.
    [BibTeX]

  • Anupama Jha*, Stephanie C Bohaczuk*, Yizi Mao, Jane Ranchalis, Benjamin J Mallory, Alan T Min, Morgan O Hamm, Elliott Swanson, David Dubocanin, Connor Finkbeiner, Tony Li, Dale Whittington, William Stafford Noble, Andrew B Stergachis, Mitchell R Vollger*. DNA-m6A calling and integrated long-read epigenetic and genetic analysis with fibertools. Genome Research, 2024.
    [BibTeX]

  • Tangqi Fang, Yifeng Liu, Addie Woicik, Minsi Lu, Anupama Jha, Xiao Wang, Gang Li, Borislav Hristov, Zixuan Liu, Hanwen Xu, William Stafford Noble, Sheng Wang. Enhancing Hi-C contact matrices for loop detection with Capricorn: a multiview diffusion model. Bioinformatics, 2024.
    [BibTeX]

  • Jorge Vaquero-Garcia, Joseph K Aicher, San Jewell, Matthew R Gazzara, Caleb M Radens*, Anupama Jha*, Scott S Norton, Nicholas F Lahens, Gregory R Grant, Yoseph Barash. RNA splicing analysis using heterogeneous and large RNA-seq datasets. Nature Communications, 2023.
    [BibTeX]

  • Anupama Jha*, Mathieu Quesnel-Vallières*, David Wang, Andrei Thomas-Tikhonenko, Kristen W Lynch, Yoseph Barash. Identifying common transcriptome signatures of cancer by interpreting deep learning models. Genome Biology, 2022.
    [BibTeX]

  • William P. Bone, Katherine M. Siewert, Anupama Jha, Derek Klarin, Scott M. Damrauer, the VA Million Veteran Project, Kyong-Mi Chang, Philip S. Tsao, Themistocles L. Assimes, Marylyn D. Ritchie, Benjamin F. Voight. Multi-trait association studies discover pleiotropic loci between Alzheimer’s disease and cardiometabolic traits. Alzheimer's research & therapy, 2021.
    [BibTeX]

  • Xinjun Ji, Anupama Jha, Jesse Humenik, Louis R Ghanem, Andrew Kromer, Christopher Duncan-Lewis, Elizabeth Traxler, Mitchell J Weiss, Yoseph Barash, Stephen A Liebhaber. RNA-binding proteins PCBP1 and PCBP2 are critical determinants of murine erythropoiesis. Molecular and Cellular Biology, 2021.
    [BibTeX]

  • Barry Slaff, Caleb Matthew Radens, Paul Jewell, Anupama Jha, Nicholas Lahens, Gregory R Grant, Andrei Thomas-Tikhonenko, Kristen W Lynch, Yoseph Barash. MOCCASIN: A method for correcting for known and unknown confounders in RNA splicing analysis. Nature Communications, 2021.
    [BibTeX]

  • Anupama Jha, Joseph K Aicher, Matthew R Gazzara, Deependra Singh, Yoseph Barash. Enhanced integrated gradients: improving interpretability of deep learning models using splicing codes as a case study. Genome Biology, 2020.
    [BibTeX]

  • Anupama Jha, Matthew R Gazzara, Yoseph Barash. Integrative deep models for alternative splicing. Bioinformatics, 2017.
    [BibTeX]

  • Matthew R Gazzara, Michael J Mallory, Renat Roytenberg, John P Lindberg, Anupama Jha, Kristen W Lynch, Yoseph Barash. Ancient antagonism between CELF and RBFOX families tunes mRNA splicing outcomes. Genome Research, 2017.
    [BibTeX]

PhD Thesis