THE PAI LAB

RNA Genomics @ UMass Chan Medical School
RNA Therapeutics Institute
Abstract Fluid Shape

Publishing since

2018

About

Learn About Us
About Image

The Efficiency of mRNA Biogenesis

We aim to understand how RNAs are regulated through the lifecycle of an RNA molecule. To do so, we develop and apply high-throughput approaches to study the kinetics, fidelity, and coupling of transcriptional and RNA processing mechanisms.

Our lab combines experimental & computational genomic tools to probe steps of RNA maturation. We are always looking for enthusastic researchers to join us!

Functional Genomics

genetics, genomics, molecular & cellular biology

Quantitative Analyses

bioinformatics, mathematical modeling, machine learning

What's New?

Jan 2026

We're excited to welcome Kelly Cochran as a new postdoc in the lab, joint with the Schärfen lab! She is joining us after completing her PhD with Anshul Kundaje at Stanford, working on machine learning methods to predict transcriptional mechanisms.

Dec 2025

Happy to have been able to contribute to a new paper from the Dekker lab at UMass Chan, now out in Nature Cell Biology!

Dec 2025

PREPRINT ALERT! As a companion to our recent study on cryptic splicing, the team developed a new approach called CRYPTID-exon to identify cryptic exons in short-read RNA-seq data.

Nov 2025

Sad to say goodbye to senior scientist, Rachel Daniels, who will not be going far! We wish her the best in her shift to being 100% in the Karlsson lab at UMass Chan and are happy we will still see and interact frequently!

Nov 2025

PREPRINT ALERT! As part of our efforts to develop RNA genomics methods, we present SPARK, a framework to simulate reads from various nascent RNA approaches. Excited to see this first output from our official collaboration with the Engelhardt lab at Gladstone.

Oct 2025

NOW OUT in Science, our study describing the PITA phenomenon of coordinated transcription and 3' end processing, the result of a long-standing with the Fiszbein lab at BU! Congrats to Ezequiel and Christine!

Oct 2025

Happy to have been able to contribute to a new paper from the Lee lab at UMass Chan, now out in Cell!.

Oct 2025

Congrats to Ezequiel Calvo-Roitberg, who won the 2025 Michael R. Green Award in Graduate Research from the UMass Chan Department of Molecular Cell and Cancer Biology for his contributions to the study of gene regulation. We're excited to attend his talk at the Michael Green Award Symposium!

Aug 2025

We're excited to welcome Ye Liu as a new postdoc in the lab! She is joining us after completing her PhD on biochemical mechanisms of splicing with Aaron Hoskins at UWisconsin.

Jul 2025

PREPRINT ALERT! A tour-de-force effort by Eraj and Kaitlyn on identifying and characterizing noisy splicing in human cells, as part of our long-standing collaboration with the Watts Lab at UMass Chan!

Feb-Jun 2025

Lots of exciting collaborative papers have been published this year! Thanks to the Lien, Khvorova, and Galloway labs for including us in their work and congrats to all the authors!

Apr 2025

Congrats to Eva and Joe for passing their qualifying exams! They're both back to lab with lots of excitement to dive into their thesis work!

Feb 2025

Congrats to Jesse Lehman for being awarded an NRSA Fellowship (F31) from NIAID to continue his work to understand the dynamics of transcription and splicing in the immune response. Read more about the news here!

Jan 2025

Welcome to GSBS rotation student Mary Likhite, who will be rotating with us through March and working to analyze nascent RNA long-read sequencing simulations and libraries.

Research

Brief Overview of our Research

Kinetics of Transcription & mRNA Processing

The mechanisms of enzymes are encoded in the kinetics of substrate turnover. We postulate that the mechanisms of transcriptome dynamics are similarly encoded in the aggregate kinetics of mRNA biogenesis and maturation rather than steady-state transcriptome profiles. Thus, we directly study the rates at which transcriptional and mRNA processing steps occur, with the central hypothesis is that the temporal progression of RNA maturation mechanisms encodes the molecular underpinnings of developmental trajectories, mis-regulation of disease pathogenesis, and rapid cellular responses to external stimuli.

However, measuring the timing of crucial mRNA processing steps across the genome remains challenging. Thus, a primary focus of our group is to first to develop genomics methods to measure the kinetics of transcription, mRNA splicing, and 3’ end cleavage at high resolution by integrating nascent RNA sequencing across molecular time with statistical modeling and machine learning techniques. We are now applying our novel approaches to relevant mammalian systems to understand the trajectories of mRNA isoform expression.

Fidelity of mRNA Splicing

RNA processing likely involves a critical balance between speed and accuracy – faster processing is likely accompanied by an increase in errors that lead to nonproductive transcripts. However, it is commonly believed that the spliceosome is a high-fidelity deterministic enzyme that rarely makes mistakes. We and others have begun to identify extensive non-canonical splicing that might be the result of stochastic splicing interactions. We have developed computational tools to analyze splicing intermediates in nascent RNA that are informative about splicing fidelity. Work is on-going to understand the regulatory factors and interactions that underlie nuclear quality control mechanisms and how certain RNA molecules evade such checkpoints.

Understanding the molecular basis for RNA processing infidelity is particularly germane to contemporary efforts to redirect inaccurate mRNA splicing in a therapeutic context. We are deploying our computational analyses of nascent RNA to identify splice sites and exons that are amenable to therapeutic targeting and testing the efficacy of such predictions.

Coupling of Co-Transcriptional Mechanisms

Despite an increasing appreciation that mRNA processing mechanisms do not act independently, we do not yet understand how they integrate to control steady-state or transient gene expression levels. Thus, a key open question in functional genomics remains: How do the dynamics of individual molecular processes combine to influence the cellular transcriptome?

The majority of mammalian mRNA isoform diversity is driven by alternative 5’ or 3’ untranslated regions (UTRs), rather than alternative splicing of coding exons. Since exons at the start or end of transcripts are challenging to identify, we first developed computational approaches to systematically identify and quantify terminal exons from short or long-read sequencing data based on their isoform-specific usage. Using these tools, we observed a coordinated relationship between transcription start and end sites that is dictated by the genomic order of alternative sites. We call this new regulatory phenomenon the "Positional Initiation-Termination Axis" (PITA) and show that this coupling is governed by variable rates of RNAPII elongation across a gene. Ongoing work is focused on mechanistic details of PITA and further connections to splice site choice that ultimately determine full isoform composition.

Team

Meet the Group

Athma Pai

Principal Investigator


Ann Latino

Administrative Coordinator


Ezequiel Calvo-Roitberg

Graduate Student


Kelly Cochran

Postdoctoral Researcher

joint with Schärfen lab

Nida Javeed

Research Associate


Eva Jazbec

Graduate Student

joint with Sontheimer lab

Eraj Khokhar

Postdoctoral Researcher


Marina Krykbaeva

Postdoctoral Researcher


Jesse Lehman

Graduate Student


Ye Liu

Postdoctoral Researcher


Joseph Paquette

Graduate Student


Valeria Sanabria

Graduate Student


Publications

Read Our Publications
  • Selected
  • Coupling
  • Fidelity
  • Kinetics
  • Collaborative
  • All Papers

Khokhar, Brokaw bioRxiv 2025

Calvo-Roitberg bioRxiv 2025

Khokhar, Brokaw bioRxiv 2025

Schooley Nat. Cell. Bio. 2025

Calvo-Roitberg, Carroll Science 2025

Harper Cell 2025

Davis NAR 2025

Peterman NAR 2025

Zhang PNAS 2025

Calvo-Roitberg Genome Res. 2024

Torres-Ulloa RNA 2024

Vierbuchen PNAS 2023

Wang NAR 2022

Valton NSMB 2022

MacMillan, Kong Sci Rep 2022

Fiszben Sci Adv 2022

Ibraheim Nat Comm 2021

Alterman Nat Biotech 2019

Pai & Luca WIREs RNA 2018

Didiot Cell Rep. 2018

Pai, Paggi PLOS Genetics 2018

Aguiar Nat Comm 2018

Pai eLife 2017

Software

Computational Resources

We often develop new computational approaches or pipelines and make them freely available to the community. NOTE: we are not software developers and most of these code repositories have not been actively maintained since publication.

CRYPTID-exon

identify cryptic exons


Khokhar, Brokaw et al. bioRxiv 2025

CRYPTID-SS

identify cryptic splice sites


Khokhar, Brokaw et al. bioRxiv 2025

HIT-index

identify first & last exons


Fiszbein et al. SciAdv 2022

SPARK

simulate nascent RNA reads


Calvo-Roitberg et al. bioRxiv 2025

Splicing Rates

rates of intron splicing


Pai et al. eLife 2017

Recursive Splicing

identify recursive splice sites


Pai, Paggi et al. PLOS Genetics 2018

3' Cleavage Rates

estimate rates of 3' end cleavage


Torres-Ulloa et al. RNA 2024

Photos

Life in Lab and Beyond

Contact

Let's Connect

Our Address

Albert Sherman Center

368 Plantation Street, AS5-2057

Worcester, MA 01605

Email Address

info@thepailab.org

Locations:

Wet Lab: ASC5-2003/2004

Dry Lab: ASC5-2053/2055

AAP Office: ASC5-2057

Get in Touch

Loading
Your message has been sent. Thank you!