Berger Lab at MIT CSAIL

Research

A full list of publications can be found here.

2024

Democratizing protein language models with parameter-efficient fine-tuning
Samuel Sledzieski, Meghana Kshirsagar, Minkyung Baek, Rahul Dodhia, Juan Lavista Ferres, Bonnie Berger
Proceedings of the National Academy of Sciences  ·  20 Jun 2024  ·  doi:10.1073/pnas.2405840121
Scanorama: integrating large and diverse single-cell transcriptomic datasets
Brian L. Hie, Soochi Kim, Thomas A. Rando, Bryan Bryson, Bonnie Berger
Nature Protocols  ·  06 Jun 2024  ·  doi:10.1038/s41596-024-00991-3
Causal gene regulatory analysis with RNA velocity reveals an interplay between slow and fast transcription factors
Rohit Singh, Alexander P. Wu, Anish Mudide, Bonnie Berger
Cell Systems  ·  01 May 2024  ·  doi:10.1016/j.cels.2024.04.005
AlphaFold Meets Flow Matching for Generating Protein Ensembles
Bowen Jing, Bonnie Berger, Tommi Jaakkola
arXiv  ·  01 Jan 2024  ·  doi:10.48550/arXiv.2402.04845
Secure Discovery of Genetic Relatives Across Large-Scale and Distributed Genomic Datasets
Matthew M. Hong, David Froelicher, Ricky Magner, Victoria Popic, Bonnie Berger, Hyunghoon Cho
Lecture Notes in Computer Science  ·  01 Jan 2024  ·  doi:10.1007/978-1-0716-3989-4_19

2023

TT3D: Leveraging precomputed protein 3D sequence models to predict protein–protein interactions
Samuel Sledzieski, Kapil Devkota, Rohit Singh, Lenore Cowen, Bonnie Berger
Bioinformatics  ·  28 Oct 2023  ·  doi:10.1093/bioinformatics/btad663
SCA: recovering single-cell heterogeneity through information-based dimensionality reduction
Benjamin DeMeo, Bonnie Berger
Genome Biology  ·  25 Aug 2023  ·  doi:10.1186/s13059-023-02998-7
Assessing transcriptomic reidentification risks using discriminative sequence models
Shuvom Sadhuka, Daniel Fridman, Bonnie Berger, Hyunghoon Cho
Genome Research  ·  04 Aug 2023  ·  doi:10.1101/gr.277699.123
Efficient mapping of accurate long reads in minimizer space with mapquik
Bariş Ekim, Kristoffer Sahlin, Paul Medvedev, Bonnie Berger, Rayan Chikhi
Genome Research  ·  30 Jun 2023  ·  doi:10.1101/gr.277679.123
Contrastive learning in protein language space predicts interactions between drugs and protein targets
Rohit Singh, Samuel Sledzieski, Bryan Bryson, Lenore Cowen, Bonnie Berger
Proceedings of the National Academy of Sciences  ·  08 Jun 2023  ·  doi:10.1073/pnas.2220778120
sfkit: a web-based toolkit for secure and federated genomic analysis
Simon Mendelsohn, David Froelicher, Denis Loginov, David Bernick, Bonnie Berger, Hyunghoon Cho
Nucleic Acids Research  ·  29 May 2023  ·  doi:10.1093/nar/gkad464
Scalable and Privacy-Preserving Federated Principal Component Analysis
David Froelicher, Hyunghoon Cho, Manaswitha Edupalli, Joao Sa Sousa, Jean-Philippe Bossuat, Apostolos Pyrgelis, Juan R. Troncoso-Pastoriza, Bonnie Berger, Jean-Pierre Hubaux
2023 IEEE Symposium on Security and Privacy (SP)  ·  01 May 2023  ·  doi:10.1109/SP46215.2023.10179350
Learning the Language of Antibody Hypervariability
Rohit Singh, Chiho Im, Yu Qiu, Brian Mackness, Abhinav Gupta, …, Lena Erlach, Maria Wendt, Yves Fomekong Nanfack, Bryan Bryson, Bonnie Berger
Cold Spring Harbor Laboratory  ·  28 Apr 2023  ·  doi:10.1101/2023.04.26.538476
Unveiling causal regulatory mechanisms through cell-state parallax
Alexander Po-Yen Wu, Rohit Singh, Christopher Walsh, Bonnie Berger
Cold Spring Harbor Laboratory  ·  03 Mar 2023  ·  doi:10.1101/2023.03.02.530529
Sequre: a high-performance framework for secure multiparty computation enables biomedical data sharing
Haris Smajlović, Ariya Shajii, Bonnie Berger, Hyunghoon Cho, Ibrahim Numanagić
Genome Biology  ·  11 Jan 2023  ·  doi:10.1186/s13059-022-02841-5
EigenFold: Generative Protein Structure Prediction with Diffusion Models
Bowen Jing, Ezra Erives, Peter Pao-Huang, Gabriele Corso, Bonnie Berger, Tommi Jaakkola
arXiv  ·  01 Jan 2023  ·  doi:10.48550/arXiv.2304.02198
Equivariant Scalar Fields for Molecular Docking with Fast Fourier Transforms
Bowen Jing, Tommi Jaakkola, Bonnie Berger
arXiv  ·  01 Jan 2023  ·  doi:10.48550/arXiv.2312.04323

2022

Learning the Drug-Target Interaction Lexicon
Rohit Singh, Samuel Sledzieski, Lenore Cowen, Bonnie Berger
Cold Spring Harbor Laboratory  ·  10 Dec 2022  ·  doi:10.1101/2022.12.06.519374
Navigating bottlenecks and trade-offs in genomic data analysis
Bonnie Berger, Yun William Yu
Nature Reviews Genetics  ·  07 Dec 2022  ·  doi:10.1038/s41576-022-00551-z
Secure and Federated Genome-Wide Association Studies for Biobank-Scale Datasets
Hyunghoon Cho, David Froelicher, Jeffrey Chen, Manaswitha Edupalli, Apostolos Pyrgelis, Juan R. Troncoso-Pastoriza, Jean-Pierre Hubaux, Bonnie Berger
Cold Spring Harbor Laboratory  ·  02 Dec 2022  ·  doi:10.1101/2022.11.30.518537
Uncovering structural ensembles from single-particle cryo-EM data using cryoDRGN
Laurel F. Kinman, Barrett M. Powell, Ellen D. Zhong, Bonnie Berger, Joseph H. Davis
Nature Protocols  ·  14 Nov 2022  ·  doi:10.1038/s41596-022-00763-x
Adapting protein language models for rapid DTI prediction
Samuel Sledzieski, Rohit Singh, Lenore Cowen, Bonnie Berger
Cold Spring Harbor Laboratory  ·  04 Nov 2022  ·  doi:10.1101/2022.11.03.515084
Contrasting drugs from decoys
Samuel Sledzieski, Rohit Singh, Lenore Cowen, Bonnie Berger
Cold Spring Harbor Laboratory  ·  04 Nov 2022  ·  doi:10.1101/2022.11.03.515086
CryoDRGN2: Ab Initio Neural Reconstruction of Dynamic Protein Complexes
Ellen D Zhong, Adam Lerer, Joseph H Davis, Bonnie Berger
Microscopy and Microanalysis  ·  01 Aug 2022  ·  doi:10.1017/S1431927622005062
Prioritizing transcription factor perturbations from single-cell transcriptomics
Rohit Singh, Joshua Shing Shun Li, Sudhir Gopal Tattikota, Yifang Liu, Jun Xu, Yanhui Hu, Norbert Perrimon, Bonnie Berger
Cold Spring Harbor Laboratory  ·  30 Jun 2022  ·  doi:10.1101/2022.06.27.497786
Genome-wide mapping of somatic mutation rates uncovers drivers of cancer
Maxwell A. Sherman, Adam U. Yaari, Oliver Priebe, Felix Dietlein, Po-Ru Loh, Bonnie Berger
Nature Biotechnology  ·  20 Jun 2022  ·  doi:10.1038/s41587-022-01353-8
Secure and federated linear mixed model association tests
Jeffrey Chen, Manaswitha Edupalli, Bonnie Berger, Hyunghoon Cho
Cold Spring Harbor Laboratory  ·  24 May 2022  ·  doi:10.1101/2022.05.20.492837
Cellular and transcriptional diversity over the course of human lactation
Sarah K. Nyquist, Patricia Gao, Tessa K. J. Haining, Michael R. Retchin, Yarden Golan, …, Nadav Ahituv, Micaela E. Martinez, Alex K. Shalek, Bonnie Berger, Brittany A. Goods
Proceedings of the National Academy of Sciences  ·  04 Apr 2022  ·  doi:10.1073/pnas.2121720119
Deep learning guided optimization of human antibody against SARS-CoV-2 variants with broad neutralization
Sisi Shan, Shitong Luo, Ziqing Yang, Junxian Hong, Yufeng Su, …, Xuanling Shi, Qi Zhang, Bonnie Berger, Linqi Zhang, Jian Peng
Proceedings of the National Academy of Sciences  ·  01 Mar 2022  ·  doi:10.1073/pnas.2122954119

2021

Deciphering the species-level structure of topologically associating domains
Rohit Singh, Bonnie Berger
Cold Spring Harbor Laboratory  ·  29 Oct 2021  ·  doi:10.1101/2021.10.28.466333
Scalable Multimer Structure Prediction using Diffusion Models
Peter Pao-Huang, Bowen Jing, Bonnie Berger
NeurIPS 2023 AI for Science Workshop  ·  28 Oct 2021  ·  [no id info]
Minimizer-space de Bruijn graphs: Whole-genome assembly of long reads in minutes on a personal computer
Barış Ekim, Bonnie Berger, Rayan Chikhi
Cell Systems  ·  01 Oct 2021  ·  doi:10.1016/j.cels.2021.08.009
D-SCRIPT translates genome to phenome with sequence-based, structure-aware, genome-scale predictions of protein-protein interactions
Samuel Sledzieski, Rohit Singh, Lenore Cowen, Bonnie Berger
Cell Systems  ·  01 Oct 2021  ·  doi:10.1016/j.cels.2021.08.010
A Python-based programming language for high-performance computational genomics
Ariya Shajii, Ibrahim Numanagić, Alexander T. Leighton, Haley Greenyer, Saman Amarasinghe, Bonnie Berger
Nature Biotechnology  ·  19 Jul 2021  ·  doi:10.1038/s41587-021-00985-6
Bayesian information sharing enhances detection of regulatory associations in rare cell types
Alexander P Wu, Jian Peng, Bonnie Berger, Hyunghoon Cho
Bioinformatics  ·  01 Jul 2021  ·  doi:10.1093/bioinformatics/btab269
Levenshtein Distance, Sequence Comparison and Biological Database Search
Bonnie Berger, Michael S. Waterman, Yun William Yu
IEEE Transactions on Information Theory  ·  01 Jun 2021  ·  doi:10.1109/TIT.2020.2996543
Learning the protein language: Evolution, structure, and function
Tristan Bepler, Bonnie Berger
Cell Systems  ·  01 Jun 2021  ·  doi:10.1016/j.cels.2021.05.017
Schema: metric learning enables interpretable synthesis of heterogeneous single-cell modalities
Rohit Singh, Brian L. Hie, Ashwin Narayan, Bonnie Berger
Genome Biology  ·  03 May 2021  ·  doi:10.1186/s13059-021-02313-2
CryoDRGN: reconstruction of heterogeneous cryo-EM structures using neural networks
Ellen D. Zhong, Tristan Bepler, Bonnie Berger, Joseph H. Davis
Nature Methods  ·  01 Feb 2021  ·  doi:10.1038/s41592-020-01049-4
Assessing single-cell transcriptomic variability through density-preserving data visualization
Ashwin Narayan, Bonnie Berger, Hyunghoon Cho
Nature Biotechnology  ·  18 Jan 2021  ·  doi:10.1038/s41587-020-00801-7
Learning the language of viral evolution and escape
Brian Hie, Ellen D. Zhong, Bonnie Berger, Bryan Bryson
Science  ·  15 Jan 2021  ·  doi:10.1126/science.abd7331
Exploring generative atomic models in cryo-EM reconstruction
Ellen D. Zhong, Adam Lerer, Joseph H. Davis, Bonnie Berger
arXiv  ·  01 Jan 2021  ·  doi:10.48550/arXiv.2107.01331

2020

Learning mutational semantics
Brian Hie, Ellen Zhong, Bryan Bryson, Bonnie Berger
Advances in Neural Information Processing Systems  ·  12 Dec 2020  ·  [no id info]
Leveraging Uncertainty in Machine Learning Accelerates Biological Discovery and Design
Brian Hie, Bryan D. Bryson, Bonnie Berger
Cell Systems  ·  01 Nov 2020  ·  doi:10.1016/j.cels.2020.09.007
Topaz-Denoise: general deep denoising models for cryoEM and cryoET
Tristan Bepler, Kotaro Kelley, Alex J. Noble, Bonnie Berger
Nature Communications  ·  15 Oct 2020  ·  doi:10.1038/s41467-020-18952-1
Improved haplotype inference by exploiting long-range linking and allelic imbalance in RNA-seq datasets
Emily Berger, Deniz Yorukoglu, Lillian Zhang, Sarah K. Nyquist, Alex K. Shalek, Manolis Kellis, Ibrahim Numanagić, Bonnie Berger
Nature Communications  ·  16 Sep 2020  ·  doi:10.1038/s41467-020-18320-z
Computational Methods for Single-Cell RNA Sequencing
Brian Hie, Joshua Peters, Sarah K. Nyquist, Alex K. Shalek, Bonnie Berger, Bryan D. Bryson
Annual Review of Biomedical Data Science  ·  20 Jul 2020  ·  doi:10.1146/annurev-biodatasci-012220-100601
Hopper: a mathematically optimal algorithm for sketching biological data
Benjamin DeMeo, Bonnie Berger
Bioinformatics  ·  01 Jul 2020  ·  doi:10.1093/bioinformatics/btaa408
Privacy-Preserving Biomedical Database Queries with Optimal Privacy-Utility Trade-Offs
Hyunghoon Cho, Sean Simmons, Ryan Kim, Bonnie Berger
Cell Systems  ·  01 May 2020  ·  doi:10.1016/j.cels.2020.03.006
Carnelian uncovers hidden functional patterns across diverse study populations from whole metagenome sequencing reads
Sumaiya Nazeen, Yun William Yu, Bonnie Berger
Genome Biology  ·  24 Feb 2020  ·  doi:10.1186/s13059-020-1933-7
A Randomized Parallel Algorithm for Efficiently Finding Near-Optimal Universal Hitting Sets
Barış Ekim, Bonnie Berger, Yaron Orenstein
Lecture Notes in Computer Science  ·  01 Jan 2020  ·  doi:10.1007/978-3-030-45257-5_3

2019

Meta-analysis of Caenorhabditis elegans single-cell developmental data reveals multi-frequency oscillation in gene activation
Luke A D Hutchison, Bonnie Berger, Isaac S Kohane
Bioinformatics  ·  20 Dec 2019  ·  doi:10.1093/bioinformatics/btz864
Positive-unlabeled convolutional neural networks for particle picking in cryo-electron micrographs
Tristan Bepler, Andrew Morin, Micah Rapp, Julia Brasch, Lawrence Shapiro, Alex J. Noble, Bonnie Berger
Nature Methods  ·  07 Oct 2019  ·  doi:10.1038/s41592-019-0575-8
Emerging technologies towards enhancing privacy in genomic data sharing
Bonnie Berger, Hyunghoon Cho
Genome Biology  ·  02 Jul 2019  ·  doi:10.1186/s13059-019-1741-0