Publications

2025

J. Theor. Biol. ★

Scalable inference of transcriptional variability with BASiCS

Alan O'Callaghan and Catalina A. Vallejos

Journal of Theoretical Biology 611 : 112157 (2025)

Abstract

BASiCS (Bayesian Analysis of Single-Cell Sequencing data) is an integrated Bayesian hierarchical model for the analysis of single-cell RNA sequencing data. BASiCS performs simultaneous data normalization and quantification of technical noise, and enables analysis of mean expression and expression variability within or across cell populations. We extend BASiCS with a divide and conquer inference scheme to enable scalable Bayesian inference for large datasets. We compare the performance of the divide and conquer approach to standard Markov Chain Monte Carlo (MCMC) and variational inference methods (ADVI) in terms of accuracy and scalability. Our results demonstrate that the divide and conquer approach enables large-scale scRNA-seq analysis, providing accurate and efficient inference while maintaining the interpretability and flexibility of the BASiCS framework.

Publications

2025

Scalable inference of transcriptional variability with BASiCS

Large-scale clustering of longitudinal faecal calprotectin and C-reactive protein profiles in inflammatory bowel disease

Blood-based epigenome-wide association study and prediction of alcohol consumption

2024

Differential behaviour of a risk score for emergency hospital admission by demographics in Scotland—A retrospective study

A review on statistical and machine learning competing risks methods

Development and assessment of a machine learning tool for predicting emergency admission in Scotland

Ethical considerations of use of hold-out sets in clinical prediction model management

2023

Longitudinal Fecal Calprotectin Profiles Characterize Disease Course Heterogeneity in Crohn's Disease

Development and validation of DNA methylation scores in two European cohorts augment 10-year risk prediction of type 2 diabetes

Improving Risk Stratification for Patients With Type 2 Myocardial Infarction

2022

SCRaPL: A Bayesian hierarchical framework for detecting technical associates in single cell multiomics data

2021

scMET: Bayesian modeling of DNA methylation heterogeneity at single-cell resolution

Model updating after interventions paradoxically introduces bias

Single-nucleus RNA-seq2 reveals functional crosstalk between liver zonation and ploidy

Development and assessment of a machine learning tool for predicting emergency admission in Scotland

2020

Eleven grand challenges in single-cell data science

High-Sensitivity Cardiac Troponin and the Universal Definition of Myocardial Infarction

2018

Correcting the Mean-Variance Dependency for Differential Variability Testing Using Single-Cell RNA Sequencing Data

High-sensitivity troponin in the evaluation of patients with suspected acute coronary syndrome: a stepped-wedge, cluster-randomised controlled trial

2017

Normalizing single-cell RNA sequencing data: challenges and opportunities

Incorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach

Aging increases cell-to-cell transcriptional variability upon immune stimulation

2016

Beyond comparisons of means: understanding changes in gene expression at the single-cell level

Bayesian survival modelling of university outcomes

2015

BASiCS: Bayesian Analysis of Single-Cell Sequencing Data

Objective Bayesian Survival Analysis Using Shape Mixtures of Log-Normal Distributions