Tools — gatac.tl#
The gatac.tl namespace provides downstream analysis tools: dimensionality
reduction, peak calling, marker detection, motif scanning, chromVAR deviation
scoring, and topic modelling.
Dimensionality reduction#
Compute a spectral decomposition of the cell × feature matrix — the standard
entry point for UMAP and clustering in ATAC-seq workflows (spectral) — or
model topics over the peak-accessibility matrix with GPU-accelerated
mini-batch Online Variational Bayes (lda, MiniBatchLDA).
GPU-accelerated spectral embedding via Laplacian Eigenmaps. |
|
Learn topics from a binarized peak matrix using GPU-accelerated mini-batch LDA. |
|
GPU-accelerated Mini-batch LDA via Online Variational Bayes. |
Peak calling & marker peaks#
Call ATAC peaks per cell-type group using the MACS3 algorithm under the hood, merge them into a non-overlapping set, count fragments over peaks, and identify differentially accessible peaks between groups using a GPU- accelerated binomial test with Benjamini–Hochberg correction.
GPU-accelerated peak calling per cluster using gmacs algorithm. |
|
Merge peaks from different groups into fixed-width, non-overlapping peaks. |
|
Generate cell by peak count matrix. |
|
GPU-accelerated marker peak detection using binomial test. |
|
Extract marker peak names from stored results. |
Motif analysis#
Read motifs from MEME-format files, test for over-representation in peak sets, and run GSEA on motif rankings.
Read motifs from a MEME format file. |
|
Parse MEME format content into motifs. |
|
DNA motif represented as a position weight matrix (PWM). |
|
Sample background peaks whose GC-content distribution matches target peaks. |
|
Identify enriched transcription factor motifs using GPU acceleration. |
|
Run preranked GSEA to identify enriched TF motifs from a LogFC-ranked peak list. |
chromVAR#
Compute transcription-factor activity deviation scores following the chromVAR algorithm. All compute-intensive steps are executed on GPU.
Run the full chromVAR pipeline in a single call. |
|
Compute peak biases (GC content and/or CpG density) for background sampling. |
|
Sample background peaks for chromVAR analysis. |
|
Scan peaks for motif matches and create a sparse motif match matrix. |
|
Compute chromVAR TF deviation scores. |
To run the four steps individually, see the docstring of
compute_deviations (which lists them end-to-end).