This guide provides a comprehensive overview of the models supported by SAMtools, the ubiquitous suite of utilities for interacting with high-throughput sequencing data.
Because "model" can refer to two different things in bioinformatics—sequencing technologies (how data is generated) or genomic feature formats (how data is described)—this guide covers both contexts to ensure you have the complete picture.
SAMtool (Segment Anything Model tool) provides a unified interface for various promptable segmentation models. Below is the complete list of officially supported model architectures and specific weight variants as of the latest release (v1.2.0).
The Samtool development team has announced support for the following models in the v2.5 release (Q3 2025):
Additionally, Samtool will introduce automatic operator substitution for unsupported ops using MLIR dialects. samtool supported models
In the rapidly evolving landscape of computer vision, the release of Meta AI’s Segment Anything Model (SAM) was a watershed moment. However, deploying SAM effectively often requires more than just the base model; it requires robust tooling. Enter SAMtool—a collection of utilities, wrappers, and extensions designed to streamline segmentation workflows.
But one question dominates the technical forums: Which models actually work with SAMtool? The answer is more nuanced than a simple list. Here is your comprehensive guide to the architectures, checkpoints, and custom variants supported by the modern SAMtool ecosystem.
For somatic mutation calling, SAMtools provides samtools mpileup with the -B (disable BAQ) flag for tumor samples to avoid over-filtering, followed by BCFtools with the -c (consensus caller) or somatic-specific priors.
Critical addition: samtools calmd (Calculate MD tags) is used post-alignment to realign reads based on the reference, reducing false positives in homopolymer regions. This guide provides a comprehensive overview of the
A SAMtools-supported model treats each command as a node in a directed acyclic graph (DAG):
| Node | Function | Input | Output | I/O Model |
| :--- | :--- | :--- | :--- | :--- |
| samtools view | Conversion/filtering | SAM/BAM/CRAM | BAM | Stream |
| samtools sort | Coordinate sorting | Unsorted BAM | Sorted BAM | Disk-heavy |
| samtools index | BAI indexing | Sorted BAM | .bai file | Random access |
| samtools mpileup | Genotype likelihood | Multiple BAMs | BCF/VCF | Streaming + math |
| samtools depth | Coverage histogram | BAM | Text table | Streaming |
Problem: In circulating tumor DNA (ctDNA) with 1% allele frequency (AF), standard SAMtools models produce high false positives due to sequencing errors.
Solution: A SAMtools-supported model with hard filtering and strand bias correction. Stable Diffusion 3 – Multi-modal text-to-image with DiT
Workflow:
samtools view -q 30 -F 0x04 in.bam (Keep only reads with MAPQ >= 30, not unmapped).samtools mpileup -d 1000 -Q 20 -f ref.fa in.bam > pileup.txtbcftools filter -e 'INFO/DP<20 || INFO/AF<0.01 || INFO/STB>0.05'Result: Sensitivity dropped from 95% to 89%, but precision improved from 0.45 to 0.82 (validated by ddPCR).
A lightweight model for copy number variation:
samtools depth -a -b regions.bed in.bam | \
awk 'sum+=$3 END print sum/NR' # Mean depth per region
This model bypasses complex segmentation algorithms and uses SAMtools' native -d (max depth) flag to normalize GC bias.