Mamba - Selective State Space Models Quick start Mamba is a state-space model architecture achieving O(n) linear complexity for sequence modeling. Installation : Prerequisites : Linux, NVIDIA GPU, PyTorch 1.12+, CUDA 11.6+ Basic usage (Mamba block): Common workflows Workflow 1: Language model with Mamba-2 Complete LM with generation : Workflow 2: Use pretrained Mamba models Load from HuggingFace : Available models : - - - - - Workflow 3: Mamba-1 vs Mamba-2 Mamba-1 (smaller state): Mamba-2 (multi-head, larger state): Key differences : - State size : Mamba-1 (d state=16) vs Mamba-2 (d state=128…