Scanpy is a Python-based toolkit designed for scalable analysis of single-cell gene expression data. It supports datasets exceeding one million cells and integrates tightly with the anndata data structure for efficient data handling. The toolkit offers a comprehensive suite of functionalities including preprocessing, visualization, clustering, trajectory inference, and differential expression testing.