Key Features - model2vec

✨

Reduces Sentence Transformer model sizes by up to 50x, with models ranging from about 8 MB to 30 MB.

✨

Speeds up inference by up to 500x on CPU by using fixed token vectors and simple averaging instead of full transformer computations.

✨

Integrates directly with Milvus, Weaviate, Spice.ai, Sentence Transformers, and LangChain for embedding generation and vector search.

✨

Supports fine-tuning classifiers on static embeddings using PyTorch, Lightning, or scikit-learn for various classification tasks.

✨

Models can be loaded from Hugging Face Hub or local paths, facilitating easy access and deployment.