Multi-Token Prediction

🚀

Multi-Token Prediction

Multi-Token Prediction enables simultaneous prediction of multiple future tokens per position to improve training efficiency and model performance.

Code & Development