🚀
Multi-Token Prediction
Multi-Token Prediction enables simultaneous prediction of multiple future tokens per position to improve training efficiency and model performance.
Code & Development
Multi-Token Prediction enables simultaneous prediction of multiple future tokens per position to improve training efficiency and model performance.