inferwire
/

Category

AI

AI

LoopMDM: Boosting AI Efficiency via Layer Looping

LoopMDM introduces a recursive transformer architecture for masked diffusion models, improving training speed and performance by looping early-middle layers to achieve deeper reasoning with fewer parameters.

AI

HDET: Optimizing AI Training via GPU Divergence

Hyperparameter-Divergent Ensemble Training (HDET) repurposes idle GPU replicas to explore learning rates in real-time, significantly improving training efficiency for large neural networks.