Run-time Optimization
6 articles
Pandas vs. FireDucks Performance Comparison
20x faster Pandas by changing one line of code.
All-Reduce and Ring-Reduce for Model Synchronization in Multi-GPU Training
Two synchronization algorithms for intermediate-ML models.
Mixed Precision Training
Train large deep learning models efficiently.