Engineering Best Practices
A collection of 19 posts
5 Must-Know Ways to Test ML Models in Production (Implementation Included)
A beginner-friendly guide to model testing.
Spark DataFrames and Big Data ML using PySpark on Databricks
A completely hands-on and beginner-friendly deep dive on PySpark using Databricks.
A Practical Guide to Scaling ML Model Training
GPUs - GPU Clusters - Distributed Training.
Quantization: Optimize ML Models to Run Them on Tiny Hardware
A must-know skill for ML engineers to reduce model footprint and inference time.
How to Structure Your Code for Machine Learning Development
A highly overlooked yet critical skill for data scientists.
Develop an Elegant Testing Framework For Data Science Projects Using Pytest
A Comprehensive Guide to Pytest for data scientists.
A Beginner-friendly Guide to Multi-GPU Model Training
Models are becoming bigger and bigger. Learn how to scale models using distributed training.
Object-Oriented Programming with Python for Data Scientists
A beginner to advanced guide for Python OOP.
11 Powerful Techniques To Supercharge Your ML Models
Take your ML models to the next level with 11 lesser-known techniques.
Implementing Parallelized CUDA Programs From Scratch Using CUDA Programming
A beginner-friendly guide for curious minds who don't know the internal workings of model.cuda().
8 Fatal (Yet Non-obvious) Pitfalls and Cautionary Measures in Data Science
8 data science lesson I wish I had known earlier.
Model Compression: A Critical Step Towards Efficient Machine Learning
Four critical ways to reduce model footprint and inference time.
Deploy, Version Control, and Manage ML Models Right From Your Jupyter Notebook with Modelbit
Deployment has possibly never been so simple.
PyTorch Models Are Not Deployment-Friendly! Supercharge Them With TorchScript.
Eliminating the dependence of PyTorch models on Python.
Optimize Your ML Development and Operations with MLflow
The guide that every data scientist must read to manage ML experiments like a pro.
How to Streamline Your Machine Learning Workflow With DVC
The guide that every data scientist must read to manage ML experiments like a pro.
You Cannot Build Large Data Projects Until You Learn Data Version Control!
The underappreciated, yet critical, skill that most data scientists overlook.
Sklearn Models are Not Deployment Friendly! Supercharge Them With Tensor Computations.
Speed up sklearn model inference up to 50x with GPU support.