31 min read
Model Compression: A Step Towards Efficient Machine Learning
Model accuracy alone (or an equivalent performance metric) rarely determines which model will be deployed. Much of the engineering effort goes into making the model production-friendly. Because typically, the model that gets shipped is NEVER solely determined by performance — a misconception that many have. Instead, we also consider several operational