Techniques for Optimizing PyTorch Models

Forward Automatic Differentiation -- For cases when $N_k$ is larger than $N_0$ in the computational graph
Quantization -- Technique to reduce computational load and fasten inferencing
Automatic Mixed Precision -- To match appopriate datatype for each operation and reduce runtime
Knowledge Distillation -- Method to transfer knowledge from larger to small network
Profiling -- To assess time and memory consumption for a model's operation
Pruning -- Method to reduce model parameters without affecting much of the performance

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
01_fwAD.ipynb		01_fwAD.ipynb
02_quantization.ipynb		02_quantization.ipynb
README.md		README.md
auto_mixed_precision.ipynb		auto_mixed_precision.ipynb
knowledge_distillation.ipynb		knowledge_distillation.ipynb
parameterization.ipynb		parameterization.ipynb
profiling.ipynb		profiling.ipynb
pruning.ipynb		pruning.ipynb

Provide feedback