PDP: Parameter-free Differentiable Pruning is All You Need Apple Machine Learning Research
DNN pruning is a popular way to reduce the size of a model, improve the inference latency, and minimize the power consumption on DNN accelerators. However, existing approaches might be too complex, expensive or ineffective to apply to a variety of vision/language tasks, DNN architectures… Read More »PDP: Parameter-free Differentiable Pruning is All You Need Apple Machine Learning Research