Abstract: This paper focuses on developing an improved arithmetic optimization algorithm to achieve better convergence during exploration and exploitation phases. The proposed algorithm has been ...
Abstract: Fixed-point quantization techniques have attracted considerable attention in deep neural network (DNN) inference acceleration. Nevertheless, they often require time-consuming fine-tuning or ...
The setup for testing and evaluating of our code is based on the framework provided in the pqm4 project.