Cost-informed Model Choice for On-device AI Applications
Keywords:
On-Device AI, Neural Network Quantization, Mobile AI Optimization, Energy-Efficient Inference, Edge Computing ConstraintsAbstract
On-device AI deployment presents a set of unique challenges that are fundamentally different from thoseof cloud-based systems. These challenges necessitate specialized optimization approaches. Thedeployment must strike a balance between user experience and stringent resource constraints, including privacy preservation, energy
References
Andrew G. Howard et al., "MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications," ResearchGate, April 2017.
https://www.researchgate.net/publication/316184205_MobileNets_Efficient_Convolutional_Neural_Net works_for_Mobile_Vision_Applications


