Vol. 2 No. 2 (2022): Journal of AI-Assisted Scientific Discovery
Articles

Model Compression for Efficient Deployment: Analyzing model compression techniques to reduce the size of machine learning models for efficient deployment on resource-constrained devices

Dr. Fatima Hassan
Professor of AI-driven Healthcare Analytics, University of Cape Town, South Africa
Cover

Published 31-12-2022

Keywords

  • Model Compression,
  • Machine Learning,
  • Quantization,
  • Pruning,
  • Knowledge Distillation

How to Cite

[1]
D. F. Hassan, “Model Compression for Efficient Deployment: Analyzing model compression techniques to reduce the size of machine learning models for efficient deployment on resource-constrained devices”, Journal of AI-Assisted Scientific Discovery, vol. 2, no. 2, pp. 1–10, Dec. 2022, Accessed: Sep. 19, 2024. [Online]. Available: https://scienceacadpress.com/index.php/jaasd/article/view/13

Abstract

Model compression techniques play a crucial role in deploying machine learning models on resource-constrained devices such as smartphones, IoT devices, and edge devices. These techniques aim to reduce the size of the models while maintaining their performance. This paper provides an overview of various model compression techniques, including quantization, pruning, knowledge distillation, and compact architectures. We analyze the effectiveness of these techniques in terms of model size reduction, inference speedup, and memory footprint reduction. We also discuss the trade-offs between model size reduction and performance degradation. Additionally, we examine the challenges and future directions of model compression for efficient deployment.

Downloads

Download data is not yet available.

References

  1. Sasidharan Pillai, Aravind. “Utilizing Deep Learning in Medical Image Analysis for Enhanced Diagnostic Accuracy and Patient Care: Challenges, Opportunities, and Ethical Implications”. Journal of Deep Learning in Genomic Data Analysis 1.1 (2021): 1-17.
  2. Pulimamidi, Rahul. "Emerging Technological Trends for Enhancing Healthcare Access in Remote Areas." Journal of Science & Technology 2.4 (2021): 53-62.
  3. Pulimamidi, Rahul. "Leveraging IoT Devices for Improved Healthcare Accessibility in Remote Areas: An Exploration of Emerging Trends." Internet of Things and Edge Computing Journal 2.1 (2022): 20-30.
  4. Reddy, Surendranadha Reddy Byrapu. "Predictive Analytics in Customer Relationship Management: Utilizing Big Data and AI to Drive Personalized Marketing Strategies." Australian Journal of Machine Learning Research & Applications 1.1 (2021): 1-12.
  5. Raparthi, Mohan, et al. "Data Science in Healthcare Leveraging AI for Predictive Analytics and Personalized Patient Care." Journal of AI in Healthcare and Medicine 2.2 (2022): 1-11.
  6. Pillai, Aravind Sasidharan. "A Natural Language Processing Approach to Grouping Students by Shared Interests." Journal of Empirical Social Science Studies 6.1 (2022): 1-16.