Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/611432
Title: Neural Network Model Compression for Edge AI Through Weight Approximation Exponent Sharing and Retraining
Researcher: Kashikar, Prachi
Guide(s): Sinha, Sharad
Keywords: Computer Science
Computer Science Artificial Intelligence
Engineering and Technology
University: Indian Institute of Technology Goa
Completed Date: 2024
Abstract: The rise of edge AI (Artificial Intelligence) applications has brought exciting possibilities for tasks like computer vision and natural language processing on resource-constrained devices. These devices, often with limited memory and battery power, struggle to run large traditional neural networks. To address this challenge, model compression techniques have become a major focus of research. Existing methods like quantization, weight sharing, and pruning can achieve significant size reduction, but often at the cost of some accuracy loss. This trade-o! between size and accuracy becomes a critical bottleneck for deploying these powerful models on edge devices. newline
Pagination: 
URI: http://hdl.handle.net/10603/611432
Appears in Departments:School of Mathematics and Computer Science

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File436.18 kBAdobe PDFView/Open
02_prelim pages.pdf1.01 MBAdobe PDFView/Open
03_content.pdf241.05 kBAdobe PDFView/Open
04_abstract.pdf141.65 kBAdobe PDFView/Open
05_chapter 1.pdf563.95 kBAdobe PDFView/Open
06_chapter 2.pdf318.03 kBAdobe PDFView/Open
07_chapter 3.pdf702.45 kBAdobe PDFView/Open
08_chapter 4.pdf1.45 MBAdobe PDFView/Open
09_chapter 5.pdf1.22 MBAdobe PDFView/Open
10_chapter 6.pdf174.7 kBAdobe PDFView/Open
11_annexures.pdf260.5 kBAdobe PDFView/Open
80_recommendation.pdf597.21 kBAdobe PDFView/Open
90_plagiarism_report.pdf9.53 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial 4.0 International (CC BY-NC 4.0).

Altmetric Badge: