Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/545753
Title: Design of an algorithm to minimize speculative parallelism overhead in simultaneous multithreading on multicore chip architecture
Researcher: Sudhakar Kumar
Guide(s): Singh, Sunil K. and Aggarwal, Naveen
Keywords: Automatic Parallelization
Hardware Transactional Memory
LLVM Compiler Infrastructure
Speculative Parallelization
University: Panjab University
Completed Date: 2023
Abstract: Efficient software leveraging parallel architectures is crucial for optimal execution across multiple cores. Achieving this efficiency is challenging with automatic parallelization. Speculative parallelism offers an alternative by dividing sequential workloads into implicit threads for separate core execution, assuming no dependencies. Mis-speculation requires corrective action. Despite its benefits, this technique comes with various overheads that must be tackled. This research proposes two architectural frameworks, Efficient Speculative Parallelism Architectural Framework (ESPAF) and Novel Dependency Resolution Aware Framework (NDRAF), along with their corresponding algorithms, which aim to tackle overheads associated with speculative parallelization implementation. The ESPAF converts sequential workload into multi-threaded code efficiently, but it has a limitation in handling all types of loop dependencies. The NDRAF is specifically designed to address this issue and outperforms ESPAF in terms of performance gain. Hardware Transactional Memory (HTM) is utilized for hardware architectural support to handle data violations. To evaluate compatibility and scalability, four different versions of HTMs were integrated with ESPAF and NDRAF. Both architectural frameworks were implemented using the LLVM compiler infrastructure and evaluated on the Intel(R) Core(TM) i5-7400 CPU, using the SPEC2006 and SPEC2017 benchmark suites. The study found that NDRAF outperforms ESPAF, with a 1.81x computational speedup and 2.11x performance gain with NDRAF with hardware support. The Eager/Lazy/Timestamp (EL_T) HTM exhibits the best performance as the thread count rises. The results indicated that the proposed architectures, and algorithms with their hardware supports effectively minimized overhead, resulting in better performance compared to sequential algorithms. newline
Pagination: xviii, 175p.
URI: http://hdl.handle.net/10603/545753
Appears in Departments:University Institute of Engineering and Technology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File59.08 kBAdobe PDFView/Open
02_prelim pages.pdf1.7 MBAdobe PDFView/Open
03_chapter 1.pdf1.1 MBAdobe PDFView/Open
04_chapter 2.pdf1.99 MBAdobe PDFView/Open
05_chapter 3.pdf1.83 MBAdobe PDFView/Open
06_chapter 4.pdf650.29 kBAdobe PDFView/Open
07_chapter 5.pdf469.12 kBAdobe PDFView/Open
08_chapter 6.pdf1.05 MBAdobe PDFView/Open
09_chapter 7.pdf177.06 kBAdobe PDFView/Open
10_annexures.pdf272.76 kBAdobe PDFView/Open
80_recommendation.pdf233.3 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: