Please use this identifier to cite or link to this item:
http://hdl.handle.net/10603/545753
Title: | Design of an algorithm to minimize speculative parallelism overhead in simultaneous multithreading on multicore chip architecture |
Researcher: | Sudhakar Kumar |
Guide(s): | Singh, Sunil K. and Aggarwal, Naveen |
Keywords: | Automatic Parallelization Hardware Transactional Memory LLVM Compiler Infrastructure Speculative Parallelization |
University: | Panjab University |
Completed Date: | 2023 |
Abstract: | Efficient software leveraging parallel architectures is crucial for optimal execution across multiple cores. Achieving this efficiency is challenging with automatic parallelization. Speculative parallelism offers an alternative by dividing sequential workloads into implicit threads for separate core execution, assuming no dependencies. Mis-speculation requires corrective action. Despite its benefits, this technique comes with various overheads that must be tackled. This research proposes two architectural frameworks, Efficient Speculative Parallelism Architectural Framework (ESPAF) and Novel Dependency Resolution Aware Framework (NDRAF), along with their corresponding algorithms, which aim to tackle overheads associated with speculative parallelization implementation. The ESPAF converts sequential workload into multi-threaded code efficiently, but it has a limitation in handling all types of loop dependencies. The NDRAF is specifically designed to address this issue and outperforms ESPAF in terms of performance gain. Hardware Transactional Memory (HTM) is utilized for hardware architectural support to handle data violations. To evaluate compatibility and scalability, four different versions of HTMs were integrated with ESPAF and NDRAF. Both architectural frameworks were implemented using the LLVM compiler infrastructure and evaluated on the Intel(R) Core(TM) i5-7400 CPU, using the SPEC2006 and SPEC2017 benchmark suites. The study found that NDRAF outperforms ESPAF, with a 1.81x computational speedup and 2.11x performance gain with NDRAF with hardware support. The Eager/Lazy/Timestamp (EL_T) HTM exhibits the best performance as the thread count rises. The results indicated that the proposed architectures, and algorithms with their hardware supports effectively minimized overhead, resulting in better performance compared to sequential algorithms. newline |
Pagination: | xviii, 175p. |
URI: | http://hdl.handle.net/10603/545753 |
Appears in Departments: | University Institute of Engineering and Technology |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
01_title.pdf | Attached File | 59.08 kB | Adobe PDF | View/Open |
02_prelim pages.pdf | 1.7 MB | Adobe PDF | View/Open | |
03_chapter 1.pdf | 1.1 MB | Adobe PDF | View/Open | |
04_chapter 2.pdf | 1.99 MB | Adobe PDF | View/Open | |
05_chapter 3.pdf | 1.83 MB | Adobe PDF | View/Open | |
06_chapter 4.pdf | 650.29 kB | Adobe PDF | View/Open | |
07_chapter 5.pdf | 469.12 kB | Adobe PDF | View/Open | |
08_chapter 6.pdf | 1.05 MB | Adobe PDF | View/Open | |
09_chapter 7.pdf | 177.06 kB | Adobe PDF | View/Open | |
10_annexures.pdf | 272.76 kB | Adobe PDF | View/Open | |
80_recommendation.pdf | 233.3 kB | Adobe PDF | View/Open |
Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).
Altmetric Badge: