Automatically Selecting Profitable Thread Block Sizes Using Machine Learning

Connors, Tiffany A.

Automatically Selecting Profitable Thread Block Sizes Using Machine Learning

Files

Connors,TiffanyFinal.pdf (641 KB)

Date

2017-05

Authors

Connors, Tiffany A.

Abstract

Graphics processing units (GPUs) provide high performance at low power consumption as long as resources are well utilized. Thread block size is one factor in determining a kernel's occupancy, which is a metric for measuring GPU utilization. A general guideline is to find the block size that leads to the highest occupancy. However, many combinations of block and grid sizes can provide highest occupancy, but performance can vary significantly between different configurations. This is because variation in thread structure yields different utilization of hardware resources. Thus, optimizing for occupancy alone is insufficient and thread structure must also be considered. It is the programmer's responsibility to set block size, but selecting the right size is not always intuitive. In this paper, we propose using machine learning to automatically select profitable block sizes. Additionally, we show that machine learning techniques coupled with performance counters can provide insight into the underlying reasons for performance variance between different configurations.

Keywords

machine learning, optimization, performance tuning, auto-tuning, GPU, performance heuristics, supervised machine learning, Honors College

Citation

Connors, T. A. (2017). Automatically selecting profitable thread block sizes using machine learning (Unpublished thesis). Texas State University, San Marcos, Texas.

URI

https://hdl.handle.net/10877/6731

Collections

Honors College Capstones

Full item page

Automatically Selecting Profitable Thread Block Sizes Using Machine Learning

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

Rights

Rights Holder

Rights License

Rights URI

URI

Collections