Accelerating Data Transfer for Throughput Processors.

Jamshidi, Davoud

Accelerating Data Transfer for Throughput Processors.

dc.contributor.author	Jamshidi, Davoud
dc.date.accessioned	2017-01-26T22:22:38Z
dc.date.available	2017-01-26T22:22:38Z
dc.date.issued	2016
dc.date.submitted	2016
dc.identifier.uri	https://hdl.handle.net/2027.42/135935
dc.description.abstract	Graphics processing units (GPUs) have become prevalent in modern computing systems. While their highly parallel architectures are traditionally used as accelerators for rendering graphics, GPUs are also adept at handling data parallel workloads when provided large blocks of data for processing. Extracting performance from a GPU requires the programmer to provide enough work to keep the device fully utilized. Unlike CPUs, which are highly optimized to reduce memory access latency, GPUs are optimized for throughput and tend to have high access latency. The naive approach to obtaining performance is to provide a GPU with hundreds to thousands of threads so that some threads will be able to perform computation while others are waiting for data to arrive. This approach, however, cannot guarantee that there will always be enough computation that can hide the long latency of off-chip memory access. Common memory access patterns on GPUs further complicate code optimization. These patterns include streaming data that is only used once, tiling data in scratchpad memories to preserve locality and share data among many threads, and irregular accesses where neighboring threads access divergent memory locations. Limitations posed by the microarchitecture of modern GPU cores can hinder the GPUs ability to effectively hide memory access latency. This in turn limits GPU throughput and slows down execution of code on GPUs. This thesis proposes architectural modifications to GPUs that address the issues and inefficiencies posed by these access patterns through the decoupling of memory requests from threads, the execution pipeline, and limited memory system resources. For streaming accesses, instead of threads requesting their own data, data is delivered to threads in a manner that better utilizes available memory bandwidth. Tiled accesses are offloaded to specialized hardware that implements direct memory access for GPUs, freeing computation resources from generating tile addresses and improving tile transfer times. The cost of divergence in irregular patterns is ameliorated using a scatter-gather mechanism distributed across the memory subsystem, which reduces traffic across the on-chip interconnect. The proposed modifications effectively improve throughput, boosting kernel performance on average by 1.23x, 1.36x, and 1.29x, for streaming, tiled, and irregular accesses respectively.
dc.language.iso	en_US
dc.subject	computer architecture
dc.subject	Graphics Processing Unit (GPU) architecture
dc.title	Accelerating Data Transfer for Throughput Processors.
dc.type	Thesis	en_US
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Computer Science & Engineering
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeemember	Mahlke, Scott
dc.contributor.committeemember	Dick, Robert
dc.contributor.committeemember	Austin, Todd M
dc.contributor.committeemember	Dreslinski Jr, Ronald
dc.subject.hlbsecondlevel	Computer Science
dc.subject.hlbtoplevel	Engineering
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/135935/1/ajamshid_1.pdf
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: ajamshid_1.pdf
Size:: 3.767MB
Format:: PDF
Description:: Access Restricted to UM users only.

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.