Advances in GPU Research and Practice - 1st Edition - ISBN: 9780128037386, 9780128037881

Advances in GPU Research and Practice

1st Edition

Authors: Hamid Sarbazi-Azad
eBook ISBN: 9780128037881
Paperback ISBN: 9780128037386
Imprint: Morgan Kaufmann
Published Date: 6th September 2016
Page Count: 774
Tax/VAT will be calculated at check-out Price includes VAT (GST)
30% off
30% off
30% off
30% off
30% off
20% off
20% off
30% off
30% off
30% off
30% off
30% off
20% off
20% off
30% off
30% off
30% off
30% off
30% off
20% off
20% off
30% off
30% off
30% off
30% off
30% off
20% off
20% off
79.00
55.30
55.30
55.30
55.30
55.30
63.20
63.20
92.95
65.06
65.06
65.06
65.06
65.06
74.36
74.36
130.00
91.00
91.00
91.00
91.00
91.00
104.00
104.00
139.94
97.96
97.96
97.96
97.96
97.96
111.95
111.95
Unavailable
Price includes VAT (GST)
× DRM-Free

Easy - Download and start reading immediately. There’s no activation process to access eBooks; all eBooks are fully searchable, and enabled for copying, pasting, and printing.

Flexible - Read on multiple operating systems and devices. Easily read eBooks on smart phones, computers, or any eBook readers, including Kindle.

Open - Buy once, receive and download all available eBook formats, including PDF, EPUB, and Mobi (for Kindle).

Institutional Access

Secure Checkout

Personal information is secured with SSL technology.

Free Shipping

Free global shipping
No minimum order.

Table of Contents

  • Dedication
  • List of Contributors
  • Preface
  • Acknowledgments
  • Part 1: Programming and Tools
    • Chapter 1: Formal analysis techniques for reliable GPU programming: current solutions and call to action
      • Abstract
      • Acknowledgments
      • 1 GPUs in Support of Parallel Computing
      • 2 A quick introduction to GPUs
      • 3 Correctness issues in GPU programming
      • 4 The need for effective tools
      • 5 Call to Action
    • Chapter 2: SnuCL: A unified OpenCL framework for heterogeneous clusters
      • Abstract
      • Acknowledgments
      • 1 Introduction
      • 2 OpenCL
      • 3 Overview of SnuCL framework
      • 4 Memory management in SnuCL Cluster
      • 5 SnuCL extensions to OpenCL
      • 6 Performance evaluation
      • 7 Conclusions
    • Chapter 3: Thread communication and synchronization on massively parallel GPUs
      • Abstract
      • 1 Introduction
      • 2 Coarse-Grained Communication and Synchronization
      • 3 Built-In Atomic Functions on Regular Variables
      • 4 Fine-Grained Communication and Synchronization
      • 5 Conclusion and Future Research Direction
    • Chapter 4: Software-level task scheduling on GPUs
      • Abstract
      • Acknowledgments
      • 1 Introduction, Problem Statement, and Context
      • 2 Nondeterministic behaviors caused by the hardware
      • 3 SM-centric transformation
      • 4 Scheduling-enabled optimizations
      • 5 Other scheduling work on GPUs
      • 6 Conclusion and future work
    • Chapter 5: Data placement on GPUs
      • Abstract
      • 1 Introduction
      • 2 Overview
      • 3 Memory specification through MSL
      • 4 Compiler support
      • 5 Runtime support
      • 6 Results
      • 7 Related work
      • 8 Summary
  • Part 2: Algorithms and Applications
    • Chapter 6: Biological sequence analysis on GPU
      • Abstract
      • 1 Introduction
      • 2 Pairwise Sequence Comparison and Sequence-Profile Comparison
      • 3 Design aspects of GPU solutions for biological sequence analysis
      • 4 GPU Solutions for Pairwise Sequence Comparison
      • 5 GPU Solutions for Sequence-Profile Comparison
      • 6 Conclusion and perspectives
    • Chapter 7: Graph algorithms on GPUs
      • Abstract
      • 1 Graph representation for GPUs
      • 2 Graph traversal algorithms: the breadth first search (BFS)
      • 3 The single-source shortest path (SSSP) problem
      • 4 The APSP problem
      • 5 Load Balancing and Memory Accesses: Issues and Management Techniques
    • Chapter 8: GPU alignment of two and three sequences
      • Abstract
      • 1 Introduction
      • 2 GPU architecture
      • 3 Pairwise alignment
      • 4 Alignment of three sequences
      • 5 Conclusion
    • Chapter 9: Augmented Block Cimmino Distributed Algorithm for solving tridiagonal systems on GPU
      • Abstract
      • 1 Introduction
      • 2 ABCD Solver for tridiagonal systems
      • 3 GPU implementation and optimization
      • 4 Performance evaluation
      • 5 Conclusion and future work
    • Chapter 10: GPU computing applied to linear and mixed-integer programming
      • Abstract
      • Acknowledgments
      • 1 Introduction
      • 2 Operations Research in Practice
      • 3 Exact Optimization Algorithms
      • 4 Metaheuristics
      • 5 Conclusions
      • Conflicts of Interest
    • Chapter 11: GPU-accelerated shortest paths computations for planar graphs
      • Abstract
      • 1 Introduction
      • 2 Related work
      • 3 Partitioned Approaches
      • 4 Computational Complexity Analysis
      • 5 Experiments and results
      • About the Authors
    • Chapter 12: GPU sorting algorithms
      • Abstract
      • 1 Introduction
      • 2 Generic Programming Strategies for GPU
      • 3 Sorting algorithms
    • Chapter 13: MPC: An effective floating-point compression algorithm for GPUs
      • Abstract
      • Acknowledgments
      • 1 Introduction
      • 2 Methodology
      • 3 Experimental results
      • 4 Summary and Conclusions
    • Chapter 14: Adaptive sparse matrix representation for efficient matrix-vector multiplication
      • Abstract
      • 1 Introduction
      • 2 Sparse matrix-vector multiplication
      • 3 GPU architecture and programming model
      • 4 Optimization principles for SpMV
      • 5 Platform (Adaptive Runtime System)
      • 6 Results and analysis
      • 7 Summary
  • Part 3: Architecture and Performance
    • Chapter 15: A framework for accelerating bottlenecks in GPU execution with assist warps
      • Abstract
      • Acknowledgments
      • 1 Introduction
      • 2 Background
      • 3 Motivation
      • 4 The CABA Framework
      • 5 A Case for CABA: Data Compression
      • 6 Methodology
      • 7 Results
      • 8 Other Uses of the CABA Framework
      • 9 Related Work
      • 10 Conclusion
    • Chapter 16: Accelerating GPU accelerators through neural algorithmic transformation
      • Abstract
      • 1 Introduction
      • 2 Neural transformation for GPUs
      • 3 Instruction-set-architecture design
      • 4 Neural accelerator: design and integration
      • 5 Controlling quality trade-offs
      • 6 Evaluation
      • 7 Related work
      • 8 Conclusion
    • Chapter 17: The need for heterogeneous network-on-chip architectures with GPGPUs: A case study with photonic interconnects
      • Abstract
      • 1 Introduction
      • 2 Background
      • 3 The Need for Heterogeneous Interconnections
      • 4 Characterization of GPGPU Performance
      • 5 Conclusion
    • Chapter 18: Accurately modeling GPGPU frequency scaling with the CRISP performance model
      • Abstract
      • Acknowledgments
      • 1 Introduction
      • 2 Motivation and related work
      • 3 GPGPU DVFS performance model
      • 4 Methodology
      • 5 Results
      • 6 Conclusion
  • Part 4: Power and Reliability
    • Chapter 19: Energy and power considerations of GPUs
      • Abstract
      • 1 Introduction
      • 2 Evaluation methodology
      • 3 Power profiling of regular and irregular programs
      • 4 Affecting power and energy on GPUs
      • 5 Summary
      • Appendix
      • About the authors
    • Chapter 20: Architecting the last-level cache for GPUs using STT-MRAM nonvolatile memory
      • Abstract
      • 1 Introduction
      • 2 Background
      • 3 Related Work
      • 4 Two-Part L2 Cache Architecture
      • 5 Dynamic Write Threshold Detection Mechanism
      • 6 Implementation
      • 7 Evaluation Result
      • 8 Conclusion
    • Chapter 21: Power management of mobile GPUs
      • Abstract
      • Acknowledgments
      • 1 Introduction
      • 2 GPU Power Management for Mobile Games
      • 3 GPU Power Management for GPGPU Applications
      • 4 Future Outlook
      • 5 Conclusions
    • Chapter 22: Advances in GPU reliability research
      • Abstract
      • 1 Introduction
      • 2 Evaluating GPU Reliability
      • 3 Hardware Reliability Enhancements
      • 4 Software Reliability Enhancements
      • 5 Summary
    • Chapter 23: Addressing hardware reliability challenges in general-purpose GPUs
      • Abstract
      • 1 Introduction
      • 2 GPGPUs Architecture
      • 3 Modeling and Characterizing GPGPUs Reliability in the Presence of Soft Errors [25]
      • 4 RISE: Improving the Streaming Processors’ Reliability Against Soft Errors in GPGPUs [36]
      • 5 Mitigating the Susceptibility of GPGPUs to PVs [43]
  • Author Index
  • Subject Index

Description

Advances in GPU Research and Practice focuses on research and practices in GPU based systems. The topics treated cover a range of issues, ranging from hardware and architectural issues, to high level issues, such as application systems, parallel programming, middleware, and power and energy issues.

Divided into six parts, this edited volume provides the latest research on GPU computing. Part I: Architectural Solutions focuses on the architectural topics that improve on performance of GPUs, Part II: System Software discusses OS, compilers, libraries, programming environment, languages, and paradigms that are proposed and analyzed to help and support GPU programmers. Part III: Power and Reliability Issues covers different aspects of energy, power, and reliability concerns in GPUs. Part IV: Performance Analysis illustrates mathematical and analytical techniques to predict different performance metrics in GPUs. Part V: Algorithms presents how to design efficient algorithms and analyze their complexity for GPUs. Part VI: Applications and Related Topics provides use cases and examples of how GPUs are used across many sectors.

Key Features

  • Discusses how to maximize power and obtain peak reliability when designing, building, and using GPUs
  • Covers system software (OS, compilers), programming environments, languages, and paradigms proposed to help and support GPU programmers
  • Explains how to use mathematical and analytical techniques to predict different performance metrics in GPUs
  • Illustrates the design of efficient GPU algorithms in areas such as bioinformatics, complex systems, social networks, and cryptography
  • Provides applications and use case scenarios in several different verticals, including medicine, social sciences, image processing, and telecommunications

Readership

Researchers in academia and graduate students in Computer Science as well as Practitioners and engineers in industry


Details

No. of pages:
774
Language:
English
Copyright:
© Morgan Kaufmann 2017
Published:
Imprint:
Morgan Kaufmann
eBook ISBN:
9780128037881
Paperback ISBN:
9780128037386

About the Authors

Hamid Sarbazi-Azad Author

Professor Hamid Sarbazi-Azad, Ph.D. is currently a professor at the department of computer engineering of Sharif University of Technology, and heads the IPM School of Computer Science, Tehran, Iran. His research interests include high-performance computer architectures, NoCs and SoCs, multicore systems, GPUs, and memory architectures. He is an associate editor/editorial board member of several journals including IEEE Transactions on Computers and ACM Computing Surveys. He received his PhD in computing science from the University of Glasgow, Glasgow, UK, in 2002

Affiliations and Expertise

Sharif University of Technology, Tehran, Iran