↓ Skip to main content

Euro-Par 2017: Parallel Processing

Overview of attention for book
Cover of 'Euro-Par 2017: Parallel Processing'

Table of Contents

  1. Altmetric Badge
    Book Overview
  2. Altmetric Badge
    Chapter 1 Computing Just What You Need: Online Data Analysis and Reduction at Extreme Scales
  3. Altmetric Badge
    Chapter 2 Scaling Energy Adaptive Applications for Sustainable Profitability
  4. Altmetric Badge
    Chapter 3 Off-Road Performance Modeling – How to Deal with Segmented Data
  5. Altmetric Badge
    Chapter 4 Online Dynamic Monitoring of MPI Communications
  6. Altmetric Badge
    Chapter 5 Micro-benchmarking MPI Neighborhood Collective Operations
  7. Altmetric Badge
    Chapter 6 Performance Characterization of De Novo Genome Assembly on Leading Parallel Systems
  8. Altmetric Badge
    Chapter 7 NVIDIA Jetson Platform Characterization
  9. Altmetric Badge
    Chapter 8 Following the Blind Seer – Creating Better Performance Models Using Less Information
  10. Altmetric Badge
    Chapter 9 An Accurate Simulator of Cache-Line Conflicts to Exploit the Underlying Cache Performance
  11. Altmetric Badge
    Chapter 10 Shutdown Policies with Power Capping for Large Scale Computing Systems
  12. Altmetric Badge
    Chapter 11 Partitioning Strategy Selection for In-Memory Graph Pattern Matching on Multiprocessor Systems
  13. Altmetric Badge
    Chapter 12 Efficient Dynamic Pinning of Parallelized Applications by Reinforcement Learning with Applications
  14. Altmetric Badge
    Chapter 13 Accelerating by Idling: How Speculative Delays Improve Performance of Message-Oriented Systems
  15. Altmetric Badge
    Chapter 14 Using Simulation to Evaluate and Tune the Performance of Dynamic Load Balancing of an Over-Decomposed Geophysics Application
  16. Altmetric Badge
    Chapter 15 Optimizing Egalitarian Performance in the Side-Effects Model of Colocation for Data Center Resource Management
  17. Altmetric Badge
    Chapter 16 Generic Algorithms for Scheduling Applications on Hybrid Multi-core Machines
  18. Altmetric Badge
    Chapter 17 Low-Cost Approximation Algorithms for Scheduling Independent Tasks on Hybrid Platforms
  19. Altmetric Badge
    Chapter 18 Runtime-Assisted Shared Cache Insertion Policies Based on Re-reference Intervals
  20. Altmetric Badge
    Chapter 19 Rewriting System for Profile-Guided Data Layout Transformations on Binaries
  21. Altmetric Badge
    Chapter 20 Hardware Support for Scratchpad Memory Transactions on GPU Architectures
  22. Altmetric Badge
    Chapter 21 Execution of Recursive Queries in Apache Spark
  23. Altmetric Badge
    Chapter 22 Replica-Aware Partitioning Design in Parallel Database Systems
  24. Altmetric Badge
    Chapter 23 A Simplified Model for Simulating the Execution of a Workflow in Cloud
  25. Altmetric Badge
    Chapter 24 Dealing with Performance Unpredictability in an Asymmetric Multicore Processor Cloud
  26. Altmetric Badge
    Chapter 25 Deadline-Aware Deployment for Time Critical Applications in Clouds
  27. Altmetric Badge
    Chapter 26 More Sharing, More Benefits? A Study of Library Sharing in Container-Based Infrastructures
  28. Altmetric Badge
    Chapter 27 An Efficient Communication Aware Heuristic for Multiple Cloud Application Placement
  29. Altmetric Badge
    Chapter 28 Energy-Driven Straggler Mitigation in MapReduce
  30. Altmetric Badge
    Chapter 29 Leveraging Cloud Heterogeneity for Cost-Efficient Execution of Parallel Applications
  31. Altmetric Badge
    Chapter 30 A Consensus-Based Fault-Tolerant Event Logger for High Performance Applications
  32. Altmetric Badge
    Chapter 31 Families of Graph Algorithms: SSSP Case Study
  33. Altmetric Badge
    Chapter 32 SEMem: Deployment of MPI-Based In-Memory Storage for Hadoop on Supercomputers
  34. Altmetric Badge
    Chapter 33 Supporting the Xeon Phi Coprocessor in a Heterogeneous Programming Model
  35. Altmetric Badge
    Chapter 34 GLT: A Unified API for Lightweight Thread Libraries
  36. Altmetric Badge
    Chapter 35 PASCAL: A Parallel Algorithmic SCALable Framework for N -body Problems
  37. Altmetric Badge
    Chapter 36 GASPI/GPI In-memory Checkpointing Library
  38. Altmetric Badge
    Chapter 37 Optimized Batched Linear Algebra for Modern Architectures
  39. Altmetric Badge
    Chapter 38 New Efficient General Sparse Matrix Formats for Parallel SpMV Operations
  40. Altmetric Badge
    Chapter 39 Lazy Parallel Kronecker Algebra-Operations on Heterogeneous Multicores
  41. Altmetric Badge
    Chapter 40 Performance Evaluation of Computation and Communication Kernels of the Fast Multipole Method on Intel Manycore Architecture
  42. Altmetric Badge
    Chapter 41 Efficient Non-blocking Radix Trees
  43. Altmetric Badge
    Chapter 42 A Concurrency-Optimal Binary Search Tree
  44. Altmetric Badge
    Chapter 43 Scalable Fine-Grained Metric-Based Remeshing Algorithm for Manycore/NUMA Architectures
  45. Altmetric Badge
    Chapter 44 Performance Evaluation of Thread-Level Speculation in Off-the-Shelf Hardware Transactional Memories
  46. Altmetric Badge
    Chapter 45 Addressing Volume and Latency Overheads in 1D-parallel Sparse Matrix-Vector Multiplication
  47. Altmetric Badge
    Chapter 46 Improving the Network of Search Engine Services Through Application-Driven Routing
  48. Altmetric Badge
    Chapter 47 Accelerating the Tucker Decomposition with Compressed Sparse Tensors
  49. Altmetric Badge
    Chapter 48 Shared Memory Pipelined Parareal
  50. Altmetric Badge
    Chapter 49 Nonintrusive AMR Asynchrony for Communication Optimization
  51. Altmetric Badge
    Chapter 50 Balanced CSR Sparse Matrix-Vector Product on Graphics Processors
  52. Altmetric Badge
    Chapter 51 To Distribute or Not to Distribute: The Question of Load Balancing for Performance or Energy
Attention for Chapter 37: Optimized Batched Linear Algebra for Modern Architectures
Altmetric Badge

Mentioned by

twitter
11 X users

Citations

dimensions_citation
4 Dimensions

Readers on

mendeley
7 Mendeley
You are seeing a free-to-access but limited selection of the activity Altmetric has collected about this research output. Click here to find out more.
Chapter title
Optimized Batched Linear Algebra for Modern Architectures
Chapter number 37
Book title
Euro-Par 2017: Parallel Processing
Published by
Springer, Cham, August 2017
DOI 10.1007/978-3-319-64203-1_37
Book ISBNs
978-3-31-964202-4, 978-3-31-964203-1
Authors

Jack Dongarra, Sven Hammarling, Nicholas J. Higham, Samuel D. Relton, Mawussi Zounon, Dongarra, Jack, Hammarling, Sven, Higham, Nicholas J., Relton, Samuel D., Zounon, Mawussi

X Demographics

X Demographics

The data shown below were collected from the profiles of 11 X users who shared this research output. Click here to find out more about how the information was compiled.
Mendeley readers

Mendeley readers

The data shown below were compiled from readership statistics for 7 Mendeley readers of this research output. Click here to see the associated Mendeley record.

Geographical breakdown

Country Count As %
Unknown 7 100%

Demographic breakdown

Readers by professional status Count As %
Researcher 3 43%
Student > Bachelor 1 14%
Student > Doctoral Student 1 14%
Student > Master 1 14%
Unknown 1 14%
Readers by discipline Count As %
Computer Science 3 43%
Materials Science 1 14%
Medicine and Dentistry 1 14%
Engineering 1 14%
Unknown 1 14%