Report for: Euro-Par 2017: Parallel Processing

=> Thread: Batched BLAS "Optimized Batched Linear Algebra for Modern Architectures", Jack Dongarra, et al, Euro-Par 2017 --- "Fast Batched Matrix Multiplication for Small Sizes using Half Precision Arithmetic on GPUs", .., Jack Dongarra, IPDPS 2019 htt

05 Jul 2019

Reply Repost Favourite

@lucasawilson @HPC_Guru @rightrelevance @suhaibkhan @IoanHadade @labriOfficial @Inria_Bordeaux https://t.co/N1wxfY9AZ2 , https://t.co/ro5MiHzQ7n and https://t.co/3W9yy54WMb

17 Apr 2018

Reply Repost Favourite

=> [Webinar] Speed Up Small-Matrix Multiplication using New Intel Math Kernel Library Capabilities, Oct 18 2017 https://t.co/IadenlkB3v Compact/Batch DGEMM & SGEMM MKL Performance Benchmarks https://t.co/ouNR108ZYq https://t.co/TEPbwX3nr5 Batched BL

06 Mar 2018

Reply Repost Favourite

RT @ogawa_tter: "Optimized Batched Linear Algebra for Modern Architectures", Jack Dongarra, et al, Euro-Par 2017 (Aug 1 2017) https://t.co/…

20 Dec 2017

Reply Repost Favourite

Euro-Par 2017: Parallel Processing

Table of Contents

Mentioned by

Citations

Readers on