StarPU

Publications List

Publications of year 2016

Thesis

  1. Marc Sergent. Scalability of a task-based runtime system for dense linear algebra applications. PhD thesis, Université de Bordeaux, December 2016. [WWW] [PDF] Keyword(s): On MPI Support, High performance computing, Run-time systems, Distributed computing, Task-based programming, Parallel programming models, Calcul haute performance, Supports d'exécution, Calcul distribué, Programmation par tâches, Modèles de programmation parallèle.
    @phdthesis{sergent:tel-01483666,
    TITLE = {{Scalability of a task-based runtime system for dense linear algebra applications}},
    AUTHOR = {Sergent, Marc},
    URL = {https://tel.archives-ouvertes.fr/tel-01483666},
    NUMBER = {2016BORD0372},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On MPI Support ; High performance computing ; Run-time systems ; Distributed computing ; Task-based programming ; Parallel programming models ; Calcul haute performance ; Supports d'ex{\'e}cution ; Calcul distribu{\'e} ; Programmation par t{\^a}ches ; Mod{\`e}les de programmation parall{\`e}le},
    PDF = {https://tel.archives-ouvertes.fr/tel-01483666/file/SERGENT_MARC_2016.pdf},
    HAL_ID = {tel-01483666},
    HAL_VERSION = {v1},
    
    }
    


Conference articles

  1. Emmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, and Suraj Kumar. Are Static Schedules so Bad ? A Case Study on Cholesky Factorization. In IPDPS'16, Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium, IPDPS'16, Chicago, IL, United States, May 2016. IEEE. [WWW] [PDF] Keyword(s): On Scheduling, Cholesky Factorization, Accelerators, Heterogeneous Systems, Runtime Systems, Scheduling, Unrelated Machines.
    @inproceedings{agullo:hal-01223573,
    TITLE = {{Are Static Schedules so Bad ? A Case Study on Cholesky Factorization}},
    AUTHOR = {Agullo, Emmanuel and Beaumont, Olivier and Eyraud-Dubois, Lionel and Kumar, Suraj},
    URL = {https://hal.inria.fr/hal-01223573},
    BOOKTITLE = {{IPDPS'16}},
    ADDRESS = {Chicago, IL, United States},
    PUBLISHER = {{IEEE}},
    SERIES = {Proceedings of the 30th IEEE International Parallel \& Distributed Processing Symposium, IPDPS'16},
    YEAR = {2016},
    MONTH = May,
    keywords = {On Scheduling; Cholesky Factorization ; Accelerators ; Heterogeneous Systems ; Runtime Systems; Scheduling ; Unrelated Machines},
    PDF = {https://hal.inria.fr/hal-01223573/file/heteroprioCameraReady-ieeeCompatiable.pdf},
    HAL_ID = {hal-01223573},
    HAL_VERSION = {v2},
    
    }
    


  2. Olivier Beaumont, Terry Cojean, Lionel Eyraud-Dubois, Abdou Guermouche, and Suraj Kumar. Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources. In International Conference on High Performance Computing, Data, and Analytics (HiPC), Hyderabad, India, December 2016. [WWW] [PDF] Keyword(s): On Scheduling, STARPU, Scheduling, Linear Algebra, Heterogeneous Platforms, Task-based Scheduling, Cholesky Factorization, Simulation, Resource Aggregation.
    @inproceedings{beaumont:hal-01361992,
    TITLE = {{Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources}},
    AUTHOR = {Beaumont, Olivier and Cojean, Terry and Eyraud-Dubois, Lionel and Guermouche, Abdou and Kumar, Suraj},
    URL = {https://hal.inria.fr/hal-01361992},
    BOOKTITLE = {{International Conference on High Performance Computing, Data, and Analytics (HiPC)}},
    ADDRESS = {Hyderabad, India},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On Scheduling; STARPU ; Scheduling ; Linear Algebra ; Heterogeneous Platforms ; Task-based Scheduling ; Cholesky Factorization ; Simulation ; Resource Aggregation},
    PDF = {https://hal.inria.fr/hal-01361992v2/document},
    HAL_ID = {hal-01361992},
    HAL_VERSION = {v1},
    
    }
    


  3. Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines. In HeteroPar'2016 workshop of Euro-Par, Grenoble, France, August 2016. [WWW] [PDF] Keyword(s): On Scheduling, dense linear algebra, Cholesky, Multicore, accelerator, GPU, heterogeneous computing, task DAG, runtime system.
    @inproceedings{cojean:hal-01181135,
    TITLE = {{Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines}},
    AUTHOR = {Cojean, Terry and Guermouche, Abdou and Hugo, Andra and Namyst, Raymond and Wacrenier, Pierre-Andr{\'e}},
    URL = {https://hal.inria.fr/hal-01181135},
    BOOKTITLE = {{HeteroPar'2016 workshop of Euro-Par}},
    ADDRESS = {Grenoble, France},
    YEAR = {2016},
    MONTH = Aug,
    KEYWORDS = {On Scheduling;dense linear algebra ; Cholesky ; Multicore ; accelerator ; GPU ; heterogeneous computing ; task DAG ; runtime system},
    PDF = {https://hal.inria.fr/hal-01181135/file/papier%20%281%29.pdf},
    HAL_ID = {hal-01181135},
    HAL_VERSION = {v3},
    
    }
    


  4. Vinicius Garcia Pinto, Luka Stanisic, Arnaud Legrand, Lucas Mello Schnorr, Samuel Thibault, and Vincent Danjean. Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach. In 3rd Workshop on Visual Performance Analysis (VPA), Salt Lake City, United States, November 2016. Note: Held in conjunction with SC16. [WWW] [PDF] Keyword(s): On Scheduling, STARPU.
    @inproceedings{garciapinto:hal-01353962,
    TITLE = {{Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach}},
    AUTHOR = {Garcia Pinto, Vinicius and Stanisic, Luka and Legrand, Arnaud and Mello Schnorr, Lucas and Thibault, Samuel and Danjean, Vincent},
    URL = {https://hal.inria.fr/hal-01353962},
    NOTE = {Held in conjunction with SC16},
    BOOKTITLE = {{3rd Workshop on Visual Performance Analysis (VPA)}},
    ADDRESS = {Salt Lake City, United States},
    YEAR = {2016},
    MONTH = Nov,
    KEYWORDS = {On Scheduling; STARPU},
    PDF = {https://hal.inria.fr/hal-01353962/file/VPA_2016_paper_3.pdf},
    HAL_ID = {hal-01353962},
    HAL_VERSION = {v1},
    
    }
    


  5. Johan Janzén, David Black-Schaffer, and Andra Hugo. Partitioning GPUs for Improved Scalability. In IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), October 2016. [WWW] [doi:10.1109/SBAC-PAD.2016.14] Keyword(s): On Scheduling.
    @InProceedings{JaBlHU2016a,
    author = {Johan Janz{\'e}n and David Black-Schaffer and Andra Hugo},
    title = {{Partitioning GPUs for Improved Scalability}},
    booktitle = {IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)},
    year = 2016,
    KEYWORDS = {On Scheduling},
    DOI = {10.1109/SBAC-PAD.2016.14},
    URL = {http://ieeexplore.ieee.org/abstract/document/7789322/},
    month = Oct
    }
    


  6. Marc Sergent, David Goudin, Samuel Thibault, and Olivier Aumage. Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System. In 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, Chicago, United States, May 2016. [WWW] [PDF] Keyword(s): On Memory Control, memory control, task-based run-time systems, compressed linear algebra, distributed computing.
    @inproceedings{sergent:hal-01284004,
    TITLE = {{Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System}},
    AUTHOR = {Sergent, Marc and Goudin, David and Thibault, Samuel and Aumage, Olivier},
    URL = {https://hal.inria.fr/hal-01284004},
    BOOKTITLE = {{21st International Workshop on High-Level Parallel Programming Models and Supportive Environments}},
    ADDRESS = {Chicago, United States},
    YEAR = {2016},
    MONTH = May,
    keywords = {On Memory Control; memory control ; task-based run-time systems ; compressed linear algebra ; distributed computing},
    PDF = {https://hal.inria.fr/hal-01284004/file/PID4127657.pdf},
    HAL_ID = {hal-01284004},
    HAL_VERSION = {v1},
    
    }
    


Internal reports

  1. Emmanuel Agullo, Olivier Aumage, Berenger Bramas, Olivier Coulaud, and Samuel Pitoiset. Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method. Research Report RR-8953, Inria, March 2016. [WWW] [PDF] Keyword(s): On OpenMP Support on top of StarPU, STARPU, runtime system, parallel programming model, compiler, priority, commutativity, multicore architecture, moteur d'exécution, modèle de programmation parallèle, compilateur, OpenMP 4.0, OpenMP 4.X, priorité, commutativité, architecture multicore.
    @techreport{agullo:hal-01372022,
    TITLE = {{Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Bramas, Berenger and Coulaud, Olivier and Pitoiset, Samuel},
    URL = {https://hal.inria.fr/hal-01372022},
    TYPE = {Research Report},
    NUMBER = {RR-8953},
    PAGES = {49},
    INSTITUTION = {{Inria}},
    YEAR = {2016},
    MONTH = Mar,
    KEYWORDS = {On OpenMP Support on top of StarPU; STARPU ; runtime system ; parallel programming model ; compiler ; priority ; commutativity ; multicore architecture ; moteur d'ex{\'e}cution ; mod{\`e}le de programmation parall{\`e}le ; compilateur ; OpenMP 4.0 ; OpenMP 4.X ; priorit{\'e} ; commutativit{\'e} ; architecture multicore},
    PDF = {https://hal.inria.fr/hal-01372022/file/RR-8953.pdf},
    HAL_ID = {hal-01372022},
    HAL_VERSION = {v1},
    
    }
    


  2. Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Martin Khannouz, and Luka Stanisic. Task-based fast multipole method for clusters of multicore processors. Research Report RR-8970, Inria Bordeaux Sud-Ouest, October 2016. [WWW] [PDF] Keyword(s): On Applications, STARPU, multicore processor, runtime system, FMM, cluster, high performance computing (HPC), fast multipole method, hybrid parallelization, task-based programming, MPI, OpenMP.
    @techreport{agullo:hal-01387482,
    TITLE = {{Task-based fast multipole method for clusters of multicore processors}},
    AUTHOR = {Agullo, Emmanuel and Bramas, B{\'e}renger and Coulaud, Olivier and Khannouz, Martin and Stanisic, Luka},
    URL = {https://hal.inria.fr/hal-01387482},
    TYPE = {Research Report},
    NUMBER = {RR-8970},
    PAGES = {15 },
    INSTITUTION = {{Inria Bordeaux Sud-Ouest}},
    YEAR = {2016},
    MONTH = Oct,
    KEYWORDS = {On Applications; STARPU ; multicore processor ; runtime system ; FMM ; cluster ; high performance computing (HPC) ; fast multipole method ; hybrid parallelization ; task-based programming ; MPI ; OpenMP},
    PDF = {https://hal.inria.fr/hal-01387482/file/report-8970.pdf},
    HAL_ID = {hal-01387482},
    HAL_VERSION = {v1},
    
    }
    


  3. E Agullo, L Giraud, A Guermouche, S Nakov, and Jean Roman. Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures. Research Report 8912, Inria Bordeaux Sud-Ouest, May 2016. [WWW] [PDF] Keyword(s): High Performance Computing (HPC), multi-GPUs, heterogeneous architectures, task-based model, runtime system, sparse linear systems, Conjugate Gradient., On Applications, StarPU, scheduling.
    @techreport{agullo:hal-01316982,
    TITLE = {{Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures}},
    AUTHOR = {Agullo, E and Giraud, L and Guermouche, A and Nakov, S and Roman, Jean},
    URL = {https://hal.inria.fr/hal-01316982},
    TYPE = {Research Report},
    NUMBER = {8912},
    INSTITUTION = {{Inria Bordeaux Sud-Ouest}},
    YEAR = {2016},
    MONTH = May,
    KEYWORDS = {High Performance Computing (HPC) ; multi-GPUs ; heterogeneous architectures ; task-based model ; runtime system ; sparse linear systems ; Conjugate Gradient.},
    PDF = {https://hal.inria.fr/hal-01316982/file/RR-8912.pdf},
    HAL_ID = {hal-01316982},
    HAL_VERSION = {v1},
    KEYWORDS = {On Applications; StarPU, scheduling} 
    }
    


Miscellaneous

  1. Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of modern architectures. Note: This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops, November 2016. [WWW] [PDF] Keyword(s): Intel Xeon-Phi KNL, heterogeneous computing, GPU, accelerator, Multicore, dense linear algebra, task DAG, Cholesky factorization, runtime system, On Scheduling.
    @unpublished{cojean:hal-01409965,
    TITLE = {{Resource aggregation for task-based Cholesky Factorization on top of modern architectures}},
    AUTHOR = {Cojean, Terry and Guermouche, Abdou and Hugo, Andra and Namyst, Raymond and Wacrenier, Pierre-Andr{\'e}},
    URL = {https://hal.inria.fr/hal-01409965},
    NOTE = {This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops},
    YEAR = {2016},
    MONTH = Nov,
    KEYWORDS = {Intel Xeon-Phi KNL ; heterogeneous computing ; GPU ; accelerator ; Multicore ; dense linear algebra ; task DAG ; Cholesky factorization ; runtime system},
    PDF = {https://hal.inria.fr/hal-01409965/file/submission.pdf},
    HAL_ID = {hal-01409965},
    HAL_VERSION = {v1},
    KEYWORDS = {On Scheduling},
    
    }
    








Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Les documents contenus dans ces répertoires sont rendus disponibles par les auteurs qui y ont contribué en vue d'assurer la diffusion à temps de travaux savants et techniques sur une base non-commerciale. Les droits de copie et autres droits sont gardés par les auteurs et par les détenteurs du copyright, en dépit du fait qu'ils présentent ici leurs travaux sous forme électronique. Les personnes copiant ces informations doivent adhérer aux termes et contraintes couverts par le copyright de chaque auteur. Ces travaux ne peuvent pas être rendus disponibles ailleurs sans la permission explicite du détenteur du copyright.




Last modified: Sun Sep 9 17:51:28 2018
Author: samy.


This document was translated from BibTEX by bibtex2html