StarPU

Publications List

All publications sorted by year

2018

  1. Terry Cojean. Programmation of heterogeneous architectures using moldable tasks. Theses, Université de Bordeaux, March 2018. [WWW] [PDF] Keyword(s): On Parallel Tasks, High Performance Computing, Runtime systems, Parallel tasks programming, Applied linear algebra, Calcul Haute Performance, Supports d'exécution, Programmation à l'aide de tâches parallèles, Algèbre linéaire appliquée. [bibtex-key = cojean:tel-01816341]
    @phdthesis{cojean:tel-01816341,
    TITLE = {{Programmation of heterogeneous architectures using moldable tasks}},
    AUTHOR = {Cojean, Terry},
    URL = {https://tel.archives-ouvertes.fr/tel-01816341},
    NUMBER = {2018BORD0041},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2018},
    MONTH = Mar,
    KEYWORDS = {On Parallel Tasks; High Performance Computing ; Runtime systems ; Parallel tasks programming ; Applied linear algebra ; Calcul Haute Performance ; Supports d'ex{\'e}cution ; Programmation {\`a} l'aide de t{\^a}ches parall{\`e}les ; Alg{\`e}bre lin{\'e}aire appliqu{\'e}e},
    TYPE = {Theses},
    PDF = {https://tel.archives-ouvertes.fr/tel-01816341/file/COJEAN_TERRY_2018.pdf},
    HAL_ID = {tel-01816341},
    HAL_VERSION = {v1},
    
    }
    


  2. Vinicius Garcia Pinto, Lucas Mello Schnorr, Luka Stanisic, Arnaud Legrand, Samuel Thibault, and Vincent Danjean. A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters. Concurrency and Computation: Practice and Experience, April 2018. [WWW] [PDF] [doi:10.1002/cpe.4472] Keyword(s): On Scheduling, Heterogeneous platforms, Cholesky, High-Performance Computing, Trace Visualization, Task-based applications. [bibtex-key = garciapinto:hal-01616632]
    @article{garciapinto:hal-01616632,
    TITLE = {{A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters}},
    AUTHOR = {Garcia Pinto, Vinicius and Schnorr, Lucas Mello and Stanisic, Luka and Legrand, Arnaud and Thibault, Samuel and Danjean, Vincent},
    URL = {https://hal.inria.fr/hal-01616632},
    JOURNAL = {{Concurrency and Computation: Practice and Experience}},
    PUBLISHER = {{Wiley}},
    YEAR = {2018},
    MONTH = Apr,
    DOI = {10.1002/cpe.4472},
    KEYWORDS = {On Scheduling; Heterogeneous platforms ; Cholesky ; High-Performance Computing ; Trace Visualization ; Task-based applications},
    PDF = {https://hal.inria.fr/hal-01616632/file/CCPE_article_submitted_2018_02_06.pdf},
    HAL_ID = {hal-01616632},
    HAL_VERSION = {v2},
    
    }
    


  3. Vinicius Garcia Pinto, Lucas Mello Schnorr, Arnaud Legrand, Samuel Thibault, Luka Stanisic, and Vincent Danjean. Detecç ao de Anomalias de Desempenho em Aplicaç oes de Alto Desempenho baseadas em Tarefas em Clusters Hìbridos. In 17º Workshop em Desempenho de Sistemas Computacionais e de Comunicaç ao (WPerformance), Natal, Brazil, July 2018. [WWW] [PDF] Keyword(s): On Scheduling. [bibtex-key = pinto:hal-01842038]
    @inproceedings{pinto:hal-01842038,
    TITLE = {{Detec{\c c}{\~ a}o de Anomalias de Desempenho em Aplica{\c c}{\~ o}es de Alto Desempenho baseadas em Tarefas em Clusters H{\'i}bridos}},
    AUTHOR = {Pinto, Vinicius Garcia and Mello Schnorr, Lucas and Legrand, Arnaud and Thibault, Samuel and Stanisic, Luka and Danjean, Vincent},
    URL = {https://hal.inria.fr/hal-01842038},
    BOOKTITLE = {{17º Workshop em Desempenho de Sistemas Computacionais e de Comunica{\c c}{\~ a}o (WPerformance)}},
    ADDRESS = {Natal, Brazil},
    YEAR = {2018},
    MONTH = Jul,
    KEYWORDS = {On Scheduling},
    PDF = {https://hal.inria.fr/hal-01842038/file/181587_1.pdf},
    HAL_ID = {hal-01842038},
    HAL_VERSION = {v1},
    
    }
    


2017

  1. Suraj Kumar. Scheduling of Dense Linear Algebra Kernels on Heterogeneous Resources. PhD thesis, Université de Bordeaux, April 2017. [WWW] [PDF] Keyword(s): On Scheduling, STARPU, Runtime Systems, Heterogeneous Platforms, Task-based Scheduling, Dynamic Schedulers, Dense Linear Algebra, Systèmes d'ordonnancement dynamiques, Plates-formes hétérogènes, Algèbre linéaire dense, Ordonnancement dynamique, Ordonnancement à base de graphe de tâches. [bibtex-key = kumar:tel-01538516]
    @phdthesis{kumar:tel-01538516,
    TITLE = {{Scheduling of Dense Linear Algebra Kernels on Heterogeneous Resources}},
    AUTHOR = {Kumar, Suraj},
    URL = {https://tel.archives-ouvertes.fr/tel-01538516},
    NUMBER = {2017BORD0572},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2017},
    MONTH = Apr,
    KEYWORDS = {On Scheduling; STARPU ; Runtime Systems ; Heterogeneous Platforms ; Task-based Scheduling ; Dynamic Schedulers ; Dense Linear Algebra ; Syst{\`e}mes d'ordonnancement dynamiques ; Plates-formes h{\'e}t{\'e}rog{\`e}nes ; Alg{\`e}bre lin{\'e}aire dense ; Ordonnancement dynamique ; Ordonnancement {\`a} base de graphe de t{\^a}ches},
    PDF = {https://tel.archives-ouvertes.fr/tel-01538516/file/KUMAR_SURAL_2017.pdf},
    HAL_ID = {tel-01538516},
    HAL_VERSION = {v1},
    
    }
    


  2. Emmanuel Agullo, Olivier Aumage, Berenger Bramas, Olivier Coulaud, and Samuel Pitoiset. Bridging the gap between OpenMP and task-based runtime systems for the fast multipole method. IEEE Transactions on Parallel and Distributed Systems, April 2017. [WWW] [PDF] [doi:10.1109/TPDS.2017.2697857] Keyword(s): On OpenMP Support on top of StarPU, STARPU, multicore architecture, commutativity, priority, high performance computing, fast multipole method, runtime system, OpenMP, compiler, parallel programming model, StarPU, KStar, ScalFMM. [bibtex-key = agullo:hal-01517153]
    @article{agullo:hal-01517153,
    TITLE = {{Bridging the gap between OpenMP and task-based runtime systems for the fast multipole method}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Bramas, Berenger and Coulaud, Olivier and Pitoiset, Samuel},
    URL = {https://hal.inria.fr/hal-01517153},
    JOURNAL = {{IEEE Transactions on Parallel and Distributed Systems}},
    YEAR = {2017},
    MONTH = Apr,
    DOI = {10.1109/TPDS.2017.2697857},
    KEYWORDS = {On OpenMP Support on top of StarPU; STARPU ; multicore architecture ; commutativity ; priority ; high performance computing ; fast multipole method ; runtime system ; OpenMP ; compiler ; parallel programming model ; StarPU ; KStar ; ScalFMM},
    PDF = {https://hal.inria.fr/hal-01517153/file/tpds_kstar_scalfmm_print.pdf},
    HAL_ID = {hal-01517153},
    HAL_VERSION = {v1},
    
    }
    


  3. Emmanuel Agullo, Olivier Aumage, Mathieu Faverge, Nathalie Furmento, Florent Pruvost, Marc Sergent, and Samuel Thibault. Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model. IEEE Transactions on Parallel and Distributed Systems, 2017. [WWW] [PDF] Keyword(s): On MPI Support, runtime system, sequential task flow, task-based programming, heterogeneous computing, distributed computing, multicore, GPU, Cholesky factorization. [bibtex-key = agullo:hal-01618526]
    @article{agullo:hal-01618526,
    TITLE = {{Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Faverge, Mathieu and Furmento, Nathalie and Pruvost, Florent and Sergent, Marc and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01618526},
    JOURNAL = {{IEEE Transactions on Parallel and Distributed Systems}},
    PUBLISHER = {{Institute of Electrical and Electronics Engineers}},
    YEAR = {2017},
    KEYWORDS = {On MPI Support ; runtime system ; sequential task flow ; task-based programming ; heterogeneous computing ; distributed computing ; multicore ; GPU ; Cholesky factorization},
    PDF = {https://hal.inria.fr/hal-01618526/file/tpds14.pdf},
    HAL_ID = {hal-01618526},
    HAL_VERSION = {v1},
    
    }
    


  4. Jean Marie Couteyen Carpaye, Jean Roman, and Pierre Brenner. Design and Analysis of a Task-based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time Stepping. International Journal of Computational Science and Engineering, pp 1 - 22, 2017. [WWW] [PDF] [doi:10.1016/j.jocs.2017.03.008] Keyword(s): On Applications, HPC, CFD, runtime, task-based. [bibtex-key = couteyencarpaye:hal-01507613]
    @article{couteyencarpaye:hal-01507613,
    TITLE = {{Design and Analysis of a Task-based Parallelization over a Runtime System of an Explicit Finite-Volume CFD Code with Adaptive Time Stepping}},
    AUTHOR = {Couteyen Carpaye, Jean Marie and Roman, Jean and Brenner, Pierre},
    URL = {https://hal.inria.fr/hal-01507613},
    JOURNAL = {{International Journal of Computational Science and Engineering}},
    PUBLISHER = {{Inderscience}},
    PAGES = {1 - 22},
    YEAR = {2017},
    DOI = {10.1016/j.jocs.2017.03.008},
    KEYWORDS = {On Applications, HPC ; CFD ; runtime ; task-based},
    PDF = {https://hal.inria.fr/hal-01507613/file/flusepa-task-hal-inria-preprint.pdf},
    HAL_ID = {hal-01507613},
    HAL_VERSION = {v1},
    
    }
    


  5. O. Beaumont, L. Eyraud-Dubois, and S. Kumar. Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs. In 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pages 768-777, May 2017. [WWW] [PDF] [doi:10.1109/IPDPS.2017.71] Keyword(s): On Scheduling, STARPU, List scheduling, Approximation proofs, Runtime systems, Heterogeneous scheduling, Dense linear algebra. [bibtex-key = beaumont:hal-01386174]
    @INPROCEEDINGS{beaumont:hal-01386174,
    author={O. Beaumont and L. Eyraud-Dubois and S. Kumar},
    booktitle={2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS)},
    title={Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs},
    year={2017},
    volume={},
    number={},
    pages={768-777},
    doi={10.1109/IPDPS.2017.71},
    URL = {https://hal.inria.fr/hal-01386174},
    PDF = {https://hal.inria.fr/hal-01386174/file/heteroPrioApproxProofsRR.pdf},
    ISSN={},
    month={May} KEYWORDS = {On Scheduling; STARPU ; List scheduling ; Approximation proofs ; Runtime systems ; Heterogeneous scheduling ; Dense linear algebra},
    HAL_ID = {hal-01386174},
    HAL_VERSION = {v1},
    
    }
    


  6. Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Luka Stanisic, and Samuel Thibault. Modeling Irregular Kernels of Task-based codes: Illustration with the Fast Multipole Method. Research Report RR-9036, INRIA Bordeaux, February 2017. [WWW] [PDF] Keyword(s): On Performance Model Tuning, Mathematical Software, Modeling and simulation, Parallel computing methodologies, fast multipole method, runtime system, task-based programming. [bibtex-key = agullo:hal-01474556]
    @techreport{agullo:hal-01474556,
    TITLE = {{Modeling Irregular Kernels of Task-based codes: Illustration with the Fast Multipole Method}},
    AUTHOR = {Agullo, Emmanuel and Bramas, B{\'e}renger and Coulaud, Olivier and Stanisic, Luka and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01474556},
    TYPE = {Research Report},
    NUMBER = {RR-9036},
    PAGES = {35},
    INSTITUTION = {{INRIA Bordeaux}},
    YEAR = {2017},
    MONTH = Feb,
    KEYWORDS = {On Performance Model Tuning; Mathematical Software ; Modeling and simulation ; Parallel computing methodologies ; fast multipole method ; runtime system ; task-based programming},
    PDF = {https://hal.inria.fr/hal-01474556/file/rapport.pdf},
    HAL_ID = {hal-01474556},
    HAL_VERSION = {v1},
    
    }
    


  7. Emmanuel Agullo, Alfredo Buttari, Mikko Byckling, Abdou Guermouche, and Ian Masliah. Achieving high-performance with a sparse direct solver on Intel KNL. Research Report RR-9035, Inria Bordeaux Sud-Ouest ; CNRS-IRIT ; Intel corporation ; Université Bordeaux, February 2017. [WWW] [PDF] Keyword(s): On Applications, runtime system, sparse direct solver, energy efficiency, high-performance computing, portability, Intel KNL, manycore parallelism. [bibtex-key = agullo:hal-01473475]
    @techreport{agullo:hal-01473475,
    TITLE = {{Achieving high-performance with a sparse direct solver on Intel KNL}},
    AUTHOR = {Agullo, Emmanuel and Buttari, Alfredo and Byckling, Mikko and Guermouche, Abdou and Masliah, Ian},
    URL = {https://hal.inria.fr/hal-01473475},
    TYPE = {Research Report},
    NUMBER = {RR-9035},
    PAGES = {15},
    INSTITUTION = {{Inria Bordeaux Sud-Ouest ; CNRS-IRIT ; Intel corporation ; Universit{\'e} Bordeaux}},
    YEAR = {2017},
    MONTH = Feb,
    KEYWORDS = {On Applications; runtime system ; sparse direct solver ; energy efficiency ; high-performance computing ; portability ; Intel KNL ; manycore parallelism},
    PDF = {https://hal.inria.fr/hal-01473475/file/RR-9035.pdf},
    HAL_ID = {hal-01473475},
    HAL_VERSION = {v1},
    
    }
    


  8. Arthur Chevalier. Critical resources management and scheduling under StarPU. Master's thesis, Université de Bordeaux, September 2017. [WWW] [PDF] Keyword(s): On Memory Control, StarPU. [bibtex-key = chevalier:hal-01718280]
    @mastersthesis{chevalier:hal-01718280,
    TITLE = {{Critical resources management and scheduling under StarPU}},
    AUTHOR = {Chevalier, Arthur},
    URL = {https://hal.inria.fr/hal-01718280},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2017},
    MONTH = Sep,
    KEYWORDS = {On Memory Control; StarPU},
    PDF = {https://hal.inria.fr/hal-01718280/file/Memoire.pdf},
    HAL_ID = {hal-01718280},
    HAL_VERSION = {v1},
    
    }
    


2016

  1. Marc Sergent. Scalability of a task-based runtime system for dense linear algebra applications. PhD thesis, Université de Bordeaux, December 2016. [WWW] [PDF] Keyword(s): On MPI Support, High performance computing, Run-time systems, Distributed computing, Task-based programming, Parallel programming models, Calcul haute performance, Supports d'exécution, Calcul distribué, Programmation par tâches, Modèles de programmation parallèle. [bibtex-key = sergent:tel-01483666]
    @phdthesis{sergent:tel-01483666,
    TITLE = {{Scalability of a task-based runtime system for dense linear algebra applications}},
    AUTHOR = {Sergent, Marc},
    URL = {https://tel.archives-ouvertes.fr/tel-01483666},
    NUMBER = {2016BORD0372},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On MPI Support ; High performance computing ; Run-time systems ; Distributed computing ; Task-based programming ; Parallel programming models ; Calcul haute performance ; Supports d'ex{\'e}cution ; Calcul distribu{\'e} ; Programmation par t{\^a}ches ; Mod{\`e}les de programmation parall{\`e}le},
    PDF = {https://tel.archives-ouvertes.fr/tel-01483666/file/SERGENT_MARC_2016.pdf},
    HAL_ID = {tel-01483666},
    HAL_VERSION = {v1},
    
    }
    


  2. Emmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, and Suraj Kumar. Are Static Schedules so Bad ? A Case Study on Cholesky Factorization. In IPDPS'16, Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium, IPDPS'16, Chicago, IL, United States, May 2016. IEEE. [WWW] [PDF] Keyword(s): On Scheduling, Cholesky Factorization, Accelerators, Heterogeneous Systems, Runtime Systems, Scheduling, Unrelated Machines. [bibtex-key = agullo:hal-01223573]
    @inproceedings{agullo:hal-01223573,
    TITLE = {{Are Static Schedules so Bad ? A Case Study on Cholesky Factorization}},
    AUTHOR = {Agullo, Emmanuel and Beaumont, Olivier and Eyraud-Dubois, Lionel and Kumar, Suraj},
    URL = {https://hal.inria.fr/hal-01223573},
    BOOKTITLE = {{IPDPS'16}},
    ADDRESS = {Chicago, IL, United States},
    PUBLISHER = {{IEEE}},
    SERIES = {Proceedings of the 30th IEEE International Parallel \& Distributed Processing Symposium, IPDPS'16},
    YEAR = {2016},
    MONTH = May,
    keywords = {On Scheduling; Cholesky Factorization ; Accelerators ; Heterogeneous Systems ; Runtime Systems; Scheduling ; Unrelated Machines},
    PDF = {https://hal.inria.fr/hal-01223573/file/heteroprioCameraReady-ieeeCompatiable.pdf},
    HAL_ID = {hal-01223573},
    HAL_VERSION = {v2},
    
    }
    


  3. Olivier Beaumont, Terry Cojean, Lionel Eyraud-Dubois, Abdou Guermouche, and Suraj Kumar. Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources. In International Conference on High Performance Computing, Data, and Analytics (HiPC), Hyderabad, India, December 2016. [WWW] [PDF] Keyword(s): On Scheduling, STARPU, Scheduling, Linear Algebra, Heterogeneous Platforms, Task-based Scheduling, Cholesky Factorization, Simulation, Resource Aggregation. [bibtex-key = beaumont:hal-01361992]
    @inproceedings{beaumont:hal-01361992,
    TITLE = {{Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources}},
    AUTHOR = {Beaumont, Olivier and Cojean, Terry and Eyraud-Dubois, Lionel and Guermouche, Abdou and Kumar, Suraj},
    URL = {https://hal.inria.fr/hal-01361992},
    BOOKTITLE = {{International Conference on High Performance Computing, Data, and Analytics (HiPC)}},
    ADDRESS = {Hyderabad, India},
    YEAR = {2016},
    MONTH = Dec,
    KEYWORDS = {On Scheduling; STARPU ; Scheduling ; Linear Algebra ; Heterogeneous Platforms ; Task-based Scheduling ; Cholesky Factorization ; Simulation ; Resource Aggregation},
    PDF = {https://hal.inria.fr/hal-01361992v2/document},
    HAL_ID = {hal-01361992},
    HAL_VERSION = {v1},
    
    }
    


  4. Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines. In HeteroPar'2016 workshop of Euro-Par, Grenoble, France, August 2016. [WWW] [PDF] Keyword(s): On Scheduling, dense linear algebra, Cholesky, Multicore, accelerator, GPU, heterogeneous computing, task DAG, runtime system. [bibtex-key = cojean:hal-01181135]
    @inproceedings{cojean:hal-01181135,
    TITLE = {{Resource aggregation for task-based Cholesky Factorization on top of heterogeneous machines}},
    AUTHOR = {Cojean, Terry and Guermouche, Abdou and Hugo, Andra and Namyst, Raymond and Wacrenier, Pierre-Andr{\'e}},
    URL = {https://hal.inria.fr/hal-01181135},
    BOOKTITLE = {{HeteroPar'2016 workshop of Euro-Par}},
    ADDRESS = {Grenoble, France},
    YEAR = {2016},
    MONTH = Aug,
    KEYWORDS = {On Scheduling;dense linear algebra ; Cholesky ; Multicore ; accelerator ; GPU ; heterogeneous computing ; task DAG ; runtime system},
    PDF = {https://hal.inria.fr/hal-01181135/file/papier%20%281%29.pdf},
    HAL_ID = {hal-01181135},
    HAL_VERSION = {v3},
    
    }
    


  5. Vinicius Garcia Pinto, Luka Stanisic, Arnaud Legrand, Lucas Mello Schnorr, Samuel Thibault, and Vincent Danjean. Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach. In 3rd Workshop on Visual Performance Analysis (VPA), Salt Lake City, United States, November 2016. Note: Held in conjunction with SC16. [WWW] [PDF] Keyword(s): On Scheduling, STARPU. [bibtex-key = garciapinto:hal-01353962]
    @inproceedings{garciapinto:hal-01353962,
    TITLE = {{Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach}},
    AUTHOR = {Garcia Pinto, Vinicius and Stanisic, Luka and Legrand, Arnaud and Mello Schnorr, Lucas and Thibault, Samuel and Danjean, Vincent},
    URL = {https://hal.inria.fr/hal-01353962},
    NOTE = {Held in conjunction with SC16},
    BOOKTITLE = {{3rd Workshop on Visual Performance Analysis (VPA)}},
    ADDRESS = {Salt Lake City, United States},
    YEAR = {2016},
    MONTH = Nov,
    KEYWORDS = {On Scheduling; STARPU},
    PDF = {https://hal.inria.fr/hal-01353962/file/VPA_2016_paper_3.pdf},
    HAL_ID = {hal-01353962},
    HAL_VERSION = {v1},
    
    }
    


  6. Johan Janzén, David Black-Schaffer, and Andra Hugo. Partitioning GPUs for Improved Scalability. In IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), October 2016. [WWW] [doi:10.1109/SBAC-PAD.2016.14] Keyword(s): On Scheduling. [bibtex-key = JaBlHU2016a]
    @InProceedings{JaBlHU2016a,
    author = {Johan Janz{\'e}n and David Black-Schaffer and Andra Hugo},
    title = {{Partitioning GPUs for Improved Scalability}},
    booktitle = {IEEE 28th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)},
    year = 2016,
    KEYWORDS = {On Scheduling},
    DOI = {10.1109/SBAC-PAD.2016.14},
    URL = {http://ieeexplore.ieee.org/abstract/document/7789322/},
    month = Oct
    }
    


  7. Marc Sergent, David Goudin, Samuel Thibault, and Olivier Aumage. Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System. In 21st International Workshop on High-Level Parallel Programming Models and Supportive Environments, Chicago, United States, May 2016. [WWW] [PDF] Keyword(s): On Memory Control, memory control, task-based run-time systems, compressed linear algebra, distributed computing. [bibtex-key = sergent:hal-01284004]
    @inproceedings{sergent:hal-01284004,
    TITLE = {{Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System}},
    AUTHOR = {Sergent, Marc and Goudin, David and Thibault, Samuel and Aumage, Olivier},
    URL = {https://hal.inria.fr/hal-01284004},
    BOOKTITLE = {{21st International Workshop on High-Level Parallel Programming Models and Supportive Environments}},
    ADDRESS = {Chicago, United States},
    YEAR = {2016},
    MONTH = May,
    keywords = {On Memory Control; memory control ; task-based run-time systems ; compressed linear algebra ; distributed computing},
    PDF = {https://hal.inria.fr/hal-01284004/file/PID4127657.pdf},
    HAL_ID = {hal-01284004},
    HAL_VERSION = {v1},
    
    }
    


  8. Emmanuel Agullo, Olivier Aumage, Berenger Bramas, Olivier Coulaud, and Samuel Pitoiset. Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method. Research Report RR-8953, Inria, March 2016. [WWW] [PDF] Keyword(s): On OpenMP Support on top of StarPU, STARPU, runtime system, parallel programming model, compiler, priority, commutativity, multicore architecture, moteur d'exécution, modèle de programmation parallèle, compilateur, OpenMP 4.0, OpenMP 4.X, priorité, commutativité, architecture multicore. [bibtex-key = agullo:hal-01372022]
    @techreport{agullo:hal-01372022,
    TITLE = {{Bridging the gap between OpenMP 4.0 and native runtime systems for the fast multipole method}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Bramas, Berenger and Coulaud, Olivier and Pitoiset, Samuel},
    URL = {https://hal.inria.fr/hal-01372022},
    TYPE = {Research Report},
    NUMBER = {RR-8953},
    PAGES = {49},
    INSTITUTION = {{Inria}},
    YEAR = {2016},
    MONTH = Mar,
    KEYWORDS = {On OpenMP Support on top of StarPU; STARPU ; runtime system ; parallel programming model ; compiler ; priority ; commutativity ; multicore architecture ; moteur d'ex{\'e}cution ; mod{\`e}le de programmation parall{\`e}le ; compilateur ; OpenMP 4.0 ; OpenMP 4.X ; priorit{\'e} ; commutativit{\'e} ; architecture multicore},
    PDF = {https://hal.inria.fr/hal-01372022/file/RR-8953.pdf},
    HAL_ID = {hal-01372022},
    HAL_VERSION = {v1},
    
    }
    


  9. Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Martin Khannouz, and Luka Stanisic. Task-based fast multipole method for clusters of multicore processors. Research Report RR-8970, Inria Bordeaux Sud-Ouest, October 2016. [WWW] [PDF] Keyword(s): On Applications, STARPU, multicore processor, runtime system, FMM, cluster, high performance computing (HPC), fast multipole method, hybrid parallelization, task-based programming, MPI, OpenMP. [bibtex-key = agullo:hal-01387482]
    @techreport{agullo:hal-01387482,
    TITLE = {{Task-based fast multipole method for clusters of multicore processors}},
    AUTHOR = {Agullo, Emmanuel and Bramas, B{\'e}renger and Coulaud, Olivier and Khannouz, Martin and Stanisic, Luka},
    URL = {https://hal.inria.fr/hal-01387482},
    TYPE = {Research Report},
    NUMBER = {RR-8970},
    PAGES = {15 },
    INSTITUTION = {{Inria Bordeaux Sud-Ouest}},
    YEAR = {2016},
    MONTH = Oct,
    KEYWORDS = {On Applications; STARPU ; multicore processor ; runtime system ; FMM ; cluster ; high performance computing (HPC) ; fast multipole method ; hybrid parallelization ; task-based programming ; MPI ; OpenMP},
    PDF = {https://hal.inria.fr/hal-01387482/file/report-8970.pdf},
    HAL_ID = {hal-01387482},
    HAL_VERSION = {v1},
    
    }
    


  10. E Agullo, L Giraud, A Guermouche, S Nakov, and Jean Roman. Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures. Research Report 8912, Inria Bordeaux Sud-Ouest, May 2016. [WWW] [PDF] Keyword(s): High Performance Computing (HPC), multi-GPUs, heterogeneous architectures, task-based model, runtime system, sparse linear systems, Conjugate Gradient., On Applications, StarPU, scheduling. [bibtex-key = agullo:hal-01316982]
    @techreport{agullo:hal-01316982,
    TITLE = {{Task-based Conjugate Gradient: from multi-GPU towards heterogeneous architectures}},
    AUTHOR = {Agullo, E and Giraud, L and Guermouche, A and Nakov, S and Roman, Jean},
    URL = {https://hal.inria.fr/hal-01316982},
    TYPE = {Research Report},
    NUMBER = {8912},
    INSTITUTION = {{Inria Bordeaux Sud-Ouest}},
    YEAR = {2016},
    MONTH = May,
    KEYWORDS = {High Performance Computing (HPC) ; multi-GPUs ; heterogeneous architectures ; task-based model ; runtime system ; sparse linear systems ; Conjugate Gradient.},
    PDF = {https://hal.inria.fr/hal-01316982/file/RR-8912.pdf},
    HAL_ID = {hal-01316982},
    HAL_VERSION = {v1},
    KEYWORDS = {On Applications; StarPU, scheduling} 
    }
    


  11. Terry Cojean, Abdou Guermouche, Andra Hugo, Raymond Namyst, and Pierre-André Wacrenier. Resource aggregation for task-based Cholesky Factorization on top of modern architectures. Note: This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops, November 2016. [WWW] [PDF] Keyword(s): Intel Xeon-Phi KNL, heterogeneous computing, GPU, accelerator, Multicore, dense linear algebra, task DAG, Cholesky factorization, runtime system, On Scheduling. [bibtex-key = cojean:hal-01409965]
    @unpublished{cojean:hal-01409965,
    TITLE = {{Resource aggregation for task-based Cholesky Factorization on top of modern architectures}},
    AUTHOR = {Cojean, Terry and Guermouche, Abdou and Hugo, Andra and Namyst, Raymond and Wacrenier, Pierre-Andr{\'e}},
    URL = {https://hal.inria.fr/hal-01409965},
    NOTE = {This paper is submitted for review to the Parallel Computing special issue for HCW and HeteroPar 16 workshops},
    YEAR = {2016},
    MONTH = Nov,
    KEYWORDS = {Intel Xeon-Phi KNL ; heterogeneous computing ; GPU ; accelerator ; Multicore ; dense linear algebra ; task DAG ; Cholesky factorization ; runtime system},
    PDF = {https://hal.inria.fr/hal-01409965/file/submission.pdf},
    HAL_ID = {hal-01409965},
    HAL_VERSION = {v1},
    KEYWORDS = {On Scheduling},
    
    }
    


2015

  1. Corentin Rossignon. A fine grain model programming for parallelization of sparse linear solver. PhD thesis, Université de Bordeaux, July 2015. [WWW] [PDF] Keyword(s): On Applications, Sparse linear algebra, Multicore, Runtime Systems, Task-based programming, Parallelism, Algèbre linéaire creuse, Multi-coeurs, NUMA, Parallélisme, Graphe de tâches, Supports d’exécution. [bibtex-key = rossignon:tel-01230876]
    @phdthesis{rossignon:tel-01230876,
    TITLE = {{A fine grain model programming for parallelization of sparse linear solver}},
    AUTHOR = {Rossignon, Corentin},
    URL = {https://tel.archives-ouvertes.fr/tel-01230876},
    NUMBER = {2015BORD0094},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2015},
    MONTH = Jul,
    keywords = {On Applications; Sparse linear algebra ; Multicore ; Runtime Systems ; Task-based programming ; Parallelism ; Alg{\`e}bre lin{\'e}aire creuse ; Multi-coeurs ; NUMA ; Parall{\'e}lisme ; Graphe de t{\^a}ches ; Supports d’ex{\'e}cution},
    PDF = {https://tel.archives-ouvertes.fr/tel-01230876/file/ROSSIGNON_CORENTIN_2015.pdf},
    HAL_ID = {tel-01230876},
    HAL_VERSION = {v1},
    
    }
    


  2. Luka Stanisic, Samuel Thibault, Arnaud Legrand, Brice Videau, and Jean-François Méhaut. Faithful Performance Prediction of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures. Concurrency and Computation: Practice and Experience, pp 16, May 2015. [WWW] [PDF] [doi:10.1002/cpe] Keyword(s): On The Simulation Support through SimGrid, runtime systems, simulation, simgrid, HPC, Starpu-simgrid. [bibtex-key = stanisic:hal-01147997]
    @article{stanisic:hal-01147997,
    TITLE = {{Faithful Performance Prediction of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures}},
    AUTHOR = {Stanisic, Luka and Thibault, Samuel and Legrand, Arnaud and Videau, Brice and M{\'e}haut, Jean-Fran{\c c}ois},
    URL = {https://hal.inria.fr/hal-01147997},
    JOURNAL = {{Concurrency and Computation: Practice and Experience}},
    PUBLISHER = {{John Wiley and Sons}},
    PAGES = {16},
    YEAR = {2015},
    MONTH = May,
    DOI = {10.1002/cpe},
    PDF = {https://hal.inria.fr/hal-01147997/file/CCPE14_article.pdf},
    HAL_ID = {hal-01147997},
    HAL_VERSION = {v1},
    KEYWORDS = {On The Simulation Support through SimGrid; runtime systems ; simulation ; simgrid ; HPC ; Starpu-simgrid} 
    }
    


  3. Emmanuel Agullo, Olivier Beaumont, Lionel Eyraud-Dubois, Julien Herrmann, Suraj Kumar, Loris Marchal, and Samuel Thibault. Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms. In Heterogeneity in Computing Workshop 2015, Hyderabad, India, May 2015. [WWW] [PDF] Keyword(s): On Scheduling, StarPU, Simgrid, Dynamic Schedulers, Resource Allocation, Scheduling, Heterogeneous Resources, Dense Linear Algebra, Simulation, Cholesky Factorization. [bibtex-key = agullo:hal-01120507]
    @inproceedings{agullo:hal-01120507,
    TITLE = {{Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms}},
    AUTHOR = {Agullo, Emmanuel and Beaumont, Olivier and Eyraud-Dubois, Lionel and Herrmann, Julien and Kumar, Suraj and Marchal, Loris and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01120507},
    PDF = {https://hal.inria.fr/hal-01120507/document},
    BOOKTITLE = {{Heterogeneity in Computing Workshop 2015}},
    ADDRESS = {Hyderabad, India},
    YEAR = {2015},
    MONTH = May,
    HAL_ID = {hal-01120507},
    HAL_VERSION = {v1},
    KEYWORDS = {On Scheduling;StarPU; Simgrid; Dynamic Schedulers; Resource Allocation; Scheduling; Heterogeneous Resources; Dense Linear Algebra; Simulation; Cholesky Factorization} 
    }
    


  4. Vìctor Martìnez, David Michéa, Fabrice Dupros, Olivier Aumage, Samuel Thibault, Hideo Aochi, and Philippe Olivier Alexandre Navaux. Towards seismic wave modeling on heterogeneous many-core architectures using task-based runtime system. In 27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Florianopolis, Brazil, October 2015. [WWW] [PDF] Keyword(s): On Applications, StarPU, scheduling. [bibtex-key = MaMiDuAuThiAoNa15]
    @inproceedings{MaMiDuAuThiAoNa15,
    TITLE = {{Towards seismic wave modeling on heterogeneous many-core architectures using task-based runtime system}},
    AUTHOR = {Mart{\'i}nez, V{\'i}ctor and Mich{\'e}a, David and Dupros, Fabrice and Aumage, Olivier and Thibault, Samuel and Aochi, Hideo and Navaux, Philippe Olivier Alexandre},
    URL = {https://hal.inria.fr/hal-01182746},
    PDF = {https://hal.inria.fr/hal-01182746/file/sbac2015_soumission.pdf},
    BOOKTITLE = {{27th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD)}},
    ADDRESS = {Florianopolis, Brazil},
    YEAR = {2015},
    MONTH = Oct,
    HAL_ID = {hal-01182746},
    HAL_VERSION = {v1},
    KEYWORDS = {On Applications; StarPU; scheduling} 
    }
    


  5. Luka Stanisic, Emmanuel Agullo, Alfredo Buttari, Abdou Guermouche, Arnaud Legrand, Florent Lopez, and Brice Videau. Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers. In The 21st IEEE International Conference on Parallel and Distributed Systems, Melbourne, Australia, December 2015. [WWW] [PDF] Keyword(s): On The Simulation Support through SimGrid, Sparse Linear Algebra, Mumps, Starpu-simgrid, HPC, Simgrid, Runtime. [bibtex-key = stanisic:hal-01180272]
    @inproceedings{stanisic:hal-01180272,
    TITLE = {{Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers}},
    AUTHOR = {Stanisic, Luka and Agullo, Emmanuel and Buttari, Alfredo and Guermouche, Abdou and Legrand, Arnaud and Lopez, Florent and Videau, Brice},
    URL = {https://hal.inria.fr/hal-01180272},
    BOOKTITLE = {{The 21st IEEE International Conference on Parallel and Distributed Systems}},
    ADDRESS = {Melbourne, Australia},
    YEAR = {2015},
    MONTH = Dec,
    KEYWORDS = {On The Simulation Support through SimGrid; Sparse Linear Algebra ; Mumps ; Starpu-simgrid ; HPC ; Simgrid ; Runtime},
    PDF = {https://hal.inria.fr/hal-01180272/file/QRMSTARSG_article.pdf},
    HAL_ID = {hal-01180272},
    HAL_VERSION = {v2},
    
    }
    


2014

  1. Andra-Ecaterina Hugo. Composability of parallel codes on heterogeneous architectures. Theses, Université de Bordeaux, December 2014. [WWW] [PDF] Keyword(s): Runtime, Composability, Hypervisor, Support d'exécution, Composition, On Composability. [bibtex-key = hugo:tel-01162975]
    @phdthesis{hugo:tel-01162975,
    TITLE = {{Composability of parallel codes on heterogeneous architectures}},
    AUTHOR = {Hugo, Andra-Ecaterina},
    URL = {https://tel.archives-ouvertes.fr/tel-01162975},
    NUMBER = {2014BORD0373},
    SCHOOL = {{Universit{\'e} de Bordeaux}},
    YEAR = {2014},
    MONTH = Dec,
    KEYWORDS = {Runtime ; Composability ; Hypervisor ; Support d'ex{\'e}cution ; Composition},
    TYPE = {Theses},
    PDF = {https://tel.archives-ouvertes.fr/tel-01162975/file/HUGO_ANDRA_2014.pdf},
    HAL_ID = {tel-01162975},
    HAL_VERSION = {v1},
    KEYWORDS = {On Composability},
    
    }
    


  2. Emmanuel Agullo, Bérenger Bramas, Olivier Coulaud, Eric Darve, Matthias Messner, and Toru Takahashi. Task-Based FMM for Multicore Architectures. SIAM Journal on Scientific Computing, 36(1):66-93, 2014. [WWW] [PDF] [doi:10.1137/130915662] Keyword(s): On Applications, fast multipole method, multicore architectures, shared memory paradigm, runtime system, pipeline. [bibtex-key = agullo:hal-00911856]
    @article{agullo:hal-00911856,
    TITLE = {{Task-Based FMM for Multicore Architectures}},
    AUTHOR = {Agullo, Emmanuel and Bramas, B{\'e}renger and Coulaud, Olivier and Darve, Eric and Messner, Matthias and Takahashi, Toru},
    URL = {https://hal.inria.fr/hal-00911856},
    JOURNAL = {{SIAM Journal on Scientific Computing}},
    PUBLISHER = {{Society for Industrial and Applied Mathematics}},
    VOLUME = {36},
    NUMBER = {1},
    PAGES = {66-93},
    YEAR = {2014},
    DOI = {10.1137/130915662},
    KEYWORDS = {On Applications; fast multipole method ; multicore architectures ; shared memory paradigm ; runtime system ; pipeline},
    PDF = {https://hal.inria.fr/hal-00911856/file/sisc-cpu.pdf},
    HAL_ID = {hal-00911856},
    HAL_VERSION = {v1},
    
    }
    


  3. Emmanuel Agullo, Olivier Aumage, Mathieu Faverge, Nathalie Furmento, Florent Pruvost, Marc Sergent, and Samuel Thibault. Harnessing clusters of hybrid nodes with a sequential task-based programming model. In 8th International Workshop on Parallel Matrix Algorithms and Applications, July 2014. [WWW] [PDF] Keyword(s): On MPI Support. [bibtex-key = agullo:hal-01283949]
    @inproceedings{agullo:hal-01283949,
    TITLE = {{Harnessing clusters of hybrid nodes with a sequential task-based programming model}},
    AUTHOR = {Agullo, Emmanuel and Aumage, Olivier and Faverge, Mathieu and Furmento, Nathalie and Pruvost, Florent and Sergent, Marc and Thibault, Samuel},
    URL = {https://hal.inria.fr/hal-01283949},
    BOOKTITLE = {{8th International Workshop on Parallel Matrix Algorithms and Applications}},
    YEAR = {2014},
    MONTH = Jul,
    PDF = {https://hal.inria.fr/hal-01283949/file/pmaa14.pdf},
    HAL_ID = {hal-01283949},
    HAL_VERSION = {v1},
    keywords = {On MPI Support} 
    }
    


  4. Sylvain Henry, Alexandre Denis, Denis Barthou, Marie-Christine Counilh, and Raymond Namyst. Toward OpenCL Automatic Multi-Device Support. In Fernando Silva, Ines Dutra, and Vitor Santos Costa, editors, Euro-Par 2014, Porto, Portugal, August 2014. Springer. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = sylvain:hal-01005765]
    @inproceedings{sylvain:hal-01005765,
    hal_id = {hal-01005765},
    url = {http://hal.inria.fr/hal-01005765},
    title = {{Toward OpenCL Automatic Multi-Device Support}},
    author = {Henry, Sylvain and Denis, Alexandre and Barthou, Denis and Counilh, Marie-Christine and Namyst, Raymond},
    language = {Anglais},
    affiliation = {Exascale Computing Research Laboratory , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Euro-Par 2014}},
    publisher = {Springer},
    address = {Porto, Portugal},
    editor = {Fernando Silva and Ines Dutra and Vitor Santos Costa },
    audience = {internationale },
    year = {2014},
    month = Aug,
    pdf = {http://hal.inria.fr/hal-01005765/PDF/final.pdf},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  5. Xavier Lacoste, Mathieu Faverge, Pierre Ramet, Samuel Thibault, and George Bosilca. Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes. In HCW'2014 workshop of IPDPS, Phoenix, États-Unis, May 2014. IEEE. Note: RR-8446 RR-8446. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = lacoste:hal-00987094]
    @inproceedings{lacoste:hal-00987094,
    hal_id = {hal-00987094},
    url = {http://hal.inria.fr/hal-00987094},
    title = {{Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes}},
    author = {Lacoste, Xavier and Faverge, Mathieu and Ramet, Pierre and Thibault, Samuel and Bosilca, George},
    language = {Anglais},
    affiliation = {HiePACS - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest , Innovative Computing Laboratory - ICL},
    booktitle = {{HCW'2014 workshop of IPDPS}},
    publisher = {IEEE},
    address = {Phoenix, {\'E}tats-Unis},
    note = {RR-8446 RR-8446 },
    audience = {internationale },
    year = {2014},
    month = May,
    pdf = {http://hal.inria.fr/hal-00987094/PDF/sparsegpus.pdf},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  6. Marc Sergent and Simon Archipoff. Modulariser les ordonnanceurs de tâches : une approche structurelle. In Compas'2014, Neuchâtel, Suisse, April 2014. [WWW] [PDF] Keyword(s): On Scheduling, StarPU. [bibtex-key = sergent:hal-00978364]
    @inproceedings{sergent:hal-00978364,
    hal_id = {hal-00978364},
    url = {http://hal.inria.fr/hal-00978364},
    title = {{Modulariser les ordonnanceurs de t{\^a}ches : une approche structurelle}},
    author = {Sergent, Marc and Archipoff, Simon},
    language = {Fran{\c c}ais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Compas'2014}},
    address = {Neuch{\^a}tel, Suisse},
    audience = {nationale },
    year = {2014},
    month = Apr,
    pdf = {http://hal.inria.fr/hal-00978364/PDF/ordonnanceurs\_modulaires.pdf},
    KEYWORDS = {On Scheduling;StarPU} 
    }
    


  7. Luka Stanisic, Samuel Thibault, Arnaud Legrand, Brice Videau, and Jean-François Méhaut. Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures. In Euro-par - 20th International Conference on Parallel Processing, Porto, Portugal, August 2014. Springer-Verlag. [WWW] [PDF] Keyword(s): On The Simulation Support through SimGrid, StarPU. [bibtex-key = stanisic:hal-01011633]
    @inproceedings{stanisic:hal-01011633,
    hal_id = {hal-01011633},
    url = {http://hal.inria.fr/hal-01011633},
    title = {{Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-Core Architectures}},
    author = {Stanisic, Luka and Thibault, Samuel and Legrand, Arnaud and Videau, Brice and M{\'e}haut, Jean-Fran{\c c}ois},
    language = {Anglais},
    affiliation = {MESCAL - INRIA Grenoble Rh{\^o}ne-Alpes / LIG laboratoire d'Informatique de Grenoble , Laboratoire d'Informatique de Grenoble - LIG , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Euro-par - 20th International Conference on Parallel Processing}},
    publisher = {Springer-Verlag},
    address = {Porto, Portugal},
    audience = {internationale },
    year = {2014},
    month = Aug,
    pdf = {http://hal.inria.fr/hal-01011633/PDF/StarPUSG\_article.pdf},
    KEYWORDS = {On The Simulation Support through SimGrid;StarPU} 
    }
    


  8. Philippe Virouleau, Pierrick BRUNET, François Broquedis, Nathalie Furmento, Samuel Thibault, Olivier Aumage, and Thierry Gautier. Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite. In 10th International Workshop on OpenMP, IWOMP2014, 10th International Workshop on OpenMP, IWOMP2014, Salvador, Brazil, France, pages 16 - 29, September 2014. Springer. [WWW] [PDF] [doi:10.1007/978-3-319-11454-5_2] Keyword(s): On OpenMP Support on top of StarPU, OpenMP, task dependencies, benchmarks, runtime systems, KASTORS, StarPU. [bibtex-key = virouleau:hal-01081974]
    @inproceedings{virouleau:hal-01081974,
    TITLE = {{Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite}},
    AUTHOR = {Virouleau, Philippe and BRUNET, Pierrick and Broquedis, Fran{\c c}ois and Furmento, Nathalie and Thibault, Samuel and Aumage, Olivier and Gautier, Thierry},
    URL = {https://hal.inria.fr/hal-01081974},
    PDF = {https://hal.inria.fr/hal-01081974/document},
    BOOKTITLE = {{10th International Workshop on OpenMP, IWOMP2014}},
    ADDRESS = {Salvador, Brazil, France},
    PUBLISHER = {{Springer}},
    SERIES = {10th International Workshop on OpenMP, IWOMP2014},
    PAGES = {16 - 29},
    YEAR = {2014},
    MONTH = Sep,
    DOI = {10.1007/978-3-319-11454-5\_2},
    HAL_ID = {hal-01081974},
    HAL_VERSION = {v1},
    KEYWORDS = {On OpenMP Support on top of StarPU;OpenMP; task dependencies; benchmarks; runtime systems; KASTORS; StarPU} 
    }
    


  9. Emmanuel Agullo, Berenger Bramas, Olivier Coulaud, Eric Darve, Matthias Messner, and Toru Takahashi. Task-based FMM for heterogeneous architectures. Research Report RR-8513, Inria Bordeaux - Sud-Ouest, April 2014. [WWW] [PDF] Keyword(s): On Applications, pipeline., heterogeneous architectures, graphics processing unit, Fast multipole methods, pipeline, scheduling, runtime system. [bibtex-key = agullo:hal-00974674]
    @techreport{agullo:hal-00974674,
    TITLE = {{Task-based FMM for heterogeneous architectures}},
    AUTHOR = {Agullo, Emmanuel and Bramas, Berenger and Coulaud, Olivier and Darve, Eric and Messner, Matthias and Takahashi, Toru},
    URL = {https://hal.inria.fr/hal-00974674},
    TYPE = {Research Report},
    NUMBER = {RR-8513},
    PAGES = {29},
    INSTITUTION = {{Inria Bordeaux - Sud-Ouest}},
    YEAR = {2014},
    MONTH = Apr,
    KEYWORDS = {On Applications; pipeline. ; heterogeneous architectures ; graphics processing unit ; Fast multipole methods ; pipeline ; scheduling ; runtime system},
    PDF = {https://hal.inria.fr/hal-00974674/file/RR-8513.pdf},
    HAL_ID = {hal-00974674},
    HAL_VERSION = {v1},
    
    }
    


  10. Cédric Augonnet, Olivier Aumage, Nathalie Furmento, Samuel Thibault, and Raymond Namyst. StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators. Rapport de recherche RR-8538, INRIA, May 2014. [WWW] [PDF] Keyword(s): On MPI Support, StarPU. [bibtex-key = augonnet:hal-00992208]
    @techreport{augonnet:hal-00992208,
    hal_id = {hal-00992208},
    url = {http://hal.inria.fr/hal-00992208},
    title = {{StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators}},
    author = {Augonnet, C{\'e}dric and Aumage, Olivier and Furmento, Nathalie and Thibault, Samuel and Namyst, Raymond},
    language = {Anglais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI},
    type = {Rapport de recherche},
    institution = {INRIA},
    number = {RR-8538},
    year = {2014},
    month = May,
    pdf = {http://hal.inria.fr/hal-00992208/PDF/RR-8538.pdf},
    KEYWORDS = {On MPI Support;StarPU} 
    }
    


  11. Xavier Lacoste, Mathieu Faverge, Pierre Ramet, Samuel Thibault, and George Bosilca. Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes. Rapport de recherche RR-8446, INRIA, January 2014. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = lacoste:hal-00925017]
    @techreport{lacoste:hal-00925017,
    hal_id = {hal-00925017},
    url = {http://hal.inria.fr/hal-00925017},
    title = {{Taking advantage of hybrid systems for sparse direct solvers via task-based runtimes}},
    author = {Xavier Lacoste and Mathieu Faverge and Pierre Ramet and Samuel Thibault and George Bosilca},
    keywords = {On Applications; StarPU},
    language = {Anglais},
    affiliation = {HiePACS - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest , Innovative Computing Laboratory - ICL},
    pages = {25},
    type = {Rapport de recherche},
    institution = {INRIA},
    number = {RR-8446},
    year = {2014},
    month = Jan,
    pdf = {http://hal.inria.fr/hal-00925017/PDF/RR-8446.pdf} 
    }
    


  12. Emmanuel Agullo, Olivier Aumage, Mathieu Faverge, Nathalie Furmento, Florent Pruvost, Marc Sergent, and Samuel Thibault. Overview of Distributed Linear Algebra on Hybrid Nodes over the StarPU Runtime. SIAM Conference on Parallel Processing for Scientific Computing, February 2014. [WWW] [PDF] Keyword(s): On Applications. [bibtex-key = sergent:hal-00978602]
    @misc{sergent:hal-00978602,
    hal_id = {hal-00978602},
    url = {http://hal.inria.fr/hal-00978602},
    title = {{Overview of Distributed Linear Algebra on Hybrid Nodes over the StarPU Runtime}},
    author = {Agullo, Emmanuel and Aumage, Olivier and Faverge, Mathieu and Furmento, Nathalie and Pruvost, Florent and Sergent, Marc and Thibault, Samuel},
    language = {Anglais},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , HiePACS - INRIA Bordeaux - Sud-Ouest},
    howpublished = {{SIAM Conference on Parallel Processing for Scientific Computing}},
    address = {Portland, Oregon, {\'E}tats-Unis},
    audience = {internationale },
    year = {2014},
    month = Feb,
    pdf = {http://hal.inria.fr/hal-00978602/PDF/siampp14.pdf},
    keywords = {On Applications} 
    }
    


2013

  1. Cyril Bordage. Ordonnancement dynamique, adapté aux architectures hétérogènes, de la méthode multipôle pour les équations de Maxwell, en électromagnétisme. PhD thesis, Université Bordeaux 1, 351 cours de la Libération --- 33405 TALENCE cedex, December 2013. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = Bor13Thesis]
    @PhDThesis{ Bor13Thesis,
    author = {Cyril Bordage},
    title = {{Ordonnancement dynamique, adapt{\'e} aux architectures h{\'e}t{\'e}rog{\`e}nes, de la m{\'e}thode multip{\^o}le pour les {\'e}quations de Maxwell, en {\'e}lectromagn{\'e}tisme}},
    school = {{Universit{\'e} Bordeaux 1}},
    address = {351 cours de la Lib{\'e}ration --- 33405 TALENCE cedex},
    URL = {https://tel.archives-ouvertes.fr/tel-00958494},
    PDF = {https://tel.archives-ouvertes.fr/tel-00958494/document},
    year = 2013,
    month = DEC,
    KEYWORDS = {On Applications;StarPU} 
    }
    


  2. Sylvain Henry. Modèles de programmation et supports exécutifs pour architectures hétérogènes. PhD thesis, Université Bordeaux 1, 351 cours de la Libération --- 33405 TALENCE cedex, November 2013. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = Hen13Thesis]
    @PhDThesis{Hen13Thesis,
    author = {Sylvain Henry},
    title = {Mod{\`e}les de programmation et supports ex{\'e}cutifs pour architectures h{\'e}t{\'e}rog{\`e}nes},
    school = {{Universit{\'e} Bordeaux 1}},
    address = {351 cours de la Lib{\'e}ration --- 33405 TALENCE cedex},
    year = 2013,
    month = NOV,
    URL = {http://tel.archives-ouvertes.fr/tel-00948309},
    PDF = {http://tel.archives-ouvertes.fr/tel-00948309/document},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  3. Sylvain Henry. ViperVM: a Runtime System for Parallel Functional High-Performance Computing on Heterogeneous Architectures. In 2nd Workshop on Functional High-Performance Computing (FHPC'13), Boston, États-Unis, September 2013. [WWW] [PDF] Keyword(s): On Applications, Parallel Functional Programming, High-Performance Computing, Heterogeneous Architectures. [bibtex-key = hen13fhpc]
    @inproceedings{hen13fhpc,
    hal_id = {hal-00851122},
    url = {http://hal.inria.fr/hal-00851122},
    title = {{ViperVM: a Runtime System for Parallel Functional High-Performance Computing on Heterogeneous Architectures}},
    author = {Henry, Sylvain},
    keywords = {On Applications; Parallel Functional Programming; High-Performance Computing; Heterogeneous Architectures},
    language = {Anglais},
    affiliation = {Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{2nd Workshop on Functional High-Performance Computing (FHPC'13)}},
    address = {Boston, {\'E}tats-Unis},
    audience = {internationale },
    year = {2013},
    month = Sep,
    pdf = {http://hal.inria.fr/hal-00851122/PDF/fhpc13.pdf},
    
    }
    


  4. Andra Hugo. Le problème de la composition parallèle : une approche supervisée. In 21èmes Rencontres Francophones du Parallélisme (RenPar'21), Grenoble, France, January 2013. [WWW] [PDF] Keyword(s): On Composability, Composition, Hypervisor, StarPU. [bibtex-key = AH13Renpar]
    @InProceedings{AH13Renpar,
    author = {Andra Hugo},
    title = {{Le probl{\`e}me de la composition parall{\`e}le : une approche supervis{\'e}e}},
    booktitle = {21{\`e}mes Rencontres Francophones du Parall{\'e}lisme (RenPar'21)},
    year = 2013,
    address = {Grenoble, France},
    month = JAN,
    url = {http://hal.inria.fr/hal-00773610},
    pdf = {http://hal.inria.fr/hal-00773610/document},
    KEYWORDS = {On Composability;Composition; Hypervisor; StarPU} 
    }
    


  5. Andra Hugo, Abdou Guermouche, Raymond Namyst, and Pierre-André Wacrenier. Composing multiple StarPU applications over heterogeneous machines: a supervised approach. In Third International Workshop on Accelerators and Hybrid Exascale Systems, Boston, USA, May 2013. [WWW] [PDF] Keyword(s): On Composability. [bibtex-key = hugo:hal-00824514]
    @inproceedings{hugo:hal-00824514,
    title = {{Composing multiple StarPU applications over heterogeneous machines: a supervised approach}},
    author = {Andra Hugo and Abdou Guermouche and Raymond Namyst and Pierre-Andr{\'e} Wacrenier},
    affiliation = {RUNTIME - INRIA Bordeaux - Sud-Ouest , Laboratoire Bordelais de Recherche en Informatique - LaBRI , HiePACS - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{Third International Workshop on Accelerators and Hybrid Exascale Systems}},
    address = {Boston, USA},
    year = {2013},
    month = May,
    url = {http://hal.inria.fr/hal-00824514},
    pdf = {http://hal.inria.fr/hal-00824514/document},
    KEYWORDS = {On Composability} 
    }
    


  6. Tetsuya Odajima, Taisuke Boku, Mitsuhisa Sato, Toshihiro Hanawa, Yuetsu Kodama, Raymond Namyst, Samuel Thibault, and Olivier Aumage. Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing. In The 2013 International Symposium on Advances of Distributed and Parallel Computing (ADPC 2013), Vietri sul Mare, Italie, December 2013. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = odajima:hal-00920915]
    @inproceedings{odajima:hal-00920915,
    hal_id = {hal-00920915},
    url = {http://hal.inria.fr/hal-00920915},
    title = {{Adaptive Task Size Control on High Level Programming for GPU/CPU Work Sharing}},
    author = {Tetsuya Odajima and Taisuke Boku and Mitsuhisa Sato and Toshihiro Hanawa and Yuetsu Kodama and Raymond Namyst and Samuel Thibault and Olivier Aumage},
    language = {Anglais},
    affiliation = {Graduate School of Systems and Information Engineering [Tsukuba] , Center for Computational Sciences [Tsukuba] - CCS , Graduate School for Systems and Information Engineering [Tsukuba] , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{The 2013 International Symposium on Advances of Distributed and Parallel Computing (ADPC 2013)}},
    address = {Vietri sul Mare, Italie},
    audience = {internationale },
    year = {2013},
    month = Dec,
    pdf = {http://hal.inria.fr/hal-00920915/PDF/ADPC2013-117.pdf},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  7. Satoshi Ohshima, Satoshi Katagiri, Kengo Nakajima, Samuel Thibault, and Raymond Namyst. Implementation of FEM Application on GPU with StarPU. In SIAM CSE13 - SIAM Conference on Computational Science and Engineering 2013, Boston, États-Unis, February 2013. SIAM. [WWW] Keyword(s): On Applications, StarPU. [bibtex-key = ohshima:hal-00926144]
    @inproceedings{ohshima:hal-00926144,
    hal_id = {hal-00926144},
    url = {http://hal.inria.fr/hal-00926144},
    title = {{Implementation of FEM Application on GPU with StarPU}},
    author = {Satoshi Ohshima and Satoshi Katagiri and Kengo Nakajima and Samuel Thibault and Raymond Namyst},
    language = {Anglais},
    affiliation = {Computer Science Department - CST , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{SIAM CSE13 - SIAM Conference on Computational Science and Engineering 2013}},
    address = {Boston, {\'E}tats-Unis},
    organization = {SIAM},
    audience = {internationale },
    year = {2013},
    month = Feb,
    KEYWORDS = {On Applications;StarPU} 
    }
    


  8. Corentin Rossignon. Optimisation du produit matrice-vecteur creux sur architecture GPU pour un simulateur de reservoir. In 21èmes Rencontres Francophones du Parallélisme (RenPar'21), Grenoble, France, January 2013. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = Ros13Renpar]
    @InProceedings{Ros13Renpar,
    author = {Corentin Rossignon},
    title = {{O}ptimisation du produit matrice-vecteur creux sur architecture GPU pour un simulateur de r{\'}eservoir},
    booktitle = {21{\`e}mes Rencontres Francophones du Parall{\'e}lisme (RenPar'21)},
    year = 2013,
    address = {Grenoble, France},
    month = JAN,
    url = {http://hal.inria.fr/hal-00773571},
    pdf = {http://hal.inria.fr/hal-00773571/document},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  9. Corentin Rossignon, Pascal Hénon, Olivier Aumage, and Samuel Thibault. A NUMA-aware fine grain parallelization framework for multi-core architecture. In PDSEC - 14th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing - 2013, Boston, États-Unis, May 2013. [WWW] [PDF] Keyword(s): On Applications, StarPU.
    Abstract:
    In this paper, we present some solutions to handle to problems commonly encountered when dealing with fine grain parallelization on multi-core architecture: expressing algorithm using a task grain size suitable for the hardware and minimizing the time penalty due to Non Uniform Memory Accesses. To evaluate the benefit of our work we present some experiments on the fine grain parallelization of an iterative solver for spare linear system with some comparisons with the Intel TBB approach.
    [bibtex-key = rossignon:hal-00858350]
    @inproceedings{rossignon:hal-00858350,
    hal_id = {hal-00858350},
    url = {http://hal.inria.fr/hal-00858350},
    title = {{A NUMA-aware fine grain parallelization framework for multi-core architecture}},
    author = {Corentin Rossignon and Pascal H{\'e}non and Olivier Aumage and Samuel Thibault},
    abstract = {{In this paper, we present some solutions to handle to problems commonly encountered when dealing with fine grain parallelization on multi-core architecture: expressing algorithm using a task grain size suitable for the hardware and minimizing the time penalty due to Non Uniform Memory Accesses. To evaluate the benefit of our work we present some experiments on the fine grain parallelization of an iterative solver for spare linear system with some comparisons with the Intel TBB approach.}},
    language = {Anglais},
    affiliation = {TOTAL-Scientific and Technical Center Jean F{\'e}ger - CSTJF , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest},
    booktitle = {{PDSEC - 14th IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing - 2013}},
    address = {Boston, {\'E}tats-Unis},
    audience = {internationale },
    year = {2013},
    month = May,
    pdf = {http://hal.inria.fr/hal-00858350/PDF/taggre\_pdsec\_2013.pdf},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  10. Ludovic Courtès. C Language Extensions for Hybrid CPU/GPU Programming with StarPU. Research Report RR-8278, INRIA, April 2013. [WWW] [PDF] Keyword(s): On The C Extensions, parallel programming, GPU, scheduling, programming language support, StarPU. [bibtex-key = LC13Report]
    @techreport{LC13Report,
    hal_id = {hal-00807033},
    url = {http://hal.inria.fr/hal-00807033},
    title = {{C Language Extensions for Hybrid CPU/GPU Programming with StarPU}},
    author = {Court{\`e}s, Ludovic},
    pages = {25},
    type = {Research Report},
    institution = {INRIA},
    number = {RR-8278},
    year = {2013},
    month = Apr,
    pdf = {http://hal.inria.fr/hal-00807033/PDF/RR-8278.pdf},
    KEYWORDS = {On The C Extensions;parallel programming; GPU; scheduling; programming language support; StarPU} 
    }
    


2012

  1. Sylvain Henry, Alexandre Denis, and Denis Barthou. Programmation unifiée multi-accélérateur OpenCL. Techniques et Sciences Informatiques, (8-9-10):1233-1249, 2012. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = HenDenBar2012TSI]
    @article{HenDenBar2012TSI,
    title = {{Programmation unifi{\'e}e multi-acc{\'e}l{\'e}rateur OpenCL}},
    author = {Sylvain Henry and Alexandre Denis and Denis Barthou},
    publisher = {Lavoisier},
    pages = {1233-1249},
    journal = {Techniques et Sciences Informatiques},
    number = {8-9-10 },
    year = {2012},
    url = {http://hal.inria.fr/hal-00772742},
    pdf = {http://hal.inria.fr/hal-00772742/document},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  2. Sidi Ahmed Mahmoudi, Pierre Manneback, Cédric Augonnet, and Samuel Thibault. Traitements d'Images sur Architectures Parallèles et Hétérogènes. Technique et Science Informatiques, 2012. [WWW] Keyword(s): On Applications, StarPU. [bibtex-key = MahManAugThi12TSI]
    @Article{MahManAugThi12TSI,
    author = {Sidi Ahmed Mahmoudi and Pierre Manneback and C\'edric Augonnet and Samuel Thibault},
    title = {Traitements d'Images sur Architectures Parall\`eles et H\'et\'erog\`enes },
    journal = {{Technique et Science Informatiques}},
    editor = {Lavoisier},
    year = 2012,
    url = {http://hal.inria.fr/hal-00714858/},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  3. Cédric Augonnet, Olivier Aumage, Nathalie Furmento, Raymond Namyst, and Samuel Thibault. StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators. In Siegfried Benkner Jesper Larsson Träff and Jack Dongarra, editors, EuroMPI 2012, volume 7490 of LNCS, September 2012. Springer. Note: Poster Session. [WWW] [PDF] Keyword(s): On MPI Support, StarPU. [bibtex-key = AugAumFurNamThi2012EuroMPI]
    @InProceedings{AugAumFurNamThi2012EuroMPI,
    author = {C\'edric Augonnet and Olivier Aumage and Nathalie Furmento and Raymond Namyst and Samuel Thibault},
    title = {{StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators}},
    booktitle = {EuroMPI 2012},
    year = 2012,
    editor = {Jesper Larsson Tr{\"a}ff, Siegfried Benkner and Jack Dongarra},
    volume = {7490},
    series = {LNCS},
    month = SEP,
    note = {Poster Session},
    publisher = {Springer},
    url = {http://hal.inria.fr/hal-00725477},
    pdf = {http://hal.inria.fr/hal-00725477/document},
    KEYWORDS = {On MPI Support;StarPU} 
    }
    


  4. Siegfried Benkner, Enes Bajrovic, Erich Marth, Martin Sandrieser, Raymond Namyst, and Samuel Thibault. High-Level Support for Pipeline Parallelism on Many-Core Architectures. In Europar - International European Conference on Parallel and Distributed Computing - 2012, Rhodes Island, Grèce, August 2012. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = BenkBajMarSanNamThiEuroPar2012]
    @inproceedings{ BenkBajMarSanNamThiEuroPar2012,
    hal_id = {hal-00697020},
    url = {http://hal.inria.fr/hal-00697020},
    title = {{High-Level Support for Pipeline Parallelism on Many-Core Architectures}},
    author = {Benkner, Siegfried and Bajrovic, Enes and Marth, Erich and Sandrieser, Martin and Namyst, Raymond and Thibault, Samuel},
    booktitle = {{Europar - International European Conference on Parallel and Distributed Computing - 2012}},
    address = {Rhodes Island, Gr{\`e}ce},
    audience = {internationale },
    year = {2012},
    month = AUG,
    pdf = {http://hal.inria.fr/hal-00697020/PDF/europar2012-submitted.pdf},
    keywords = {On Applications; StarPU} 
    }
    


  5. Christoph Kessler, Usman Dastgeer, Samuel Thibault, Raymond Namyst, Andrew Richards, Uwe Dolinsky, Siegfried Benkner, Jesper Larsson Träff, and Sabri Pllana. Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore Systems. In Design, Automation and Test in Europe (DATE), Dresden, Allemagne, March 2012. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = kessler:hal-00776610]
    @inproceedings{kessler:hal-00776610,
    hal_id = {hal-00776610},
    url = {http://hal.inria.fr/hal-00776610},
    title = {{Programmability and Performance Portability Aspects of Heterogeneous Multi-/Manycore Systems}},
    author = {Kessler, Christoph and Dastgeer, Usman and Thibault, Samuel and Namyst, Raymond and Richards, Andrew and Dolinsky, Uwe and Benkner, Siegfried and Larsson Tr{\"a}ff, Jesper and Pllana, Sabri},
    language = {Anglais},
    affiliation = {PELAB - PELAB , Laboratoire Bordelais de Recherche en Informatique - LaBRI , RUNTIME - INRIA Bordeaux - Sud-Ouest , Codeplay Software , University of Vienna , Technical University of Vienna - TU WIEN},
    booktitle = {{Design, Automation and Test in Europe (DATE)}},
    address = {Dresden, Allemagne},
    audience = {internationale },
    year = {2012},
    month = Mar,
    pdf = {http://hal.inria.fr/hal-00776610/PDF/date12-paper.pdf},
    keywords = {On Applications; StarPU} 
    }
    


2011

  1. Cédric Augonnet. Scheduling Tasks over Multicore machines enhanced with Accelerators: a Runtime System's Perspective. PhD thesis, Université Bordeaux 1, 351 cours de la Libération --- 33405 TALENCE cedex, December 2011. [WWW] [PDF] Keyword(s): General Presentations, StarPU. [bibtex-key = Aug11Thesis]
    @PhDThesis{ Aug11Thesis,
    author = {C\'edric Augonnet},
    title = {{Scheduling Tasks over Multicore machines enhanced with Accelerators: a Runtime System's Perspective}},
    school = {{Universit{\'e} Bordeaux 1}},
    address = {351 cours de la Lib{\'e}ration --- 33405 TALENCE cedex},
    year = 2011,
    month = DEC,
    url = {http://tel.archives-ouvertes.fr/tel-00777154},
    pdf = {http://tel.archives-ouvertes.fr/tel-00777154/document},
    KEYWORDS = {General Presentations;StarPU} 
    }
    


  2. Cédric Augonnet, Samuel Thibault, Raymond Namyst, and Pierre-André Wacrenier. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par 2009, 23:187-198, February 2011. [WWW] [PDF] [doi:10.1002/cpe.1631] Keyword(s): General Presentations, StarPU. [bibtex-key = AugThiNamWac11CCPE]
    @Article{ AugThiNamWac11CCPE,
    author = {C{\'e}dric Augonnet and Samuel Thibault and Raymond Namyst and Pierre-Andr{\'e} Wacrenier},
    title = {{StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures}},
    journal = {Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par 2009},
    volume = 23,
    issue = 2,
    pages = {187--198},
    year = 2011,
    month = FEB,
    publisher = {John Wiley & Sons, Ltd.},
    doi = {10.1002/cpe.1631},
    url = {http://hal.inria.fr/inria-00550877},
    pdf = {http://hal.inria.fr/inria-00550877/document},
    KEYWORDS = {General Presentations;StarPU} 
    }
    


  3. Siegfried Benkner, Sabri Pllana, Jesper Larsson Träff, Philippas Tsigas, Uwe Dolinsky, Cédric Augonnet, Beverly Bachmayer, Christoph Kessler, David Moloney, and Vitaly Osipov. PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems. IEEE Micro, 31(5):28-41, September 2011. ISSN: 0272-1732. [WWW] [PDF] [doi:10.1109/MM.2011.67] Keyword(s): On Applications, StarPU. [bibtex-key = BenPllTraTsiDolAugBacKesMolOsi11IEEEMicro]
    @article{ BenPllTraTsiDolAugBacKesMolOsi11IEEEMicro,
    author = {Siegfried Benkner and Sabri Pllana and Jesper Larsson Tr{\"a}ff and Philippas Tsigas and Uwe Dolinsky and C\'edric Augonnet and Beverly Bachmayer and Christoph Kessler and David Moloney and Vitaly Osipov},
    title = {{PEPPHER: Efficient and Productive Usage of Hybrid Computing Systems}},
    journal ={IEEE Micro},
    volume = {31},
    number = {5},
    issn = {0272-1732},
    year = {2011},
    pages = {28-41},
    doi = {10.1109/MM.2011.67},
    publisher = {IEEE Computer Society},
    address = {Los Alamitos, CA, USA},
    month = SEP,
    url = {http://hal.inria.fr/hal-00648480},
    pdf = {http://hal.inria.fr/hal-00648480/document},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  4. Emmanuel Agullo, Cédric Augonnet, Jack Dongarra, Mathieu Faverge, Julien Langou, Hatem Ltaief, and Stanimire Tomov. LU factorization for accelerator-based systems. In 9th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 11), Sharm El-Sheikh, Egypt, June 2011. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = AguAugDonFavLanLtaTomAICCSA11]
    @InProceedings{AguAugDonFavLanLtaTomAICCSA11,
    author = {Emmanuel Agullo and C{\'e}dric Augonnet and Jack Dongarra and Mathieu Faverge and Julien Langou and Hatem Ltaief and Stanimire Tomov},
    title = {{LU} factorization for accelerator-based systems},
    booktitle = {9th ACS/IEEE International Conference on Computer Systems and Applications (AICCSA 11)},
    year = 2011,
    month = JUN,
    address = {Sharm El-Sheikh, Egypt},
    url = {http://hal.inria.fr/hal-00654193},
    pdf = {http://hal.inria.fr/hal-00654193/document},
    keywords = {On Applications; StarPU} 
    }
    


  5. Emmanuel Agullo, Cédric Augonnet, Jack Dongarra, Mathieu Faverge, Hatem Ltaief, Samuel Thibault, and Stanimire Tomov. QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators. In 25th IEEE International Parallel & Distributed Processing Symposium (IEEE IPDPS 2011), Anchorage, Alaska, USA, May 2011. [WWW] [PDF] [doi:10.1109/IPDPS.2011.90] Keyword(s): On Applications, StarPU. [bibtex-key = AguAugDonFavLtaThiTom11IPDPS]
    @InProceedings{AguAugDonFavLtaThiTom11IPDPS,
    HAL_ID = {inria-00547614},
    author={Emmanuel Agullo and C{\'{e}}dric Augonnet and Jack Dongarra and Mathieu Faverge and Hatem Ltaief and Samuel Thibault and Stanimire Tomov},
    title = {{QR Factorization on a Multicore Node Enhanced with Multiple GPU Accelerators}},
    booktitle = {{25th IEEE International Parallel \& Distributed Processing Symposium (IEEE IPDPS 2011)}},
    ADDRESS={Anchorage, Alaska, USA},
    language = {{A}nglais},
    audience = {internationale },
    DAY=16,
    MONTH=MAY,
    YEAR=2011,
    doi = {10.1109/IPDPS.2011.90},
    URL = {http://hal.inria.fr/inria-00547614},
    pdf = {http://hal.inria.fr/inria-00547614/document},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  6. Usman Dastgeer, Christoph Kessler, and Samuel Thibault. Flexible runtime support for efficient skeleton programming on hybrid systems. In Proceedings of the International Conference on Parallel Computing (ParCo), Applications, Tools and Techniques on the Road to Exascale Computing, volume 22 of Advances of Parallel Computing, Gent, Belgium, pages 159-166, August 2011. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = DasKesThi11ParCo]
    @InProceedings{ DasKesThi11ParCo,
    author = { Usman Dastgeer and Christoph Kessler and Samuel Thibault },
    title = { Flexible runtime support for efficient skeleton programming on hybrid systems },
    booktitle = {Proceedings of the International Conference on Parallel Computing (ParCo), Applications, Tools and Techniques on the Road to Exascale Computing},
    year = 2011,
    month = AUG,
    address = {Gent, Belgium},
    series = {Advances of Parallel Computing},
    pages = {159--166},
    volume = {22},
    url = {http://hal.inria.fr/inria-00606200/},
    pdf = {http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.699.6216&rep=rep1&type=pdf},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  7. Sylvain Henry. Programmation multi-accélérateurs unifiée en OpenCL. In 20èmes Rencontres Francophones du Parallélisme (RenPar'20), Saint Malo, France, May 2011. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = Hen11Renpar]
    @InProceedings{Hen11Renpar,
    author = {Sylvain Henry},
    title = {Programmation multi-acc{\'e}l{\'e}rateurs unifi{\'e}e en {OpenCL}},
    booktitle = {20{\`e}mes Rencontres Francophones du Parall{\'e}lisme (RenPar'20)},
    year = 2011,
    address = {Saint Malo, France},
    month = MAY,
    url = {http://hal.archives-ouvertes.fr/hal-00643257},
    pdf = {http://hal.archives-ouvertes.fr/hal-00643257/document},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  8. Sidi Ahmed Mahmoudi, Pierre Manneback, Cédric Augonnet, and Samuel Thibault. Détection optimale des coins et contours dans des bases d'images volumineuses sur architectures multicoeurs hétérogènes. In 20èmes Rencontres Francophones du Parallélisme, Saint-Malo / France, May 2011. [WWW] Keyword(s): On Applications, StarPU. [bibtex-key = MahManAugThi11Renpar20]
    @InProceedings{MahManAugThi11Renpar20,
    author = {Sidi Ahmed Mahmoudi and Pierre Manneback and C{\'e}dric Augonnet and Samuel Thibault},
    title = {D{\'e}tection optimale des coins et contours dans des bases d'images volumineuses sur architectures multicoeurs h{\'e}t{\'e}rog{\`e}nes},
    booktitle = {20{\`e}mes Rencontres Francophones du Parall{\'e}lisme},
    year = 2011,
    month = MAY,
    address = {Saint-Malo / France},
    url = {http://hal.inria.fr/inria-00606195},
    KEYWORDS = {On Applications;StarPU} 
    }
    


  9. Andra Hugo. Composabilité de codes parallèles sur architectures hétérogènes. Mémoire de Master, Université Bordeaux 1, June 2011. [WWW] [PDF] Keyword(s): On Composability, StarPU. [bibtex-key = AH11Master]
    @MastersThesis{AH11Master,
    author = {Andra Hugo},
    title = {{Composabilit{\'e} de codes parall{\`e}les sur architectures h{\'e}t{\'e}rog{\`e}nes}},
    school = {Universit{\'e} Bordeaux 1},
    year = {2011},
    type = {M{\'e}moire de Master},
    month = JUN,
    url = {http://hal.inria.fr/inria-00619654/en/},
    pdf = {http://hal.inria.fr/inria-00619654/document},
    KEYWORDS = {On Composability;StarPU} 
    }
    


2010

  1. Emmanuel Agullo, Cédric Augonnet, Jack Dongarra, Hatem Ltaief, Raymond Namyst, Samuel Thibault, and Stanimire Tomov. A Hybridization Methodology for High-Performance Linear Algebra Software for GPUs. In Wen-mei W. Hwu, editor, GPU Computing Gems, volume 2. Morgan Kaufmann, September 2010. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = AguAugDonLtaNamThiTomGPUgems]
    @incollection{AguAugDonLtaNamThiTomGPUgems,
    HAL_ID = {inria-00547847},
    URL = {http://hal.inria.fr/inria-00547847/en/},
    title = { A {H}ybridization {M}ethodology for {H}igh-{P}erformance {L}inear {A}lgebra {S}oftware for {GPU}s},
    author = {{A}gullo, {E}mmanuel and {A}ugonnet, {C}{\'e}dric and {D}ongarra, {J}ack and {L}taief, {H}atem and {N}amyst, {R}aymond and {T}hibault, {S}amuel and {T}omov, {S}tanimire},
    language = {{A}nglais},
    booktitle = {{GPU} {C}omputing {G}ems },
    publisher = {{M}organ {K}aufmann },
    audience = {internationale },
    volume = {2 },
    editor = {{W}en-mei {W}. {H}wu },
    month = SEP,
    year = {2010},
    keywords = {On Applications; StarPU},
    URL = {http://hal.inria.fr/inria-00547847},
    pdf = {http://hal.inria.fr/inria-00547847/document},
    
    }
    


  2. Emmanuel Agullo, Cédric Augonnet, Jack Dongarra, Hatem Ltaief, Raymond Namyst, Jean Roman, Samuel Thibault, and Stanimire Tomov. Dynamically scheduled Cholesky factorization on multicore architectures with GPU accelerators. In Symposium on Application Accelerators in High Performance Computing (SAAHPC), Knoxville, USA, July 2010. [WWW] [PDF] Keyword(s): On Applications, StarPU. [bibtex-key = AguAugDonLtaNamRomThiTom10SAAHPC]
    @inproceedings{AguAugDonLtaNamRomThiTom10SAAHPC,
    HAL_ID = {inria-00547616},
    URL = {http://hal.inria.fr/inria-00547616/en/},
    title = { {D}ynamically scheduled {C}holesky factorization on multicore architectures with {GPU} accelerators},
    author = {{A}gullo, {E}mmanuel and {A}ugonnet, {C}{\'e}dric and {D}ongarra, {J}ack and {L}taief, {H}atem and {N}amyst, {R}aymond and {R}oman, {J}ean and {T}hibault, {S}amuel and {T}omov, {S}tanimire},
    language = {{A}nglais},
    booktitle = {{S}ymposium on {A}pplication {A}ccelerators in {H}igh {P}erformance {C}omputing ({SAAHPC}) },
    address = {{K}noxville, USA },
    audience = {internationale },
    month = JUL,
    year = {2010},
    URL = {http://hal.inria.fr/inria-00547616},
    pdf = {http://hal.inria.fr/inria-00547616/document},
    keywords = {On Applications; StarPU},
    
    }
    


  3. Cédric Augonnet, Jérôme Clet-Ortega, Samuel Thibault, and Raymond Namyst. Data-Aware Task Scheduling on Multi-Accelerator based Platforms. In The 16th International Conference on Parallel and Distributed Systems (ICPADS), Shanghai, China, December 2010. [WWW] [PDF] [doi:10.1109/ICPADS.2010.129] Keyword(s): On Data Transfer Management, StarPU. [bibtex-key = AugCleThiNam10ICPADS]
    @InProceedings{AugCleThiNam10ICPADS,
    author = {C\'edric Augonnet and J\'er\^ome Clet-Ortega and Samuel Thibault and Raymond Namyst},
    title = {{Data-Aware Task Scheduling on Multi-Accelerator based Platforms}},
    booktitle = {The 16th International Conference on Parallel and Distributed Systems (ICPADS)},
    year = {2010},
    address = {Shanghai, China},
    month = DEC,
    doi = {10.1109/ICPADS.2010.129},
    url = {http://hal.inria.fr/inria-00523937},
    pdf = {http://hal.inria.fr/inria-00523937/document},
    KEYWORDS = {On Data Transfer Management;StarPU} 
    }
    


  4. Cédric Augonnet, Samuel Thibault, and Raymond Namyst. StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines. Technical Report 7240, INRIA, March 2010. [WWW] [PDF] Keyword(s): General Presentations, StarPU. [bibtex-key = AugThiNamWac10RR7240]
    @TechReport{AugThiNamWac10RR7240,
    author = {C{\'e}dric Augonnet and Samuel Thibault and Raymond Namyst},
    title = {{StarPU: a Runtime System for Scheduling Tasks over Accelerator-Based Multicore Machines}},
    institution = {INRIA},
    year = 2010,
    type = {Technical Report},
    number = 7240,
    month = MAR,
    url = {http://hal.inria.fr/inria-00467677},
    pdf = {http://hal.inria.fr/inria-00467677/document},
    keywords = {General Presentations;StarPU} 
    }
    


2009

  1. Cédric Augonnet. StarPU: un support exécutif unifié pour les architectures multicoeurs hétérogènes. In 19èmes Rencontres Francophones du Parallélisme, Toulouse / France, September 2009. Note: Best Paper Award. [WWW] [PDF] Keyword(s): General Presentations, StarPU. [bibtex-key = Aug09Renpar19]
    @InProceedings{Aug09Renpar19,
    author = {C{\'e}dric Augonnet},
    title = {{StarPU: un support ex{\'e}cutif unifi{\'e} pour les architectures multic\oe{}urs h{\'e}t{\'e}rog{\`e}nes}},
    booktitle = {19{\`e}mes Rencontres Francophones du Parall{\'e}lisme},
    year = 2009,
    month = SEP,
    address = {Toulouse / France},
    note = {Best Paper Award},
    url = {http://hal.inria.fr/inria-00411581},
    pdf = {http://hal.inria.fr/inria-00411581/document},
    KEYWORDS = {General Presentations;StarPU} 
    }
    


  2. Cédric Augonnet, Samuel Thibault, and Raymond Namyst. Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures. In Proceedings of the International Euro-Par Workshops 2009, HPPC'09, volume 6043 of Lecture Notes in Computer Science, Delft, The Netherlands, pages 56-65, August 2009. Springer. [WWW] [PDF] [doi:10.1007/978-3-642-14122-5_9] Keyword(s): On Performance Model Tuning, StarPU. [bibtex-key = AugThiNam09HPPC]
    @Inproceedings{AugThiNam09HPPC,
    author = {C{\'e}dric Augonnet and Samuel Thibault and Raymond Namyst},
    title = {{Automatic Calibration of Performance Models on Heterogeneous Multicore Architectures}},
    booktitle = {Proceedings of the International Euro-Par Workshops 2009, HPPC'09},
    address = {Delft, The Netherlands},
    month = AUG,
    year = 2009,
    publisher = {Springer},
    series = {Lecture Notes in Computer Science},
    volume = {6043},
    pages = {56--65},
    doi = {10.1007/978-3-642-14122-5_9},
    url = {http://hal.inria.fr/inria-00421333},
    pdf = {http://hal.inria.fr/inria-00421333/document},
    KEYWORDS = {On Performance Model Tuning;StarPU} 
    }
    


  3. Cédric Augonnet, Samuel Thibault, Raymond Namyst, and Maik Nijhuis. Exploiting the Cell/BE architecture with the StarPU unified runtime system. In SAMOS Workshop - International Workshop on Systems, Architectures, Modeling, and Simulation, volume 5657 of Lecture Notes in Computer Science, Samos, Greece, July 2009. [WWW] [PDF] [doi:10.1007/978-3-642-03138-0_36] Keyword(s): On The Cell Support, StarPU. [bibtex-key = AugThiNamNij09Samos]
    @InProceedings{AugThiNamNij09Samos,
    author = {C{\'e}dric Augonnet and Samuel Thibault and Raymond Namyst and Maik Nijhuis},
    title = {Exploiting the {Cell/BE} architecture with the {StarPU} unified runtime system},
    booktitle = {SAMOS Workshop - International Workshop on {S}ystems, {A}rchitectures, {M}odeling, and {S}imulation},
    year = {2009},
    month = JUL,
    volume = {5657},
    series = {Lecture Notes in Computer Science},
    address = {Samos, Greece},
    doi = {10.1007/978-3-642-03138-0_36},
    url = {http://hal.inria.fr/inria-00378705},
    pdf = {http://hal.inria.fr/inria-00378705/document},
    KEYWORDS = {On The Cell Support;StarPU} 
    }
    


  4. Cédric Augonnet, Samuel Thibault, Raymond Namyst, and Pierre-André Wacrenier. StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures. In Proceedings of the 15th International Euro-Par Conference, volume 5704 of Lecture Notes in Computer Science, Delft, The Netherlands, pages 863-874, August 2009. Springer. [WWW] [PDF] [doi:10.1007/978-3-642-03869-3_80] Keyword(s): General Presentations, StarPU. [bibtex-key = AugThiNamWac09Europar]
    @inproceedings{AugThiNamWac09Europar,
    author = {C{\'e}dric Augonnet and Samuel Thibault and Raymond Namyst and Pierre-Andr{\'e} Wacrenier},
    title = {{StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures}},
    booktitle = {Proceedings of the 15th International Euro-Par Conference},
    year = 2009,
    publisher = {Springer},
    series = {Lecture Notes in Computer Science},
    volume = 5704,
    pages = {863--874},
    address = {Delft, The Netherlands},
    month = AUG,
    doi = {10.1007/978-3-642-03869-3_80},
    url = {http://hal.inria.fr/inria-00384363},
    pdf = {http://hal.inria.fr/inria-00384363/document},
    KEYWORDS = {General Presentations;StarPU} 
    }
    


2008

  1. Cédric Augonnet and Raymond Namyst. A unified runtime system for heterogeneous multicore architectures. In Proceedings of the International Euro-Par Workshops 2008, HPPC'08, volume 5415 of Lecture Notes in Computer Science, Las Palmas de Gran Canaria, Spain, pages 174-183, August 2008. Springer. ISBN: 978-3-642-00954-9. [WWW] [PDF] [doi:10.1007/978-3-642-00955-6_22] Keyword(s): General Presentations, StarPU. [bibtex-key = AugNam08HPPC]
    @Inproceedings{AugNam08HPPC,
    author = {C{\'e}dric Augonnet and Raymond Namyst},
    title = {{A unified runtime system for heterogeneous multicore architectures}},
    booktitle = {Proceedings of the International Euro-Par Workshops 2008, HPPC'08},
    address = {Las Palmas de Gran Canaria, Spain},
    publisher = {Springer},
    series = {Lecture Notes in Computer Science},
    volume = 5415,
    pages = {174--183},
    doi = {10.1007/978-3-642-00955-6_22},
    isbn = {978-3-642-00954-9},
    month = AUG,
    year = 2008,
    url = {http://hal.inria.fr/inria-00326917},
    pdf = {http://hal.inria.fr/inria-00326917/document},
    KEYWORDS = {General Presentations;StarPU} 
    }
    


  2. Cédric Augonnet. Vers des supports d'exécution capables d'exploiter les machines multicoeurs hétérogènes. Mémoire de DEA, Université Bordeaux 1, June 2008. [WWW] [PDF] Keyword(s): General Presentations, StarPU. [bibtex-key = Aug08Master]
    @MastersThesis{Aug08Master,
    author = {C{\'e}dric Augonnet},
    title = {{Vers des supports d'ex{\'e}cution capables d'exploiter les machines multicoeurs h{\'e}t{\'e}rog{\`e}nes}},
    school = {Universit{\'e} Bordeaux 1},
    year = {2008},
    type = {M{\'e}moire de DEA},
    month = JUN,
    url = {http://hal.inria.fr/inria-00289361},
    pdf = {http://hal.inria.fr/inria-00289361/document},
    keywords = {General Presentations; StarPU} 
    }
    








Disclaimer:

This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All person copying this information are expected to adhere to the terms and constraints invoked by each author's copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.

Les documents contenus dans ces répertoires sont rendus disponibles par les auteurs qui y ont contribué en vue d'assurer la diffusion à temps de travaux savants et techniques sur une base non-commerciale. Les droits de copie et autres droits sont gardés par les auteurs et par les détenteurs du copyright, en dépit du fait qu'ils présentent ici leurs travaux sous forme électronique. Les personnes copiant ces informations doivent adhérer aux termes et contraintes couverts par le copyright de chaque auteur. Ces travaux ne peuvent pas être rendus disponibles ailleurs sans la permission explicite du détenteur du copyright.




Last modified: Sun Sep 9 17:51:28 2018
Author: samy.


This document was translated from BibTEX by bibtex2html