An asymmetric distributed shared memory model for heterogeneous parallel systems I Gelado, JE Stone, J Cabezas, S Patel, N Navarro, WW Hwu Proceedings of the fifteenth International Conference on Architectural …, 2010 | 269 | 2010 |
Enabling preemptive multiprogramming on GPUs I Tanasic, I Gelado, J Cabezas, A Ramirez, N Navarro, M Valero ACM SIGARCH Computer Architecture News 42 (3), 193-204, 2014 | 266 | 2014 |
Supercomputing with commodity CPUs: Are mobile SoCs ready for HPC? N Rajovic, PM Carpenter, I Gelado, N Puzovic, A Ramirez, M Valero Proceedings of the International Conference on High Performance Computing …, 2013 | 214 | 2013 |
Predictive runtime code scheduling for heterogeneous architectures VJ Jiménez, L Vilanova, I Gelado, M Gil, G Fursin, N Navarro International Conference on High-Performance Embedded Architectures and …, 2009 | 176 | 2009 |
Accelerating reduction and scan using tensor core units A Dakkak, C Li, J Xiong, I Gelado, W Hwu Proceedings of the ACM International Conference on Supercomputing, 46-57, 2019 | 106 | 2019 |
Implicitly parallel programming models for thousand-core microprocessors W Hwu, S Ryoo, SZ Ueng, JH Kelm, I Gelado, SS Stone, RE Kidd, ... Proceedings of the 44th annual Design Automation Conference, 754-759, 2007 | 98 | 2007 |
Assessing accelerator-based HPC reverse time migration M Araya-Polo, J Cabezas, M Hanzich, M Pericas, F Rubio, I Gelado, ... IEEE Transactions on Parallel and Distributed Systems 22 (1), 147-162, 2010 | 96 | 2010 |
Energy efficient hpc on embedded socs: Optimization techniques for mali gpu I Grasso, P Radojkovic, N Rajovic, I Gelado, A Ramirez 2014 IEEE 28th International parallel and distributed processing symposium …, 2014 | 70 | 2014 |
Efficient performance evaluation of memory hierarchy for highly multithreaded graphics processors SS Baghsorkhi, I Gelado, M Delahaye, WW Hwu ACM SIGPLAN Notices 47 (8), 23-34, 2012 | 69 | 2012 |
Experiences with mobile processors for energy efficient HPC N Rajovic, A Rico, J Vipond, I Gelado, N Puzovic, A Ramirez 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE), 464-468, 2013 | 64 | 2013 |
Throughput-oriented GPU memory allocation I Gelado, M Garland Proceedings of the 24th symposium on principles and practice of parallel …, 2019 | 55 | 2019 |
CUBA: an architecture for efficient cpu/co-processor data communication I Gelado, JH Kelm, S Ryoo, SS Lumetta, N Navarro, WW Hwu Proceedings of the 22nd annual international conference on Supercomputing …, 2008 | 49 | 2008 |
Automatic parallelization of kernels in shared-memory multi-gpu nodes J Cabezas, L Vilanova, I Gelado, TB Jablin, N Navarro, WW Hwu Proceedings of the 29th ACM on International Conference on Supercomputing, 3-13, 2015 | 34 | 2015 |
Comparison based sorting for systems with multiple GPUs I Tanasic, L Vilanova, M Jordà, J Cabezas, I Gelado, N Navarro, W Hwu Proceedings of the 6th Workshop on General Purpose Processor Using Graphics …, 2013 | 26 | 2013 |
High-performance reverse time migration on GPU J Cabezas, M Araya-Polo, I Gelado, N Navarro, E Morancho, JM Cela 2009 International Conference of the Chilean Computer Science Society, 77-86, 2009 | 24 | 2009 |
GPU-initiated on-demand high-throughput storage access in the BaM system architecture Z Qureshi, VS Mailthody, I Gelado, S Min, A Masood, J Park, J Xiong, ... Proceedings of the 28th ACM International Conference on Architectural …, 2023 | 22 | 2023 |
Efficient exception handling support for GPUs I Tanasic, I Gelado, M Jorda, E Ayguade, N Navarro Proceedings of the 50th Annual IEEE/ACM International Symposium on …, 2017 | 15 | 2017 |
CIGAR: Application partitioning for a CPU/coprocessor architecture JH Kelm, I Gelado, MJ Murphy, N Navarro, S Lumetta, W Hwu 16th International Conference on Parallel Architecture and Compilation …, 2007 | 15 | 2007 |
Runtime and architecture support for efficient data exchange in multi-accelerator applications J Cabezas, I Gelado, JE Stone, N Navarro, DB Kirk, W Hwu IEEE Transactions on Parallel and Distributed Systems 26 (5), 1405-1418, 2014 | 14 | 2014 |
Parallelizing general histogram application for cuda architectures U Milic, I Gelado, N Puzovic, A Ramirez, M Tomasevic 2013 International Conference on Embedded Computer Systems: Architectures …, 2013 | 14 | 2013 |