FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs A Papakonstantinou, K Gururaj, JA Stratton, D Chen, J Cong, WMW Hwu 2009 IEEE 7th Symposium on Application Specific Processors, 35-42, 2009 | 245 | 2009 |
Accelerator-rich architectures: Opportunities and progresses J Cong, MA Ghodrat, M Gill, B Grigorian, K Gururaj, G Reinman Proceedings of the 51st annual design automation conference, 1-6, 2014 | 125 | 2014 |
An energy-efficient adaptive hybrid cache J Cong, K Gururaj, H Huang, C Liu, G Reinman, Y Zou IEEE/ACM International Symposium on Low Power Electronics and Design, 67-72, 2011 | 72 | 2011 |
Assuring application-level correctness against soft errors J Cong, K Gururaj 2011 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 150-157, 2011 | 65 | 2011 |
Multilevel granularity parallelism synthesis on FPGAs A Papakonstantinou, Y Liang, JA Stratton, K Gururaj, D Chen, WMW Hwu, ... 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom …, 2011 | 63 | 2011 |
Energy efficient multiprocessor task scheduling under input-dependent variation J Cong, K Gururaj 2009 Design, Automation & Test in Europe Conference & Exhibition, 411-416, 2009 | 60 | 2009 |
MC-Sim: An efficient simulation tool for MPSoC designs J Cong, K Gururaj, G Han, A Kaplan, M Naik, G Reinman 2008 IEEE/ACM International Conference on Computer-Aided Design, 364-371, 2008 | 46 | 2008 |
Evaluation of static analysis techniques for fixed-point precision optimization J Cong, K Gururaj, B Liu, C Liu, Z Zhang, S Zhou, Y Zou 2009 17th IEEE Symposium on Field Programmable Custom Computing Machines …, 2009 | 39 | 2009 |
Efficient compilation of CUDA kernels for high-performance computing on FPGAs A Papakonstantinou, K Gururaj, JA Stratton, D Chen, J Cong, WMW Hwu ACM Transactions on Embedded Computing Systems (TECS) 13 (2), 1-26, 2013 | 30 | 2013 |
Synthesis of reconfigurable high-performance multicore systems J Cong, K Gururaj, G Han Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2009 | 25 | 2009 |
High-performance CUDA kernel execution on FPGAs A Papakonstantinou, K Gururaj, JA Stratton, D Chen, J Cong, WMW Hwu Proceedings of the 23rd international conference on Supercomputing, 515-516, 2009 | 19 | 2009 |
Domain-specific processor with 3d integration for medical image processing J Cong, K Guruaj, M Huang, S Li, B Xiao, Y Zou ASAP 2011-22nd IEEE International conference on application-specific systems …, 2011 | 18 | 2011 |
Accelerating monte carlo based SSTA using FPGA J Cong, K Gururaj, W Jiang, B Liu, K Minkovich, B Yuan, Y Zou Proceedings of the 18th annual ACM/SIGDA international symposium on Field …, 2010 | 18 | 2010 |
Scalable low-latency persistent neural machine translation on CPU server with multiple FPGAs E Nurvitadhi, A Boutros, P Budhkar, A Jafari, D Kwon, D Sheffield, ... 2019 International Conference on Field-Programmable Technology (ICFPT), 307-310, 2019 | 12 | 2019 |
Synthesis algorithm for application-specific homogeneous processor networks J Cong, K Gururaj, G Han, W Jiang IEEE transactions on very large scale integration (VLSI) systems 17 (9 …, 2009 | 10 | 2009 |
Controllability-driven power virus generation for digital circuits K Najeeb, K Gururaj, V Kamakoti, VM Vedula 20th International Conference on VLSI Design held jointly with 6th …, 2007 | 10 | 2007 |
Architecture support for custom instructions with memory operations J Cong, K Gururaj Proceedings of the ACM/SIGDA international symposium on Field programmable …, 2013 | 4 | 2013 |
Accelerate Genomics Research with the Broad-Intel Genomics Stack P Foley, A Prabhakaran, K Gururaj, M Naik, S Gopalan, A Shargorodskiy, ... | 2 | 2017 |
Communication bottleneck in hardware-software partitioning M Moazeni, A Vahdatpour, K Gururaj, M Sarrafzadeh Proceedings of the 16th international ACM/SIGDA symposium on Field …, 2008 | 2 | 2008 |
Controllability-driven peak dynamic power estimation for VLSI circuits K Najeeb, K Gururaj, V Kamakoti, VM Vedula Journal of Low Power Electronics 3 (3), 280-292, 2007 | 1 | 2007 |