Juan Gómez Luna
Juan Gómez Luna
Verified email at ethz.ch
Title
Cited by
Cited by
Year
Processing data where it makes sense: Enabling in-memory computation
O Mutlu, S Ghose, J Gómez-Luna, R Ausavarungnirun
Microprocessors and Microsystems 67, 28-41, 2019
842019
Chai: Collaborative heterogeneous applications for integrated-architectures
J Gómez-Luna, I El Hajj, LW Chang, V Garcıa-Flores, SG de Gonzalo, ...
ISPASS, 2017
782017
Mqsim: A framework for enabling realistic studies of modern multi-queue {SSD} devices
A Tavakkol, J Gómez-Luna, M Sadrosadati, S Ghose, O Mutlu
16th {USENIX} Conference on File and Storage Technologies ({FAST} 18), 49-66, 2018
752018
An optimized approach to histogram computation on GPU
J Gómez-Luna, JM González-Linares, JI Benavides, N Guil
Machine Vision and Applications 24 (5), 899-908, 2013
492013
Processing-in-memory: A workload-driven perspective
S Ghose, A Boroumand, JS Kim, J Gómez-Luna, O Mutlu
IBM Journal of Research and Development 63 (6), 3: 1-3: 19, 2019
482019
FLIN: Enabling fairness and enhancing performance in modern NVMe solid state drives
A Tavakkol, M Sadrosadati, S Ghose, J Kim, Y Luo, Y Wang, NM Ghiasi, ...
2018 ACM/IEEE 45th Annual International Symposium on Computer Architecture …, 2018
482018
Napel: Near-memory computing application performance prediction via ensemble learning
G Singh, J Gómez-Luna, G Mariani, GF Oliveira, S Corda, S Stuijk, ...
2019 56th ACM/IEEE Design Automation Conference (DAC), 1-6, 2019
422019
Smash: Co-designing software compression and hardware-accelerated indexing for efficient sparse matrix operations
K Kanellopoulos, N Vijaykumar, C Giannoula, R Azizi, S Koppula, ...
Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019
372019
Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications
V Garcıa, J Gomez-Luna, T Grass, A Rico, E Ayguade, AJ Pena
2016 IEEE International Symposium on Workload Characterization (IISWC), 1-10, 2016
332016
Performance modeling of atomic additions on GPU scratchpad memory
J Gómez-Luna, J González-Linares, J Benavides Benítez, N Guil
IEEE Transactions on Parallel and Distributed Systems 24 (11), 2273-2282, 2013
332013
A modern primer on processing in memory
O Mutlu, S Ghose, J Gómez-Luna, R Ausavarungnirun
arXiv preprint arXiv:2012.03112, 2020
322020
Performance models for asynchronous data transfers on consumer Graphics Processing Units
J Gómez-Luna, JM González-Linares, JI Benavides, N Guil
Journal of Parallel and Distributed Computing 72 (9), 1117-1126, 2012
322012
KLAP: Kernel launch aggregation and promotion for optimizing dynamic parallelism
I El Hajj, J Gómez-Luna, C Li, LW Chang, D Milojicic, W Hwu
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture …, 2016
312016
Genasm: A high-performance, low-power approximate string matching acceleration framework for genome sequence analysis
DS Cali, GS Kalsi, Z Bingöl, C Firtina, L Subramanian, JS Kim, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
302020
Enabling practical processing in and near memory for data-intensive computing
O Mutlu, S Ghose, J Gómez-Luna, R Ausavarungnirun
Proceedings of the 56th Annual Design Automation Conference 2019, 1-4, 2019
282019
In-place transposition of rectangular matrices on accelerators
IJ Sung, J Gómez-Luna, JM González-Linares, N Guil, WMW Hwu
ACM SIGPLAN Notices 49 (8), 207-218, 2014
262014
FIGARO: Improving system performance via fine-grained In-DRAM data relocation and caching
Y Wang, L Orosa, X Peng, Y Guo, S Ghose, M Patel, JS Kim, JG Luna, ...
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture …, 2020
252020
FPGA implementation of the generalized Hough transform
SR Geninatti, JIB Benítez, MH Calviño, NG Mata, JG Luna
2009 International Conference on Reconfigurable Computing and FPGAs, 172-177, 2009
252009
Natsa: A near-data processing accelerator for time series analysis
I Fernandez, R Quislant, E Gutiérrez, O Plata, C Giannoula, M Alser, ...
2020 IEEE 38th International Conference on Computer Design (ICCD), 120-129, 2020
232020
Parallelization of a video segmentation algorithm on CUDA–enabled graphics processing units
J Gómez-Luna, JM González-Linares, JI Benavides, N Guil
European Conference on Parallel Processing, 924-935, 2009
222009
The system can't perform the operation now. Try again later.
Articles 1–20