Fengguang Song
TitleCited byYear
Enabling and Scaling Matrix Computations on Heterogeneous Multi-Core and Multi-GPU Systems
F Song, S Tomov, J Dongarra
ICS 2012, 2012
118*2012
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems
F Song, A YarKhan, J Dongarra
High Performance Computing, Networking, Storage and Analysis (SC), 2009 …, 2009
1152009
An algebra for cross-experiment performance analysis
F Song, F Wolf, N Bhatia, J Dongarra, S Moore
International Conference on Parallel Processing, 2004. ICPP 2004., 63-72, 2004
672004
A Scalable Framework for Heterogeneous GPU-Based Clusters
F Song, J Dongarra
SPAA 2012, 2012
572012
Scalable tile communication-avoiding QR factorization on multicore cluster systems
F Song, H Ltaief, B Hadri, J Dongarra
Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
372010
L2 cache modeling for scientific applications on chip multi-processors
F Song, S Moore, J Dongarra
2007 International Conference on Parallel Processing (ICPP 2007), 51-51, 2007
332007
Analytical modeling and optimization for affinity based thread scheduling on multicore systems
F Song, S Moore, J Dongarra
IEEE Cluster Computing, 2009., 1-10, 2009
31*2009
Feedback-directed thread scheduling with memory considerations
F Song, S Moore, J Dongarra
HPDC 2007, 97-106, 2007
262007
Experiments with strassen’s algorithm: from sequential to parallel
F Song, J Dongarra, S Moore
Parallel and Distributed Computing and Systems 2 (3), 2006
212006
Automatic experimental analysis of communication patterns in virtual topologies
N Bhatia, F Song, F Wolf, J Dongarra, B Mohr, S Moore
2005 International Conference on Parallel Processing (ICPP'05), 465-472, 2005
152005
Performance instrumentation and compiler optimizations for MPI/OpenMP applications
O Hernandez, F Song, B Chapman, J Dongarra, B Mohr, S Moore, F Wolf
International Workshop on OpenMP, 267-278, 2005
152005
Correcting Soft Errors Online in Fast Fourier Transform
X Liang, J Chen, D Tao, S Li, P Wu, H Li, K Ouyang, Y Liu, F Song, ...
SC'17, 2017
112017
A scalable approach to solving dense linear algebra problems on hybrid CPU‐GPU systems
F Song, J Dongarra
Concurrency and Computation: Practice and Experience 27 (14), 3702-3723, 2015
102015
KV-Cache: A Scalable High-Performance Web-Object Cache for Manycore
D Waddington, J Colmenares, J Kuang, F Song
The 6th ACM/IEEE International Conference on Utility and Cloud Computings …, 2013
102013
Automating the Large-Scale Collection and Analysis of Performance Data on Linux Clusters
P Mucci, J Dongarra, S Moore, F Song, F Wolf, R Kufrin
Proceedings of the 5th LCI International Conference on Linux Clusters: The …, 2004
102004
CUBE User Manual
F Song, F Wolf
University of Tennessee, Innovative Computing Laboratory, 2004
92004
Scaling Up Matrix Computations on Shared-Memory Manycore Systems with 1000 CPU Cores
F Song, J Dongarra
The 28th ACM International Conference on Supercomputing (ICS'14), 2014
82014
Implementing a High-Performance Recommendation System Using Phoenix++
C Cao, F Song, D Waddington
The 8th IEEE International Conference for Internet Technology and Secured …, 2013
82013
Static and dynamic scheduling for effective use of multicore systems
F Song
Ph.D. Dissertation, 2009
82009
LBM-IB: A parallel library to solve 3D fluid-structure interaction problems on manycore systems
P Nagar, F Song, L Zhu, L Lin
2015 44th International Conference on Parallel Processing, 51-60, 2015
62015
The system can't perform the operation now. Try again later.
Articles 1–20