Yet another accelerated sgd: Resnet-50 training on imagenet in 74.7 seconds M Yamazaki, A Kasagi, A Tabuchi, T Honda, M Miwa, N Fukumoto, ... arXiv preprint arXiv:1903.12650, 2019 | 113 | 2019 |
Understanding storage traffic characteristics on enterprise virtual desktop infrastructure C Lee, T Kumano, T Matsuki, H Endo, N Fukumoto, M Sugawara Proceedings of the 10th ACM International Systems and Storage Conference, 1-11, 2017 | 112 | 2017 |
MLPerf™ HPC: A holistic benchmark suite for scientific machine learning on HPC systems S Farrell, M Emani, J Balma, L Drescher, A Drozd, A Fink, G Fox, D Kanter, ... 2021 IEEE/ACM Workshop on Machine Learning in High Performance Computing …, 2021 | 24 | 2021 |
Optimizing power-performance trade-off for parallel applications through dynamic core and frequency scaling S Imamura, H Sasaki, N Fukumoto, K Inoue, K Murakami Proceedings of the RESoLVE 12, 2012 | 12 | 2012 |
mpiqulacs: A distributed quantum computer simulator for a64fx-based cluster systems S Imamura, M Yamazaki, T Honda, A Kasagi, A Tabuchi, H Nakao, ... arXiv preprint arXiv:2203.16044, 2022 | 8 | 2022 |
Analyzing the impact of data prefetching on Chip MultiProcessors N Fukumoto, T Mihara, K Inoue, K Murakami 2008 13th Asia-Pacific Computer Systems Architecture Conference, 1-8, 2008 | 8 | 2008 |
3D implemented SRAM/DRAM hybrid cache architecture for high-performance and low power consumption K Inoue, S Hashiguchi, S Ueno, N Fukumoto, K Murakami 2011 IEEE 54th International Midwest Symposium on Circuits and Systems …, 2011 | 7 | 2011 |
サーバシステムの性能データ収集および転送における効率化手法の考察 飯山知香, 平井聡, 山岡茉莉, 福本尚人, 小口正人 マルチメディア, 分散, 協調とモバイルシンポジウム 2022 論文集 2022, 1696-1701, 2022 | 3 | 2022 |
The 16,384-node parallelism of 3D-CNN training on an arm CPU based supercomputer A Tabuchi, K Shirahata, M Yamazaki, A Kasagi, T Honda, K Kurihara, ... 2021 IEEE 28th International Conference on High Performance Computing, Data …, 2021 | 3 | 2021 |
Towards straggler-tolerant and accuracy-aware distributed DNN training in clouds S Okuno, M Miwa, N Fukumoto 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021 | 3 | 2021 |
A traffic-aware memory-cube network using bypassing Y Shikama, R Kawano, H Matsutani, H Amano, Y Nagasaka, N Fukumoto, ... Microprocessors and Microsystems 90, 104471, 2022 | 2 | 2022 |
Preliminary performance analysis of distributed DNN training with relaxed synchronization K Shirahata, A Haderbache, N Fukumoto, K Nakashima IEICE Transactions on Electronics 104 (6), 257-260, 2021 | 2 | 2021 |
SRAM/DRAM ハイブリッド・キャッシュにおける実行時動作モード決定法の提案 橋口慎哉, 福本尚人, 井上弘士, 村上和彰 研究報告計算機アーキテクチャ (ARC) 2011 (9), 1-6, 2011 | 2 | 2011 |
Performance balancing: software-based on-chip memory management for effective CMP executions N Fukumoto, K Imazato, K Inoue, K Murakami Proceedings of the 10th workshop on MEmory performance: DEaling with …, 2009 | 2 | 2009 |
適応的ヘルパースレッド実行に基づくマルチコア向け演算/メモリ性能バランシング 今里賢一, 福本尚人, 井上弘士, 村上和彰 研究報告システムソフトウェアとオペレーティング・システム (OS) 2009 (16), 1-8, 2009 | 2 | 2009 |
A Binary Translator to Accelerate Development of Deep Learning Processing Library for AArch64 CPU K Kawakami, K Kurihara, M Yamazaki, T Honda, N Fukumoto IEICE Transactions on Electronics 105 (6), 222-231, 2022 | 1 | 2022 |
Performance analysis of multi-containerized MD simulations for low-level resource allocation S Okuno, A Hirai, N Fukumoto 2022 IEEE International Parallel and Distributed Processing Symposium …, 2022 | 1 | 2022 |
Efficient collision-free mttkrp algorithm for multi-core cpus with less memory usage Y Nagasaka, N Fukumoto 2022 22nd IEEE International Symposium on Cluster, Cloud and Internet …, 2022 | 1 | 2022 |
サーバシステムの性能データ収集および転送効率化に向けた改善案の検討 飯山知香, 平井聡, 山岡茉莉, 福本尚人, 小口正人 第 84 回全国大会講演論文集 2022 (1), 127-128, 2022 | 1 | 2022 |
Low-latency low-energy memory-cube networks using dual-voltage datapaths Y Shikama, R Kawano, H Matsutani, H Amano, Y Nagasaka, N Fukumoto, ... 2021 29th Euromicro International Conference on Parallel, Distributed and …, 2021 | 1 | 2021 |