Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems T Chen, M Li, Y Li, M Lin, N Wang, M Wang, T Xiao, B Xu, C Zhang, ... arXiv preprint arXiv:1512.01274, 2015 | 1667 | 2015 |
Empirical evaluation of rectified activations in convolutional network B Xu, N Wang, T Chen, M Li arXiv preprint arXiv:1505.00853, 2015 | 1555 | 2015 |
Scaling distributed machine learning with the parameter server M Li, DG Andersen, JW Park, AJ Smola, A Ahmed, V Josifovski, J Long, ... 11th {USENIX} Symposium on Operating Systems Design and Implementation …, 2014 | 1176 | 2014 |
Efficient mini-batch training for stochastic optimization M Li, T Zhang, Y Chen, AJ Smola Proceedings of the 20th ACM SIGKDD international conference on Knowledge …, 2014 | 526 | 2014 |
Communication Efficient Distributed Machine Learning with the Parameter Server. M Li, DG Andersen, AJ Smola, K Yu NIPS 2, 1.4-2.2, 2014 | 396 | 2014 |
Emotion classification based on gamma-band EEG M Li, BL Lu 2009 Annual International Conference of the IEEE Engineering in medicine and …, 2009 | 335 | 2009 |
Bag of tricks for image classification with convolutional neural networks T He, Z Zhang, H Zhang, Z Zhang, J Xie, M Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 299 | 2019 |
Parameter Server for Distributed Machine Learning M Li, L Zhou, Z Yang, A Li, F Xia, DG Andersen, A Smola | 145 | 2013 |
Making large-scale Nyström approximation possible M Li, JTY Kwok, B Lü ICML 2010-Proceedings, 27th International Conference on Machine Learning, 631, 2010 | 126 | 2010 |
Dive into deep learning JM Czum Journal of the American College of Radiology: JACR 17 (5), 637-638, 2020 | 119* | 2020 |
Resnest: Split-attention networks H Zhang, C Wu, Z Zhang, Y Zhu, Z Zhang, H Lin, Y Sun, T He, J Mueller, ... arXiv preprint arXiv:2004.08955, 2020 | 106 | 2020 |
Large-scale Nyström kernel matrix approximation using randomized SVD M Li, W Bi, JT Kwok, BL Lu IEEE transactions on neural networks and learning systems 26 (1), 152-164, 2014 | 88 | 2014 |
Iterative row sampling M Li, GL Miller, R Peng 2013 IEEE 54th Annual Symposium on Foundations of Computer Science, 127-136, 2013 | 80 | 2013 |
Time and space efficient spectral clustering via column sampling M Li, XC Lian, JT Kwok, BL Lu CVPR 2011, 2297-2304, 2011 | 67 | 2011 |
Difacto: Distributed factorization machines M Li, Z Liu, AJ Smola, YX Wang Proceedings of the Ninth ACM International Conference on Web Search and Data …, 2016 | 53 | 2016 |
Bag of freebies for training object detection neural networks Z Zhang, T He, H Zhang, Z Zhang, J Xie, M Li arXiv preprint arXiv:1902.04103, 2019 | 51 | 2019 |
Distributed delayed proximal gradient methods M Li, DG Andersen, A Smola NIPS Workshop on Optimization for Machine Learning 3, 3, 2013 | 51 | 2013 |
GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing. J Guo, H He, T He, L Lausen, M Li, H Lin, X Shi, C Wang, J Xie, S Zha, ... Journal of Machine Learning Research 21 (23), 1-7, 2020 | 48 | 2020 |
Revise saturated activation functions B Xu, R Huang, M Li arXiv preprint arXiv:1602.05980, 2016 | 48 | 2016 |
xgboost: Extreme Gradient Boosting (2017) T Chen, T He, M Benesty, V Khotilovich, Y Tang, H Cho, K Chen, ... R package version 0.6-4, 2015 | 48 | 2015 |