|
[1] Yaqi Sun, Shijing Si, Jianzong Wang, Yuhan Dong, Zhi Bo Zhu, and Jing Xiao. A fair federated learning framework with reinforcement learning. In Proceedings of the International Joint Conference on Neural Networks (IJCNN), 2022. [2] Nang Hung Nguyen, Phi-Le Nguyen, Duc Long Nguyen, Trung Thanh Nguyen, ThuyDung Nguyen, Huy-Hieu Pham, and Truong Thao Nguyen. Feddrl: Deep reinforcement learning-based adaptive aggregation for non-iid data in federated learning. In Proceedings of the International Conference on Parallel Processing (ICPP), 2022. [3] Dilith Jayakody. Deep q-networks (dqn) – a quick introduction (with code). https://dilithjay.com/blog/ deep-q-networks-dqn-a-quick-introduction-with-code/. [4] Alex Krizhevsky. Cifar-10 and cifar-100 datasets. https://www.cs.toronto.edu/ ~kriz/cifar.html. [5] H. B. McMahan, Eider Moore, Daniel Ramage, Seth Hampson, and Blaise Agüera y Arcas. Communication-efficient learning of deep networks from decentralized data. In Proceedings of the International Conference on Artificial Intelligence and Statistics(AISTATS), 2016. [6] Wei Yang Bryan Lim, Nguyen Cong Luong, Dinh Thai Hoang, Yutao Jiao, Ying-Chang Liang, Qiang Yang, Dusit Niyato, and Chunyan Miao. Federated learning in mobile edge networks: A comprehensive survey. IEEE Communications Surveys Tutorials, 22(3):2031–2063, 2020. [7] Robin C. Geyer, Tassilo Klein, and Moin Nabi. Differentially private federated learning: A client level perspective. https://arxiv.org/abs/1712.07557, 2018. [8] Aaron Segal, Antonio Marcedone, Benjamin Kreuter, Daniel Ramage, H. Brendan McMahan, Karn Seth, K. A. Bonawitz, Sarvar Patel, and Vladimir Ivanov. Practical secure aggregation for privacy-preserving machine learning. In Proceedings of the 2017 ACM SIGSAC Conference on Computer and Communications Security, page 1175– 1191, New York, NY, USA, 2017. Association for Computing Machinery. [9] Joost Verbraeken, Matthijs Wolting, Jonathan Katzy, Jeroen Kloppenburg, Tim Verbelen, and Jan S. Rellermeyer. A survey on distributed machine learning. ACM Computing Surveys (CSUR), 53(2), 2020. [10] HUI-ROU ZHOUG. Learning-based client selections considering client similarity in federated learning. Master’s thesis, Yuan Ze University, 2023. [11] S. A. Gokte. Most popular distance metrics used in knn and when to use them. KDnuggets, 2023. [12] Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 2nd edition, 2018. [13] Jens Kober, J. Andrew Bagnell, and Jan Peters. Reinforcement learning in robotics: A survey. The International Journal of Robotics Research, 32(11):1238–1274, 2013. [14] J. Moody and M. Saffell. Learning to trade via direct reinforcement. IEEE Transactions on Neural Networks, 12(4):875–889, 2001. [15] H. B. McMahan, Eider Moore, Daniel Ramage, and Blaise Agüera y Arcas. Federated learning of deep networks using model averaging. arXiv preprint arXiv:1602.05629, 2016. [16] Jian Li, Tongbao Chen, and Shaohua Teng. A comprehensive survey on client selection strategies in federated learning. Computer Networks, 251:110663, 2024. [17] Zhipeng Cheng, Xuwei Fan, Ning Chen, Minghui Liwang, Lianfen Huang, and Xianbin Wang. Learning-based client selection for multiple federated learning services with constrained monetary budgets. ICT Express (2023): n. pag., 9(6):1059–1064, 2023. [18] Chiyuan Zhang, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. Understanding deep learning requires rethinking generalization. https://arxiv.org/ abs/1611.03530, 2017. [19] Shai Shalev-Shwartz and Shai Ben-David. Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press, Cambridge, UK, 2014. [20] Yue Zhao, Meng Li, Liangzhen Lai, Naveen Suda, Damon Civin, and Vikas Chandra. Federated learning with non-iid data. https://arxiv.org/abs/1806.00582, 2018. [21] Daniel J Beutel, Taner Topal, Akhil Mathur, Xinchi Qiu, Javier Fernandez-Marques, Yan Gao, Lorenzo Sani, Hei Li Kwing, Titouan Parcollet, Pedro PB de Gusmão, and Nicholas D Lane. Flower: A friendly federated learning research framework. arXiv preprint arXiv:2007.14390, 2020.
|