臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.213) 您好！臺灣時間：2025/11/10 09:45

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
紙本論文
QR Code

本論文永久網址:

研究生:

阮毓欽

研究生(外文):

Yu-Chin Juan

論文名稱:

在共享記憶體系統的快速平行隨機梯度下降法矩陣分解

論文名稱(外文):

A Fast Parallel Stochastic Gradient Method for Matrix Factorization in Shared Memory Systems

指導教授:

林智仁

口試委員:

林軒田、李育杰

口試日期:

2014-07-11

學位類別:

碩士

校院名稱:

國立臺灣大學

系所名稱:

資訊工程學研究所

學門:

工程學門

學類:

電資工程學類

論文種類:

學術論文

論文出版年:

2014

畢業學年度:

102

語文別:

英文

論文頁數:

中文關鍵詞:

推薦系統、矩陣分解、隨機梯度下降法、平行計算、共享記憶體演算法

外文關鍵詞:

Recommender system、Matrix factorization、Stochastic gradient descent、Parallel computing、Shared memory algorithm

相關次數:

被引用:0
點閱:461
評分:
下載:0
書目收藏:0

在推薦系統上，矩陣分解是一個非常有效的技術。
對於矩陣分解問題，隨機梯度下降法是一個高效的演算法。
然而，這個演算法並不容易被平行。
這篇論文，在共享記憶體系統中，我們開發一個新的平行演算法叫做FPSG。
藉由解決負載不平衡問題及快取失效問題，我們開發的平行演算法比現有的平行演算法更加有效。

Matrix factorization is known to be an effective method for recommender systems that are given only the ratings from users to items.
Currently, stochastic gradient (SG) method is one of the most popular algorithms for matrix factorization.
However, as a sequential approach, SG is difficult to be parallelized for handling web-scale problems.
In this thesis, we develop a fast parallel SG method, FPSG, for shared memory systems.
By dramatically reducing the cache-miss rate and carefully addressing the load balance of threads, FPSG is more efficient than state-of-the-art parallel algorithms for matrix factorization.

TABLE OF CONTENTS
口試委員會審定書................................ i
中文摘要...................................... ii
ABSTRACT .................................... iii
LISTOFFIGURES................................ vi
LIST OF TABLES . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . viii
CHAPTER
I.Introduction ............................... 1
II. Existing Parallelized Stochastic Gradient Descent Algorithms andCoordinateDescentMethods.................. 6
2.1 HogWild .............................. 6
2.2 DSGD................................ 8
2.3 CCD++ .............................. 9
III. Problems in Parallel SG Methods for Matrix Factorization . . 13
3.1 LockingProblem.......................... 13
3.2 MemoryDiscontinuity....................... 14
IV.OurApproaches ............................. 16
4.1 Lock-FreeScheduling ....................... 16
4.2 PartialRandomMethod...................... 18
4.3 OverviewofFPSG......................... 21
4.4 ImplementationIssues....................... 22
V.Experiments ............................... 25
5.1 Settings............................... 25
5.2 Comparison of Methods on Training Time versus RMSE . . . . 27
5.2.1 The effectiveness of addressing the locking problem . 27
5.2.2 The effectiveness of having better memory locality . . 28
5.2.3 Comparison with the state-of-the-art methods . . . . 28
5.2.4 Comparison with CCD++ for Non-negative Matrix
Factorization....................... 29
5.3 SpeedupofFPSG ......................... 31
VI.Discussion................................. 33
6.1 DataLocalityandtheUpdateOrder............... 33
6.2 NumberofBlocks ......................... 34
VII.ConclusionsandFutureWorks.................... 38
BIBLIOGRAPHY................................. 40
APPENDICES................................... 42

R. M. Bell and Y. Koren. Lessons from the Netflix prize challenge. ACM SIGKDD Explorations Newsletter, 9(2):75–79, 2007.
K.-W. Chang, C.-J. Hsieh, and C.-J. Lin. Coordinate descent method for large- scale L2-loss linear SVM. Journal of Machine Learning Research, 9:1369–1398, 2008. URL http://www.csie.ntu.edu.tw/~cjlin/papers/cdl2.pdf.
G. Dror, N. Koenigstein, Y. Koren, and M. Weimer. The Yahoo! music dataset and KDD-Cup 11. In JMLR Workshop and Conference Proceedings: Proceedings of KDD Cup 2011, volume 18, pages 3–18, 2012.
R. Gemulla, E. Nijkamp, P. J. Haas, and Y. Sismanis. Large-scale matrix factor- ization with distributed stochastic gradient descent. In Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 69–77, 2011.
K. B. Hall, S. Gilpin, and G. Mann. MapReduce/Bigtable for distributed op- timization. In Neural Information Processing Systems Workshop on Leaning on Cores, Clusters, and Clouds, 2010.
C.-J. Hsieh and I. S. Dhillon. Fast coordinate descent methods with variable se- lection for non-negative matrix factorization. In Proceedings of the Seventeenth ACM SIGKDD International Conference on Knowledge Discovery and Data Min- ing, 2011.
J. Kiefer and J. Wolfowitz. Stochastic estimation of the maximum of a regression function. The Annals of Mathematical Statistics, 23(3):462–466, 1952.
Y. Koren, R. M. Bell, and C. Volinsky. Matrix factorization techniques for rec- ommender systems. Computer, 42(8):30–37, 2009.
A. Kyrola, G. Blelloch, and C. Guestrin. Graphchi: Large-scale graph computa- tion on just a pc. In Proceedings of the 10th USENIX Symposium on Operating Systems Design and Implementation (OSDI ’12), Hollywood, October 2012.
G. Mann, R. McDonald, M. Mohri, N. Silberman, and D. Walker. Efficient large- scale distributed training of conditional maximum entropy models. In Y. Bengio, D. Schuurmans, J. Lafferty, C. K. I. Williams, and A. Culotta, editors, Advances in Neural Information Processing Systems 22, pages 1231–1239. 2009.
40
&;#65532;R. McDonald, K. Hall, and G. Mann. Distributed training strategies for the struc- tured perceptron. In Proceedings of the 48th Annual Meeting of the Association of Computational Linguistics (ACL), pages 456–464, 2010.
F. Niu, B. Recht, C. R &;#769;e, and S. J. Wright. HOGWILD!: A lock-free ap- proach to parallelizing stochastic gradient descent. In J. Shawe-Taylor, R. Zemel, P. Bartlett, F. Pereira, and K. Weinberger, editors, Advances in Neural Informa- tion Processing Systems 24, pages 693–701, 2011.
I. Pil &;#769;aszy, D. Zibriczky, and D. Tikk. Fast ALS-based matrix factorization for explicit and implicit feedback datasets. In Proceedings of the Fourth ACM Con- ference on Recommender Systems, pages 71–78, 2010.
H. Robbins and S. Monro. A stochastic approximation method. The Annals of Mathematical Statistics, 22(3):400–407, 1951.
H.-F. Yu, C.-J. Hsieh, S. Si, and I. S. Dhillon. Scalable coordinate descent ap- proaches to parallel matrix factorization for recommender systems. In Proceedings of the IEEE International Conference on Data Mining, pages 765–774, 2012.
H. Yun, H.-F. Yu, C.-J. Hsieh, S. Vishwanathan, and I. S. Dhillon. Nomad: Non-locking, stochastic multi-machine algorithm for asynchronous and decentral- ized matrix completion. In International Conference on Very Large Data Bases (VLDB), 2014.
Y. Zhou, D. Wilkinson, R. Schreiber, and R. Pan. Large-scale parallel collabo- rative filtering for the Netflix prize. In Proceedings of the Fourth International Conference on Algorithmic Aspects in Information and Management, pages 337– 348, 2008.
Y. Zhuang, W.-S. Chin, Y.-C. Juan, and C.-J. Lin. A fast parallel SGD for matrix factorization in shared memory systems. In Proceedings of the ACM Rec- ommender Systems, 2013. URL http://www.csie.ntu.edu.tw/~cjlin/papers/ libmf.pdf.
M. Zinkevich, M. Weimer, A. Smola, and L. Li. Parallelized stochastic gradient de- scent. In J. Lafferty, C. K. I. Williams, J. Shawe-Taylor, R. Zemel, and A. Culotta, editors, Advances in Neural Information Processing Systems 23, pages 2595–2603. 2010.

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	網路評比資料之統計分析
2.	基於矩陣分解於社群網路中APP之推薦
3.	基於正範例與未分類資料學習法之單一類別推薦系統
4.	推薦系統資料插補改良法-電影推薦系統應用
5.	結合音樂內容和協同過濾方法之音樂推薦
6.	結合多情境因素及協同過濾方法之多媒體推薦
7.	一個基於矩陣分解的分佈估計方法應用於協同過濾推薦系統
8.	使用漸進式奇異值分解法於結合社群關係之大型推薦系統
9.	基於矩陣分解的雙向推薦系統
10.	大規模協同過濾演算法
11.	平行Cholesky分解考慮下稀疏矩陣之重排列

無相關期刊

1.	增量和減量式學習與線性分類
2.	機器學習在移動計算中的應用–交通狀態辨識的個案研究
3.	大規模羅吉斯回歸與線性支持向量機在Spark上之應用
4.	大規模矩陣分解與其廣義模型
5.	大規模線性分類資料低階多項式映射中雜湊函數之應用
6.	牛頓法於大規模線性分類問題的預處理器研究
7.	隨機森林與梯度提升決策樹在大數據下之探討
8.	分散式牛頓法在規則化羅吉斯回歸上之應用
9.	暖啟動於線性分類器之參數選擇
10.	牛頓法於機器學習之應用
11.	藉由單類矩陣分解進行搜尋推薦
12.	L1正規化的平滑近似方法之比較
13.	大規模線性排序支持向量機在分散式環境下之分析實作
14.	大規模線性排序支持向量機
15.	大規模分解機器之最佳化方法比較

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室