臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.216.73) 您好！臺灣時間：2026/07/23 00:55

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
QR Code

本論文永久網址:

研究生:

王星凱

研究生(外文):

Hsing-Kai Wang

論文名稱:

有效率的分散式關聯規則探勘系統

論文名稱(外文):

An Efficient Distributed Association Rules Mining System

指導教授:

張昭憲

指導教授(外文):

Jau-Shien Chang

學位類別:

碩士

校院名稱:

淡江大學

系所名稱:

資訊管理學系

學門:

電算機學門

學類:

電算機一般學類

論文種類:

學術論文

論文出版年:

2003

畢業學年度:

語文別:

中文

論文頁數:

中文關鍵詞:

關聯規則探勘、分散式系統、資料探勘、資料庫

外文關鍵詞:

Association Rules Mining、Distributed System、Data Mining、Database

相關次數:

被引用:0
點閱:230
評分:
下載:29
書目收藏:3

關聯規則探勘(Association Rule Mining)可從交易資料庫中找出”A->B”型態的簡明規則(如果購買A也會購買B)。利用這項技術，企業可歸納出顧客的消費習慣，進而發展合適的行銷策略。然而，面對日益龐大的交易資料庫，為了加快探勘速度，如何利用多部電腦進行分散式探勘便引起學者們的廣泛注意。
本研究針對大型交易資料庫的關聯規則探勘，發展了一套有效率的分散式關聯規則探勘系統- EDAMS(an Efficient Distributed Association rules Mining System)。由於分散式探勘的效能瓶頸通常發生在節點間探勘結果之整合，因此我們捨棄傳統點對點的資料交換方式[3][7][9]，將特定節點改為資料伺服器(只負責資料整合與分發，不從事探勘工作)，有效地將傳訊次數由O(n2)大幅縮減至O(n)。此外，本研究採用DHP[2]做為基礎演算法，充分利用其在二階項目集的良好縮減能力，進一步降低總體資料傳輸量。為驗證系統有效性，我們使用八部電腦針對十萬筆至七十萬筆的交易資料進行探勘。由實驗數據可知，當資料筆數增加時，整體的加速比率(speedup ratio)也逐步提昇，顯示本系統的良好特性。此外，在相同資料筆數與支持度之下，EDAMS的加速比率也優於之前的相關研究[7][9]，驗證了犧牲一節點做為資料伺服器以改善傳訊次數之可行性。

Association rule mining can help the enterprises to capture the consumer behaviors and develop effective marketing strategies. However, the size of transaction database is increasing everyday, how to get timely mining results becomes a serious problem. In this paper, we propose an Effective Distributed Association rule Mining System, EDAMS, to cope with this problem. Unlike other distributed mining systems, a dedicated node is used as data server to collect exchange data among nodes. Thus, the point-to-point broadcasts are avoided and therefore the number of message exchanged is greatly reduced from O(n2) to O(n). Besides, to reduce the total amount of message, the DHP algorithm[2] is used as the basis algorithm to reduce the number of candidate 2-itemsets. According to our experimental results, the EDAMS achieve steadily increasing speedup ration ranging from 100,000 to 700,000 transaction data. Also, the speedup ratio is superior to those in the previous work[7][9]. It clearly demonstrates the effectiveness of our system.

第一章緒論 1
第一節研究背景與動機. 1
第二節論文章節架構 3
第二章相關研究介紹與評析 4
第一節單機版資料探勘演算法 4
第二節分散式版資料探勘演算法 8
第三節演算法評析 10
第三章 EDAMS系統 13
第一節系統模組 13
第二節系統運作模式 14
第三節節點間的傳訊次數與資料量大小 16
第四節 EDAMS 演算法 22
第四章實驗結果 29
第一節分散式實驗環境的建構 29
第二節實驗數據 30
第五章結論 33
參考文獻 34

[1] Agrawal and R. Srikant, “Fast algorithms for mining associations rules”, Proceedings of the 20th International Conference on Very Large Data Base, 1994.
[2] Jong Soo Park ， Ming-Syan Chen and Philip S. Yu ，“An effective hash-based algorithm for mining association rules，” Proceedings of the 1995 ACM SIGMOD international conference on Management of Data, May 1995, pp. 175-186.
[3] Cheung, D.W., Ng, V.T.; Fu,A.W., Yongjian Fu，“ Efficient Mining of Association Rules in Distributed Databases，” IEEE Transactions on Knowledge and Data Engineering，Vol. 8，No. 6，Dec 1996.
[4] Adomavicius, G., Tuzhilin, A., “ Using data mining methods to build customer profiles，” IEEE Computer , Volume: 34 Issue: 2 , Feb 2001 ,pp74-82.
[5] Aggarwal, C.C., Yu, P.S., “A new approach to online generation of association rules，”, IEEE Transactions on Knowledge and Data Engineering, Volume: 13 Issue: 4 , Jul/Aug 2001, pp527-540.
[6] Zaki, M.J; “Parallel and distributed association mining: a survey，” IEEE Concurrency, Vol. 7 Issue 4 , Oct-Dec 1999, pp14-25.
[7] R. Agrawal and J.C. Shafer, “Parallel Mining of Association Rules: Design, Implementation, and Experience, “ IBM Research Report RJ1004, 1996.
[8] Amitabha Das , Wee-Keong Ng , Yew-Kwong Woon, “Rapid association rule mining，” Proceedings of the tenth international conference on Information and knowledge management October 2001， pp. 474-481.
[9] D. W. Cheung, J. Han, V. T. Ng, A. W. Fu, and Y. Fu.” A fast distributed algorithms for mining association rules” In Proceedings of IEEE 4th International Conference on Parallel and Distributed Information Systems, pages 31--42, December 1996.
[10] Z. Chen, “Data Mining and Uncertain Reasoning,” John Wiley & Sons, Inc., 2001.
[11] R. Srikant and R. Agrawal, “Mining Quantitative Association Rules in Large Relational Tables,” Proceedings of the 1995 ACM SIGMOD international conference on Management of data, 1996, pp. 1-12.
[12] S. Mitra, et. al., “Data Mining in Soft Computing Framework: A Survey,” IEEE Trans. on Neural Networks, Vol. 13, No. 1, Jan. 2002, pp. 3-14.
[13] S. Pal, et. al., “Web Mining in Soft Computing Framework: Relevance, State of the Art and Future Directions,” IEEE Trans. on Neural Networks, Vol. 13, No. 5, Sep. 2002, pp. 1163-1177.
[14] D. W. Cheung, S. D. Lee, and y. Xiao, “Effect of Data Skewness and Workload Balance in Parallel Data Mining,” IEEE Trans. on Knowledge and Data Engineering, Vol. 14, No. 3, May/June 2002.
[15] http://www.remoteanything.com.

電子全文

國圖紙本論文

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

1.	以資料探勘探討顧客消費之行為
2.	在超大型政府資料庫中進行資料探勘之研究─以汽、機車失竊犯罪資料為例
3.	應用資料採擷技術於學校建築耐震評估專家系統知識擷取之研究
4.	e化護理照護系統之開發
5.	以關連法則探勘為基礎之電路板件維修輔助系統
6.	具隱私防護之關聯規則探勘研究
7.	應用資料採礦技術於資料庫加值中的抽樣方法
8.	應用資料採礦技術於資料庫加值中的誤差指標及模型準則
9.	行動網絡環境下之服務樣式探勘機制
10.	整合資料採礦流程於會員資料庫之使用者介面
11.	應用於分散式系統之平行循序樣本探勘
12.	GeneOntology架構下基因表現值的多階層關聯規則探勘
13.	以多層次關聯規則探勘技術探索圖書館使用者借閱行為模式
14.	結合分群法和關聯性法則之資料探勘-以104家教網為例
15.	在分散式系統中利用高效率演算法探勘關聯規則

無相關期刊

1.	在Adhoc無線網路中設計一具有動態排程多重路徑繞徑協定，加以改善TCP效能
2.	微影玻璃鈍化製程改善
3.	使用T-S模糊控制法則以穩定並具有H∞性能之時間延遲多機電力系統
4.	影像資料庫中一個相似尋取之方法
5.	以主觀效用調合多評選準則中各評選準則之評點
6.	存貨記錄不正確下定期盤點存貨系統之研究
7.	應用貝氏資訊準則在語者切割和最適混合數決定之研究
8.	671 nm主動式Q開關雷射之優化研究
9.	在分散式系統中利用高效率演算法探勘關聯規則
10.	智慧型派遣計程車系統對乘客選擇行為影響之研究
11.	行前交通資訊對城際旅行者運具選擇行為影響之研究
12.	機車使用汰換時程之研究
13.	應用類神經網路支援題庫系統選題鑑別度預測之研究
14.	以網路流量為基礎分析網路使用者之行為-以淡江大學為例
15.	商業智慧系統應用於食品飲料業之個案研究

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室