跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.81) 您好!臺灣時間:2025/10/04 03:40
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:何承道
研究生(外文):Cheng-Tao Ho
論文名稱:頻繁同構圖形探勘策略之研究
論文名稱(外文):HybirdGMiner:The Mining Strategy on Frequent Isomorphism Graph Structure
指導教授:張嘉惠張嘉惠引用關係
指導教授(外文):Chia-Hui Chang
學位類別:碩士
校院名稱:國立中央大學
系所名稱:資訊工程研究所
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2006
畢業學年度:94
語文別:中文
論文頁數:38
中文關鍵詞:圖形探勘型樣探勘圖形同構
外文關鍵詞:pattern mininggraph isomorphismgraph structuresgraph mining
相關次數:
  • 被引用被引用:1
  • 點閱點閱:338
  • 評分評分:
  • 下載下載:15
  • 收藏至我的研究室書目清單書目收藏:0
由於在頻繁項目集合(Frequent Itemsets)和序列型樣(Sequential Patterns)的探勘技術日趨成熟,很自然的,我們會想再進一步探討另一種包涵更廣泛資料關聯性的型樣探勘(Pattern Mining)- 圖形探勘(Graph Mining)。圖形探勘的應用非常廣泛,較著名的應用領域像是化學(Chemistry)、生物學(Biology)和電腦網路方面(Computer Network),以及其它所有可以對應成圖形型樣(Graph Pattern)的實際資料,在這些領域都會需要圖形型樣的探勘技術來支援其資料的分析與預測。圖形探勘的主要挑戰在於如何解決子/圖形同構(Subgraph/ Graph Isomorphism)問題,在本篇論文中我們提出一個結合圖形標準型態(Canonical Form)和資料內嵌結構的演算法,針對圖形資料庫(Graph Databases)進行高效率探勘。其主要概念為利用圖形標準型態解決重覆列舉問題,以及有技巧的記錄圖形型樣在資料庫中的位置(Embedding List),完全避免子圖形同構的檢查問題。實驗顯示我們所提出的演算法無論在合成資料與實際資料中,探勘效率都會勝過gSpan。
As the mining of frequent itemsets and sequential patterns became more mature, it is very natural that we would want to explore other patterns such as graph structures. Graph mining has very wide applications, such as chemistry, biology and computer networks. The main challenge in graph mining is how to solve the graph/ subgraph isomorphism problems. Thus, we propose an algorithm that combined previous pattern mining skills and some graph mining techniques to mine all frequent subgraph patterns efficiently. Our algorithm adopts canonical form to avoid the duplicate enumeration, and used an effective embedding list structure to avert the subgraph isomorphism checking completely. Our empirical study on synthetic and real datasets demonstrates that HybridGMiner achieves a substantial performance gain over the algorithm gSpan.
第一章 緒論 1
1.1. 研究動機與目的 1
1.2. 論文架構 2
第二章 問題定義 3
第三章 相關研究 6
3.1. 以廣度優先搜尋(BFS)之演算法 6
3.1.1. AGM演算法 7
3.1.2. FSG演算法 7
3.2. 以深度優先搜尋(DFS)之演算法 9
3.2.1. gSpan演算法 9
3.2.2. MoFa演算法 10
3.2.3. FFSM演算法 11
3.2.4. Gaston演算法 12
3.3. 演算法之整體比較表 13
第四章 HybridGMiner演算法 15
4.1. 圖形探勘的挑戰 15
4.2. HybridGMiner演算法架構 15
4.2.1. 型樣列舉方法(Enumeration) 16
4.2.2. 搜尋空間刪減技術(Pruning) 18
4.2.3. 圖形型樣成長機制(Extension) 22
4.3. 虛擬碼(Pseudo Code) 25
第五章 實驗結果 28
5.1. 合成資料集(Synthetic Data) 28
5.1.1. 資料產生器 28
5.1.2. 實驗結果與分析 29
5.2. 實際資料(Real World Data) 32
第六章 結論 35
參考文獻 36
[1]R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. 1994 International Conference. Very Large Data Bases (VLDB’94), pages 487-499, Santiago, Chile, Sept.1994.
[2]C. Borgelt. On Canonical Forms for Frequent Graph Mining. Workshop on Mining Graphs, Trees, and Sequences (MGTS'05 at PKDD'05, Porto, Portugal), 1-12. ECML/PKDD'05 Organization Committee, Porto, Portugal 2005.
[3]C. Borgelt, M.R. Berthold. Mining Molecular Fragments: Finding Relevant Substructures of Molecules. In Proceedings of the International Conference on Data Mining (ICDM), pages 51-58, 2002.
[4]L. Dehaspe, H. Toivonen, and R.D. King. Finding frequent substructures in chemical compounds. Proc. of the 4th International Conference on Knowledge Discovery and Data Mining, pages 30-36. AAAI Press. August 1998.
[5]A. Deutsch, M. F. Fernandez, D. Suciu. Storing semistructured data with STORED. International Conference on Management of Data Proceedings of the 1999 ACM SIGMOD international conference on Management of data, Pages: 431 – 442, 1999.
[6]R. Goldman, J. Widom. Dataguides: Enabling query formulation and optimization in semistructured databases. VLDB'97, Proceedings of 23rd International Conference on Very Large Data Bases, pages: 436-445, 1997.
[7]L. B. Holder, D. J. Cook, and S. Djoko. Substructure discovery in the subdue system. In Proceedings of the AAAI Workshop on Knowledge Discovery in Databases, pages 169-180, 1994.
[8]K. Y. Huang, C. H. Chang and K. Z. Lin, PROWL: An efficient frequent continuity mining algorithm on event sequences. In Proc. of 6th International Conference on Data Warehousing and Knowledge Discovery (DaWak), 2004.
[9]J. Huan, W. Wang, J. Prins. "Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism", in Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM’03), 2003.
[10]J. Huan, W. Wang, J. Prins, J. Yang. SPIN: Mining Maximal Frequent Subgrsphs from Graph Databases. In Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD’04), pages 581-586, 2004.
[11]A. Inokuchi, T. Washio, H. Motoda. An Apriori-based Algorithm for Mining Frequent Substructures from Graph Data. the 4th European Conference on Principles and Practice of Knowledge Discovery in Data Mining (PKDD2000), pp.13-23, 2000.
[12]M. Kuramochi, G. Karypis. Frequent Subgraph Discovery. Proceedings of the 2001 IEEE International Conference on Data Mining (ICDM’02), pages 721-724, 2002.
[13]B. D. McKay. Practical graph isomorphism. 10th. Manitoba Conference on Numerical Mathematics and Computing (Winnipeg, 1980); Congressus Numerantium, 30 (1981) 45-87.
[14]Alípio M. Jorge, Luís Torgo, Pavel B. Brazdil, Rui Camacho, João Gama. A Quantitative Comparison of the Subgraph Miners MoFa, gSpan, FFSM, and Gaston. the 9th European Conference on Principles and Practice of Knowledge Discovery in Data Mining (PKDD2005), pages 392-403, 2005.

[15]S. Nijssen, J.N. Kok. Frequent Graph Mining and its Application to Molecular Databases. In Proceedings of the IEEE International Conference on Systems, Man and Cybernetics, SMC 2004, Den Haag, Netherlands, October 10-13, 2004. IEEE Press, 2004.
[16]J. Pei, J. Han, B. Mortazavi-Asl, H. Pinto, Q. Chen, U. Dayal and M-C. Hsu. PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-Projected Pattern Growth. In. Proc. 2001 International Conference Data Engineering (ICDE'01), pages 215-224, Heidelberg, Germany, April 2001.
[17]K. Shearer, H. bunks, S. Venkatesh. Video Indexing and Similarity Retrieval by Largest Common Subgraph Detection using Decision Trees. Pattern Recognition 34 (2001) 1075—1091.
[18]X. Yan, J. Han. gSpan: gSpan: Graph-based Substructure Pattern Mining. In Proc. 2002 International Conference Data Engineering (ICDM’02), pages 721, 2002.
[19]X. Yan, J. Han. gSpan: gSpan: Graph-based Substructure Pattern Mining Technical Report UIUCDCS-R-2002-2296, Department of Computer Science, University of Illinois at Urbana-Champaign, 2002.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
1. 張翠娥(1999):幼兒園融合教育方案實施探究。樹德科技學報,1,15-33。
2. 孫瑋英(2000):談重度/多重障礙者在融合教育上的教學調整。國小特殊教育,30,88-92。
3. 胡致芬(1997):重度障礙者之統合教育。特殊教育季刊,62,6-21。
4. 林麗容(1999):融合教育的實施方式及其相關的配合措施。國教輔導,38(3),19-22。
5. 何東墀(2001):融合教育理念的流變與困境。特教園丁,16(4),56-60。
6. 吳淑美(1995):完全包含(full inclusion)模式可行嗎?。特教新知通訊,3(3),1-2。
7. 吳昆壽(1998):融合教育的省思。特教新知通訊,5(7),1-2。
8. 吳武典、張正芬、盧台華、蔡崇建(1991):殘障學生對「無障礙的校園環境」之需求評估研究。特殊教育研究學刊,7,23-41。
9. 邱上真(2001):普通班教師對特殊需求學生之因應措施、所面對之困境以及所需之支持系統。特殊教育研究學刊,21,1-26。
10. 毛連塭、許素彬(1995):當前特殊教育的兩個重要理念。特教新知通訊,2(3),1-2。
11. 鄒啟蓉(1998):台北市啟智幼兒班回歸主流實施現況與相關問題研究。特殊教育研究學刊,16,151-169。
12. 蔡明富(1998):融合教育及其對班級經營的啟示。特殊教育與復健學報,6,349-380。
13. 蔡明富 (1998):談融合教育下教師與家長所面臨之問題及其啟示。教師之友,39(2),62-69。
14. 蔡昆瀛(2000):談學校融合教育之相關法規與配套措施。國教新知,47(2),12-17。
15. 鄭麗月(1999):從特殊兒童的融合教育談學校行政的配合。特教新知通訊,6(1),1-4。