跳到主要內容

臺灣博碩士論文加值系統

(216.73.216.106) 您好!臺灣時間:2026/04/01 17:11
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:王怡書
研究生(外文):I-Shu Wang
論文名稱:混合式分類器於多類別資料之建構
論文名稱(外文):The construction of multiple-class data based on hybrid classifiers
指導教授:黃美玲黃美玲引用關係
指導教授(外文):Mei-Ling Huang
學位類別:碩士
校院名稱:國立勤益科技大學
系所名稱:工業工程與管理系
學門:工程學門
學類:工業工程學類
論文種類:學術論文
論文出版年:2010
畢業學年度:98
語文別:中文
論文頁數:44
中文關鍵詞:特徵選取C5決策樹ANN類神經網路
外文關鍵詞:Feature SelectionC5 decision treeArtificial Neural Network
相關次數:
  • 被引用被引用:1
  • 點閱點閱:244
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
本研究針對多類別的大腸桿菌資料庫利用特徵選取的方式來縮減變數。建構三種混合式分類器分別為決策樹結合類神經網路、支援向量機混合決策樹、貝氏網路混合決策樹。特徵選取中,本研究以Information Gain萃取重要的特徵變數。實驗結果可知,本研究所建構三種混合式分類器準確率都有不錯表現,最高準確率為94.05%。為了驗證本研究方法的適用性,不侷限於某一特定領域,再利用帕金森氏症資料庫來驗證其最高準確率為95.38%。
This paper utilizes Information Gain to extract important features, and compares the performances among combined decision tree (C5) and artificial neural network(ANN)、hybrid support vector machine(SVM) and C5、hybrid bayesian network(BN) and C5. Three hybrid classifiers are developed in this study and the highest accuracy is 94.05% in the multiple–class E.coli dataset. In order to explain this research is widely applicable and is not limited to particular field, we again verify this research architecture on Parkinson dataset and the accuracy is up to 95.38%.
摘要 I
Abstract II
致謝 III
表目錄 VI
圖目錄 VII
第一章 緒論 1
1.1 研究背景與動機 1
1.2 研究目的 1
1.3 研究範圍與對象 2
1.4 研究流程 2
1.5 研究架構 3
第二章 文獻探討 4
2.1 資料探勘 4
2.2 決策樹 5
2.3 Information Gain 6
2.4 類神經網路 7
2.5 倒傳遞類神經網路 8
2.6 交叉驗證 10
2.7 支援向量機 10
2.7.1 線性支援向量機 10
2.7.2 非線性支援向量機 13
2.8 貝氏網路 14
2.9 混合式分類器相關文獻 15
第三章 研究方法 17
3.1 資料說明 18
3.2 資料前處理 20
3.3 倒傳遞類神經網之建構 21
3.4 建構C5決策樹結合ANN倒傳遞類神經網路分類器 22
3.5 建構 SVM支援向量機混合C5決策樹分類器 24
3.6 建構BN貝氏網路混合C5決策樹分類器 25
3.7 田口品質工程之參數設計 26
3.7.1 大腸桿菌資料庫SVM-參數設計 26
3.7.2 帕金森氏症資料庫SVM-參數設計 29
第四章 研究結果與討論 32
4.1 基本統計分析 32
4.2 Information Gain於大腸桿菌與帕金森氏症資料庫特徵選取能力的比較 35
4.3 混合式分類器於大腸桿菌及帕金森氏症資料庫準確率 35
第五章 結論與未來研究方向 40
參考文獻 41


1.維基百科,http://zh.wikipedia.org/zh-tw。
2.行政院衛生署,2007,中華民國九十五年美國進口波菜遭O157:H7型大腸桿菌污染,行政院衛生署網站。
3.尹相志,2006, SQLServer2005 資料採礦聖經,學貫行銷股份有限公司。
4.曾憲雄、蔡秀滿、蘇東興、曾秋蓉、王慶堯著,資料探勘Data Ming,2006,旗標出版股份有限公司,台北市。
5.丁一賢及陳牧言著,2005,資料探勘Data Minig,滄海書局,台中市。
6.洪昌億,2001,決策樹應用在中西醫腦中風診斷之研究,長庚大學,碩士論文,。
7.葉怡成,2003,類神經網路模式應用與實作,儒林圖書有限公司,台北市。
8.丁一賢、陳牧言,2006,資料探勘,滄海圖書出版股份有限公司。
9.葉怡成,2000,類神經網路模式應用與實作,第一版,儒林圖書。
10.廖家雄,2000,以貝氏網路為基礎的知式處理機制,東海大學,碩士論文。
11.A. Berson and S. J. Smith, 1997, Data Mining Techniques for Marketing Sales and Customer Support, Wiley Computer Publishing.
12.Abram, Mike, April 2005, “Watercourse warning over spring herbicide”, Farmers Weekly, Vol. 142 , Iss. 17, pp. 41.
13.Amir F. Atiya and Ahmed Al-Ani, 2009, “A penalized likelihood based pattern classification algorithm”, Pattern Recognition, Vol. 42, pp. 2684-2694.
14.Chen Xiang, Png Chin Yong, Lim Swee Meng, 2008, “Design of multiple-level hybrid classifier for intrusion detection system using Bayesian clustering and decision trees” , Pattern Recognition Letters, Vol. 29 Issue 11, pp. 918-924.
15.Chun-Lang Chang and Chih-Hao Chen, 2009, “Applying decision tree and neural network to increase quality of dermatologic diagnosis”, Expert Systems with Applications, Vol. 36, pp. 4035-4041.
16.D. O. Hebb, 1949, The Organization of Behavior, John Wiley, New York.
17.David Heckerman, 1996, A Tutorial on Learning With Bayesian Networks, Technical Report MSR-TR-95-06, Microsoft Corporation, Redmond, USA.
18.Fei-Long Chen, Feng-Chia Li, 2010, “Combination of feature selection approaches with SVM in credit scoring”, Expert Systems with Applications, Vol. 37, pp. 4902-4909.
19.G. H. Grupe and M. M. Owrang, 1995, “Database Mining Discovering New Knowledge and Cooperative Advantage”, Information Systems Management, Vol. 12, pp. 26-31.
20.Gaurav Marwah Lois, Lois Boggess, 2002, “Artificial immune systems for classification: some issues”, http://citeseerx.ist.psu.edu/viewdoc/summary? doi=10.1.1.2.7259.
21.Gerhard Arminger, Daniel Enache, Thorsten Bonne, 1997, “Analyzing Credit Risk Data: A Comparison of Logistic Discrimination Classification Tree Analysis and Feedforward Networks” , Computational Statistics, Vol. 12, pp. 293-310.
22.Halife Kodaz, Seral Ozsen, Ahmet Arslan, Salih Gunes, 2009, “Medical application of information gain based artificial immune recognition system(AIRS):Diagnosis of thyoid disease”, Exper Systems with Applications, Vol. 36, pp. 3086-3092.
23.Huajie Zhang and Charles X. Ling, 2001, “An improved learning algorithm for augmented naïve bayes”, Lecture Notes in Computer Science, Vol. 2035, pp. 581-586.
24.J. Han and M. Kamber, 2000, Data Mining: Concepts and Techniques, Morgan Kaufmann, San Francisco, CA, USA..
25.J. Han and M. Kamber, 2001, Data Mining: Concepts and Techniques, Morgan Kaufmann, San Francisco, CA, USA.
26.J. Ross Quinlan 1993, C4.5 Programs for Machine Learning, Morgan Kaufmann Publishers Inc. San Francisco, CA, USA
27.Jingnian Chen, Houkuan Huang, Fengzhan Tian, Shengfeng Tian, 2008, “A selective bayes classifier for classifying in complete data based on gain ratio”, Knowledge-Based System, Vol. 21, pp. 530-540.
28.Joon Hur and Jong Woo Kim, 2008, “A hybrid classification method using error pattern modeling”, Expert Systems with Applications, Vol. 34, pp. 231-241.
29.Kidong Lee, David Booth, and Pervaiz Alam, 2005, “A comparison of supervised and unsupervised neural network in predicting bankruptcy of Korean firms’’, Expert Systems with Applications, Vol. 29, pp. 1-16.
30.Kuan-Yu Lin, 2004, Intelligent Multi Layer Selective Model and Application of Investment Strategy, Master Thesis, Graduate Institute of Finance, Ling Tung University.
31.Max A Little, Patrick E. McSharry, Eric J. Hunter, Jennifer Spielman, Lorraine O. Ramig, 2009, “Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease”, IEEE, Vol.56, no.4, pp. 1015-1022.
32.P. A. Devijver and J. Kittler, 1982, Pattern Recognition: A Statistical Approach, Prentice-Hall, London.
33.Paul Horton and Kenta Nakai, 1996, “A probabilistic classification system for predicting the cellular localization sites of proteins”, AAAI, pp.109-115.
34.Paul Horton and Kenta Nakai, 1997,“Better prediction of protein cellular localization sites with the k neast neighbors classifier”, ISMB, vol. 5, pp.147-152.
35.Philip K. Chen, Wei Fan, Andreas L. Prodromidis, Salvatore J. Stolfo, 1999, “Distributed Data Mining in Credit Card Fraud Detection” , IEEE Intelligent Systems, Vol. 14, pp. 67-74.
36.Qinbao Song, Martin Shepperd, Xiangru Chen, Jun Liu, 2008, “Can k-nn imputation improve the performance of C4.5 with small software project data set? A comparative evaluation”, The Journal of Systems and Software, Vol. 81, pp. 2361-2370.
37.Resul Das, 2009, “A comparison of multiple classification methods for diagnosis of Parkinson disease”, Expert Systems with Applications, Vol. 37, pp. 1568-1572.
38.Sadik Kara, Bekir Hakan Aksebaeci, Halife Kodaz, Salih Gunes, Esma Kaya, Hatice Ozbilge, 2009, “medical application of information gain-based artificial immune recognition system(IG-AIRS): classification of microorganism species”, Expert Systems Applications, Vol. 36, pp. 5168-5172.
39.Sandhya Peddabachigari, Ajith Abraham, Crina Grosan, Johnson Thomas, 2007, “Modeling intrusion detection system using hybrid intelligent systems”, Journal of network and computer applications, Vol. 30, pp. 114-132.
40.Shahar Cohen, Lior Rokach, Oded Maimon, 2007, “Decision-tree instance-space decomposition with grouped gain-ratio”, Information sciences, Vol. 177, pp. 3592-3612.
41.Shi-Ming Huang, Chih-Fong Tsai, David C. Yen, Yin-Lin Cheng, 2008, “A hybrid financial analysis model for business failure prediction”, Expert Systems with Applications, vol. 35, pp. 1034-1040.
42.Shigeru Takasaki, 2008, “Efficient prediction methods for selecting effective siRNA sequences”, Computers in Biology and Medicine, Vol. 32, pp. 332-337.
43.Shyi-Ming Chen, Jen-Da Shie, 2009, “Fuzzy classification systems based on fuzzy information gain measures”, Expert Systems with Applications, Vol. 36, pp. 4517-4522.
44.T.A. Arentze and H.J.P. Timmermans, 2005, “Information Gain, novely seeking and travel:a model of dynamic activity-travel behavior under conditions of uncertainty”, Transportation research part a, Vol. 39,pp. 125-145.
45.Usama Fayyad, Gregory Piatetsky-Shapiro, Terry Widener, 1996 , “The KDD process for extracting useful knowledge from volumes of data”, Communications of the ACM, Vol. 39, No. 11, pp.27-34.
46.V. Sugumaran, G.R. Sabareesh, K.I Ramachandra, 2008, “Fault diagnostics of roller bearing using kernel based neighborhood score multi-class support vector machine”, Expert Systems with Applications, Vol. 34, pp. 3090-3098.
47.Van der Werff, T. J., 2001, “10 Emerging Technologies That Will Change the Word”, http://www.globalfuture.com/mit-trends2001.htm.
48.W. J. Frawley, Gregory Piatesky-Shapiro, Christopher J. Matheus, 1992, “Knowledge Discovery in Databases: An Overview”, AI Magazine, Vol.13, No3, pp.57-70.
49.W. J. Frawley, Gregory Piatesky-Shapiro, Christopher J. Matheus, C. J., 1992, “Knowledge discovery in databases: An overview”, AI Magazine, pp.57-70.
50.Warren S. McCulloch and Walter Pitts, 1943, “A Logical Calculus of Ideas Immanent in Nervous Activity”, Bulletin of Mathematical Biophysics, Vol. 5, No. 1, pp. 115-133.
51.Y. Yang and Q. Zhang, 1998, “The Application of Neural Networks to Rock Engineering System(RES)”, International Journal Rock Mechanics and Mining Sciences, Vol. 35,pp.727-745.

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供,不一定有電子全文可供下載,若連結有誤,請點選上方之〝勘誤回報〞功能,我們會盡快修正,謝謝!
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top