跳到主要內容

臺灣博碩士論文加值系統

(18.97.9.175) 您好!臺灣時間:2024/12/07 23:17
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:許宏敏
研究生(外文):Hung-Min Hsu
論文名稱:應用機器學習於內容網站推薦系統研究
論文名稱(外文):Applying Machine Learning for Recommendation System in Content Website
指導教授:張瑞益張瑞益引用關係
口試委員:何建明林宣華林正偉楊凱翔王家輝張恆華林書宇
口試日期:2017-05-24
學位類別:博士
校院名稱:國立臺灣大學
系所名稱:工程科學及海洋工程學研究所
學門:工程學門
學類:綜合工程學類
論文種類:學術論文
論文出版年:2017
畢業學年度:105
語文別:英文
論文頁數:87
中文關鍵詞:主題關鍵字擷取技術行動導向型錄推薦系統詢問式學習基因演算法
外文關鍵詞:Subject-keyphrase extractionMobile-oriented catalog based recommendation systemQuery-based-learning genetic algorithm
相關次數:
  • 被引用被引用:0
  • 點閱點閱:480
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
現今應用機器學習建立內容網站推薦系統越來越普遍,各式各樣的推薦系統根據不同的網站內容和服務需求被發展出來。推薦系統的好壞會直接影響使用者的黏著度。過去基於內容的推薦系統是透過比對內容的相似度,依據相似度的高低來做為相關內容推薦的依據,然而這樣的作法無法處理內容的用字相同但主要表達的意義不同的問題,也沒有考慮使用者想找尋甚麼資訊。此外在網站首頁和手機的推薦系統都有空間限制的問題,受限於版面大小的問題無法像在電腦上瀏覽一樣,一次能夠呈現大量內容。若使用者無法在短時間瀏覽的內容網站的優勢內容會降低再度來訪意願。在本論文中,我們提出兩種推薦系統來解決以上的問題: 基於主題關鍵字的推薦系統以及行動導向型錄推薦系統。我們提出主題關鍵字的概念跟一般的關鍵字相比更強調使用者意圖,我們希望使用者在搜尋引擎查詢主題關鍵字所找出的文章,必須要在該文章中被完整描述。接著,我們利用粒子學演算法和定義使用鏈的相關特徵發展了一套主題關鍵字擷取技術。定義使用鏈是由定義單元和使用單元構成的資料結構,包含定義的變數和所有對這個變數的使用所構成。定義使用鏈的概念為主題關鍵字是定義變數,然後主題關鍵字被其他字、子句或詞所描述。針對行動導向型錄推薦系統,我們提出了詢問式基因演算法來建構行動導向型錄,能在有限型錄數量的情況之下,建立出吸引最多使用者的型錄。詢問式基因演算法有三種類型的神諭,分別是偏好模型、商品向量和交易向量,透過神諭可以不斷地加入高吸引力的產品到行動導向型錄中。根據實驗的結果主題關鍵字技術可以有效地找出文章的核心觀念以提升相似資源推薦的品質。至於行動導向型錄推薦系統方面,詢問式基因演算法建立出來的型錄比目前最好的方法更能吸引使用者。最後我們探討了如何在真實的內容網站(教育大市集)應用這兩種推薦系統技術,而結果顯示這些技術的確能提升內容的使用率。
The use of machine learning to build content website recommendation systems has become increasingly popular. Various types of recommendation systems have been developed based on website content and service demands. The quality of a recommendation system directly affects user adhesion. A previous content-based recommendation system recommended relevant content according to content similarity obtained through content comparison. However, such an approach can neither consider that same words in content may have different meanings nor consider what information users want to find. In addition, recommendation systems for website homepage and mobile phone, which cannot simultaneously display a large amount of content as a PC because of limited page space, are subject to space constraints. If users are unable to browse the most advantageous content of a content website in a short time, their willingness to revisit the website reduces. In this dissertation, we propose the following two recommendation systems to solve these problems: subject-keyphrase-based and Mobile-Oriented Catalog (MOC) based recommendation systems. Subject-keyphrase is more focused on user intention than general keywords. We expect that when users search for a subject-keyphrase in a search engine, the subject-keyphrase is fully described in the acquired articles. We then developed subject-keyphrase extraction technique based on Particle Swarm Optimization (PSO) and Definition-Use Chain (DU Chain). DU Chain is a data structure which includes a definition (D-component) of a variable and all the uses (U-component) reachable from that definition. The DU Chain follows from what has been said in that subject-keyphrases are described by other words, clauses or phrases which can be referred to as U-components and the subject-keyphrase is the definition of a variable. For the mobile-oriented catalog recommendation system, we propose Query-Based-Learning Genetic Algorithm (QBLGA) to construct MOCs to attract the most users in the case of limited catalog quantity. QBLGA has three main types of oracle which are preference modeling, Product2Vec and Transaction2Vec. The oracle of QBLGA can actively and repeatedly add high-attractive products into MOCs for higher covered customers. The experimental results show that the subject-keyphrase technology can effectively determine the core concept of the article to improve the quality of similar resource recommendations. For the MOC based recommendation system, the catalog built by using our QBLGA has proved to be more attractive to users than the state-of-the-art method. Finally, we showed the application of these two recommendation system techniques to a real content website (Taiwan Open Platform for Educational Resources; TOPER); the results show that these techniques could improve the content usage.
誌謝 i
中文摘要 ii
ABSTRACT iii
CONTENTS v
LIST OF FIGURES viii
LIST OF TABLES x
Chapter 1 Introduction 1
Chapter 2 Related Works 6
2.1 Keyphrase Extraction 6
2.2 Background and Concept of Mobile-Oriented Catalog 7
Chapter 3 Proposed Methods 10
3.1 Subject-Keyphrase Extraction based on Definition-Use Chain 10
3.1.1 Concept of Subject-Keyphrase 10
3.1.2 Definition-Use Chain 13
3.1.3 Subject-Keyphrase Extraction (SKE) 16
3.2 Query-Based-Learning Genetic Algorithm to Construct Mobile-Oriented Catalogs 26
3.2.1 Mobile-Oriented Catalog Definition and Formulation 28
3.2.2 Query-Based-Learning Genetic Algorithm (QBLGA) 33
3.2.3 Oracles of QBLGA 39
Chapter 4 The Information Technology Needs of Taiwan Open Platform for Educational Resources 47
4.1 User Experience Analytics 50
4.2 User Experience Design 53
4.3 Big Data Framework 54
4.4 Search Engine Optimization 57
4.5 Mobile Learning 58
Chapter 5 Experimental Results and Analysis 60
5.1 Evaluation Results of Subject-Keyphrase Extraction 60
5.2 Evaluation Results of MOC Construction 63
5.3 Feasibility Analysis for Applying SKE and MOC in TOPER 73
Chapter 6 Conclusion and Future Works 82
6.1 Conclusion 82
6.2 Future Works 83
REFERENCES 85
[1]F. Liu, D. Pennell, F. Liu, and Y. Liu, "Unsupervised approaches for automatic keyword extraction using meeting transcripts," in Proceedings of human language technologies: The 2009 annual conference of the North American chapter of the association for computational linguistics, 2009, pp. 620-628.
[2]Y. Matsuo and M. Ishizuka, "Keyword extraction from a single document using word co-occurrence statistical information," International Journal on Artificial Intelligence Tools, vol. 13, pp. 157-169, 2004.
[3]G. Ercan and I. Cicekli, "Using lexical chains for keyword extraction," Information Processing & Management, vol. 43, pp. 1705-1714, 2007.
[4]C. Fellbaum, WordNet: Springer, 2010.
[5]A. Fader, S. Soderland, and O. Etzioni, "Identifying relations for open information extraction," in Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011, pp. 1535-1545.
[6]A. Carlson, J. Betteridge, R. C. Wang, E. R. Hruschka Jr, and T. M. Mitchell, "Coupled semi-supervised learning for information extraction," in Proceedings of the third ACM international conference on Web search and data mining, 2010, pp. 101-110.
[7]E. Frank, G. W. Paynter, I. H. Witten, C. Gutwin, and C. G. Nevill-Manning, "Domain-specific keyphrase extraction," 1999.
[8]P. D. Turney, "Learning algorithms for keyphrase extraction," Information Retrieval, vol. 2, pp. 303-336, 2000.
[9]K. Barker and N. Cornacchia, "Using noun phrase heads to extract document keyphrases," in Advances in Artificial Intelligence, ed: Springer, 2000, pp. 40-52.
[10]O. Medelyan and I. H. Witten, "Thesaurus based automatic keyphrase indexing," in Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries, 2006, pp. 296-297.
[11]P. Turney, "Coherent keyphrase extraction via web mining," 2003.
[12]T. D. Nguyen and M.-Y. Kan, "Keyphrase extraction in scientific publications," in Asian Digital Libraries. Looking Back 10 Years and Forging New Frontiers, ed: Springer, 2007, pp. 317-326.
[13]Y. E. Lee and I. Benbasat, "A framework for the study of customer interface design for mobile commerce," International Journal of Electronic Commerce, vol. 8, pp. 79-102, 2004.
[14]J. F. Rayport and B. J. Jaworski, Introduction to e-commerce: McGraw-Hill/Irwin marketspaceU, 2002.
[15]J. Persson and J. Berndtsson, "Determinants of smartphone shopping adoption: Key factors for online shopping of consumer goods through smartphones in Sweden," Master''s Degree, Department of Business Administration, Lund University, Lund University Publications, 2015.
[16]Y. Xu, M. Spasojevic, J. Gao, and M. Jacob, "Designing a vision-based mobile interface for in-store shopping," in Proceedings of the 5th Nordic conference on Human-computer interaction: building bridges, 2008, pp. 393-402.
[17]Q. Su and L. Chen, "A method for discovering clusters of e-commerce interest patterns using click-stream data," Electronic Commerce Research and Applications, vol. 14, pp. 1-13, 2015.
[18]G. Wang, X. Zhang, S. Tang, H. Zheng, and B. Y. Zhao, "Unsupervised clickstream clustering for user behavior analysis," in Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, 2016, pp. 225-236.
[19]J. Kleinberg, C. Papadimitriou, and P. Raghavan, "Segmentation problems," in Proceedings of the thirtieth annual ACM symposium on Theory of computing, 1998, pp. 473-482.
[20]J. Kleinberg, C. Papadimitriou, and P. Raghavan, "A microeconomic view of data mining," Data Mining and Knowledge Discovery, vol. 2, pp. 311-324, 1998.
[21]D. Xu, Y. Ye, and J. Zhang, "Approximating the 2-catalog segmentation problem using semidefinite programming relaxations," Optimization Methods and Software, vol. 18, pp. 705-719, 2003.
[22]M. Steinbach, G. Karypis, and V. Kumar, "Efficient algorithms for creating product catalogs," DTIC Document2000.
[23]M. Ester, R. Ge, W. Jin, and Z. Hu, "A microeconomic data mining problem: customer-oriented catalog segmentation," in Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining, 2004, pp. 557-562.
[24]A. Amiri, "Customer-oriented catalog segmentation: effective solution approaches," Decision Support Systems, vol. 42, pp. 1860-1871, 2006.
[25]I. Mahdavi, M. Movahednejad, and F. Adbesh, "Designing customer-oriented catalogs in e-CRM using an effective self-adaptive genetic algorithm," Expert Systems with Applications, vol. 38, pp. 631-639, 2011.
[26]W. You, D. Fontaine, and J.-P. Barthès, "An automatic keyphrase extraction system for scientific documents," Knowledge and information systems, vol. 34, pp. 691-724, 2013.
[27]G. Salton and C. Buckley, "Term-weighting approaches in automatic text retrieval," Information Processing & Management, vol. 24, pp. 513-523, 1988.
[28]R. C. Eberhart and J. Kennedy, "A new optimizer using particle swarm theory," in Proceedings of the sixth international symposium on micro machine and human science, 1995, pp. 39-43.
[29]D. Chen and C. D. Manning, "A Fast and Accurate Dependency Parser using Neural Networks," in EMNLP, 2014, pp. 740-750.
[30]R. J.-H. Wang, E. C. Malthouse, and L. Krishnamurthi, "On the go: how mobile shopping affects customer purchase behavior," Journal of Retailing, vol. 91, pp. 217-234, 2015.
[31]L. Y. Chen, "Exploring the quality of mobile shopping system and its link to the organizational performance," International Journal of Information Processing and Management, vol. 6, p. 19, 2015.
[32]M. R. Gary and D. S. Johnson, "Computers and Intractability: A Guide to the Theory of NP-completeness," ed: WH Freeman and Company, New York, 1979.
[33]R.-I. Chang, L.-B. Lai, W.-D. Su, J.-C. Wang, and J.-S. Kouh, "Intrusion detection by backpropagation neural networks with sample-query and attribute-query," International Journal of Computational Intelligence Research, vol. 3, pp. 6-10, 2007.
[34]R.-I. Chang, S.-Y. Lin, and Y. Hung, "Particle swarm optimization with query-based learning for multi-objective power contract problem," Expert Systems with Applications, vol. 39, pp. 3116-3126, 2012.
[35]D. M. Blei, A. Y. Ng, and M. I. Jordan, "Latent Dirichlet allocation," The Journal of Machine Learning Research, vol. 3, pp. 993-1022, 2003.
[36]T. L. Griffiths and M. Steyvers, "Finding scientific topics," Proceedings of the National Academy of Sciences, vol. 101, pp. 5228-5235, 2004.
[37]T. Mikolov, K. Chen, G. Corrado, and J. Dean, "Efficient estimation of word representations in vector space," arXiv preprint arXiv:1301.3781, 2013.
[38]T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean, "Distributed representations of words and phrases and their compositionality," in Advances in neural information processing systems, 2013, pp. 3111-3119.
[39]Q. V. Le and T. Mikolov, "Distributed Representations of Sentences and Documents," in ICML, 2014, pp. 1188-1196.
[40]I. Tuomi, "Open educational resources and the transformation of education," European Journal of Education, vol. 48, pp. 58-78, 2013.
[41]J. Hylén, "Open educational resources: Opportunities and challenges," Proceedings of Open Education, pp. 49-63, 2006.
[42]J. Atenas, L. Havemann, and E. Priego, "Opening teaching landscapes: The importance of quality assurance in the delivery of open educational resources," Open Praxis, vol. 6, pp. 29-43, 2014.
[43]T. Browne, R. Holding, A. Howell, and S. Rodway-Dyer, "The challenges of OER to Academic Practice," Journal of Interactive Media in Education, vol. 2010, 2010.
[44]L. Havemann, J. Stroud, and J. Atenas, "Breaking down barriers: Open Educational Practices as an emerging academic literacy," 2014.
[45]V. Rolfe, "Open educational resources: staff attitudes and awareness," Research in Learning Technology, vol. 20, 2012.
[46]R. Schuwer, K. Kreijns, and M. Vermeulen, "Wikiwijs: An unexpected journey and the lessons learned towards OER," Open Praxis, vol. 6, pp. 91-102, 2014.
[47]R. Windle, H. Wharrad, D. McCormick, H. Laverty, and M. Taylor, "Sharing and reuse in OER: experiences gained from open reusable learning objects in health," Journal of Interactive Media in Education, vol. 2010, 2010.
[48]M. Manso Vazquez, M. Caeiro Rodriguez, and M. Llamas Nistal, "Development of a xAPI application profile for self-regulated learning requirements for capturing SRL related data," in Global Engineering Education Conference (EDUCON), 2015 IEEE, 2015, pp. 358-365.
[49]D. E. Millard, K. Borthwick, Y. Howard, P. McSweeney, and C. Hargood, "The HumBox: Changing educational practice around a learning resource repository," Computers & Education, vol. 69, pp. 287-302, 2013.
[50]H. Crompton, "A historical overview of mobile learning: Toward learner-centered education," Handbook of mobile learning, pp. 3-14, 2013.
[51]S. N. Kim, O. Medelyan, M.-Y. Kan, and T. Baldwin, "Semeval-2010 task 5: Automatic keyphrase extraction from scientific articles," in Proceedings of the 5th International Workshop on Semantic Evaluation, 2010, pp. 21-26.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top