跳到主要內容

臺灣博碩士論文加值系統

(18.97.14.82) 您好!臺灣時間:2024/12/08 16:51
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

我願授權國圖
: 
twitterline
研究生:林佳勳
研究生(外文):Jia-Syun Lin
論文名稱:以謠言與謊言理論為基礎之真假評論識別研究
論文名稱(外文):Identifying Deceptive review comments with rumor and lie theories
指導教授:許秉瑜許秉瑜引用關係沈國基沈國基引用關係
學位類別:碩士
校院名稱:國立中央大學
系所名稱:企業管理學系
學門:商業及管理學門
學類:企業管理學類
論文種類:學術論文
論文出版年:2016
畢業學年度:104
語文別:中文
論文頁數:52
中文關鍵詞:謠言謊言虛假評論負向評論文字探勘
相關次數:
  • 被引用被引用:1
  • 點閱點閱:621
  • 評分評分:
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
現今,互聯網普及的時代,人們已習慣頻繁的參與網路上的互動,不僅積極將自身的真實體驗發佈分享,同時也扮演訊息接收者的角色,如此而產生的大量評論已成為多數人購買商品或服務前的參考依據,亦有調查數據顯示,消費者對於線上評論的信任程度為逐年成長的趨勢,其中負向評論更是影響消費者的決策。不幸的是,由於線上評論的快速傳遞特性與巨大影響力,許多組織開始蓄意誇大自身產品或捏造負向評論來攻擊競爭對手,以期從中獲取利益,然而,這樣濫用線上評論的同時,對消費者個人及商業組織皆造成損害。研究顯示,受到網路評論而影響購買意圖最強烈的,即是旅遊業及旅館業,這些評論記錄實用的旅遊資訊及個人經驗,是旅客在出發至陌生景點前的重要參考資訊。
故本研究將以芝加哥前二十間知名旅館的負向真實評論及虛假評論為研究對象,包括在六個知名旅遊評論網站上的真實評論,以及由亞馬遜群眾智慧平台Amazon Mechanical Turk10所蒐集的虛假評論。並以謠言及謊言理論為基礎延伸六大屬性:旅館重要屬性字、模糊字、第一人稱代名詞、負面用詞、簡化思考代名詞以及冗詞贅字。運用文字探勘技術結合分類演算法進行分類器訓練,再利用多個單一分類器預測之結果進行分類器整合研究,建構準確且效率兼備的虛假評論識別模型。本研究建構之分類模型結果顯示,利用六大屬性進行資料維度縮減後,不僅讓運算效率提升,同時保持合理的準確性,有效的進行真實與虛假評論之識別,而分類器整合後所得之精確度(Precision)、召回率(Recall)、準確度(Accuracy)及F值(F-measure)四個指標值,也勝過單一分類器之效能,並能有效避免單一分類器可能不適用於其他資料集之風險。

Nowadays, people got accustomed to participate interaction frequently on the web in the era of internet. They were not only sharing their real-life experience actively , but also playing the role of the recipient of the message, and thus generated a lot of comments had become references for most people buying products or services. The survey data also showed that consumers trusted online reviews growing year by year, in which deceptive reviews had much more influences on consumer decisions. Unfortunately, due to the rapid transfer and enormous influence were typical of online reviews, many organizations began to deliberately exaggerate their own products or fabricated negative comments to attack competitors in order to derive benefit. However, individual consumers and commercial organizations would cause damage by abusing online reviews. Studies had shown that, it had much more influence by online reviews were tourism and hotel industry. These comments recorded of useful tourist information and personal experience which were important information for travelers in unfamiliar places before departing.
This study discussed the negatively truthful review and the deceptive reviews from top twenty famous hotels in Chicago, including the true reviews taking from six famous review sites and the comparison group deceptive reviews on Amazon Mechanical Turk10. On the basis of the rumors and lies theories, the method created six attributes, key words of hotel, vague words、personal pronoun、negative words、pronouns and pleonasm. By using text mining combined classification algorithm to forecast outcome and apply to build models. In this model showed that the mathematical operations not only worked more efficiently but kept the accuracy reasonably, so it could distinguish true or deceptive reviews well. After integrating classifiers, the four indicators for “ Precision”, ”Recall”, “Accuracy”, and “F-measure” had better efficacy than single classifier , also could avoid the risk of unsuitable for other data set.

中文摘要 i
Abstract ii
目錄 iii
表目錄 v
圖目錄 vi
第一章 緒論 1
1-1 研究背景 1
1-2 研究動機 3
1-3 研究目的 7
1-4 研究架構 8
第二章 文獻探討 9
2-1 謠言相關理論 9
2-1-1 謠言的定義 9
2-1-2 謠言發生的動機與原因 9
2-2 謊言(捏造故事)的特性 10
2-3 負向虛假評論 11
2-4 關鍵詞擷取方法 12
2-5 文字探勘 14
2-6 分類器的整合 16
2-6-1 分類器整合原因與方法 16
2-6-2 分類器整合之相關研究 18
2-7 真假評論識別之相關研究 19
第三章 研究方法 21
3-1 研究設計 21
3-2 基於TF-IDF 的判斷模型 22
3-3 資料維度縮減 23
3-4 評估模型建立 25
3-5 整合分類器 26
3-6 實驗結果評估準則 27
第四章 研究實作 29
4-1 資料集 29
4-2 研究結果 30
4-2-1 單一分類器-效能比較 30
4-2-2 單一分類器-效率比較 32
4-2-3 整合分類器實驗結果 32
第五章 結論與建議 37
5-1 研究結論 37
5-2 研究限制及未來研究建議 38
參考文獻 40

[1] Allport, G. W., & Postman, L. (1946). An analysis of rumor. Public Opinion Quarterly, 10(4), 501-517.
[2] Allport, G. W., & Postman, L. (1947). The psychology of rumor.
[3] Basuroy, S., Chatterjee, S., & Ravid, S. A. (2003). How critical are critical reviews? The box office effects of film critics, star power, and budgets. Journal of marketing, 67(4), 103-117.
[4] Brown, G., Wyatt, J., Harris, R., & Yao, X. (2005). Diversity creation methods: a survey and categorisation. Information Fusion, 6(1), 5-20.
[5] Chevalier, J. A., & Mayzlin, D. (2006). The effect of word of mouth on sales: Online book reviews. Journal of marketing research, 43(3), 345-354.
[6] Competeinc.(2006):Embracing Consumer Buzz Creates Measurement Challenges for Marketers. 2006年12月,取自 http://class.classmatandread.net/am1/Buzz.pdf
[7] Da Silva, N. F., Hruschka, E. R., & Hruschka, E. R. (2014). Tweet sentiment analysis with classifier ensembles. Decision Support Systems, 66, 170-179.
[8] Dietterich, T. G. (2000). Ensemble methods in machine learning. In Multiple classifier systems (pp. 1-15). Springer Berlin Heidelberg.
[9] Elliott, C. (2006). New risk in travel: fake hotel ratings. International Herald Tribune, February, 8, 2006.
[10] eMarketer.(2013):Users Seek Out the Truth in Online Reviews. 2013年2月7日,取自 http://www.emarketer.com/Article/Users-Seek-Truth-Online-Reviews/1009656
[11] eMarketers.(2014):Consumers Read More Local Online Reviews—Thanks to Rumors of Fakes? 2014年8月6日,取自http://www.emarketer.com/Article/Consumers-Read-More-Local-Online-ReviewsThanks-Rumors-of-Fakes/1011078
[12] eMarketers.(2015):Online Reviews Influence Travel-Related Purchases in the UK. 2015年6月29日,取自http://www.emarketer.com/Article/Online-Reviews-Influence-Travel-Related-Purchases-UK/1012664
[13] Fayyad, U. M., Piatetsky-Shapiro, G., Smyth, P., & Uthurusamy, R. (1996). Advances in knowledge discovery and data mining.
[14] Feldman, R., & Dagan, I. (1995, August). Knowledge Discovery in Textual Databases (KDT). In KDD (Vol. 95, pp. 112-117).
[15] Feng, S., Banerjee, R., & Choi, Y. (2012, July). Syntactic stylometry for deception detection. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers-Volume 2 (pp. 171-175). Association for Computational Linguistics.
[16] Gretzel, U., & Yoo, K. H. (2008). Use and impact of online travel reviews.Information and communication technologies in tourism 2008, 35-46.
[17] Hancock, J. T., Curry, L., Goorha, S., & Woodworth, M. (2005, January). Automated linguistic analysis of deceptive and truthful synchronous computer-mediated communication. In System Sciences, 2005. HICSS'05. Proceedings of the 38th Annual Hawaii International Conference on (pp. 22c-22c). IEEE.
[18] Hearst, M. A. (1997, July). Text data mining: Issues, techniques, and the relationship to information access. In Presentation notes for UW/MS workshop on data mining (pp. 112-117).
[19] Hospitality Marketing.(2014):How Online Hotel Reviews Affect Booking Decisions: The Research, Stats, Viewpoints & Strategies. 2014年12月5日,取自http://hospitality.cvent.com/blog/cvb-internet-marketing-2/how-online-hotel-reviews-affect-booking-decisions-the-research-stats-viewpoints-strategies
[20] Hu, N., Liu, L., & Sambamurthy, V. (2011). Fraud detection in online consumer reviews. Decision Support Systems, 50(3), 614-626.
[21] International student .(2013):Hospitality Industry. 2013年1月,取自http://blog.internationalstudent.com/2013/01/hospitality-industry/
[22] Jindal, N., & Liu, B. (2007, October). Analyzing and detecting review spam. InData Mining, 2007. ICDM 2007. Seventh IEEE International Conference on (pp. 547-552). IEEE.
[23] Jindal, N., & Liu, B. (2008, February). Opinion spam and analysis. InProceedings of the 2008 International Conference on Web Search and Data Mining (pp. 219-230). ACM.
[24] Jing, L. P., Huang, H. K., & Shi, H. B. (2002). Improved feature selection approach TFIDF in text mining. In Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on (Vol. 2, pp. 944-946). IEEE.
[25] Kantardzic, M. (2011). Data mining: concepts, models, methods, and algorithms. John Wiley & Sons.
[26] Kapferer, J. N. (2013). Rumors: Uses, interpretations, and images. Transaction Publishers.
[27] Kohonen, T., Schroeder, M. R., Huang, T. S., & Maps, S. O. (2001). Springer-Verlag New York. Inc., Secaucus, NJ, 43.
[28] Krogh, A., & Vedelsby, J. (1995). Neural network ensembles, cross validation, and active learning. Advances in neural information processing systems, 7, 231-238.
[29] Kuncheva, L. I. (2004). Combining pattern classifiers: methods and algorithms. John Wiley & Sons.
[30] Kuncheva, L. I., & Whitaker, C. J. (2003). Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Machine learning, 51(2), 181-207.
[31] Lee, D., Kim, H. S., & Kim, J. K. (2012). The role of self-construal in consumers’ electronic word of mouth (eWOM) in social networking sites: A social cognitive approach. Computers in Human Behavior, 28(3), 1054-1062.
[32] Leopold, E., & Kindermann, J. (2002). Text categorization with support vector machines. How to represent texts in input space?. Machine Learning, 46(1-3), 423-444.
[33] Litvin, S. W., Goldsmith, R. E., & Pan, B. (2008). Electronic word-of-mouth in hospitality and tourism management. Tourism management, 29(3), 458-468.
[34] Losiewicz, P., Oard, D. W., & Kostoff, R. N. (2000). Textual data mining to support science and technology management. Journal of Intelligent Information Systems, 15(2), 99-119.
[35] Mihalcea, R., & Strapparava, C. (2009, August). The lie detector: Explorations in the automatic recognition of deceptive language. In Proceedings of the ACL-IJCNLP 2009 Conference Short Papers (pp. 309-312). Association for Computational Linguistics.
[36] Mowen, J. C., & Minor M. (1990). Consumer Behavior, 2nd Macmilliam.
[37] Mukherjee, A., Venkataraman, V., Liu, B., & Glance, N. (2013). Fake review detection: Classification and analysis of real and pseudo reviews. UIC-CS-03-2013. Technical Report.
[38] Mukherjee, A., Venkataraman, V., Liu, B., & Glance, N. S. (2013, July). What yelp fake review filter might be doing?. In ICWSM.
[39] Ott, M., Choi, Y., Cardie, C., & Hancock, J. T. (2011, June). Finding deceptive opinion spam by any stretch of the imagination. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1 (pp. 309-319). Association for Computational Linguistics.


[40] Overnight-success.(2014):Survey: How Travelers Use Online Hotel Reviews. 2014年6月11日,取自http://overnight-success.softwareadvice.com/survey-how-travelers-use-online-hotel-reviews-0614/
[41] Reiter, C. (2007). Travel Web sites clamp down on bogus reviews. International Herald Tribune, 16(2), 2007.
[42] Rosnow, R. L. (1980). Psychology of rumor reconsidered.
[43] Schindler, R. M., & Bickart, B. (2005). Published word of mouth: Referable, consumer-generated information on the Internet. Online consumer psychology: Understanding and influencing consumer behavior in the virtual world, 32.
[44] Schlosser, A. E. (2005). Source Perceptions and the Persuasiveness of Internet World-of-Mouth Communication. Advances in Consumer Research, 32, 202.
[45] Shibutani, T. (1966). Improvised news: A sociological study of rumor. Ardent Media.
[46] Simoudis, E. (1996). Reality check for data mining. IEEE Intelligent Systems, (5), 26-33.
[47] Sullivan, D. (2001). Document warehousing and text mining: techniques for improving business operations, marketing, and sales. John Wiley & Sons, Inc..
[48] Tan, A. H. (1999, April). Text mining: The state of the art and the challenges. In Proceedings of the PAKDD 1999 Workshop on Knowledge Disocovery from Advanced Databases (Vol. 8, pp. 65-70).
[49] Techcrunch.(2015):Amazon Files Suit Against Individuals Offering Fake Product Reviews On Fiverr.com. 2015年10月16日,取自http://techcrunch.com/2015/10/16/amazon-files-suit-against-individuals-offering-fake-product-reviews-on-fiverr-com/
[50] TEDxTaipei.(2015):噓 有人在說謊!當別人說的話裡出現這四個特徵,小心你已經被騙了! 2015年7月24日,取自http://tedxtaipei.com/articles/the_language_of_lying/
[51] Tumer, K., & Ghosh, J. (1996). Error correlation and error reduction in ensemble classifiers. Connection science, 8(3-4), 385-404.
[52] Wu, G., Greene, D., Smyth, B., & Cunningham, P. (2010, July). Distortion as a validation criterion in the identification of suspicious reviews. In Proceedings of the First Workshop on Social Media Analytics (pp. 10-13). ACM.
[53] Xia, R., Zong, C., & Li, S. (2011). Ensemble of feature sets and classification algorithms for sentiment classification. Information Sciences, 181(6), 1138-1152.

[54] Yoo, K. H., & Gretzel, U. (2009). Comparison of deceptive and truthful travel reviews. Information and communication technologies in tourism 2009, 37-47.
[55] Zhou, L., & Sung, Y. W. (2008, January). Cues to deception in online Chinese groups. In Hawaii international conference on system sciences, proceedings of the 41st annual (pp. 146-146). IEEE.
[56] Zhou, Z. H. (2012). Ensemble methods: foundations and algorithms. CRC Press.
[57] 曾元顯. (1997). 關鍵詞自動擷取技術之探討. 中國圖書館學會, 會訊, (106).
[58] 黃愛萍(2002):網路謠言傳播型態的初探。2002年5月14日,取自 http://www.zijin.net/06masters/huangaiping/2001/1/net%20rumor.htm
[59] 電子商務網.(2016):每一筆評論都重要,線上評論影響消費者線下購買決策。 2016年3月28日,取自https://www.smartm.com.tw/article/32313739cea3
[60] 林美玉. (2015). 識別攻擊評論之研究-以旅館業為例. 中央大學企業管理學系學位論文, 1-36.
[61] 張甜. (2014). 辨別虛假評論之研究-以旅館業為例. 中央大學企業管理學系學位論文, 1-38.
[62] 鄭若麟, & 邊芹. (1992). 謠言. Kapferer, JN (1990). Rumors: uses, interpretations, and images, New Brunswick: Transaction Publishers.

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供,不一定有電子全文可供下載,若連結有誤,請點選上方之〝勘誤回報〞功能,我們會盡快修正,謝謝!
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top