研究生(外文):Chun-Fu Chen
論文名稱(外文):Predicting Taiwan Stock Market Using Social Moods
指導教授(外文):Chin-Laung Lei
外文關鍵詞:Taiwan Stock MarketSentiment AnalysisStock Market PredictionPTT
In recent years, mining social media data to forecast the future has been a popular research. The stock market behavior and investor emotions are always bonded together. With the development of social media, people are willing the share their feelings on the social media including investor.
In our study, we select PTT stock board as our platform, a forum gathering investors sharing their opinions, and crawl data on it. We calculate the emotion score through NTUSD and DUTIR sentiment dictionary and predict two representative stock market indices: Taiwan Futures Index and Taiwan Capitalization Weighted Stock Index. The concept of fixed-sized rolling window and fixed feature size are adopted in this thesis. That is, if the emotion cause the variation of stock market, the main causality might be different in different time span. The rolling window size and feature size are selected to our prediction model through lower Root Mean Square Error.
There are four value recorded each day: opening value, intra-day highest value, intra-day lowest value and closing value. We classify these four value into three groups through K-means clustering algorithm and then conduct prediction.

致謝 i
中文摘要 ii
Abstract iii
Contents iv
List of Figures vi
List of Tables vii
Chapter 1 Introduction 1
Chapter 2 Related Work 4
Chapter 3 Background 6
3.1 PTT Stock Board 6
3.2 Sentiment Dictionary 6
3.2.1 DUTIR 6
3.2.2 NTUSD 7
3.3 Jieba 8
3.4 Scikit-learn 8
Chapter 4 Datasets 10
4.1 Stock Market Data 10
4.2 Online Emotions 11
Chapter 5 Methodology 17
5.1 Rolling window 17
5.2 Feature Ranking 18
5.2.1 Random Forest Based Feature ranking 18
5.3 Model Selection 19
Chapter 6 Case study: Taiwan stock market 21
6.1 Discretization of Stock Market 21
6.2 Prediction in Taiwan Future (TX) 25
6.3 Prediction in Taiwan Capitalization Weighted Stock Index 30
Chapter 7 Conclusion 36
Bibliography 38

