論文名稱(外文):Enhanced Feature Learning with Its Applications to Link Prediction and Defense in Heterogeneous Information Networks
指導教授(外文):Cheng-Te Li
外文關鍵詞:Social Networks AnalysisHeterogeneous Information NetworkFeature Learning
由於網際網路包含豐富的資訊,可以將這些資訊建構成一個異質資訊網路,但因為其中可能包含許多不需要的信息,要如何從中挖掘出真正有用的資訊,是值得我們探討的問題,期望可以設計一套方法,從網路中取得重要的資訊,以了解節點之間的相關性。我們的任務是預測用戶的社交連結(UU-LP),以及用戶與項目之間的連結(UI-LP),倘若能夠精確預測兩者之間的關係,表示該方法能夠正確捕捉到節點之間的相關性,進而可用於商品推薦等等。我們提出了兩種強化異質資訊網路中的特徵表示學習方法:metamotif2vec與diversewalk2vec,metamotif2vec設計一個結構性的隨機遊走,可以同時考量較多不同類型節點之間的關係,而diversewalk2vec則是設計多樣性的隨機遊走,不用事先定義隨機遊走的形式,透過讓路徑通過多種類型的節點,自動捕捉其中的相關性,且可設定一參數讓隨機遊走可以傾向在同質網路或是異質網路中進行。我們在Twitter打卡紀錄及Douban Book兩筆資料進行實驗,相比於目前最先進的異質網路表示學習方法metapath2vec,我們提出的diversewalk2vec與metamotif2vec在UU-LP及UI-LP的任務中,平均可分別獲得7.1%與5.2%的精確率提升。而網路雖帶來了生活便利性,但也產生了隱私風險的問題,因此我們設計一套擾動資料的防禦機制,同時也進行連結預測的實驗,評估防禦機制的有效性,結果顯示其確實能使預測精確率下降,因此能夠降低用戶個人隱私外洩的可能性。
Since the heterogeneous information networks contain rich information, it is worth discussing how to extract useful information from the networks. We hope that we can design a method to preserve both structural of heterogeneous network and correlation between nodes. Our task is to predict the users’ social relationships (UU-LP) and the links between users and items (UI-LP). If we can predict the relationships between users or user and item precisely, it means that the method can capture the correlation between the nodes. We propose two methods, metamotif2vec and diversewalk2vec, to learn a low-dimentional feature representation for each node in heterogeneous information networks. The metamotif2vec model formalizes a structural random walk, which can consider the relationships between much more different types of nodes at the same time. On the other hand, the diversewalk2vec model designs a diversified random walk to capture the correlation automatically without defining the form of random walk in advance. Experiments conducted on large-scale Twitter check-ins dataset and Douban book dataset exhibit that metamotif2vec and diversewalk2vec can average achieve 7.1% and 5.2% improvement over the state-of-the-art heterogeneous network representation learning method metapath2vec in both tasks of UU-LP and UI-LP, respectively. While the Internet makes human life more convenient, it also raises privacy risks. Therefore, we propose some defense mechanisms for disturbing data and also conduct experiments for link prediction to evaluate their effectiveness. The results show that the defense mechanisms can reduce the possibility of leakage of users' personal privacy.
