(3.235.191.87) 您好!臺灣時間:2021/05/13 04:59
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果

詳目顯示:::

: 
twitterline
研究生:李信宏
研究生(外文):Shing-Hung Lee
論文名稱:以時間為考量之網路錯誤推理診斷
論文名稱(外文):Network Fault Identification with Temporal Consideration
指導教授:劉安之劉安之引用關係
指導教授(外文):An-Chi Liu
學位類別:碩士
校院名稱:逢甲大學
系所名稱:資訊工程學系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2001
畢業學年度:89
語文別:中文
論文頁數:94
中文關鍵詞:網路管理網路錯誤偵測網路錯誤模型
外文關鍵詞:network managementfault identificationFSMtemporal consideration
相關次數:
  • 被引用被引用:11
  • 點閱點閱:102
  • 評分評分:系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔
  • 下載下載:10
  • 收藏至我的研究室書目清單書目收藏:0
網際網路的快速成長,伴隨著網路相關問題也越來越多,如網路資源利用分享的問題,網路使用權限的問題,以至於整個網路使用中所無法避免的錯誤發生導致網路不能使用的問題等等,如何快速找出網路問題與解決這些問題是網路管理的重要課題。網管系統的功能多而廣,但是大部分的產品都是協助管理者收集網路流量及錯誤警訊的資料,但無法對收集到的錯誤警訊作自動化的分析,因此也就無法自動而即時的回報發生錯誤的地點及原因,還必須依賴管理人員累積的經驗。這種透過累積的經驗來解決網路錯誤的方法較不系統化,且使得自動化的網路管理難以達成。若是我們依據這些管理人員累積的經驗,設計成自動化的推理機制,讓系統自動分析警訊,並找出錯誤發生的根源來加以解決,將可以大大的減少偵錯的時間。
本篇論文主要是提出以時間考量為基礎之網路錯誤警訊關係與推理診斷系統。網路錯誤與警訊的關係是非常複雜的,要考量的因素也很多,只有考慮空間是不夠的,加入時間才能真正表現出錯誤蔓延的模型。先將網路上常發生的問題找出其相對應的警訊,並觀察這些警訊在時空上的蔓延的情況,建構出網路問題發生時在每一個時間單位所可能出現的警訊集合,把這些警訊集合從錯誤發生到結束連接起來,成為錯誤在時間上的蔓延模型。由網域管理者 (Domain Manager) 管理,在固定的時間週期裡收集網路資訊,網域管理者監聽所管理的網路是否有警訊發生,若管理者接收到警訊,則利用錯誤警訊因果關係來推測,可能是哪些錯誤發生時產生的警訊集,每個時間單位都偵測出這些警訊集,再由整個系統的診斷時間來找出這些警訊集的相關性,歸納出所有滿足這些警訊集發生的錯誤。因為這些警訊都帶有位置的資訊,所以可以輕易的找出這些錯誤發生的位置,並清楚的找出錯誤發生的開始與結束。
In the last few decades, the explosive growth of the Internet has been accompanied by a wide range of internetworking problems related to routing, resource reservation, and administrators. Networks often consist of hundreds or even thousands of interconnected nodes form different manufacturers factories. As a result, we will have more and more complex problems in the network. We know that network faults are not avoidable. Therefore, it is important to find out what and where are problems in the network as soon as possible. The fault management plays an important role in managing a network effectively. The development of a practical and effective system for network fault diagnosis becomes an urgent task.
In this paper, we offer a fault diagnosis mechanism, a two-phase network alarm correlation method, for effective and automated network fault isolation. This mechanism uses a Finite State Machine (FSM) to model classified and refined fault propagation behaviors by adding temporal factor. The FSM graphs of fault propagation behaviors are constructed by several alarm states in every time interval. This two-phase inference diagnosis method includes two phases doing inference work. In the phase I, in the each fixed cycle, domain manager will monitor and listen to alarms in the network. If the domain manager received alarms, it will guess which kind of alarm states were happened with FSM graph model of faults. After this step, in the phase II, the domain manager will know what kind of fault was happened in its domain according alarm states of propagation durations. Finally, domain managers will work in a propagation durations to find the most possible fault in a global view.
In our lab environment, we can successfully recognize ten network faults of TCP/IP. Our key contribution is not only the automated process of alarm collection, alarm correlation, and fault isolation, but also identify the propagation of each fault with a corresponding severity.
摘 要.................................................i
目 錄.................................................ii
圖 表 目 錄............................................iv
表 格 目 錄............................................vi
第一章 導 論..........................................1
1.1 研究動機及目的.....................................1
1.2 系統架構...........................................2
1.3 論文章節概要.......................................3
第二章 相關研究........................................5
2.1 警訊的意義.........................................5
2.2 警訊關係的建立.....................................6
2.3 警訊關係推理的方法及系統............................9
2.4 ACView網路錯誤診斷系統.............................11
第三章 錯誤蔓延行為的知識建立...........................14
3.1 簡單的錯誤蔓延模型..................................14
3.2 錯誤蔓延之時空解析..................................17
3.2.1 警訊加入時間屬性後之分析..........................19
3.2.2 錯誤蔓延行為的三個種類............................20
3.3 錯誤蔓延模型建立...................................24
3.4 知識的表示.........................................28
3.4.1 描述語言.........................................29
第四章 診斷推理之考量與設計.............................31
4.1 錯誤蔓延區間探討...................................31
4.2含有時間考量推理引擎之架構............................31
4.3 含有時間考量推理步驟................................33
4.3.1 兩階段式推理模式.................................34
4.3.2 辨認每個時間單位發生的警訊集合....................35
4.3.3 判斷診斷區間可能發生錯誤..........................41
4.3.4 錯誤種類,位置與嚴重性評估........................44
4.4 多種錯誤同時發生之研究..............................46
4.4.1 錯誤同時發生的現象................................47
4.4.2 錯誤優先權問題的討論..............................49
第五章 系統實作........................................50
5.1 ACView 系統架構....................................50
5.2 實驗環境...........................................51
5.3 錯誤知識庫的輸入...................................52
5.4 警訊推理引擎 - ACEngine II 的實作..................55
5.4.1 推理引擎實作問題.................................56
5.4.2 推理引擎流程圖...................................58
5.4.3 錯誤實驗.........................................61
5.5系統經驗與結果分析...................................65
5.5.1 解決第一版 ACEngine 的問題.......................66
5.5.2 推理系統的診斷正確率..............................70
5.5.3 ACView 系統診斷速度的瓶頸與極限...................71
5.5.4 例外情況的處理...................................72
第六章 結論與未來發展...................................75
參 考 文 獻............................................77
附 錄.................................................81
附錄 A、警訊的種類.....................................81
附錄 B、可偵測之錯誤與其相關的警訊.......................84
附錄 C、可偵測之錯誤與其相關的警訊 (FSM 模型)............85
[APOS96]T. K. Apostolopoulos and V. C. Daskalou, “Network Management Services Using a Temporal Information Model,” Integrated Network Management III, pp. 638-649, 1997.
[BOUL94]A.T. Bouloutas, S. Calo and A. Finkel, "Alarm Correlation and Fault Identification in Communication Networks", IEEE Transactions on Computers, Vol. 42, No. 2/3/4, pp. 523-533, Feb./Mar./Apr. 1994.
[BOUL95]A. T. Bouloutas, S. B. Calo, A. Finkel, and I. Katzela, Distributed fault identification in telecommunication networks, Journal of Network and Systems Management, Vol. 3, pp. 295-312, 1995.
[CHAO99]C. S. Chao, D. L. Yang, A. C. Liu, “Alarm Correlation View,” Proceedings of the IASTED, Philadelphia, U.S.A., May 1999.
[CHAK94]S. Chakravarthy and D. Mishra, “Snoop: An Expressive Event Specification Language for Active Databases,” Data and Knowledge Engineering, 14(10), pp. 1-26, October 1994.
[CHEN96]J. L. Chen and P. H. Huang, A fuzzy expert system for network fault management, Proceedings of IEEE International Conference on Systems, Maintenance, and Cybernetics, Vol. 1, pp. 328-331, 1996.
[CHEN98]J. Y. Chen, D. L. Yang, and An-Chi Liu, A MODEL-based object-oriented topology specification for network management, Proceedings of International Computer Symposium on Computer Network, Internet, and Multimedia, pp. 164-170, Dec. 1998.
[CONC97]Concord Communications, The Web-based network performance reporting and analysis company, http://www.concord.com/.
[GAMB94]D. Gambhir, M. Post, and I. Frisch, “A Framework for Adding Real-Time Distributed Software Fault Detection and Isolation to SNMP-Based System Management,” Journal of Network and System Management, Vol. 12, No. 3, pp. 257-281, 1994.
[GATZ94]S. Gatziu, A. Geppert, and K.R. Dittrich, “Detecting Composite Events in Active Database Systems Using Petri Nets,” In the 4th International Workshop on Research Issues in Data Engineering, pp. 2-9, Feburary 1994.
[GEHA92]N. Gehani, H.V. Jagadish, and O. Shmusli, ”Composite Event Specification in Active Databases: Model & Implementation,” In the 18th International Conference on Very Large Data Bases, pp. 327-338, August 1992.
[GOLD96a]Goldsmidt Germans. Goldszmidt, ‘Distributed Management by Delegation,‘Columbia University, 1996.
[GOLD96b]Goldsmidt Germans,’Network Management Views using Delegated Agents’, The Proceedings of the 6th CAS conference, Nov. 1996.
[GOLD98]G. Goldszmidt and Y. Yemini, Computing MIB views via delegated agents, Proceedings of IEEE Third International Workshop on Systems Management, pp. 86-95, 1998.
[HASA95]Z. Hasa, “An Active Temporal Model for Network Management Databases,” Proceedings of the IFIP/IEEE Fourth International Symposium on Integrated Network Management, Santa Barbara, California, pp. 524-535, May 1995.
[HOUC96]K. Houck, S. Calo, A. Finket, “Towards a Practical Alarm Correlation System”, Integrated Netowrk Management IV, pp 238-249. Chapman & Hall, 1996
[ITI92a]Independence Technologies, Inc., iVIEW SNMP Agent Kit Programmer’s Reference, 1992.
[ITI92b]Independence Technologies, Inc., iVIEW User’s Guide, 1992.
[JAKO93]G. Jakobson and Mark D. Wessman, “Alarm Correlation,” IEEE Network, pp. 52-59, Nov. 1993.
[JORD93]J. F. Jordan and M. E. Paterok, “Event Correlation in Heterogeneous Networks Using OSI Management Framework,” Integrated Network Management, III (C-12), pp. 683-695, 1993.
[KäTK97]S. Kätker and K. Geihs, “A Generic Model for Fault Isolation in Integrated Management Systems”, Journal of Network and Systems Management, Vol. 5, No. 2, pp. 109-130, 1997.
[KATZ95]I. Katzela and M. Schwartz, Schemes for fault identification in communication networks, IEEE/ACM Trans. on Networking, Vol. 3, No. 6, pp. 753-764, Dec. 1995.
[KLIG95]S. Kliger, S. A. Yemini, Y. Yemini, D. Ohsie, and S. Stolfo, “A Coding Approach to Event Correlation,” Proc. 4th Int’l. Symp. on Integrated Network Mngmt., Santa Barbara, CA., May 1995.
[KONA96]A. Konar and A. Mandal, “Uncertainty Management in Expert Systems Using Fuzzy Petri nets,” IEEE Trans. Knowledge and Data Engineering, Vol. 8, No.1, pp. 96-105, September 1996.
[LIU95]劉安之, 網路管理, 資訊與電腦雜誌社, 1995.
[LIU98]G. Liu, A. K. Mok, and E. J. Yang, “Composite Events for Network Event Correlation,” Integrated Network Management, 1999. Distributed Management for the Networked Millennium. Proceedings of the Sixth IFIP/IEEE International Symposium on, pp. 247-260, 1999.
[LIN99]S. F. Lin, A Dynamic Visual Interface for Network Fault Management, Master Thesis, Institute of Information Engineering, Feng Chia University, Jun. 1999.
[LIAN2000]E.T. Liang, D. L. Yang and A. C. Liu, "TCP/IP Alarm Transformation in a Network Management Environment," to appear in ICOIN-14, pp.2C2.1-2C2.7, January 26-28, 2000.
[LU99]J. Z. Lu, A Segment-based TCP/IP Network Alarm Correlation Engine, Master Thesis, Institute of Information Engineering, Feng Chia University, Jun. 1999.
[MAXI90]R.A. Maxion, F.E. Feather, A case study of Ethernet anomalies in a distributed computing environment, IEEE Trans. on Reliability 39 (4) (1990) pp. 433-443.
[MAYE98]A. Mayer, S. Kliger, S. Yemini, Event modeling with the MODEL language: a tutorial introduction, http://versed.smarts.com, 1998.
[MULL95]N.J. Muller, “Managing Desktop Assets,” Internal Journal of Network Management, Vol. 5, No. 2, pp.100-111, Mar.-Apr. 1995.
[MULL96]N.J. Muller, “SNMP’s Remote Monitoring MIB,” Internal Journal of Network Management, Vol. 6, No. 1, pp.17-32, Jan.-Feb. 1996.
[OHTA96]K. Ohta, N. Kato, G. Mansfield, and Y. Nemoto, “Configuring a Network Management System for Efficient Operation,” International Journal of Network Management, Vol. 6, No. 2, pp.108-118, Mar.-Apr. 1996.
[RFC1213]M. Rose and K. McCloghrie, “Management Information Base for Network Management of TCP/IP-based Internets: MIB-II,” RFC 1213, Mar. 1991.
[RFC1757]S. Waldbusser, “Remote Network Monitoring Management Information Base,” RFC1757, Feb. 1995
[RMON94]RMON, http: //nm7.ccl.itri.tw:80/k100/project/rmon/rmon.html.
[ROBE88]Robert N. Cronk, Paul H.Callahan, Lawreence Berstein. “Rule-Base Expert Systerms for Netowrk Management and Operations: An Introduction”, IEEE Network, pp7-21, September 1988.
[ROBE96]Robert D. Gardner, David A. Harle. “Methods and Systems for Alarm Correlation”, IEEE Network, 1996
[ROBE97]Robert D. Gardner, David A. Harle. “Fault Resolution and Alarm Correlation in High-Speed Networks using Database Mining Techniques”, International Conference on ICICS’97, Sept. 1997
[ROUV95]I. Rouvellou, and G.W. Hart, "Automatic Alarm Correlation for Fault Identification," IEEE INFOCOM, Vol. 2, pp. 553-561, April 1995.
[SHEE96]K. R. Sheers, HP OpenView Event Correlation Services (ECS), Hewlett-Packard Journal, Article 4, Oct. 1996.
[SPEC99]SPECTRUM ,“Incorporationg Business Process Management into Network and Systems Management”, http://www.cabletron.com / ,1999
[SUZU97]Motohiro Suzuki, Yoshiaki Kiriha, and Shoichiro nakai, ‘Delegation Agents : Design and Implemention’ , International Symposium on Integrated Network Management, pp. 742-751, 1997.
[TSOT99]J. Tsotras, V. Phalke, anil Kumar, and B. Gopinath, “Supporting Temporal Views in a Management Information Base,” Journal of Network and Systems Management, Vol. 7, No. 2, pp.149-176, 1999.
[WHAN89]K. Y. Whang and S. Brady, “High-performance Expert System-DBMS Interface for Network Management and Control,” Selected Areas in Communications, IEEE Journal on, Vol. 73, pp. 408-417, April 1989.
[WU96]D. M. Wu, K. L. To, and R. P. Zeletin “Four Level TMN Viewpoints for Fault Management”, Journal of Network and System Management, vol. 4, no. 1, pp. 31-47, 1996.
[YEMI96]S. A. Yemini, S. Kliger, E. Mozes, Y. Yemini, and D. Ohsie, “High Speed and Robust Event Correlation”, IEEE Communication Magazine, pp.82-90, May 1996.
[YEMI97]S. Yemini, S. Kliger, E. Mozes, Y. Yemini, and D. Ohsie, “High Speed & Robust Event Correlation,” System Management ARTS, 1997.
[YU99]C. H. Yu, An Experiment-based Knowledge Database of Local Area Network Fault Behavior, Master Thesis, Institute of Information Engineering, Feng Chia University, Jun. 1999.
[ZHAO96]M. Zhao, C. Leckie, and C. Rowles, “An Interactive Fault Diagnosis Expert System for a Helpdesk Application,” Expert System, Vol. 13, No. 3. pp. 203-216, Aug. 1996.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
系統版面圖檔 系統版面圖檔