(3.235.245.219) 您好!臺灣時間:2021/05/10 01:16
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果

詳目顯示:::

我願授權國圖
: 
twitterline
研究生:高顥璋
研究生(外文):Hao-Chang Kao
論文名稱:以物件導向樹狀結構儲存XML網頁資料倉儲之歷史版本資料
論文名稱(外文):Storing Historical Version Data in XML Web Warehouse Using Object-Oriented Tree Structure
指導教授:趙景明趙景明引用關係
指導教授(外文):Ching-Ming Chao
學位類別:碩士
校院名稱:東吳大學
系所名稱:資訊科學系
學門:工程學門
學類:電資工程學類
論文種類:學術論文
論文出版年:2005
畢業學年度:93
語文別:中文
論文頁數:91
中文關鍵詞:XML網頁資料倉儲版本二元樹路徑Edit Script
外文關鍵詞:sequential Edit Scriptbinary tree storage structureversionXML Web Warehouse
相關次數:
  • 被引用被引用:0
  • 點閱點閱:251
  • 評分評分:系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔系統版面圖檔
  • 下載下載:0
  • 收藏至我的研究室書目清單書目收藏:0
本研究的網頁資料倉儲是以儲存XML網頁資料為主的XML網頁資料倉儲(XML Web Warehouse)。我們引用部分物件導向的(Object-Oriented)相關技術,把XML網頁資料的每一個元素當作是一個物件(Object),亦將XML網頁資料視為一個樹狀結構的物件。
在XML網頁的歷史資料(Historical Data)方面,我們考慮到XML網頁歷史資料的變更,通常是部分元素的變動而不是整份XML文件全部更新。因此我們把不同時間的XML歷史資料視為一個獨立的版本(Version)資料,並且把版本延伸至每一個元素,利用版本號與時間對應的方式,在更新XML網頁資料時僅儲存變動元素的版本及其資訊,而省去儲存整份文件的繁複工作。
在儲存變動元素的歷史資料,我們採用Edit Script的方式,將變動元素內容的歷史演進記錄下來,進而採取了二元樹路徑的儲存架構來改進連續式(Sequential)Edit Script架構,讓XML網頁資料倉儲的儲存及擷取都更加的有系統、有效率。在最後的實驗得知:版本二元樹架構在節省空間的前提下,亦改善了連續式Edit Script繁瑣的計算量,得到更好的效能。
The Web Warehouse that we study is XML Web Warehouse specialize in storing XML Web data. We utilize some object-oriented related technology to treat every element of an XML document as an object. And we regard XML Web data as an object of a tree structure.
We consider an update of XML Web data that usually only some element changes instead of whole XML document. The historical data of different time are regarded as an independent version. We separate the Temporal Data from the XML document, and we utilize the corresponding way of version and time make every version of XML Web data extend to every element. In the historical data part of the element changed, we only store the content of each element version by Edit Script. And the binary tree storage structure we have adopted can improve the sequential Edit Script structure. At the end of this research, Experiment that we make proves that our theory is correct.
摘要 i
Abstract ii
目錄 iii
圖目錄 vii
1. 緒論 1
1.1 研究背景 1
1.2 研究動機 2
1.3 研究目的 3
1.4 研究方法與步驟 5
1.4.1 設計合適之儲存歷史資料的架構 6
1.4.2 設計廣義的XML網頁資料倉儲及其資料模式 6
1.4.3 構思歷史版本資料儲存和擷取的演算法 7
1.4.4 實驗與效能分析 8
1.5 論文架構 8
2. 相關技術 9
2.1 資料倉儲介紹 9
2.1.1 資料倉儲簡介 9
2.1.2 資料倉儲的定義 9
2.1.3 資料倉儲的組成與結構 11
2.1.4 資料倉儲的應用 12
2.2 可延伸標示語言-XML 15
2.2.1 XML簡介 15
2.2.2 XML的特性 16
2.3 物件導向技術 20
2.3.1 基本觀念 20
2.3.2 組成要素及主要概念 21
2.3.3 物件導向的優點 22
2.3.4 物件導向的應用與開發 23
3. 文獻探討 26
3.1 資料倉儲的歷史時間處理 26
3.1.1 資料倉儲的歷史資料 26
3.1.2 歷史資料的變遷 27
3.1.3 歷史資料的時間標籤 28
3.2 網頁資料倉儲系統 30
3.2.1 WHOWEDA 31
3.2.2 Xyleme 33
3.2.3 其他相關研究 36
3.3 物件導向技術及其資料庫 36
3.4 Edit Script 37
4. XML網頁資料倉儲 39
4.1 狹義的XML網頁資料倉儲 40
4.2 用二元樹路徑儲存版本資料的邏輯架構及優勢 40
4.3 XML網頁資料倉儲的資料模式 42
4.4 時間資料轉換表 45
5. 廣義的XML網頁資料倉儲 48
5.1 XML網頁資料倉儲所接收資料的限制 48
5.2 資料物件的對應:MATCH演算法 49
5.3 資料的儲存及XMLDSA演算法 50
5.4 資料的擷取及XMLDGA演算法 53
6. 架構效能分析 57
6.1 版本資料使用連續式Edit Script 57
6.2 版本資料使用二元樹路徑的Edit Script 59
7. 系統實驗與比較 62
7.1 實驗架構 62
7.2 實驗數據 65
7.3 實驗結果分析 66
7.4 關聯式資料庫的建置與比較 68
7.4.1 關聯式資料庫的儲存結構 68
7.4.2 關聯式資料庫與物件導向資料庫的比較 70
8. 結論與未來研究方向 72
8.1 結論 72
8.2 未來研究方向 74
參考文獻 75
[1]李震華,「應用行動代理者技術於網頁資料倉儲之變動偵測與維護」,私立東吳大學資訊科學研究所,碩士論文,2003年。
[2]唐如君,「如何規畫資料倉儲」,資訊與電腦,第223期,pp.91-95, 1999年。
[3]陳文華,「架構資料倉儲的注意事項」,資訊與電腦,第224期,pp.94-99, 1999年。
[4]陳文華,「透視資料倉儲」,資訊與電腦,第222期,pp.98-103, 1999年。
[5]許哲銘,「時間序列型態之知識探索」,國立中山大學資訊管理研究所,碩士論文,1999年。
[6]張德民,「資料探勘:從搜尋金星火山到偵察考試作弊」,資訊傳真周刊,336期,pp.10,3月,1999年。
[7]張勳騰,「通信資料庫之資料探勘:目標行銷之應用」,國立中山大學資訊管理研究所,碩士論文,1999年。
[8]趙景明、李震華,「應用行動代理者技術於網際資料倉儲之變動偵測」,2002年電子商務經營管理研討會論文集(光碟),台灣,台中,12月,2002年。
[9]Amagasa, T., Yoshikawa, M., and Uemura, S., “A Data Model for Temporal XML Documents,” Proceedings of 11th International Conference on Database and Expert Systems Applications, pp.334-344, London, UK, 2000.
[10]Bhowmick, S.S., Madria, S.K., NG, W.K., and Lim, E.P., “Web Warehousing: Design and Issues,” in Proceedings of 17th International Conference on Conceptual Modeling, pp. 93-104, Singapore, November 1998.
[11]Binh, N.T., Tjoa, A.M., and Mangisengi, O., “Meta Cube-X: An XML Metadata Foundation for Interoperability Search among Web Data Warehouses,” in Proceedings of the 3rd Intl. Workshop on Design and Management of Data Warehouses, DMDW'2001, Interlaken, Switzerland, June 4, 2001.
[12]Booch, G., “Object-Oriented Development,” IEEE Transactions on Software Engineering, Vol. 12 (2), pp.211-221, 1986.
[13]Booch, G., Rumbaugh, J.E., and Jacobson, I., “The Unified Modeling Language User Guide”, Journal of Database Management, Volume 10 (4), pp.51-52, 1999.
[14]Booch, G., Seidewitz, E., Start, M., and Firesmith, D., “Object-Oriented Programming without an Object-Oriented Language – Panel,” in Proceedings of Conference on Object-Oriented Programming System Languages and Applications, pp.508, Portland, Oregon, Proceedings, November 1986.
[15]Bruckner, R.M., List, B., Schiefer J., Tjoa A.M., “Modeling Temporal Consistency in Data Warehouses,” on proceeding of the 12th International Workshop on Database and Expert Systems Applications, pp.901-905, Munich, Germany, 3-7 September, 2001.
[16]Cao, Y.,Lim, E.P.,and Ng, W.K., “Storage Management of a Historical Web Warehousing System” in proceeding of Database and Expert Systems Applications 11th International Conference, pp.457-466, London, UK, September 4-8, 2000.
[17]Cattell, R.G.G., Object Data Management, Addison Wesley Co., 1994.
[18]Champeaux, D.D., Constantine, L.L., Jacobson, I., Mellor, S.J., Ward, P., and Yourdon, E., “Structured Analysis and Object Oriented Analysis,” in Proceedings of Conference on Object-Oriented Programming Systems, Languages and Applications / European Conference on Object-Oriented Programming, pp.135-139, 1990.
[19]Chaudhuri, S., and Dayal, U., “An Overview of Data Warehousing and OLAP Technology,” SIGMOD Record, Volume 26 (1), pp. 65-74, 1997.
[20]Chaudhuri, S., and Dayal, U., “Data Warehousing and OLAP for Decision Support,” in Proceedings of 5th International Conference on Deductive and Object-Oriented Databases, pp 33-34, Montreux, Switzerland, December 1997.
[21]Cheng, K., Kambayashi, Y., Lee, S.T., and Mohania, M., “Functions of a Web Warehouse,” in Proceedings of 2000 Kyoto International Conference on Digital Libraries: Research and Practices, pp. 172-179, 2000.
[22]Chien, S., Tsotras, V. J., and Zaniolo, C., “Copy-Based versus Edit-Based Version Management Schemes for Structured Documents,” in Proceedings of 11th International Workshop on Research Issues in Data Engineering: Document Management for Data Intensive Business and Scientific Applications, pp.95-102, Heidelberg, Germany, April 2001.
[23]Chien, S., Tsotras, V.J., and Zaniolo, C., “Efficient Management of Multiversion Documents by Object Referencing,” in Proceedings of 27th International Conference on Very Large Data Bases, pp.291-300, Roma, Italy, 2001.
[24]Chien, S., Tsotras, V. J., and Zaniolo, C., “XML Document Versioning,” SIGMOD Record, Vol. 30, N0. 1, pp.46-53, March 2001.
[25]Chien, S.Y., Tsotras, V.J., Zaniolo, C., and Zhang, D., “Storing and Querying Multiversion XML Documents using Durable Node Numbers,” in Proceedings of the 2nd International Conference on Web Information Systems Engineering (WISE'01), pp.232-244, Japan, Volume 1 (Main program), 3-6 December 2001.
[26]Cobena, G., Abiteboul, S., and Marian, A., “Detecting Changes in XML Documents”, in Proceedings of the 18th International Conference on Data Engineering, pp 41-52, San Jose, CA, February 2002.
[27]Coleman, D., Artim, J., Ohnjec, V., Rivas, E., Rumbaugh, J.E., and Wirfs-Brock, R., “UML: The Language of Blueprints for Software, ” in Proceedings of Conference on Object-Oriented Programming Systems, Languages and Applications, pp.201-205, Atlanta, Georgia, October 1997.
[28]DB4O - database for objects, http://www.db4o.com.
[29]Devlin, B., Data Warehouse: From Architecture to Implementation, 6th printing, Addison Wesley, January 2000.
[30]Devlin, B., “Data Warehouse Implementation Experience in IBM Europe,” in proceeding SHARE 1991 Spring conference, SHARE Europe, Geneva, pp.231-247, 1991.
[31]Devlin, B., Murphy, P.T., “An Architecture for a Business and Information System, ” IBM Systems Journal, pp.60-80, Volume 27(1), 1988.
[32]Grandi, F., and Mandreoli, F., “The Valid Web: an XML/XSL Infrastructure for Temporal Management of Web Documents,” in Proceedings of the International Conference on Advances in Information Systems, pp.294-303, Izmir, Turkey, LNCS 1909, October 2000.
[33]Griss, M.L., Fonseca, S.P., Cowan, R.M., and Kessler, R.R., “Using UML State Machine Models for More Precise and Flexible JADE Agent Behaviors,” in Proceedings of the Agent-Oriented Software Engineering III, pp.113-125, Bologna, Italy, July 15, 2002.
[34]Härtig, M., and Dittrich, K.R., “An Object-Oriented Integration Framework for Building Heterogeneous Database Systems,” in Proceedings of the IFIP WG 2.6 Database Semantics Conference on Interoperable Database Systems, pp.33-53 Lorne, Victoria, Australia, IFIP Transactions A-25 North-Holland 1993, November 1992.
[35]HTML, Hypertext markup language, http://www.w3.org/MarkUp/
[36]Inmon, W.H., Building the Data Warehouse, New York, John Wiley & Sons, 1992.
[37]Inmon, W.H., and Hackathorn, R.D, Using the Data Warehouse, Wiley-QED, 1st edition 1994.
[38]Jacobson, I., Booch, G., and Rumbaugh, J.E., “The Unified Software Development Process”, The Unified Process. IEEE Software, 16(3), pp.82-90, 1999.
[39]Jacobson, I., and Lindström, F., “Re-engineering of Old Systems to an Object-Oriented Database,” in Proceedings of Conference on Object-Oriented Programming Systems, Languages and Applications, pp.340-350, Phoenix, Arizona, USA, November 1991.
[40]J.V.D., H., “Data Warehousing: Bringing It All Together”, Information System Management, pp. 92-95, 1998.
[41]Kanne, C.C., Moerkotte, G., “Efficient Storage of XML Data,” in Proceedings of the 16th International Conference on Data Engineering, pp.198, San Diego, California,28 February - 3 March, 2000.
[42]Kim, W., Modern Database Systems: The Object Model, Interoperability, and Beyond, Addison Wesley Co., Addison Wesley Longman, 1st edition, January 1, 1995
[43]Kimball, R., The Data Warehouse ToolsKit, John Wiley & Sons, New York, 1996.
[44]Kroha, P., “Databridge between RDBMS and OODBMS”, Information Management in Computer Integrated Manufacturing: A Comprehensive Guide to State-of-the-Art CIM Solutions. Lecture Notes in Computer Science 973 Springer, pp.132-135, 1995.
[45]Marinan, A., Abiteboul, S., and Mignet, L., “Change-Centric Management of Versions in an XML Warehouse,” in Proceedings of 27th International Conference on Very Large Data Bases, pp.581-590, Roma, Italy, Morgan Kaufmann 2001, September 2001.
[46]Mohania, M.K., “Building web warehouse for semi-structured data,” Data & Knowledge Engineering, pp. 101-103, Vol. 39, 2001.
[47]Murtaza, A., “A framework for Developing Enterprise Data Warehouse,” Information System Management, pp. 21-26, 1998.
[48]Ng, W.K., Lim, E.-P., Huang, C.T., Bhowmick, S.S., and Qin, F.Q., “Web Warehousing: An Algebra for Web Information,” in Proceedings of the IEEE Forum on Research and Technology Advances in Digital Libraries, pp. 228-237, Santa Barbara, California, USA., April 1998.
[49]Nørvåg, K., Limstrand, M., Myklebust, L., “TeXOR: Temporal XML Database on an Object-Relational Database System,” in proceeding of Perspectives of Systems Informatics, 5th International Andrei Ershov Memorial Conference, pp.520-530, PSI 2003, Akademgorodok, Novosibirsk, Russia, July 9-12, 2003.
[50]Oliver, I., Programming Classics, Prentice Hall, Australia, 1993.
[51]Ravat, F., and Teste, O., “An Object Data Warehousing Approach: a Web Site Repository,“ in Proceedings of 2000 ADBIS-DASFAA Symposium on Advances in Databases and Information Systems, pp. 128-137, Prague, Czech Republic, September 2000.
[52]Rochkind, M.J., “The Source Code Control System,” IEEE Transactions on Software Engineering (TSE), pp.364-370, Vol. 1, No. 1, 1975.
[53]Rumbaugh, J.E., “OMT: The Object Model”, Journal of Object-Oriented Programming, pp.21-27, Volume 7, Number 8, January 1995.
[54]Rumbaugh J.E., Premerlani, B., and Lorensen, E., Object-Oriented Modeling And Design, N.J., Prentice Hall, 1991
[55]Skyt, J., “Managing Aging Data in Temporal Databases and Data Warehouses,” in Proceedings of the EDBT 2000 PhD Workshop, http: //www.edbt2000.uni-konstanz.de/phd-workshop/, March 31 - April 1, 2000.
[56]Soukeras, S., King, P.J.H., “Temporal Databases: an Event Oriented Approach,” in proceeding of 12th British National Conference on Databases, pp.38-54, United Kingdom, July 6-8, 1994.
[57]Tichy, W.F., “RCS - A System for Version Control,” Software - Practice and Experience (SPE), pp.637-654, Vol.15, 1985.
[58]W3C: Document Oject Model (DOM), http://www.w3.org/DOM/
[59]W3C, World Wide Web Consortium, http://www.w3.org/
[60]Wang, Y., DeWitt, D.J., and Cai, J., “X-Diff: An Effective Change Detection Algorithm for XML Documents”, in Proceedings of 19th International conference on Data Engineering, pp.519-530, Bangalore, India, March 5-8, 2003.
[61]Weik, T., “Foundations of an Interactive Design Tool for Active OODBMS,” Grundlagen von Datenbanken, pp.132-136, Bad Helmstedt, Germany, September 1994.
[62]Whoweda - The Warehousing and Data Mining Group, Warehouse of Web Data, http://mandolin.cais.ntu.edu.sg/~whoweda/index.htm.
[63]Xiaou, R., Dillon, T.S., Chang, E., and Feng, L., “Modeling and Transformation of Object-Oriented Conceptual Models into XML Schema,” in Proceedings of the International Conference on Database and Expert Systems Applications, pp.795-804, Munich, Germany, September 3-5, 2001.
[64]XML, Extensible Markup Language, http://www.w3.org/XML/
[65]Xyleme, Xyleme Inc., http://www.xyleme.com.
[66]Xyleme L., “A dynamic warehouse for XML Data of the Web,” IEEE Data Engineering Bulletin, pp.40-47, 24(2), 2001.
[67]Yourdon, E., “Preparing Software Engineers for the 'Real World,” in Proceedings of the 15th Conference on Software Engineering Education and Training, Covington, Kentucky, USA. IEEE Computer Society, online publication: http://computer.org/proceedings/cseet/1515/1515toc.htm, February 2002.
[68]Zhu, Y., “A Framework for Warehousing the Web Contents,” in Proceedings of the 5th International Computer Science Conference, pp.83-92, Hong Kong, China, December 1999.
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top
系統版面圖檔 系統版面圖檔