跳到主要內容

臺灣博碩士論文加值系統

(18.97.9.168) 您好!臺灣時間:2025/01/16 17:04
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:麥舘碩
研究生(外文):Guan-Shuo Mai
論文名稱:鍵連開放的生態資料:一個生態資料分享的新方法
論文名稱(外文):Linked Open Data of Ecology : A New Approach of Data Sharing
指導教授:夏禹九夏禹九引用關係林朝欽林朝欽引用關係
指導教授(外文):Yue-Joe HsiaChau-Chin Lin
學位類別:碩士
校院名稱:國立東華大學
系所名稱:自然資源與環境學系
學門:環境保護學門
學類:環境資源學類
論文種類:學術論文
論文出版年:2011
畢業學年度:99
語文別:中文
論文頁數:88
中文關鍵詞:資源描述框架鍵連資料生態學資料分享
外文關鍵詞:RDFLinked DataEcologyData Sharing
相關次數:
  • 被引用被引用:3
  • 點閱點閱:706
  • 評分評分:
  • 下載下載:109
  • 收藏至我的研究室書目清單書目收藏:0
生態學是一門重視整合與合作的科學,分享生態資料是解決跨領域整合問題的需求。以metadata標準或是知識本體論(ontology) 為導向的生態資料管理系統,由於缺少一致的存取平台與標準,而難以跨越系統的藩籬。有鑑於此,本研究嘗試以近年發展的鍵連資料(Linked Data)風格建立新的生態資料分享方式。鍵連資料是Web發明人Tim Berners-Lee為實現下一代網路─語意網─所提出的資料分享的實務方式。它的四個原則善用了既有的Web架構以及資源描述框架(簡稱RDF)這個單一標準,讓資料在最細的單元彼此連結,形成有別於傳統網路的資料網,並能讓機器協助資料處理作業,更跨越了系統的界線而彼此連結。本研究串連了五個主題或內容相異,但又具有不同程度的關連性的資料集,包括了林火資料庫、林業試驗所的植物與昆蟲標本資料庫、楠溪森林動態樣區資料,以及TaiBNET提供的台灣物種名錄資料庫等等。串連的結果命名為「鍵連開放的生態資料」(簡稱LODE)並發布於資料網上。發布過程可歸納為1)準備原始資料並建立資料模型,2)將資料轉換為RDF的形式,3)將RDF資料載入專屬的系統並以推理器完備資料中隱含的語意,4)透過鍵連資料介面發布資料並在LODE與其他資料網上的資料集之間建立連結。為展現鍵連資料為分散的資料集所帶來的交互操作性,本研究建立了數個操作實例來示範如何尋訪連結以探索未知來源的動態資料、聯合查詢多來源靜態資料集、探索資料間的關係,以及以進階的查詢指令示範資料間的互動等等。藉由幾個現有的資料庫操作實例,本研究討論了鍵連資料對生態資料分享的合適性、資料品質、整合分類學資料時可能忽略的細節、將發布的資料整合進入半自動化的科學工作流程系統的可行性、以及與現有的資料管理方式整合的途徑。最後提出依鍵連資料風格分享生態資料,改進現有資料模型與發布架構的方針。
Ecology is a discipline emphasizing integrative and collaborative approaches, and data sharing could facilitate integration among different ecology subdisciplines. However, the lack of semantics and unified access platform in prevalent ecological information management systems hamper the data sharing considerably. In this study, a newly developing framework to connect data on the Web, Linked Data, is proposed to be a suitable practice for exposing, sharing and connecting pieces of ecological data. The four principles built upon standard Web technologies and Resource Description Framework (RDF) make data linkable and machine-readable in the finest level of granularity, interweaving data silos into the Web of Data. The new-proposed framework for sharing ecological data is named Linked Open Data of Ecology (LODE). As a case study, five existing datasets with different but related contents was published as LODE. The data sets are forest fire database, plant and insect specimen databases established byTFRI, the Nanshi forest dynamics plot census data provided by Providence University, and the database of catalogue of life in Taiwan provided by Academia Sinica. The data publishing workflow can be summarized into 4 steps: 1) to prepare raw data and to build general data models, 2) to transform raw data into RDF data fitting models from step 1, 3) to load RDF data and models into RDF repositories with reasoners to merge data and to complete semantics,4) to publish through Linked Data interface. Using the above schema links among 5 source datasets as well as other datasets on the Web of Data could then be established. The scalability and interpretability of this data framework were demonstrated with dynamic data discovery by link traversing, federated querying multiple static datasets, exploring relationships among data instances, and advanced queries. The suitability of this new approach in ecological data sharing is then discussed. Problems on data quality, integrating taxonomic data, analysis of ecological data automatically under LODE framework using scientific workflow tools such as Kepler, and the need to improve current data models were also explored. This study concludes that Linked Data is suitable for ecological data sharing but improvement of data publishing workflow and the user interface are needed.
1. 前言 1
2. 生態資料管理綜論 3
2.1 資料管理與生態學 3
2.2 生態資料管理方式 6
2.2.1 保存與倉儲 6
2.2.2 Metadata、Metadata標準與資料分享 7
2.3 語意網脈絡下的生態資料管理 11
2.3.1 知識本體論 11
2.3.2 鍵連資料 14
2.3.3 RDF倉儲系統與SPARQL概述 20
3. 材料與方法 23
3.1 準備原始資料以及資料模型 25
3.1.1 準備原始資料 25
3.1.2 統一的資料概念模型設計 27
3.1.3 字彙收集 30
3.2 資料轉換 31
3.2.1 建立對應檔 32
3.2.2 特殊考量與操作 33
3.2.3 轉換資料 35
3.3 將資料載入RDF倉儲系統 39
3.3.1 BigOWLIM系統 40
3.3.2 載入前處理 40
3.3.3 合併資料與關係推理 40
3.4 發布與連結資料 44
3.4.1 透過鍵連資料界面發布資料 45
3.4.2 建立內部與對外連結 46
3.4.3 描述LODE資料集 47
3.5 其他工具 48
3.5.1 SQUIN 48
3.5.2 MAP4RDF 48
3.5.3 RelFinder 48
4. 操作實例 51
4.1 直接瀏覽 51
4.2 物種名與分布地查詢 56
4.3 單一物種於世界尺度的分布 60
4.4 以地找物 62
4.5 探索資料間的關係 64
4.6 衍生範例─SPARQL查詢操作 66
5. 討論 71
5.1 鍵連資料應用於生態學的適宜性 71
5.2 與科學工作流程整合的潛力 72
5.3 原始資料品質與內容問題 75
5.4 引申自鍵連資料框架本身的議題 75
5.5 物種分類與實體身份認定問題 77
6. 後續研究方向 81
6.1 改善現有的資料模型 81
6.2 補足資料轉換與呈現方式 81
6.3 與既有框架與文化整合的資料分享願景 82
7. 引用文獻 83
附錄 87
林朝欽、陸聲山、鄭美如。2006。生態資訊學研究與生態資訊管理之推展。林業研究專訊13(6):4-8。
林朝欽、許正欣、夏禹九、褚侯森、陸聲山。2009。科學工作流程系統(Kepler)在二氧化碳通量計算上之應用。中華林學季刊42(1):75-88。
夏禹九、金恆鑣、林敏雄。2000。長期生態研究的台灣經驗。科學發展月刊 28(9):679-685。
Anonymous. 2009. Data's shameful neglect. Nature 461(7261):145-145.
Arzberger, P., P. Schroeder, A. Beaulieu, G. Bowker, K. Casey, L. Laaksonen, D. Moorman, P. Uhlir, and P. Wouters. 2004. An International Framework to Promote Access to Data. Science 303(5665):1777-1778.
Berkley, C., S. Bowers, M. B. Jones, S. J. Madin, and M. P. Schildhauer. 2009. Improving data discovery for metadata repositories through semantic search.In Proceeding of the International Workshop on Intelligent Systems for Environmental (Knowledge) Engineering and EcoInformatics, Fukuoka, Japan.
Bermudez, L. E. 2004. Ontomet: ontology metadata framework. Ph.D. Thesis. Drexel University, Philadelphia.
Berners-Lee, T. 1998a. Relational Databases on the Semantic Web. http://www.w3.org/DesignIssues/RDB-RDF.html.
Berners-Lee, T. 1998b. Semantic Web Road map. http://www.w3.org/DesignIssues/Semantic.html.
Berners-Lee, T. 1998c. What the Semantic Web can represent. http://www.w3.org/DesignIssues/RDFnot.html.
Berners-Lee, T. 2006. Linked Data. International Journal on Semantic Web and Information Systems 4(2):1. Availabel at: http://www.w3.org/DesignIssues/LinkedData.html.
Berners-Lee, T., J. Hendler, and O. Lassila. 2001. The semantic web. Scientific American 284(5):34-43.
Bizer, C., and R. Cyganiak. 2006. Publishing Relational Databases on the Semantic Web.In The 5th International Semantic Web Conference, Athens, USA.
Bizer, C., T. Heath, D. Ayers, and Y. Raimond. 2007. Interlinking Open Data on the Web. Pages 802-815 In the 4th European Semantic Web Conference.
Bizer, C., T. Heath, and T. Berners-Lee. 2009. Linked Data - The Story So Far. International Journal on Semantic Web and Information Systems 5(3):1-22.
Earl, M. J. 1994. Knowledge as strategy: reflections on Skandia International and Shorko Films. Pages 53-69 Strategic information systems. John Wiley \& Sons, Inc.
Elmasri, R., and S. B. Navathe. 2007. Fundamentals of Database Systems. 5 edition. Addison-Wesley, Harlow, England.
Gray, J. 2007. Jim Gary on eScience: A Transformed Scientific Method. Pages xvii-xxxi in T. Hey, S. Tansley, and K. Tolle, editors. The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research.
Gruber, T. R. 1995. Toward principles for the design of ontologies used for knowledge sharing. Int. J. Hum.-Comput. Stud. 43(5-6):907-928.
Halpin, H., and P. J. Hayes. 2010. When owl:sameAs isn't the Same: An Analysis of Identity Links on the Semantic Web.In Linked Data On the Web 2010, Raleigh, North Carolina, USA.
Heath, T., and C. Bizer. 2011. Linked Data: Evolving the Web into a Global Data Space. Morgan & Claypool Publishers. Available at: http://www.morganclaypool.com/doi/abs/10.2200/S00334ED1V01Y201102WBE001.
Jain, P., P. Hitzler, P. Z. Yeh, K. Verma, and A. P. Sheth. 2009. Linked data is merely more data. Pages 82-86 In Linked Data Meets Artificial Intelligence. Association for the Advancement of Artificial Intelligence Press.
Jones, M. B., M. P. Schildhauer, O. J. Reichman, and S. Bowers. 2006. The New Bioinformatics: Integrating Ecological Data from the Gene to the Biosphere. Annual Review of Ecology, Evolution, and Systematics 37(1):519-544.
Lippincott, C. L. 2000. Effects of Imperata cylindrica (L.) Beauv. (Cogongrass) Invasion on Fire Regime in Florida Sandhill (USA). Natural Areas Journal 20(22):140-149.
Mackenzie, A., A. S. Ball, and S. R. Virdee. 2001. Instant Notes Ecology. 2 edition. BIOS Scientific Publishers Ltd, Oxford, United Kingdom.
Madin, S. J., S. Bowers, M. P. Schildhauer, and M. B. Jones. 2008. Advancing ecological research with ontologies. Trends in Ecology and Evolution 23(3):159-168.
Madin, S. J., S. Bowers, M. P. Schildhauer, S. Krivov, D. D. Pennington, and F. Villa. 2007. An ontology for describing and synthesizing ecological observation data. Ecological Informatics 2:279-296.
Magnuson, J. J. 1990. Long-term ecological research and the invisible present. BioScience 40:495-501.
McCusker, J. P., and D. L. McGuinness. 2010. Towards Identity in Linked Data.In Proceedings of OWL Experiences and Directions Seventh Annual Workshop.
Megrey, B. A., S. A. Macklin, K. Bahl, and P. D. Klawitter. 2007. Metadata federation of pices member countries. North Pacific Marine Science Organization (PICES), Sidney, B.C., Canada.
Michener, W. K. 2006. Meta-information concepts for ecological data management. Ecological Informatics 1(1):3-7.
Michener, W. K., J. W. Brunt, J. Helly, T. B. Kirchner, and S. G. Stafford. 1997. Non-geospatial metadata for the ecological science. Ecological Applications 7:330-342.
Parr, C. S., and M. P. Cummings. 2005. Data Sharing in Ecology and Evolution. Trends in Ecology and Evolution 20(7):362-363.
Pennington, D. D., and W. K. Michener. 2005. The ecogrid and the kepler workflow system: a new platform for conducting ecological analyses. Bulletin of the Ecological Society of America 86(3):169-176.
Pickett, S. T. A., J. Kolasa, and C. G. Jones. 2007. Ecological Enderstanding: the Nature of Theory and the Theory of Nature. 2nd edition. Academic Press, Waltham, Massachusetts, United States.
Pinto, H. S., and J. P. Martin. 2004. Ontologies: how can they be built? Knowledge and Information Systems 6(4):441-464.
Porter, J. H. 2010. A Brief History of Data Sharing in the U.S. Long Term Ecological Research Network. Bulletin of the Ecological Society of America 91(1):14-20.
Reichman, O. J., M. B. Jones, and M. P. Schildhauer. 2011. Challenges and Opportunities of Open Data in Ecology. Science 331(6018):703-705.
San Gil, I., M. White, E. Melendez, and K. Vanderbilt. 2010. Case Studies of Ecological Integrative Information Systems: The Luquillo and Sevilleta Information Management Systems. Pages 18-35 in S. Sánchez-Alonso and I. N. Athanasiadis, editors. Metadata and Semantic Research. Springer Berlin Heidelberg.
Smith, B. 2001. Objects and their environments: from Aristotle to ecological ontology. Pages 79-97 in A. Frank, editor. Life and Motion of Socio-Economic Units. Taylor and Francis.
Sutherland, W. J. 2006. Ecological Census Techniques: A Handbook. Cambridge University Press, Cambridge, UK.
Tuomi, I. 1999. Data is more than knowledge: implications of the reversed knowledge hierarchy for knowledge management and organizational memory. J. Manage. Inf. Syst. 16(3):103-117.
Vanderbilt, K., J. Cushing, J. Gao, N. Kaplan, J. Kruger, C. Leroy, J. Mallett, K. Ramsey, and L. Zeman. 2009. Data integration challenges: an example from the International Long-Term Ecological Research Network (ILTER). Ecological Circuits 2:12-13.
Vanderbilt, K. L., D. Blankman, X. B. Guo, H. L. He, J. H. Li, C. C. Lin, S. S. Lu, Burke, C. J. Ko, A. Ogawa, E. O. Tuama, H. Schentz, S. Wen, and B. v. d. Werf. 2008. Building an information management system for global data sharing: A strategy for the international long term ecological research network.In Environmental Information Management, University of New Mexico.
Ziegler, P., and K. Dittrich. 2004. Three Decades of Data Intecration — all Problems Solved? Pages 3-12 in R. Jacquart, editor. Building the Information Society. Springer Boston.
連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供,不一定有電子全文可供下載,若連結有誤,請點選上方之〝勘誤回報〞功能,我們會盡快修正,謝謝!
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top