臺灣博碩士論文加值系統

English |FB 專頁 |Mobile

免費會員登入| 註冊

功能切換導覽列

(216.73.217.130) 您好！臺灣時間：2026/06/17 05:57

字體大小：

:::

詳目顯示

第 1 筆 / 共 1 筆

/1頁

論文基本資料
摘要
外文摘要
目次
參考文獻
電子全文
紙本論文
論文連結
QR Code

本論文永久網址:

研究生:

李佳旻

研究生(外文):

Li, Chia Min

論文名稱:

基於雙層的分群策略實施高效率大規模圖像建模

論文名稱(外文):

Efficient Large-Scale Image-Based Modeling  Using Divide and Conquer Strategy with Two-Layer Clustering

指導教授:

陳冠文

指導教授(外文):

Chen, Kuan-Wen

口試委員:

林惠勇、陳永昇、陳煥宗、陳冠文

口試委員(外文):

Lin, Huei-Yung、Chen, Yong-Sheng、Chen, Hwann-Tzong、Chen, Kuan-Wen

口試日期:

2018-07-31

學位類別:

碩士

校院名稱:

國立交通大學

系所名稱:

多媒體工程研究所

學門:

電算機學門

學類:

軟體發展學類

論文種類:

學術論文

論文出版年:

2018

畢業學年度:

106

語文別:

英文

論文頁數:

中文關鍵詞:

大範圍、圖像建模、雙層分群、分群策略

外文關鍵詞:

large-scale、image-based modeling、divide-and-conquer

相關次數:

被引用:0
點閱:244
評分:
下載:11
書目收藏:0

雖然基於圖像的3D立體建模已得到廣泛開發，但其計算成本和內存需求通常是主要需要克服的問題，特別是當我們想要建立大規模甚至城市規模的立體模型但只能使用個人電腦時。因此在本文中，我們提出了一種高效率的大規模基於圖像的立體建模流程方法，該方法使用具有位置和圖像相似性的分群策略。允許普通人只需一台個人電腦即可輕鬆構建自己的大型立體模型。此外，與過去需要用戶手動選擇或輸入數千個圖像的方法不同，我們的方法只需要用戶任意拍攝場景的多個影片，對於實際應用來說更容易，也更實用。本文的主要思想是使用基於位置信息（意即GPS）和圖像相似性（意即特徵匹配和極線幾何）的分群策略。我們首先將影片劃分為多個根據位置信息而聚類的小組影片剪輯，然後再將這些小組影片根據圖像相似性劃分為更小的圖像聚類。最後，提出了將多個小規模立體模型組合成大規模立體場景模型的框架。由本文所提出的雙層聚類的方法將大大降低計算要求，實驗結果表明其可行性和準確性。根據我們的目前已知的資訊，這是第一次使用基於位置和圖像信息的雙層聚類來進行大規模立體模型構建。

Image-based modeling has been widely developed, but its computational cost and memory requirement are usually the main issues especially when we want to build a large-scale or even city-scale model but only personal computer can be used. In this paper, we propose an efficient large-scale image-based modeling approach which uses divide and conquer strategy with both location and image similarity. It allows normal people can easily build their own large-scale models with only a PC. In addition, unlike previous methods, which require users to select or input thousands of images manually, our approach only needs users to take multiple videos of the scene arbitrarily and it is easier and more practical for real applications. The main idea of this paper is using divide and conquer strategy based on both location information, i.e. GPS, and image similarity, i.e. feature matching and epipolar geometry. We firstly divide the videos into multiple small groups of video clips with location clustering and then divide these groups into further smaller clusters of images with image clustering. Finally, a framework of combining multiple small-scale models into a large-scale one is proposed. The two-layer clustering will decrease the computational requirements very much and the experimental results show its feasibility and accuracy. This is the first work using two-layer clustering based on both location and image information for large-scale model construction, to our best knowledge.

摘要 I
Abstract II
誌謝 III
List of Contents IV
List of Figures V
List of Tables VI
Chapter 1 Introduction 1
Chapter 2 Related Work 5
Chapter 3 System Overview 8
Chapter 4 Method 12
4.1 Divide Stage: Clustering 12
4.1.1 Location Clustering 12
4.1.2 Extracting keyframes 13
4.1.3 Image Clustering 14
4.2 Conquer Stage: Construction 16
4.2.1 Build Models 17
4.2.2 Extract the Connected Images of Image Clustering 18
4.2.3 Extract the Connected Images of Location Clustering. 22
4.3 Merge Stage: Transformation and Registration 23
4.3.1 Combine Image Clustering Model 23
4.3.2 Adjust the Combined Model 24
4.3.3 Combine and adjust Location Clustering Model 25
Chapter 5 Experiment 27
5.1 Merging the Models of Image Clustering 27
5.2 Merging the Models of Location Clustering 31
5.3 Comparison 34
Chapter 6 Conclusion and Future Work 38
6.1 Conclusion 38
6.2 Future Work 38
Chapter 7 References 40

[1] N. Snavely, S. Seitz, and R. Szeliski, “Photo tourism: Exploring photo collections in 3d,” Proceedings of ACM SIGGRAPH, pp. 835–846, 2006
[2] N. Snavely, S. Seitz, and R. Szeliski, “Modeling the world from internet photo collections,” International Journal of Computer Vision 80, pp. 189–210, 2008
[3] N. Snavely, S. Seitz, and R. Szeliski, “Skeletal graphs for efficient structure from motion,” CVPR, 2008
[4] S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S.M. Seitz, and R. Szeliski, “Building rome in a day,“ Communications of the ACM, pp. 105–112, 2011
[5] C. Wu, “Towards linear-time incremental structure from motion,” Proceedings of the International Conference on 3D Vision, pp. 127–134, 2013
[6] P. Moulon, P. Monasse, and R. Marlet, “Global fusion of relative motions for robust, accurate and scalable structure from motion,” ICCV, pp. 3248–3255, 2013
[7] R. Roberts, S.N. Sinha, R. Szeliski, and D. Steedly, “Structure from motion for scenes with large duplicate structures,” CVPR, 2011
[8] C. Sweeney, T. Sattler, T. Hollerer, M. Turk, and M. Pollefeys, “Optimizing the viewing graph for structure-from-motion” ICCV, pp. 801–809 ,2015
[9] T. Shen, S. Zhu, T. Fang, R. Zhang, and L. Quan, “Graph-Based Consistent Matching for Structure-from-Motion,” ECCV, 2016
[10] M. Havlena, A. Torii, and T. Pajdla, “Efficient structure from motion by graph optimization,” ECCV, 2010
[1] B. Bhowmick, S. Patra, A. Chatterjee, V.M. Govindu, and S. Banerjee, “Divide and conquer: Efficient large-scale structure from motion using graph partitioning,” ACCV ,2014
[2] C. Sweeney, V. Fragoso, T. Hollerer, and M. Turk, “Large Scale SfM with the Distributed Camera Model,” 3DV ,2016
[3] R. Mur-Artal, J.M.M. Montiel, and J.D. Tardos, “ORB-SLAM: a versatile and accurate monocular SLAM system,” IEEE Transactions on Robotics, 2015
[4] B. Triggs, P. Mclauchlan, R. Hartley, and A. Fitzgibbon, “Bundle adjustment – a modern synthesis. Vision Algorithms” Theory and Practice, LNCS, (2000) pp. 298–372, 2000
[5] S. Agarwal, N. Snavely, S. Seitz, and R. Szeliski, “Bundle adjustment in the large.” Proceedings of the European Conference on Computer Vision, pp. 29-42, 2010
[6] P.J. Besl, and N.D. McKay, “Method for registration of 3-D shapes,” Sensor Fusion IV: Control Paradigms and Data Structures. International Society for Optics and Photonics, Vol. 1611, 1992
[7] T. Shiratori, J. Berclaz, M. Harville, C. Shah, T. Li, Y. Matsushita, and S. Shiller, “Efficient large-scale point cloud registration using loop closures,” 3DV, 2015
[8] D. Ashbrook, and T. Starner, “Learning significant locations and predicting user movement with GPS,” ISWC, 2002
[9] D. Nister, and H. Stewenius, “Scalable recognition with a vocabulary tree,” CVPR, pp. 2161–2168, 2006
[10] U. Von Luxburg, “A tutorial on spectral clustering.” Statistics and computing 17.4, pp. 395-416, 2007
[11] D. Fleet, and Y. Weiss, “Optical Flow Estimation,” Handbook of mathematical models in computer vision, pp 237-257, 2006
[12] J. Shi, and J. Malik, “Normalized cuts and image segmentation,” IEEE Trans. Pattern Anal. Mach. Intell. 22, pp. 888–905 ,2000
[13] D.G. Lowe, “Distinctive image features from scale-invariant keypoints,” IJCV, pp. 91–110, 2004
[14] Helmert Transformation, https://en.wikipedia.org/wiki/Helmert_transformation.
[15] G.A. Watson, and G.A., “Computing Helmert transformations,” Journal of computational and applied mathematics, 2006

電子全文

國圖紙本論文

連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供，不一定有電子全文可供下載，若連結有誤，請點選上方之〝勘誤回報〞功能，我們會盡快修正，謝謝！

推文
網路書籤
推薦
評分
引用網址
轉寄

top

相關論文
相關期刊
熱門點閱論文

無相關論文

無相關期刊

1.	整合即時定位與地圖建構及模型比對之視覺定位系統
2.	以深度學習方式訓練能克服光線變化的影像特徵比對網路
3.	高靜水壓輔助蛋白酶水解鱸魚發酵副產物對降低膽固醇效果
4.	高靜水壓輔助酵素水解吳郭魚發酵副產物之血管收縮素轉換酶抑制胜肽的純化與降血壓效果
5.	高靜水壓輔助蛋白酶水解螺旋藻發酵產物對於羥甲基戊二酸單醯輔酶A還原酶之抑制活性的影響
6.	高靜水壓輔助酵素水解發酵鱸魚副產物之水解物中抑制血管收縮素轉化酶活性胜肽的純化及降血壓效果
7.	基於視覺定位與環境感知之盲人導航系統
8.	使用場景結構比對於光線變化下進行攝影機定位
9.	改善語音品質之強化學習語音增強演算法
10.	股票市場之流動性風險：國際實證解析
11.	非均質多孔性介質中交互式注入對混合效率之模擬
12.	染料敏化太陽能電池應用於室內植物工廠研究
13.	新型太陽能電廠熱影像檢測系統研發
14.	室內室外混合環境的無人機自動飛航
15.	基於可擴展三維模型檢索的室內場景語義建模並於虛擬實境中與真實場景互動

簡易查詢 | 進階查詢 | 熱門排行 | 我的研究室