跳到主要內容

臺灣博碩士論文加值系統

(216.73.217.130) 您好!臺灣時間:2026/06/17 05:57
字體大小: 字級放大   字級縮小   預設字形  
回查詢結果 :::

詳目顯示

: 
twitterline
研究生:李佳旻
研究生(外文):Li, Chia Min
論文名稱:基於雙層的分群策略實施高效率大規模圖像建模
論文名稱(外文):Efficient Large-Scale Image-Based Modeling 
Using Divide and Conquer Strategy with Two-Layer Clustering
指導教授:陳冠文陳冠文引用關係
指導教授(外文):Chen, Kuan-Wen
口試委員:林惠勇陳永昇陳煥宗陳冠文
口試委員(外文):Lin, Huei-YungChen, Yong-ShengChen, Hwann-TzongChen, Kuan-Wen
口試日期:2018-07-31
學位類別:碩士
校院名稱:國立交通大學
系所名稱:多媒體工程研究所
學門:電算機學門
學類:軟體發展學類
論文種類:學術論文
論文出版年:2018
畢業學年度:106
語文別:英文
論文頁數:41
中文關鍵詞:大範圍圖像建模雙層分群分群策略
外文關鍵詞:large-scaleimage-based modelingdivide-and-conquer
相關次數:
  • 被引用被引用:0
  • 點閱點閱:244
  • 評分評分:
  • 下載下載:11
  • 收藏至我的研究室書目清單書目收藏:0
雖然基於圖像的3D立體建模已得到廣泛開發,但其計算成本和內存需求通常是主要需要克服的問題,特別是當我們想要建立大規模甚至城市規模的立體模型但只能使用個人電腦時。因此在本文中,我們提出了一種高效率的大規模基於圖像的立體建模流程方法,該方法使用具有位置和圖像相似性的分群策略。允許普通人只需一台個人電腦即可輕鬆構建自己的大型立體模型。此外,與過去需要用戶手動選擇或輸入數千個圖像的方法不同,我們的方法只需要用戶任意拍攝場景的多個影片,對於實際應用來說更容易,也更實用。本文的主要思想是使用基於位置信息(意即GPS)和圖像相似性(意即特徵匹配和極線幾何)的分群策略。我們首先將影片劃分為多個根據位置信息而聚類的小組影片剪輯,然後再將這些小組影片根據圖像相似性劃分為更小的圖像聚類。最後,提出了將多個小規模立體模型組合成大規模立體場景模型的框架。由本文所提出的雙層聚類的方法將大大降低計算要求,實驗結果表明其可行性和準確性。根據我們的目前已知的資訊,這是第一次使用基於位置和圖像信息的雙層聚類來進行大規模立體模型構建。
Image-based modeling has been widely developed, but its computational cost and memory requirement are usually the main issues especially when we want to build a large-scale or even city-scale model but only personal computer can be used. In this paper, we propose an efficient large-scale image-based modeling approach which uses divide and conquer strategy with both location and image similarity. It allows normal people can easily build their own large-scale models with only a PC. In addition, unlike previous methods, which require users to select or input thousands of images manually, our approach only needs users to take multiple videos of the scene arbitrarily and it is easier and more practical for real applications. The main idea of this paper is using divide and conquer strategy based on both location information, i.e. GPS, and image similarity, i.e. feature matching and epipolar geometry. We firstly divide the videos into multiple small groups of video clips with location clustering and then divide these groups into further smaller clusters of images with image clustering. Finally, a framework of combining multiple small-scale models into a large-scale one is proposed. The two-layer clustering will decrease the computational requirements very much and the experimental results show its feasibility and accuracy. This is the first work using two-layer clustering based on both location and image information for large-scale model construction, to our best knowledge.
摘要 I
Abstract II
誌謝 III
List of Contents IV
List of Figures V
List of Tables VI
Chapter 1 Introduction 1
Chapter 2 Related Work 5
Chapter 3 System Overview 8
Chapter 4 Method 12
4.1 Divide Stage: Clustering 12
4.1.1 Location Clustering 12
4.1.2 Extracting keyframes 13
4.1.3 Image Clustering 14
4.2 Conquer Stage: Construction 16
4.2.1 Build Models 17
4.2.2 Extract the Connected Images of Image Clustering 18
4.2.3 Extract the Connected Images of Location Clustering. 22
4.3 Merge Stage: Transformation and Registration 23
4.3.1 Combine Image Clustering Model 23
4.3.2 Adjust the Combined Model 24
4.3.3 Combine and adjust Location Clustering Model 25
Chapter 5 Experiment 27
5.1 Merging the Models of Image Clustering 27
5.2 Merging the Models of Location Clustering 31
5.3 Comparison 34
Chapter 6 Conclusion and Future Work 38
6.1 Conclusion 38
6.2 Future Work 38
Chapter 7 References 40
[1] N. Snavely, S. Seitz, and R. Szeliski, “Photo tourism: Exploring photo collections in 3d,” Proceedings of ACM SIGGRAPH, pp. 835–846, 2006
[2] N. Snavely, S. Seitz, and R. Szeliski, “Modeling the world from internet photo collections,” International Journal of Computer Vision 80, pp. 189–210, 2008
[3] N. Snavely, S. Seitz, and R. Szeliski, “Skeletal graphs for efficient structure from motion,” CVPR, 2008
[4] S. Agarwal, Y. Furukawa, N. Snavely, I. Simon, B. Curless, S.M. Seitz, and R. Szeliski, “Building rome in a day,“ Communications of the ACM, pp. 105–112, 2011
[5] C. Wu, “Towards linear-time incremental structure from motion,” Proceedings of the International Conference on 3D Vision, pp. 127–134, 2013
[6] P. Moulon, P. Monasse, and R. Marlet, “Global fusion of relative motions for robust, accurate and scalable structure from motion,” ICCV, pp. 3248–3255, 2013
[7] R. Roberts, S.N. Sinha, R. Szeliski, and D. Steedly, “Structure from motion for scenes with large duplicate structures,” CVPR, 2011
[8] C. Sweeney, T. Sattler, T. Hollerer, M. Turk, and M. Pollefeys, “Optimizing the viewing graph for structure-from-motion” ICCV, pp. 801–809 ,2015
[9] T. Shen, S. Zhu, T. Fang, R. Zhang, and L. Quan, “Graph-Based Consistent Matching for Structure-from-Motion,” ECCV, 2016
[10] M. Havlena, A. Torii, and T. Pajdla, “Efficient structure from motion by graph optimization,” ECCV, 2010
[1] B. Bhowmick, S. Patra, A. Chatterjee, V.M. Govindu, and S. Banerjee, “Divide and conquer: Efficient large-scale structure from motion using graph partitioning,” ACCV ,2014
[2] C. Sweeney, V. Fragoso, T. Hollerer, and M. Turk, “Large Scale SfM with the Distributed Camera Model,” 3DV ,2016
[3] R. Mur-Artal, J.M.M. Montiel, and J.D. Tardos, “ORB-SLAM: a versatile and accurate monocular SLAM system,” IEEE Transactions on Robotics, 2015
[4] B. Triggs, P. Mclauchlan, R. Hartley, and A. Fitzgibbon, “Bundle adjustment – a modern synthesis. Vision Algorithms” Theory and Practice, LNCS, (2000) pp. 298–372, 2000
[5] S. Agarwal, N. Snavely, S. Seitz, and R. Szeliski, “Bundle adjustment in the large.” Proceedings of the European Conference on Computer Vision, pp. 29-42, 2010
[6] P.J. Besl, and N.D. McKay, “Method for registration of 3-D shapes,” Sensor Fusion IV: Control Paradigms and Data Structures. International Society for Optics and Photonics, Vol. 1611, 1992
[7] T. Shiratori, J. Berclaz, M. Harville, C. Shah, T. Li, Y. Matsushita, and S. Shiller, “Efficient large-scale point cloud registration using loop closures,” 3DV, 2015
[8] D. Ashbrook, and T. Starner, “Learning significant locations and predicting user movement with GPS,” ISWC, 2002
[9] D. Nister, and H. Stewenius, “Scalable recognition with a vocabulary tree,” CVPR, pp. 2161–2168, 2006
[10] U. Von Luxburg, “A tutorial on spectral clustering.” Statistics and computing 17.4, pp. 395-416, 2007
[11] D. Fleet, and Y. Weiss, “Optical Flow Estimation,” Handbook of mathematical models in computer vision, pp 237-257, 2006
[12] J. Shi, and J. Malik, “Normalized cuts and image segmentation,” IEEE Trans. Pattern Anal. Mach. Intell. 22, pp. 888–905 ,2000
[13] D.G. Lowe, “Distinctive image features from scale-invariant keypoints,” IJCV, pp. 91–110, 2004
[14] Helmert Transformation, https://en.wikipedia.org/wiki/Helmert_transformation.
[15] G.A. Watson, and G.A., “Computing Helmert transformations,” Journal of computational and applied mathematics, 2006
連結至畢業學校之論文網頁點我開啟連結
註: 此連結為研究生畢業學校所提供,不一定有電子全文可供下載,若連結有誤,請點選上方之〝勘誤回報〞功能,我們會盡快修正,謝謝!
QRCODE
 
 
 
 
 
                                                                                                                                                                                                                                                                                                                                                                                                               
第一頁 上一頁 下一頁 最後一頁 top