|
[1] C. Gu, P. Arbelaez, Y. Lin, K. Yu, and J. Malik, "Multi-component models for object detection," in Computer Vision-ECCV 2012, pp. 445-458, Springer, 2012. [2] S. Cangeloso, "Self-driving cars could save lives, gas." http://www.geek.com/geek-cetera/self-driving-cars-1447453/, 2011. [On-line; accessed 15-April-2014]. [3] C. Arthur, "Augmented reality: it''s like real life, but better." http://www.theguardian.com/technology/2010/mar/21/augmented-reality-iphone-advertising, 2010. [Online; accessed 15-April-2014]. [4] R. Shapovalov, "Object detection vs. semantic segmentation." http://computerblindness.blogspot.tw/2010/06/object-detection-vs-semantic.html, 2014. [Online; accessed 15-April-2014]. [5] W. Japan, "Leave your chores to a robot." http://web-japan.org/trends/09_sci-tech/sci090327.html, 2009. [Online; accessed 15-April-2014]. [6] W. Choi, Y.-W. Chao, C. Pantofaru, and S. Savarese, "Understanding indoor scenes using 3d geometric phrases," in Computer Vision and Pattern Recognition (CVPR), 2013 IEEE Conference on, pp. 33-40,IEEE, 2013. [7] Y. Hu, L. Cao, F. Lv, S. Yan, Y. Gong, and T. S. Huang, "Action detection in complex scenes with spatial and temporal ambiguities," in Computer Vision, 2009 IEEE 12th International Conference on, pp. 128-135, IEEE, 2009. [8] C. Harris and M. Stephens, "A combined corner and edge detector.,"in Alvey vision conference, vol. 15, p. 50, Manchester, UK, 1988. [9] C. R. Lab, "Sift object recognition algorithm." http://eecs.vanderbilt.edu/cis/crl/wm.shtml, 2014. [Online; accessed 22-April-2014]. [10] H. Bay, A. Ess, T. Tuytelaars, and L. Van Gool, "Speeded-up robustfeatures (surf)," Computer vision and image understanding, vol. 110,no. 3, pp. 346-359, 2008. [11] J. Aggarwal and M. S. Ryoo, "Human activity analysis: A review,"ACM Computing Surveys (CSUR), vol. 43, no. 3, p. 16, 2011. [12] Y. Sheikh, M. Sheikh, and M. Shah, "Exploring the space of a humanaction," in Computer Vision, 2005. ICCV 2005. Tenth IEEE Interna-tional Conference on, vol. 1, pp. 144-149, IEEE, 2005. [13] I. Laptev, "On space-time interest points," International Journal ofComputer Vision, vol. 64, no. 2-3, pp. 107-123, 2005. [14] P. Dollar, V. Rabaud, G. Cottrell, and S. Belongie, "Behavior recog-nition via sparse spatio-temporal features," in Visual Surveillance andPerformance Evaluation of Tracking and Surveillance, 2005. 2nd JointIEEE International Workshop on, pp. 65-72, IEEE, 2005.79 [15] K. Soomro, A. R. Zamir, and M. Shah, "Ucf101: A dataset of101 human actions classes from videos in the wild," arXiv preprintarXiv:1212.0402, 2012. [16] S. Wu and M. Lew, "Evaluation of salient point methods," in Proceed-ings of the 21st ACM international conference on Multimedia, pp. 685-688, ACM, 2013. [17] H. Jhuang, T. Serre, L. Wolf, and T. Poggio, "A biologically inspiredsystem for action recognition," in Computer Vision, 2007. ICCV 2007.IEEE 11th International Conference on, pp. 1-8, IEEE, 2007. [18] P. Scovanner, S. Ali, and M. Shah, "A 3-dimensional sift descriptorand its application to action recognition," in Proceedings of the 15thinternational conference on Multimedia, pp. 357-360, ACM, 2007. [19] A. Fathi and G. Mori, "Action recognition by learning mid-level motionfeatures," in Computer Vision and Pattern Recognition, 2008. CVPR2008. IEEE Conference on, pp. 1-8, IEEE, 2008. [20] M.-y. Chen and A. Hauptmann, "Mosift: Recognizing human actionsin surveillance videos," 2009. [21] S. Sadanand and J. J. Corso, "Action bank: A high-level representa-tion of activity in video," in Computer Vision and Pattern Recognition(CVPR), 2012 IEEE Conference on, pp. 1234-1241, IEEE, 2012. [22] L. Wang, Y. Qiao, and X. Tang, "Motionlets: Mid-level 3d parts for hu-man motion recognition," in Computer Vision and Pattern Recognition(CVPR), 2013 IEEE Conference on, pp. 2674-2681, IEEE, 2013. [23] Wikipedia, "Outline of object recognition." http://en.wikipedia.org/wiki/Outline_of_object_recognition, 2014. [Online; accessed16-April-2014].80 [24] Wikipedia, "Intelligent vehicle technologies." http://en.wikipedia.org/wiki/Intelligent_vehicle_technologies, 2014. [Online; ac-cessed 16-April-2014]. [25] S. Edelstein, "Bmw activeassist system lets self-driving cars get side-ways and keep you on the road." http://www.digitaltrends.com/cars/bmw-activeassist-introduced-at-ces-2014/#!EkGS4, 2014.[Online; accessed 16-April-2014]. [26] Wikipedia, "Google driverless car." http://en.wikipedia.org/wiki/Google_driverless_car, 2014. [Online; accessed 16-April-2014]. [27] Google TM , "Glass." http://www.google.com/glass/start/, 2014.[Online; accessed 17-April-2014]. [28] S. Izadi, D. Kim, O. Hilliges, D. Molyneaux, R. Newcombe, P. Kohli,J. Shotton, S. Hodges, D. Freeman, A. Davison, et al., "Kinectfusion:real-time 3d reconstruction and interaction using a moving depth cam-era," in Proceedings of the 24th annual ACM symposium on User in-terface software and technology, pp. 559-568, ACM, 2011. [29] R. A. Newcombe, A. J. Davison, S. Izadi, P. Kohli, O. Hilliges, J. Shot-ton, D. Molyneaux, S. Hodges, D. Kim, and A. Fitzgibbon, "Kinect-fusion: Real-time dense surface mapping and tracking," in Mixed andaugmented reality (ISMAR), 2011 10th IEEE international symposiumon, pp. 127-136, IEEE, 2011. [30] J. Xiao and L. Quan, "Multiple view semantic segmentation for streetview images," in Computer Vision, 2009 IEEE 12th International Con-ference on, pp. 686-693, IEEE, 2009. [31] G. J. Brostow, J. Shotton, J. Fauqueur, and R. Cipolla, "Segmen-tation and recognition using structure from motion point clouds," inComputer Vision-ECCV 2008, pp. 44-57, Springer, 2008.81 [32] W. Garage, "Pr2 overview." http://www.willowgarage.com/pages/pr2/overview, 2014. [Online; accessed 16-April-2014]. [33] J. Maitin-Shepard, M. Cusumano-Towner, J. Lei, and P. Abbeel,"Cloth grasp point detection based on multiple-view geometric cueswith application to robotic towel folding," in Robotics and Automa-tion (ICRA), 2010 IEEE International Conference on, pp. 2308-2315,IEEE, 2010. [34] klaer78, "Antonio espingardeiro develops p37 s65 elderly care bot."https://www.youtube.com/watch?v=EU4pTB6JeEY, 2013. [Online;accessed 16-April-2014]. [35] Wikipedia, "Big data." http://en.wikipedia.org/wiki/Big_data,2014. [Online; accessed 17-April-2014]. [36] Wikipedia, "Cloud computing." http://en.wikipedia.org/wiki/Cloud_computing, 2014. [Online; accessed 17-April-2014]. [37] Wikipedia, "List of most popular websites." http://en.wikipedia.org/wiki/List_of_most_popular_websites, 2014. [Online; accessed17-April-2014]. [38] R. M. Haralock and L. G. Shapiro, Computer and robot vision.Addison-Wesley Longman Publishing Co., Inc., 1991. [39] D. G. Lowe, "Object recognition from local scale-invariant features,"in Computer vision, 1999. The proceedings of the seventh IEEE inter-national conference on, vol. 2, pp. 1150-1157, Ieee, 1999. [40] D. G. Lowe, "Distinctive image features from scale-invariant key-points," International journal of computer vision, vol. 60, no. 2, pp. 91-110, 2004.82 [41] H. Bay, T. Tuytelaars, and L. Van Gool, "Surf: Speeded up robust fea-tures," in Computer Vision-ECCV 2006, pp. 404-417, Springer, 2006. [42] E. Rosten and T. Drummond, "Machine learning for high-speed cornerdetection," in Computer Vision-ECCV 2006, pp. 430-443, Springer,2006. [43] E. Rosten, R. Porter, and T. Drummond, "Faster and better: A ma-chine learning approach to corner detection," Pattern Analysis andMachine Intelligence, IEEE Transactions on, vol. 32, no. 1, pp. 105-119, 2010. [44] H. Bannour, L. Hlaoua, and B. el Ayeb, "Survey of the adequate de-scriptor for content-based image retrieval on the web: Global versuslocal features.," in CORIA, pp. 445-456, 2009. [45] Y. RAOUI, E. H. BOUYAKHF, M. DEVY, and F. REGRAGUI,"Global and local image descriptors for content based image retrievaland object recognition," Applied Mathematical Sciences, vol. 5, no. 42,pp. 2109-2136, 2011. [46] S. Leutenegger, M. Chli, and R. Y. Siegwart, "Brisk: Binary robustinvariant scalable keypoints," in Computer Vision (ICCV), 2011 IEEEInternational Conference on, pp. 2548-2555, IEEE, 2011. [47] A. Alahi, R. Ortiz, and P. Vandergheynst, "Freak: Fast retina key-point," in Computer Vision and Pattern Recognition (CVPR), 2012IEEE Conference on, pp. 510-517, IEEE, 2012. [48] C. Cortes and V. Vapnik, "Support-vector networks," Machine learn-ing, vol. 20, no. 3, pp. 273-297, 1995.83 [49] J. Friedman, T. Hastie, and R. Tibshirani, "Special invited paper. ad-ditive logistic regression: A statistical view of boosting," Annals ofstatistics, pp. 337-374, 2000. [50] L. Breiman, "Random forests," Machine learning, vol. 45, no. 1, pp. 5-32, 2001. [51] Wikipedia, "Decision tree learning." http://en.wikipedia.org/wiki/Decision_tree_learning, 2014. [Online; accessed 18-April-2014]. [52] H. P. Moravec, "Obstacle avoidance and navigation in the real worldby a seeing robot rover.," tech. rep., DTIC Document, 1980. [53] Wikipedia, "Corner detection." http://en.wikipedia.org/wiki/Corner_detection, 2014. [Online; accessed 18-April-2014]. [54] S. M. Smith and J. M. Brady, "Susana new approach to low levelimage processing," International journal of computer vision, vol. 23,no. 1, pp. 45-78, 1997. [55] J. R. Quinlan, "Induction of decision trees," Machine learning, vol. 1,no. 1, pp. 81-106, 1986. [56] K. Mikolajczyk and C. Schmid, "An a&;#64259;ne invariant interest point de-tector," in Computer VisionECCV 2002, pp. 128-142, Springer, 2002. [57] H. Bay, T. Tuytelaars, and L. Van Gool, "Surf: Speeded up robust fea-tures," in Computer Vision-ECCV 2006, pp. 404-417, Springer, 2006. [58] R. Poppe, "A survey on vision-based human action recognition," Imageand vision computing, vol. 28, no. 6, pp. 976-990, 2010. [59] A. F. Bobick and J. W. Davis, "The recognition of human movementusing temporal templates," Pattern Analysis and Machine Intelligence,IEEE Transactions on, vol. 23, no. 3, pp. 257-267, 2001.84 [60] Y. Ke, R. Sukthankar, and M. Hebert, "Event detection in crowdedvideos," in Computer Vision, 2007. ICCV 2007. IEEE 11th Interna-tional Conference on, pp. 1-8, IEEE, 2007. [61] M. Rodriguez, J. Ahmed, and M. Shah, "Action mach a spatio-temporal maximum average correlation height lter for action recogni-tion," in Computer Vision and Pattern Recognition, 2008. CVPR 2008.IEEE Conference on, pp. 1-8, June 2008. [62] A. Yilmaz and M. Shah, "Actions sketch: A novel action representa-tion," in Computer Vision and Pattern Recognition, 2005. CVPR 2005.IEEE Computer Society Conference on, vol. 1, pp. 984-989, IEEE,2005. [63] M. Blank, L. Gorelick, E. Shechtman, M. Irani, and R. Basri, "Actionsas space-time shapes," in Computer Vision, 2005. ICCV 2005. TenthIEEE International Conference on, vol. 2, pp. 1395-1402, IEEE, 2005. [64] A. A. Efros, A. C. Berg, G. Mori, and J. Malik, "Recognizing actionat a distance," in Computer Vision, 2003. Proceedings. Ninth IEEEInternational Conference on, pp. 726-733, IEEE, 2003. [65] J. Yamato, J. Ohya, and K. Ishii, "Recognizing human action in time-sequential images using hidden markov model," in Computer Visionand Pattern Recognition, 1992. Proceedings CVPR''92., 1992 IEEEComputer Society Conference on, pp. 379-385, IEEE, 1992. [66] N. Oliver, E. Horvitz, and A. Garg, "Layered representations for hu-man activity recognition," in Multimodal Interfaces, 2002. Proceedings.Fourth IEEE International Conference on, pp. 3-8, IEEE, 2002. [67] Y. A. Ivanov and A. F. Bobick, "Recognition of visual activities andinteractions by stochastic parsing," Pattern Analysis and Machine In-telligence, IEEE Transactions on, vol. 22, no. 8, pp. 852-872, 2000.85 [68] M. S. Ryoo and J. K. Aggarwal, "Recognition of composite human ac-tivities through context-free grammar based representation," in Com-puter Vision and Pattern Recognition, 2006 IEEE Computer SocietyConference on, vol. 2, pp. 1709-1718, IEEE, 2006. [69] M. A. Goodale and A. D. Milner, "Separate visual pathways for per-ception and action," Trends in neurosciences, vol. 15, no. 1, pp. 20-25,1992. [70] M. A. Giese and T. Poggio, "Neural mechanisms for the recognitionof biological movements," Nature Reviews Neuroscience, vol. 4, no. 3,pp. 179-192, 2003. [71] C. Schmid, R. Mohr, and C. Bauckhage, "Evaluation of interest pointdetectors," International Journal of computer vision, vol. 37, no. 2,pp. 151-172, 2000. [72] K. Mikolajczyk and C. Schmid, "A performance evaluation of localdescriptors," Pattern Analysis and Machine Intelligence, IEEE Trans-actions on, vol. 27, no. 10, pp. 1615-1630, 2005. [73] K. Schindler and L. Van Gool, "Action snippets: How many framesdoes human action recognition require?," in Computer Vision and Pat-tern Recognition, 2008. CVPR 2008. IEEE Conference on, pp. 1-8,IEEE, 2008. [74] A. Klaser and M. Marszalek, "A spatio-temporal descriptor based on3d-gradients," 2008. [75] O. Kliper-Gross, Y. Gurovich, T. Hassner, and L. Wolf, "Motion in-terchange patterns for action recognition in unconstrained videos," inComputer Vision-ECCV 2012, pp. 256-269, Springer, 2012.86 [76] L. Fei-Fei and P. Perona, "A bayesian hierarchical model for learningnatural scene categories," in Computer Vision and Pattern Recogni-tion, 2005. CVPR 2005. IEEE Computer Society Conference on, vol. 2,pp. 524-531, IEEE, 2005. [77] C. Whiten, R. Laganiere, and G.-A. Bilodeau, "Efficient action recog-nition with mofreak," in Computer and Robot Vision (CRV), 2013 In-ternational Conference on, pp. 319-325, IEEE, 2013. [78] R. W. Hamming, "Error detecting and error correcting codes," BellSystem technical journal, vol. 29, no. 2, pp. 147-160, 1950. [79] J. J. Gibson, "The perception of the visual world.," 1950. [80] E. Shechtman and M. Irani, "Matching local self-similarities across im-ages and videos," in Computer Vision and Pattern Recognition, 2007.CVPR''07. IEEE Conference on, pp. 1-8, IEEE, 2007. [81] S. Maji, A. C. Berg, and J. Malik, "Classi cation using intersectionkernel support vector machines is e&;#64259;cient," in Computer Vision andPattern Recognition, 2008. CVPR 2008. IEEE Conference on, pp. 1-8,IEEE, 2008. [82] Wikipedia, "k-means clustering." http://en.wikipedia.org/wiki/K-means_clustering, 2014. [Online; accessed 24-May-2014]. [83] C. Schuldt, I. Laptev, and B. Caputo, "Recognizing human actions: alocal svm approach," in Pattern Recognition, 2004. ICPR 2004. Pro-ceedings of the 17th International Conference on, vol. 3, pp. 32-36,IEEE, 2004. [84] G. Bradski Dr. Dobb''s Journal of Software Tools.
|