|
[1] Krizhevsky, A., Sutskever, I., & Hinton, G. E., "Imagenet classification with deep convolutional neural networks", 2012 Advances in neural information processing systems, pp. 1097-1105, 2012. [2] ImageNet Large Scale Visual Recognition Challenge http://www.image-net.org/challenges/LSVRC/ [3] The PASCAL Visual Object Classes http://host.robots.ox.ac.uk/pascal/VOC/ [4] ImageNet dataset http://image-net.org/ [5] COCO dataset http://cocodataset.org/#home [6] Girshick, R., Donahue, J., Darrell, T. and Malik, J., "Rich feature hierarchies for accurate object detection and semantic segmentation." 2014 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 580-587. 2014. [7] He, K., Zhang, X., Ren, S. and Sun, J., "Spatial pyramid pooling in deep convolutional networks for visual recognition." IEEE transactions on pattern analysis and machine intelligence 37, no. 9 (2015): 1904-1916. [8] Girshick, R., "Fast r-cnn." 2015 Proceedings of the IEEE international conference on computer vision, pp. 1440-1448. 2015. [9] Ren, S., He, K., Girshick, R. and Sun, J., "Faster r-cnn: Towards real-time object detection with region proposal networks." 2015 Advances in neural information processing systems, pp. 91-99. 2015. [10] Dai, J., Li, Y., He, K. and Sun, J., "R-fcn: Object detection via region-based fully convolutional networks." 2016 Advances in neural information processing systems, pp. 379-387. 2016. [11] Sermanet, P., Eigen, D., Zhang, X., Mathieu, M., Fergus, R. and LeCun, Y., "Overfeat: Integrated recognition, localization and detection using convolutional networks." arXiv preprint arXiv:1312.6229 (2013). [12] Redmon, J., Divvala, S., Girshick, R. and Farhadi, A., "You only look once: Unified, real-time object detection." 2016 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 779-788. 2016. [13] Redmon, J. and Farhadi, A., "YOLO9000: better, faster, stronger." 2017 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7263-7271. 2017. [14] Redmon, J. and Farhadi, A., "Yolov3: An incremental improvement." arXiv preprint arXiv:1804.02767 (2018). [15] Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y. and Berg, A.C., "Ssd: Single shot multibox detector." 2016 European conference on computer vision, pp. 21-37. Springer, Cham, 2016. [16] Lin, T.Y., Goyal, P., Girshick, R., He, K. and Dollár, P., "Focal loss for dense object detection." 2017 Proceedings of the IEEE international conference on computer vision, pp. 2980-2988. 2017. [17] Simonyan, K. and Zisserman, A., "Very deep convolutional networks for large-scale image recognition." arXiv preprint arXiv:1409.1556 (2014). [18] Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Andreetto, M. and Adam, H., "Mobilenets: Efficient convolutional neural networks for mobile vision applications." arXiv preprint arXiv:1704.04861 (2017). [19] Zhao, H., Shi, J., Qi, X., Wang, X. and Jia, J., "Pyramid scene parsing network." 2017 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2881-2890. 2017. [20] Yu, F. and Koltun, V., "Multi-scale context aggregation by dilated convolutions." arXiv preprint arXiv:1511.07122 (2015). [21] Chen, L.C., Papandreou, G., Schroff, F. and Adam, H., "Rethinking atrous convolution for semantic image segmentation." arXiv preprint arXiv:1706.05587 (2017). [22] Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V. and Rabinovich, A., "Going deeper with convolutions." 2015 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 1-9. 2015. [23] Ioffe, S. and Szegedy, C., "Batch normalization: Accelerating deep network training by reducing internal covariate shift." arXiv preprint arXiv:1502.03167 (2015). [24] Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J. and Wojna, Z., "Rethinking the inception architecture for computer vision." 2016 Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 2818-2826. 2016. [25] Szegedy, C., Ioffe, S., Vanhoucke, V. and Alemi, A.A., "Inception-v4, inception-resnet and the impact of residual connections on learning." 2017 Thirty-First AAAI Conference on Artificial Intelligence. 2017.
|