|
[1] C.-L. Liu, F. Yin, D.-H. Wang, and Q.-F. Wang, “Casia online and offline Chinese handwriting databases,” in Document Analysis and Recognition (ICDAR), 2011 International Conference on, pp. 37–41, IEEE, 2011. [2] C. Wang, Y. Qi, and X. Wang, “The Chinese characters extraction method based on area voronoi diagram in inscription,” in Virtual Reality and Visualization (ICVRV), 2015 International Conference on, pp. 109–116, IEEE, 2015. [3] X. Wei, S. Ma, and Y. Jin, “Segmentation of connected chinese characters based on genetic algorithm,”in Document Analysis and Recognition 2005.Proceedings.Eighth International Conference on, pp. 645–649, IEEE, 2005. [4] C. Hong, G. Loudon, Y. Wu, and R. Zitserman, “Segmentation and recognition of continuous handwriting Chinese text,” International journal of pattern recognition and artificial intelligence, vol. 12, no. 02, pp. 223–232, 1998. [5] S. Zhao, Z. Chi, P. Shi, and Q. Wang, “Handwritten chinese character segmentation using a two-stage approach,” p. 0179, 2001. [6] O.E.AgazziandS.-s.Kuo,“Hidden Markov model based optical character recognition in the presence of deterministic transformations,” Pattern recognition, vol. 26, no. 12, pp. 1813–1826, 1993. [7] C. Bahlmann, B. Haasdonk, and H. Burkhardt, “Online handwriting recognition with support vector machines-a kernel approach,” in Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition, pp. 49–54, IEEE. 2002. [8] C.Jawahar, M.P.Kumar, andS.R.Kiran, “A bilingual ocr for Hindi-Telugu documents and its applications,” in Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings., pp. 408–412, IEEE, 2003. [9] X. Tong and D. A. Evans, “A statistical approach to automatic ocr error correction in context,” in Fourth Workshop on Very Large Corpora, 1996. [10] B. Epshtein, E. Ofek, and Y. Wexler, “Detecting text in natural scenes with stroke width transform,” in 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 2963–2970, IEEE, 2010. [11] H.Chen,S.S.Tsai,G.Schroth,D.M.Chen,R.Grzeszczuk,andB.Girod,“Robust text detection in natural images with edge-enhanced maximally stable extremal regions,” in 2011 18th IEEE International Conference on Image Processing, pp. 2609–2612, IEEE, 2011. [12] J. Matas, O. Chum, M. Urban, and T. Pajdla, “Robust wide-baseline stereo from maximally stable extremal regions,” Image and vision computing, vol. 22, no. 10, pp.761–767, 2004. [13] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, “You only look once: Unified, real-time object detection,” pp. 779–788, 2016. [14] W. Liu, D. Anguelov, D. Erhan, C. Szegedy, S. Reed, C.-Y. Fu, and A. C. Berg, “Ssd:Single shot multibox detector,” in European conference on computer vision, pp. 21–37, Springer, 2016. [15] K. He and J. Sun, “Convolutional neural networks at constrained time cost,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 5353–5360, 2015. [16] Y.Bengio, P.Simard, P.Frasconi, etal., “Learning longterm dependencies with gradient descent is difficult,” IEEE transactions on neural networks, vol. 5, no. 2, pp. 157–166, 1994. [17] K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778, 2016. [18] R. Zhang, Q. Wang, and Y. Lu, “Combination of resnet and center loss based metric learning for handwritten chinese character recognition,” in 2017 14th IAPR International ConferenceonDocument Analysisand Recognition (ICDAR), vol. 5, pp. 25–29, IEEE, 2017. [19] J. Redmon and A. Farhadi, “Yolo9000: better, faster, stronger,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7263–7271, 2017. [20] J. Redmon and A. Farhadi, “Yolov3: An incremental improvement,” arXiv preprint arXiv:1804.02767, 2018. [21] T.-Y. Lin, P. Dollár, R. Girshick, K. He, B. Hariharan, and S. Belongie, “Feature pyramid networks for object detection,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125, 2017. [22] R. Laroca, E. Severo, L. A. Zanlorensi, L. S. Oliveira, G. R. Gonçalves, W. R. Schwartz, and D. Menotti, “A robust real-time automatic license plate recognition based on the yolo detector,” in 2018 International Joint Conference on Neural Networks (IJCNN), pp. 1–10, IEEE, 2018. [23] C. F. G. d. Santos, “Optical character recognition using deep learning,” 2018. [24] Z. Tian, W. Huang, T. He, P. He, and Y. Qiao, “Detecting text in natural image with connectionist text proposal network,” in European conference on computer vision, pp. 56–72, Springer, 2016. [25] M.SchusterandK.K.Paliwal,“Bidirectionalrecurrentneuralnetworks,”IEEETransactions on Signal Processing, vol. 45, no. 11, pp. 2673–2681, 1997. [26] X. Zhou, C. Yao, H. Wen, Y. Wang, S. Zhou, W. He, and J. Liang, “East: an efficient and accurate scene text detector,” in Proceedings of the IEEE conference on Computer Vision and Pattern Recognition, pp. 5551–5560, 2017. [27] K.-H. Kim, S. Hong, B. Roh, Y. Cheon, and M. Park, “Pvanet: Deep but lightweight neural networks for real-time object detection,” arXiv preprint arXiv 1608.08021, 2016. [28] D. Deng, H. Liu, X. Li, and D. Cai, “Pixellink: Detecting scene text via instance segmentation,” in Thirty-Second AAAI Conference on Artificial Intelligence, 2018. [29] J. Long, E. Shelhamer, and T. Darrell, “Fully convolutional networks for semantic segmentation,” in Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 3431–3440, 2015. [30] eragonruan, “text-detection-ctpn.” https://github.com/eragonruan/text-detection-ctpn, 2017. [Online; accessed February-12-2019]. [31] argman, “East.” https://github.com/argman/EAST, 2018. [Online; accessed January-21-2019]. [32] B. Shi, X. Bai, and C. Yao, “An end-to-end trainable neural network for image based sequence recognition and its application to scene text recognition,” IEEE transactions on pattern analysis and machine intelligence, vol. 39, no. 11, pp. 2298–2304, 2016. [33] A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber, “Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks,” in Proceedings of the 23rd international conference on Machine learning, pp. 369–376, ACM, 2006. [34] ZJULearning, “pixel_link.” https://github.com/ZJULearning/pixel\_link, 2019. [Online; accessed April-10-2019]. [35] N. Otsu, “A threshold selection method from gray-level histograms,” IEEE transactions on systems, man, and cybernetics, vol. 9, no. 1, pp. 62–66, 1979.
|