ABSTRACT
This paper deals with a challenging task of learning from different modalities by tackling the difficulty problem of jointly face recognition between abstract-like sketches, cartoons, caricatures and real-life photographs. Due to the significant variations in the abstract faces, building vision models for recognizing data from these modalities is an extremely challenging. We propose a novel framework termed as Meta-Continual Learning with Knowledge Embedding to address the task of jointly sketch, cartoon, and caricature face recognition. In particular, we firstly present a deep relational network to capture and memorize the relation among different samples. Secondly, we present the construction of our knowledge graph that relates image with the label as the guidance of our meta-learner. We then design a knowledge embedding mechanism to incorporate the knowledge representation into our network. Thirdly, to mitigate catastrophic forgetting, we use a meta-continual model that updates our ensemble model and improves its prediction accuracy. With this meta-continual model, our network can learn from its past. The final classification is derived from our network by learning to compare the features of samples. Experimental results demonstrate that our approach achieves significantly higher performance compared with other state-of-the-art approaches.
Get full access to this Publication
Purchase, subscribe or recommend this publication to your librarian.
Already a Subscriber?Sign In
References
- Gwern Branwen Aaron Gokaslan Anonymous, the Danbooru community. 2019. Danbooru2018: A Large-Scale Crowdsourced and Tagged Anime Illustration Dataset. https://gwern.net/Danbooru2018. https://gwern.net/Danbooru2018 Accessed: DATE.Google Scholar
- H. S. Bhatt, S. Bharadwaj, R. Singh, and M. Vatsa. 2012. Memetically Optimized MCWLD for Matching Sketches With Digital Face Images. IEEE Transactions on Information Forensics and Security, Vol. 7, 5 (Oct 2012), 1522--1535. https://doi.org/10.1109/TIFS.2012.2204252Google Scholar
- IQ Biometrix. 2003. FACES 4.0. Houston, TX: Author (2003).Google Scholar
- Tianshui Chen, Liang Lin, Riquan Chen, Yang Wu, and Xiaonan Luo. 2018. Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18. International Joint Conferences on Artificial Intelligence Organization, 627--634. https://doi.org/10.24963/ijcai.2018/87Google Scholar
- Tianshui Chen, Weihao Yu, Riquan Chen, and Liang Lin. 2019. Knowledge-Embedded Routing Network for Scene Graph Generation. In Conference on Computer Vision and Pattern Recognition.Google Scholar
- Michael W. Cole, Jeremy R. Reynolds, Jonathan D. Power, Grega Repovs, Alan Anticevic, and Todd S. Braver. 2013. Multi-task connectivity reveals flexible hubs for adaptive task control. Nature Neuroscience, Vol. 16, 9 (2013), 1348--1355. https://doi.org/10.1038/nn.3470Google Scholar
- Lingna Dai, Fei Gao, Rongsheng Li, Jiachen Yu, Xiaoyuan Shen, Huilin Xiong, and Weilun Wu. 2019. Gated Fusion of Discriminant Features for Caricature Recognition. In Intelligence Science and Big Data Engineering. Visual Data Engineering, Zhen Cui, Jinshan Pan, Shanshan Zhang, Liang Xiao, and Jian Yang (Eds.). Springer International Publishing, Cham, 563--573.Google Scholar
- T. de Freitas Pereira, A. Anjos, and S. Marcel. 2019. Heterogeneous Face Recognition Using Domain Specific Units. IEEE Transactions on Information Forensics and Security, Vol. 14, 7 (July 2019), 1803--1816. https://doi.org/10.1109/TIFS.2018.2885284Google Scholar
- Jiankang Deng, Jia Guo, Niannan Xue, and Stefanos Zafeiriou. 2019 a. ArcFace: Additive Angular Margin Loss for Deep Face Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- Z. Deng, X. Peng, Z. Li, and Y. Qiao. 2019. Mutual Component Convolutional Neural Networks for Heterogeneous Face Recognition. IEEE Transactions on Image Processing, Vol. 28, 6 (June 2019), 3102--3114. https://doi.org/10.1109/TIP.2019.2894272Google Scholar
- Zhongying Deng, Xiaojiang Peng, and Yu Qiao. 2019 b. Residual compensation networks for heterogeneous face recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8239--8246.Google Scholar
- Yuke Fang, Weihong Deng, Junping Du, and Jiani Hu. 2020. Identity-aware CycleGAN for face photo-sketch synthesis and recognition. Pattern Recognition, Vol. 102 (2020), 107249. https://doi.org/10.1016/j.patcog.2020.107249Google Scholar
- Martha J. Farah. 2018. Socioeconomic status and the brain: prospects for neuroscience-informed policy. Nature Reviews Neuroscience, Vol. 19, 7 (2018), 428--438. https://doi.org/10.1038/s41583-018-0023--2Google Scholar
- Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, and Ran He. 2019. Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. In NeurIPS.Google Scholar
- Jianlong Fu, Heliang Zheng, and Tao Mei. 2017. Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- C. Galea and R. A. Farrugia. 2016. A Large-Scale Software-Generated Face Composite Sketch Database. In 2016 International Conference of the Biometrics Special Interest Group (BIOSIG). 1--5. https://doi.org/10.1109/BIOSIG.2016.7736902Google Scholar
- Jatin Garg, Skand Vishwanath Peri, Himanshu Tolani, and Narayanan.C Krishna. 2018. Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet). In Proceedings of the 2018 ACM Conference on Multimedia. ACM.Google Scholar
- H. Han, B. F. Klare, K. Bonnen, and A. K. Jain. 2013. Matching Composite Sketches to Face Photos: A Component-Based Approach. IEEE Transactions on Information Forensics and Security, Vol. 8, 1 (Jan 2013), 191--204. https://doi.org/10.1109/TIFS.2012.2228856Google Scholar
- S. Hu, N. Short, B. S. Riggan, M. Chasse, and M. S. Sarfraz. 2017. Heterogeneous Face Recognition: Recent Advances in Infrared-to-Visible Matching. In 2017 12th IEEE International Conference on Automatic Face Gesture Recognition (FG 2017). 883--890. https://doi.org/10.1109/FG.2017.126Google Scholar
- Jing Huo, Wenbin Li, Yinghuan Shi, Yang Gao, and Hujun Yin. 2018. WebCaricature: a benchmark for caricature recognition. In British Machine Vision Conference.Google Scholar
- Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings.Google Scholar
- B. F. Klare, S. S. Bucak, A. K. Jain, and T. Akgul. 2012. Towards automated caricature recognition. In 2012 5th IAPR International Conference on Biometrics (ICB). 139--146. https://doi.org/10.1109/ICB.2012.6199771Google Scholar
- S. J. Klum, H. Han, B. F. Klare, and A. K. Jain. 2014. The FaceSketchID System: Matching Facial Composites to Mugshots. IEEE Transactions on Information Forensics and Security, Vol. 9, 12 (Dec 2014), 2248--2263. https://doi.org/10.1109/TIFS.2014.2360825Google Scholar
- Yujia Li, Richard Zemel, Marc Brockschmidt, and Daniel Tarlow. 2016. Gated Graph Sequence Neural Networks. In Proceedings of ICLR'16 proceedings of iclr'16 ed.). https://www.microsoft.com/en-us/research/publication/gated-graph-sequence-neural-networks/Google Scholar
- D. Liu, X. Gao, N. Wang, J. Li, and C. Peng. 2020. Coupled Attribute Learning for Heterogeneous Face Recognition. IEEE Transactions on Neural Networks and Learning Systems (2020), 1--14. https://doi.org/10.1109/TNNLS.2019.2957285Google Scholar
- Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, and Le Song. 2017. SphereFace: Deep Hypersphere Embedding for Face Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- J. Lu, V. E. Liong, and J. Zhou. 2018. Simultaneous Local Binary Feature Learning and Encoding for Homogeneous and Heterogeneous Face Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 40, 8 (Aug 2018), 1979--1993. https://doi.org/10.1109/TPAMI.2017.2737538Google Scholar
- Kenneth Marino, Ruslan Salakhutdinov, and Abhinav Gupta. 2017. The More You Know: Using Knowledge Graphs for Image Classification. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- Julian McAuley and Jure Leskovec. 2012. Image Labeling on a Network: Using Social-Network Metadata for Image Classification. In Computer Vision -- ECCV 2012. 828--841.Google Scholar
- Kieron Messer, Jiri Matas, Josef Kittler, Juergen Luettin, and Gilbert Maitre. 1999. XM2VTSDB: The extended M2VTS database. In Second international conference on audio and video-based biometric person authentication, Vol. 964. 965--966.Google Scholar
- Z. Ming, J. Burie, and M. Muzzamil Luqman. 2019. Dynamic Deep Multi-task Learning for Caricature-Visual Face Recognition. In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), Vol. 1. 92--97. https://doi.org/10.1109/ICDARW.2019.00021Google Scholar
- Ashutosh Mishra, Shyam Nandan Rai, Anand Mishra, and C. V. Jawahar. 2016. IIIT-CFW: A Benchmark Database of Cartoon Faces in the Wild. In Computer Vision -- ECCV 2016 Workshops, Gang Hua and Hervé Jégou (Eds.). Springer International Publishing, Cham, 35--47.Google Scholar
- Yohsuke R. Miyamoto, Shengxin Wang, and Maurice A. Smith. 2020. Implicit adaptation compensates for erratic explicit strategy in human motor learning. Nature Neuroscience (2020). https://doi.org/10.1038/s41593-020-0600--3Google Scholar
- Shuxin Ouyang, Timothy Hospedales, Yi-Zhe Song, Xueming Li, Chen Change Loy, and Xiaogang Wang. 2016b. A survey on heterogeneous face recognition: Sketch, infra-red, 3D and low-resolution. Image and Vision Computing, Vol. 56 (2016), 28 -- 48. https://doi.org/10.1016/j.imavis.2016.09.001Google Scholar
- Shuxin Ouyang, Timothy M. Hospedales, Yi-Zhe Song, and Xueming Li. 2016a. ForgetMeNot: Memory-Aware Forensic Facial Sketch Matching. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- German I. Parisi, Ronald Kemker, Jose L. Part, Christopher Kanan, and Stefan Wermter. 2019. Continual lifelong learning with neural networks: A review. Neural Networks, Vol. 113 (2019), 54 -- 71. https://doi.org/10.1016/j.neunet.2019.01.012Google Scholar
- Chunlei Peng, Xinbo Gao, Nannan Wang, and Jie Li. 2018. Face recognition from multiple stylistic sketches: Scenarios, datasets, and evaluation. Pattern Recognition, Vol. 84 (2018), 262 -- 272. https://doi.org/10.1016/j.patcog.2018.07.014Google Scholar
- Chunlei Peng, Xinbo Gao, Nannan Wang, and Jie Li. 2019 a. Sparse graphical representation based discriminant analysis for heterogeneous face recognition. Signal Processing, Vol. 156 (2019), 46 -- 61. https://doi.org/10.1016/j.sigpro.2018.10.015Google Scholar
- Chunlei Peng, Nannan Wang, Jie Li, and Xinbo Gao. 2019 b. DLFace: Deep local descriptor for cross-modality face recognition. Pattern Recognition, Vol. 90 (2019), 161 -- 171. https://doi.org/10.1016/j.patcog.2019.01.041Google Scholar
- Blake A. Richards, Timothy P. Lillicrap, Philippe Beaudoin, Yoshua Bengio, Rafal Bogacz, Amelia Christensen, Claudia Clopath, Rui Ponte Costa, Archy de Berker, Surya Ganguli, Colleen J. Gillon, Danijar Hafner, Adam Kepecs, Nikolaus Kriegeskorte, Peter Latham, Grace W. Lindsay, Kenneth D. Miller, Richard Naud, Christopher C. Pack, Panayiota Poirazi, Pieter Roelfsema, Jo ao Sacramento, Andrew Saxe, Benjamin Scellier, Anna C. Schapiro, Walter Senn, Greg Wayne, Daniel Yamins, Friedemann Zenke, Joel Zylberberg, Denis Therien, and Konrad P. Kording. 2019. A deep learning framework for neuroscience. Nature Neuroscience, Vol. 22, 11 (2019), 1761--1770. https://doi.org/10.1038/s41593-019-0520--2Google Scholar
- H. Roy and D. Bhattacharjee. 2016. Local-Gravity-Face (LG-face) for Illumination-Invariant and Heterogeneous Face Recognition. IEEE Transactions on Information Forensics and Security, Vol. 11, 7 (July 2016), 1412--1424. https://doi.org/10.1109/TIFS.2016.2530043Google Scholar
- Shreyas Saxena and Jakob Verbeek. 2016. Heterogeneous Face Recognition with CNNs. In Computer Vision -- ECCV 2016 Workshops, Gang Hua and Hervé Jégou (Eds.). Springer International Publishing, Cham, 483--491.Google Scholar
- Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H.S. Torr, and Timothy M. Hospedales. 2018. Learning to Compare: Relation Network for Few-Shot Learning. In Computer Vision and Pattern Recognition (CVPR).Google Scholar
- Flood Sung, Li Zhang, Tao Xiang, Timothy M. Hospedales, and Yongxin Yang. 2017. Learning to Learn: Meta-Critic Networks for Sample Efficient Learning. CoRR, Vol. abs/1706.09529 (2017). arxiv: 1706.09529 http://arxiv.org/abs/1706.09529Google Scholar
- Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, and Wei Liu. 2018. CosFace: Large Margin Cosine Loss for Deep Face Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- Q. Wang, Z. Mao, B. Wang, and L. Guo. 2017. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Transactions on Knowledge and Data Engineering, Vol. 29, 12 (Dec 2017), 2724--2743. https://doi.org/10.1109/TKDE.2017.2754499Google Scholar
- X. Wang and X. Tang. 2009. Face Photo-Sketch Synthesis and Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 31, 11 (Nov 2009), 1955--1967. https://doi.org/10.1109/TPAMI.2008.222Google Scholar
- Yan Wang. 2019. Danbooru 2018 Anime Character Recognition Dataset. https://github.com/grapeot/Danbooru2018AnimeCharacterRecognitionDataset. https://github.com/grapeot/Danbooru2018AnimeCharacterRecognitionDatasetGoogle Scholar
- Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A Discriminative Feature Learning Approach for Deep Face Recognition. In Computer Vision -- ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 499--515.Google Scholar
- Xiang Wu, Huaibo Huang, Vishal M Patel, Ran He, and Zhenan Sun. 2019. Disentangled variational representation for heterogeneous face recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 9005--9012.Google Scholar
- S. Yu, H. Han, S. Shan, A. Dantcheva, and X. Chen. 2019. Improving Face Sketch Recognition via Adversarial Sketch-Photo Transformation. In 2019 14th IEEE International Conference on Automatic Face Gesture Recognition (FG 2019). 1--8. https://doi.org/10.1109/FG.2019.8756563Google Scholar
- W. Zhang, X. Wang, and X. Tang. 2011. Coupled information-theoretic encoding for face photo-sketch recognition. In CVPR 2011. 513--520. https://doi.org/10.1109/CVPR.2011.5995324Google Scholar
- Wenbo Zheng, Chao Gou, and Fei-Yue Wang. 2020 a. A novel approach inspired by optic nerve characteristics for few-shot occluded face recognition. Neurocomputing, Vol. 376 (2020), 25 -- 41. https://doi.org/10.1016/j.neucom.2019.09.045Google Scholar
- Wenbo Zheng, Chao Gou, and Lan Yan. 2019. A Relation Hashing Network Embedded with Prior Features for Skin Lesion Classification. In Machine Learning in Medical Imaging, Heung-Il Suk, Mingxia Liu, Pingkun Yan, and Chunfeng Lian (Eds.). Springer International Publishing, Cham, 115--123.Google Scholar
- Wenbo Zheng, Chao Gou, Lan Yan, and Shaocong Mo. 2020 b. Learning to Classify: A Flow-Based Relation Network for Encrypted Traffic Classification. In Proceedings of The Web Conference 2020 (Taipei, Taiwan) (WWW '20). Association for Computing Machinery, New York, NY, USA, 13--22. https://doi.org/10.1145/3366423.3380090Google Scholar
- W. Zheng, C. Gou, L. Yan, and F. Wang. 2019 a. Differential-Evolution-Based Generative Adversarial Networks for Edge Detection. In 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). 2999--3008.Google Scholar
- W. Zheng, L. Yan, C. Gou, and F. Wang. 2020. Graph Attention Model Embedded With Multi-Modal Knowledge For Depression Detection. In 2020 IEEE International Conference on Multimedia and Expo (ICME). 1--6.Google Scholar
- Wenbo Zheng, Lan Yan, Chao Gou, and Fei-Yue Wang. [n.d.] a. Federated Meta-Learning for Fraudulent Credit Card Detection. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, Christian Bessiere (Ed.). International Joint Conferences on Artificial Intelligence Organization.Google Scholar
- Wenbo Zheng, Lan Yan, Chao Gou, and Fei-Yue Wang. [n.d.] b. JND-GAN: Human-Vision-Systems Inspired Generative Adversarial Networks for Image-to-Image Translation. GAN, Vol. 50 ( [n.,d.]), 1.Google Scholar
- Wenbo Zheng, Lan Yan, Chao Gou, and Fei-Yue Wang. 2020. Webly Supervised Knowledge Embedding Model for Visual Reasoning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Google Scholar
- W. Zheng, L. Yan, C. Gou, W. Zhang, and F. Wang. 2019 b. A Relation Network Embedded with Prior Features for Few-Shot Caricature Recognition. In 2019 IEEE International Conference on Multimedia and Expo (ICME). 1510--1515. https://doi.org/10.1109/ICME.2019.00261Google Scholar
Supplemental Material
Index Terms
Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition
Comments