10.1145/3394171.3413892acmconferencesArticle/Chapter ViewAccess DenialPublication PagesmmConference Proceedings
research-article

Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition

ABSTRACT

This paper deals with a challenging task of learning from different modalities by tackling the difficulty problem of jointly face recognition between abstract-like sketches, cartoons, caricatures and real-life photographs. Due to the significant variations in the abstract faces, building vision models for recognizing data from these modalities is an extremely challenging. We propose a novel framework termed as Meta-Continual Learning with Knowledge Embedding to address the task of jointly sketch, cartoon, and caricature face recognition. In particular, we firstly present a deep relational network to capture and memorize the relation among different samples. Secondly, we present the construction of our knowledge graph that relates image with the label as the guidance of our meta-learner. We then design a knowledge embedding mechanism to incorporate the knowledge representation into our network. Thirdly, to mitigate catastrophic forgetting, we use a meta-continual model that updates our ensemble model and improves its prediction accuracy. With this meta-continual model, our network can learn from its past. The final classification is derived from our network by learning to compare the features of samples. Experimental results demonstrate that our approach achieves significantly higher performance compared with other state-of-the-art approaches.

Get full access to this Publication

Purchase, subscribe or recommend this publication to your librarian.

References

  1. Gwern Branwen Aaron Gokaslan Anonymous, the Danbooru community. 2019. Danbooru2018: A Large-Scale Crowdsourced and Tagged Anime Illustration Dataset. https://gwern.net/Danbooru2018. https://gwern.net/Danbooru2018 Accessed: DATE.Google ScholarGoogle Scholar
  2. H. S. Bhatt, S. Bharadwaj, R. Singh, and M. Vatsa. 2012. Memetically Optimized MCWLD for Matching Sketches With Digital Face Images. IEEE Transactions on Information Forensics and Security, Vol. 7, 5 (Oct 2012), 1522--1535. https://doi.org/10.1109/TIFS.2012.2204252Google ScholarGoogle Scholar
  3. IQ Biometrix. 2003. FACES 4.0. Houston, TX: Author (2003).Google ScholarGoogle Scholar
  4. Tianshui Chen, Liang Lin, Riquan Chen, Yang Wu, and Xiaonan Luo. 2018. Knowledge-Embedded Representation Learning for Fine-Grained Image Recognition. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, IJCAI-18. International Joint Conferences on Artificial Intelligence Organization, 627--634. https://doi.org/10.24963/ijcai.2018/87Google ScholarGoogle Scholar
  5. Tianshui Chen, Weihao Yu, Riquan Chen, and Liang Lin. 2019. Knowledge-Embedded Routing Network for Scene Graph Generation. In Conference on Computer Vision and Pattern Recognition.Google ScholarGoogle Scholar
  6. Michael W. Cole, Jeremy R. Reynolds, Jonathan D. Power, Grega Repovs, Alan Anticevic, and Todd S. Braver. 2013. Multi-task connectivity reveals flexible hubs for adaptive task control. Nature Neuroscience, Vol. 16, 9 (2013), 1348--1355. https://doi.org/10.1038/nn.3470Google ScholarGoogle Scholar
  7. Lingna Dai, Fei Gao, Rongsheng Li, Jiachen Yu, Xiaoyuan Shen, Huilin Xiong, and Weilun Wu. 2019. Gated Fusion of Discriminant Features for Caricature Recognition. In Intelligence Science and Big Data Engineering. Visual Data Engineering, Zhen Cui, Jinshan Pan, Shanshan Zhang, Liang Xiao, and Jian Yang (Eds.). Springer International Publishing, Cham, 563--573.Google ScholarGoogle Scholar
  8. T. de Freitas Pereira, A. Anjos, and S. Marcel. 2019. Heterogeneous Face Recognition Using Domain Specific Units. IEEE Transactions on Information Forensics and Security, Vol. 14, 7 (July 2019), 1803--1816. https://doi.org/10.1109/TIFS.2018.2885284Google ScholarGoogle Scholar
  9. Jiankang Deng, Jia Guo, Niannan Xue, and Stefanos Zafeiriou. 2019 a. ArcFace: Additive Angular Margin Loss for Deep Face Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  10. Z. Deng, X. Peng, Z. Li, and Y. Qiao. 2019. Mutual Component Convolutional Neural Networks for Heterogeneous Face Recognition. IEEE Transactions on Image Processing, Vol. 28, 6 (June 2019), 3102--3114. https://doi.org/10.1109/TIP.2019.2894272Google ScholarGoogle Scholar
  11. Zhongying Deng, Xiaojiang Peng, and Yu Qiao. 2019 b. Residual compensation networks for heterogeneous face recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 8239--8246.Google ScholarGoogle Scholar
  12. Yuke Fang, Weihong Deng, Junping Du, and Jiani Hu. 2020. Identity-aware CycleGAN for face photo-sketch synthesis and recognition. Pattern Recognition, Vol. 102 (2020), 107249. https://doi.org/10.1016/j.patcog.2020.107249Google ScholarGoogle Scholar
  13. Martha J. Farah. 2018. Socioeconomic status and the brain: prospects for neuroscience-informed policy. Nature Reviews Neuroscience, Vol. 19, 7 (2018), 428--438. https://doi.org/10.1038/s41583-018-0023--2Google ScholarGoogle Scholar
  14. Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, and Ran He. 2019. Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. In NeurIPS.Google ScholarGoogle Scholar
  15. Jianlong Fu, Heliang Zheng, and Tao Mei. 2017. Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  16. C. Galea and R. A. Farrugia. 2016. A Large-Scale Software-Generated Face Composite Sketch Database. In 2016 International Conference of the Biometrics Special Interest Group (BIOSIG). 1--5. https://doi.org/10.1109/BIOSIG.2016.7736902Google ScholarGoogle Scholar
  17. Jatin Garg, Skand Vishwanath Peri, Himanshu Tolani, and Narayanan.C Krishna. 2018. Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet). In Proceedings of the 2018 ACM Conference on Multimedia. ACM.Google ScholarGoogle Scholar
  18. H. Han, B. F. Klare, K. Bonnen, and A. K. Jain. 2013. Matching Composite Sketches to Face Photos: A Component-Based Approach. IEEE Transactions on Information Forensics and Security, Vol. 8, 1 (Jan 2013), 191--204. https://doi.org/10.1109/TIFS.2012.2228856Google ScholarGoogle Scholar
  19. S. Hu, N. Short, B. S. Riggan, M. Chasse, and M. S. Sarfraz. 2017. Heterogeneous Face Recognition: Recent Advances in Infrared-to-Visible Matching. In 2017 12th IEEE International Conference on Automatic Face Gesture Recognition (FG 2017). 883--890. https://doi.org/10.1109/FG.2017.126Google ScholarGoogle Scholar
  20. Jing Huo, Wenbin Li, Yinghuan Shi, Yang Gao, and Hujun Yin. 2018. WebCaricature: a benchmark for caricature recognition. In British Machine Vision Conference.Google ScholarGoogle Scholar
  21. Diederik P. Kingma and Jimmy Ba. 2015. Adam: A Method for Stochastic Optimization. In 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, May 7--9, 2015, Conference Track Proceedings.Google ScholarGoogle Scholar
  22. B. F. Klare, S. S. Bucak, A. K. Jain, and T. Akgul. 2012. Towards automated caricature recognition. In 2012 5th IAPR International Conference on Biometrics (ICB). 139--146. https://doi.org/10.1109/ICB.2012.6199771Google ScholarGoogle Scholar
  23. S. J. Klum, H. Han, B. F. Klare, and A. K. Jain. 2014. The FaceSketchID System: Matching Facial Composites to Mugshots. IEEE Transactions on Information Forensics and Security, Vol. 9, 12 (Dec 2014), 2248--2263. https://doi.org/10.1109/TIFS.2014.2360825Google ScholarGoogle Scholar
  24. Yujia Li, Richard Zemel, Marc Brockschmidt, and Daniel Tarlow. 2016. Gated Graph Sequence Neural Networks. In Proceedings of ICLR'16 proceedings of iclr'16 ed.). https://www.microsoft.com/en-us/research/publication/gated-graph-sequence-neural-networks/Google ScholarGoogle Scholar
  25. D. Liu, X. Gao, N. Wang, J. Li, and C. Peng. 2020. Coupled Attribute Learning for Heterogeneous Face Recognition. IEEE Transactions on Neural Networks and Learning Systems (2020), 1--14. https://doi.org/10.1109/TNNLS.2019.2957285Google ScholarGoogle Scholar
  26. Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, and Le Song. 2017. SphereFace: Deep Hypersphere Embedding for Face Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  27. J. Lu, V. E. Liong, and J. Zhou. 2018. Simultaneous Local Binary Feature Learning and Encoding for Homogeneous and Heterogeneous Face Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 40, 8 (Aug 2018), 1979--1993. https://doi.org/10.1109/TPAMI.2017.2737538Google ScholarGoogle Scholar
  28. Kenneth Marino, Ruslan Salakhutdinov, and Abhinav Gupta. 2017. The More You Know: Using Knowledge Graphs for Image Classification. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  29. Julian McAuley and Jure Leskovec. 2012. Image Labeling on a Network: Using Social-Network Metadata for Image Classification. In Computer Vision -- ECCV 2012. 828--841.Google ScholarGoogle Scholar
  30. Kieron Messer, Jiri Matas, Josef Kittler, Juergen Luettin, and Gilbert Maitre. 1999. XM2VTSDB: The extended M2VTS database. In Second international conference on audio and video-based biometric person authentication, Vol. 964. 965--966.Google ScholarGoogle Scholar
  31. Z. Ming, J. Burie, and M. Muzzamil Luqman. 2019. Dynamic Deep Multi-task Learning for Caricature-Visual Face Recognition. In 2019 International Conference on Document Analysis and Recognition Workshops (ICDARW), Vol. 1. 92--97. https://doi.org/10.1109/ICDARW.2019.00021Google ScholarGoogle Scholar
  32. Ashutosh Mishra, Shyam Nandan Rai, Anand Mishra, and C. V. Jawahar. 2016. IIIT-CFW: A Benchmark Database of Cartoon Faces in the Wild. In Computer Vision -- ECCV 2016 Workshops, Gang Hua and Hervé Jégou (Eds.). Springer International Publishing, Cham, 35--47.Google ScholarGoogle Scholar
  33. Yohsuke R. Miyamoto, Shengxin Wang, and Maurice A. Smith. 2020. Implicit adaptation compensates for erratic explicit strategy in human motor learning. Nature Neuroscience (2020). https://doi.org/10.1038/s41593-020-0600--3Google ScholarGoogle Scholar
  34. Shuxin Ouyang, Timothy Hospedales, Yi-Zhe Song, Xueming Li, Chen Change Loy, and Xiaogang Wang. 2016b. A survey on heterogeneous face recognition: Sketch, infra-red, 3D and low-resolution. Image and Vision Computing, Vol. 56 (2016), 28 -- 48. https://doi.org/10.1016/j.imavis.2016.09.001Google ScholarGoogle Scholar
  35. Shuxin Ouyang, Timothy M. Hospedales, Yi-Zhe Song, and Xueming Li. 2016a. ForgetMeNot: Memory-Aware Forensic Facial Sketch Matching. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  36. German I. Parisi, Ronald Kemker, Jose L. Part, Christopher Kanan, and Stefan Wermter. 2019. Continual lifelong learning with neural networks: A review. Neural Networks, Vol. 113 (2019), 54 -- 71. https://doi.org/10.1016/j.neunet.2019.01.012Google ScholarGoogle Scholar
  37. Chunlei Peng, Xinbo Gao, Nannan Wang, and Jie Li. 2018. Face recognition from multiple stylistic sketches: Scenarios, datasets, and evaluation. Pattern Recognition, Vol. 84 (2018), 262 -- 272. https://doi.org/10.1016/j.patcog.2018.07.014Google ScholarGoogle Scholar
  38. Chunlei Peng, Xinbo Gao, Nannan Wang, and Jie Li. 2019 a. Sparse graphical representation based discriminant analysis for heterogeneous face recognition. Signal Processing, Vol. 156 (2019), 46 -- 61. https://doi.org/10.1016/j.sigpro.2018.10.015Google ScholarGoogle Scholar
  39. Chunlei Peng, Nannan Wang, Jie Li, and Xinbo Gao. 2019 b. DLFace: Deep local descriptor for cross-modality face recognition. Pattern Recognition, Vol. 90 (2019), 161 -- 171. https://doi.org/10.1016/j.patcog.2019.01.041Google ScholarGoogle Scholar
  40. Blake A. Richards, Timothy P. Lillicrap, Philippe Beaudoin, Yoshua Bengio, Rafal Bogacz, Amelia Christensen, Claudia Clopath, Rui Ponte Costa, Archy de Berker, Surya Ganguli, Colleen J. Gillon, Danijar Hafner, Adam Kepecs, Nikolaus Kriegeskorte, Peter Latham, Grace W. Lindsay, Kenneth D. Miller, Richard Naud, Christopher C. Pack, Panayiota Poirazi, Pieter Roelfsema, Jo ao Sacramento, Andrew Saxe, Benjamin Scellier, Anna C. Schapiro, Walter Senn, Greg Wayne, Daniel Yamins, Friedemann Zenke, Joel Zylberberg, Denis Therien, and Konrad P. Kording. 2019. A deep learning framework for neuroscience. Nature Neuroscience, Vol. 22, 11 (2019), 1761--1770. https://doi.org/10.1038/s41593-019-0520--2Google ScholarGoogle Scholar
  41. H. Roy and D. Bhattacharjee. 2016. Local-Gravity-Face (LG-face) for Illumination-Invariant and Heterogeneous Face Recognition. IEEE Transactions on Information Forensics and Security, Vol. 11, 7 (July 2016), 1412--1424. https://doi.org/10.1109/TIFS.2016.2530043Google ScholarGoogle Scholar
  42. Shreyas Saxena and Jakob Verbeek. 2016. Heterogeneous Face Recognition with CNNs. In Computer Vision -- ECCV 2016 Workshops, Gang Hua and Hervé Jégou (Eds.). Springer International Publishing, Cham, 483--491.Google ScholarGoogle Scholar
  43. Flood Sung, Yongxin Yang, Li Zhang, Tao Xiang, Philip H.S. Torr, and Timothy M. Hospedales. 2018. Learning to Compare: Relation Network for Few-Shot Learning. In Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  44. Flood Sung, Li Zhang, Tao Xiang, Timothy M. Hospedales, and Yongxin Yang. 2017. Learning to Learn: Meta-Critic Networks for Sample Efficient Learning. CoRR, Vol. abs/1706.09529 (2017). arxiv: 1706.09529 http://arxiv.org/abs/1706.09529Google ScholarGoogle Scholar
  45. Hao Wang, Yitong Wang, Zheng Zhou, Xing Ji, Dihong Gong, Jingchao Zhou, Zhifeng Li, and Wei Liu. 2018. CosFace: Large Margin Cosine Loss for Deep Face Recognition. In The IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  46. Q. Wang, Z. Mao, B. Wang, and L. Guo. 2017. Knowledge Graph Embedding: A Survey of Approaches and Applications. IEEE Transactions on Knowledge and Data Engineering, Vol. 29, 12 (Dec 2017), 2724--2743. https://doi.org/10.1109/TKDE.2017.2754499Google ScholarGoogle Scholar
  47. X. Wang and X. Tang. 2009. Face Photo-Sketch Synthesis and Recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 31, 11 (Nov 2009), 1955--1967. https://doi.org/10.1109/TPAMI.2008.222Google ScholarGoogle Scholar
  48. Yan Wang. 2019. Danbooru 2018 Anime Character Recognition Dataset. https://github.com/grapeot/Danbooru2018AnimeCharacterRecognitionDataset. https://github.com/grapeot/Danbooru2018AnimeCharacterRecognitionDatasetGoogle ScholarGoogle Scholar
  49. Yandong Wen, Kaipeng Zhang, Zhifeng Li, and Yu Qiao. 2016. A Discriminative Feature Learning Approach for Deep Face Recognition. In Computer Vision -- ECCV 2016, Bastian Leibe, Jiri Matas, Nicu Sebe, and Max Welling (Eds.). Springer International Publishing, Cham, 499--515.Google ScholarGoogle Scholar
  50. Xiang Wu, Huaibo Huang, Vishal M Patel, Ran He, and Zhenan Sun. 2019. Disentangled variational representation for heterogeneous face recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 33. 9005--9012.Google ScholarGoogle Scholar
  51. S. Yu, H. Han, S. Shan, A. Dantcheva, and X. Chen. 2019. Improving Face Sketch Recognition via Adversarial Sketch-Photo Transformation. In 2019 14th IEEE International Conference on Automatic Face Gesture Recognition (FG 2019). 1--8. https://doi.org/10.1109/FG.2019.8756563Google ScholarGoogle Scholar
  52. W. Zhang, X. Wang, and X. Tang. 2011. Coupled information-theoretic encoding for face photo-sketch recognition. In CVPR 2011. 513--520. https://doi.org/10.1109/CVPR.2011.5995324Google ScholarGoogle Scholar
  53. Wenbo Zheng, Chao Gou, and Fei-Yue Wang. 2020 a. A novel approach inspired by optic nerve characteristics for few-shot occluded face recognition. Neurocomputing, Vol. 376 (2020), 25 -- 41. https://doi.org/10.1016/j.neucom.2019.09.045Google ScholarGoogle Scholar
  54. Wenbo Zheng, Chao Gou, and Lan Yan. 2019. A Relation Hashing Network Embedded with Prior Features for Skin Lesion Classification. In Machine Learning in Medical Imaging, Heung-Il Suk, Mingxia Liu, Pingkun Yan, and Chunfeng Lian (Eds.). Springer International Publishing, Cham, 115--123.Google ScholarGoogle Scholar
  55. Wenbo Zheng, Chao Gou, Lan Yan, and Shaocong Mo. 2020 b. Learning to Classify: A Flow-Based Relation Network for Encrypted Traffic Classification. In Proceedings of The Web Conference 2020 (Taipei, Taiwan) (WWW '20). Association for Computing Machinery, New York, NY, USA, 13--22. https://doi.org/10.1145/3366423.3380090Google ScholarGoogle Scholar
  56. W. Zheng, C. Gou, L. Yan, and F. Wang. 2019 a. Differential-Evolution-Based Generative Adversarial Networks for Edge Detection. In 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW). 2999--3008.Google ScholarGoogle Scholar
  57. W. Zheng, L. Yan, C. Gou, and F. Wang. 2020. Graph Attention Model Embedded With Multi-Modal Knowledge For Depression Detection. In 2020 IEEE International Conference on Multimedia and Expo (ICME). 1--6.Google ScholarGoogle Scholar
  58. Wenbo Zheng, Lan Yan, Chao Gou, and Fei-Yue Wang. [n.d.] a. Federated Meta-Learning for Fraudulent Credit Card Detection. In Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI-20, Christian Bessiere (Ed.). International Joint Conferences on Artificial Intelligence Organization.Google ScholarGoogle Scholar
  59. Wenbo Zheng, Lan Yan, Chao Gou, and Fei-Yue Wang. [n.d.] b. JND-GAN: Human-Vision-Systems Inspired Generative Adversarial Networks for Image-to-Image Translation. GAN, Vol. 50 ( [n.,d.]), 1.Google ScholarGoogle Scholar
  60. Wenbo Zheng, Lan Yan, Chao Gou, and Fei-Yue Wang. 2020. Webly Supervised Knowledge Embedding Model for Visual Reasoning. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Google ScholarGoogle Scholar
  61. W. Zheng, L. Yan, C. Gou, W. Zhang, and F. Wang. 2019 b. A Relation Network Embedded with Prior Features for Few-Shot Caricature Recognition. In 2019 IEEE International Conference on Multimedia and Expo (ICME). 1510--1515. https://doi.org/10.1109/ICME.2019.00261Google ScholarGoogle Scholar

Supplemental Material

3394171.3413892.mp4

The Video about "Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition" on ACM MM 2020

Index Terms

  1. Learning from the Past: Meta-Continual Learning with Knowledge Embedding for Jointly Sketch, Cartoon, and Caricature Face Recognition

        Comments

        About Cookies On This Site

        We use cookies to ensure that we give you the best experience on our website.

        Learn more

        Got it!