Image intelligent colorization using generative adversarial networks

Authors

  • Yevgeniya Sulema
  • Boris Topchiiev

DOI:

https://doi.org/10.34185/1562-9945-5-124-2019-09

Keywords:

нейронні мережі, інтелектуальна колоризація, сегментація зображень, оброблення зображень, оброблення мультимодальних даних

Abstract

The paper is devoted to the use of generative adversarial neural networks on the example of solving the problem of image intelligent colorization. Intelligent colorization is the process of changing colors of certain objects and details in an image by using mathematical algorithms. Existing solutions to this problem use standard methods, such as overlaying additional layers on highlighted areas in the image, changing the brightness, saturation, and hue levels. However, all of these methods require active involvement and user skills to locate the objects which need to be repainted, and then also need to adjust other metrics such as shadows, brightness, and exposure on the image to make the converted image look natural. However, machine learning technologies enable automated image processing. The paper presents the software system for intelligent colorization based on the work of two adversarial neural networks: the network-generator and the network-discriminator. The first network is a generator network; it performs input image processing in accordance with the purpose set by the user. The objective of this network is to generate data that is as similar as possible to the original data used in training. The second network is a discriminating network; it evaluates the quality of the result by determining the difference between the sample image that sets the desired processing result and the image that the first network generates. This approach provides high quality training for both networks and, accordingly, high quality of the results. At the stage of data preparation, it is proposed to perform image conversion from RGB color model to YUV or LAB color model that allows to achieve better colorization result. The paper also considers the possibility of moving from image processing to multimodal data processing and proposes to represent input multimodal data about objects on an image in the form of a multimodal multilevel ontological model.

References

“Adobe Photoshop: Match, replace, and mix colors”, 2017, https://helpx.adobe.com/photoshop/using/matching-replacing-mixing-colors.html

Z. Cheng, Q. Yang, B. Sheng, “Deep colorization”, Proceedings of IEEE International Conference on Computer Vision, 2015, pp. 415-423

Junghoon Seo, Taewon Yoon, and Jinwoo Kim, “One-to-one Example-based Automatic Image Coloring Using Deep Convolutional Generative Adversarial Network”, Journal of Advances in Information Technology, 2017, Vol. 8, No. 2, pp. 80-85.

Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, Yoshua Bengio, “Generative Adversarial Networks”, Advances in Neural Information Processing Systems, 2014,pp.1-9.

Olaf Ronneberger, Philipp Fischer, Thomas Brox. ”U-Net: Convolutional Networks for Biomedical Image Segmentation”, Medical Image Computing and Computer-Assisted Intervention, 2015, pp. 234-241.

Tsung-Yi Lin, Piotr Dollar, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie, “Feature Pyramid Networks for Object Detection”, Computer Vision and Pattern Recognition, 2017, pp. 2117-2125.

ImageNet: Summary and Statistics, 2010, http://image-net.org/about-stats

COCO Dataset, 2019, http://cocodataset.org

Michal Podpora, Grzegorz Paweł Korbas, Aleksandra Kawala-Janik, “YUV vs RGB – Choosing a Color Space for Human-Machine Interaction”, Computer Science and Information Systems, 2014, pp. 29-34.

Noor A. Ibraheem, Mokhtar M. Hasan, Rafiqul Z. Khan, Pramod K. Mishra, “Understanding Color Models: A Review”, ARPN Journal of Science and Technology, 2012, Vol. 2, No. 3, pp. 265-275.

Ivanov I., Sopov E., Panfilov I., “Comprehensive approach for solving multimodal data analysis problems based on integration of evolutionary, neural and deep neural network algorithms”, IOP Conf. Series: Materials Science and Engineering, 2018, pp. 1-10.

Shishir Kumar, Prabhat Mahanti, Su-Jing Wang, "Intelligent computational techniques for multimodal data", Multimedia Tools and Applications, 2019, Vol. 78, pp. 23809–23814.

Ye. Sulema, I. Dychka, O. Sulema, “Multimodal Data Representation Models for Virtual, Remote, and Mixed Laboratories Development”, Lecture Notes in Networks and Systems, Springer Cham, 2018, Vol. 47, pp. 559-569.

Anat Levin, Dani Lischinski, Yair Weiss. “Colorization using Optimization” Proceedings of ACM SIGGRAPH, 2004, pp. 689–694.

Published

2019-11-25