Improving deep learning performance by augmenting training data

Authors

  • Soldatenko Dmytro
  • Hnatushenko Viktorija

DOI:

https://doi.org/10.34185/1562-9945-4-147-2023-10

Keywords:

neural network, image recognition, satellite images, image processing, data augmentation, artificial intelligence, convolutional neural network, image classification.

Abstract

Satellite image recognition is a crucial application of computer vision that has the po-tential to be applied in various fields such as disaster management, agriculture, and urban planning. The objective of this study is to determine the optimal amount of input data required and select the most effective methods of augmentation necessary for training a convolutional neural network (CNN) for satellite image recognition. To achieve this, we perform a series of experiments to investigate the effect of input data quantity on several performance metrics, including model accuracy, convergence, and generalization. Additionally, we explore the impact of various data augmentation techniques, such as rotation, scaling, and flipping, on model performance. The study suggests several strategies for identifying the saturation point and mitigating the effects of overtraining, in-cluding early stopping and dropout regularization. The findings from this study can significantly contribute to the development of more ef-ficient satellite recognition models. Furthermore, they can help improve the performance of existing models, in addition to providing guidance for future research. The study emphasizes the importance of carefully selecting input data and augmentation methods to achieve optimal performance in CNNs, which is fundamental in advancing the field of computer vision. In addition to the above, the study investigates the potential of transfer learning by pre-training the model on a related dataset and fine-tuning it on the satellite imagery dataset. This approach can reduce the amount of required data and training time and increase model performance. Overall, this study provides valuable insights into the optimal amount of input data and augmentation techniques for training CNNs for satellite image recognition, and its findings can guide future research in this area.

References

Christian L, Lucas T, Ferenc H, Jose C, Andrew C, Alejandro A, Andrew A, Alykhan T, Johannes T, Zehan W, Wenzhe S. Photo-realistic single image super-resolution using a generative adversarial network. arXiv preprint. 2016.

Shervin Minaee, Yuri Y Boykov, Fatih Porikli, Antonio J Plaza, Nasser Kehtarnavaz, and Demetri Terzopoulos. Image segmentation using deep learning: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021.

Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra, and Qiang Liu. Keepaugment: A simple information-preserving data augmentation approach. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 1055–1064, 2021.

Buda Mateusz, Maki Atsuto, Mazurowski Maciej A. A systematic study of the class imbalance problem in convolutional neural networks. Neural Networks. 2018;106:249–59.

Trishul C, Yutaka S, Johnson A, Karthik K. Project adam: building an efficient and scalable deep learning training system. In: Proceedings of OSDI. 2014. P. 571–82.

Xiaofeng Z, Zhangyang W, Dong L, Qing L. DADA: deep adversarial data augmentation for extremely low data regime classification. arXiv preprint. 2018.

Dong W, Zhou N, Paul JC, Zhang X. Optimized image resizing using seam carving and scaling. ACM Transactions on Graphics (TOG). 2009 Dec 1;28(5):1-0.

Russo F. A method for estimation and filtering of Gaussian noise in images. IEEE Transactions on Instrumentation and Measurement. 2003 Sep 23;52(4):1148-54.

Published

2023-11-13