A survey on generative adversarial networks for imbalance problems in computer vision tasks
Financiación H2020 / H2020 Funds
Resumen: Any computer vision application development starts off by acquiring images and data, then preprocessing and pattern recognition steps to perform a task. When the acquired images are highly imbalanced and not adequate, the desired task may not be achievable. Unfortunately, the occurrence of imbalance problems in acquired image datasets in certain complex real-world problems such as anomaly detection, emotion recognition, medical image analysis, fraud detection, metallic surface defect detection, disaster prediction, etc., are inevitable. The performance of computer vision algorithms can significantly deteriorate when the training dataset is imbalanced. In recent years, Generative Adversarial Neural Networks (GANs) have gained immense attention by researchers across a variety of application domains due to their capability to model complex real-world image data. It is particularly important that GANs can not only be used to generate synthetic images, but also its fascinating adversarial learning idea showed good potential in restoring balance in imbalanced datasets. In this paper, we examine the most recent developments of GANs based techniques for addressing imbalance problems in image data. The real-world challenges and implementations of synthetic image generation based on GANs are extensively covered in this survey. Our survey first introduces various imbalance problems in computer vision tasks and its existing solutions, and then examines key concepts such as deep generative image models and GANs. After that, we propose a taxonomy to summarize GANs based techniques for addressing imbalance problems in computer vision tasks into three major categories: 1. Image level imbalances in classification, 2. object level imbalances in object detection and 3. pixel level imbalances in segmentation tasks. We elaborate the imbalance problems of each group, and provide GANs based solutions in each group. Readers will understand how GANs based techniques can handle the problem of imbalances and boost performance of the computer vision algorithms.
Idioma: Inglés
DOI: 10.1186/s40537-021-00414-0
Año: 2021
Publicado en: Journal of Big Data 8, 1 (2021), 27 [59 pp]
ISSN: 2196-1115

Factor impacto JCR: 10.835 (2021)
Categ. JCR: COMPUTER SCIENCE, THEORY & METHODS rank: 6 / 111 = 0.054 (2021) - Q1 - T1
Factor impacto CITESCORE: 14.4 - Decision Sciences (Q1) - Computer Science (Q1)

Factor impacto SCIMAGO: 2.592 - Computer Networks and Communications (Q1) - Information Systems and Management (Q1) - Hardware and Architecture (Q1)

Financiación: info:eu-repo/grantAgreement/EC/H2020/814225/EU/DIGItal MANufacturing Technologies for Zero-defect Industry 4.0 Production/DIGIMAN4.0
Tipo y forma: Article (Published version)
Área (Departamento): Área Ing. Procesos Fabricación (Dpto. Ingeniería Diseño Fabri.)

Creative Commons You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.


Exportado de SIDERAL (2023-05-18-14:04:51)


Visitas y descargas

Este artículo se encuentra en las siguientes colecciones:
Articles > Artículos por área > Ingeniería de los Procesos de Fabricación



 Record created 2021-03-08, last modified 2023-05-19


Versión publicada:
 PDF
Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)