ANALYSIS OF METHODS AND ALGORITHMS FOR RECOGNITION AND IDENTIFICATION OF IMAGES BY THEIR SEPARATE FRAGMENTS
DOI:
https://doi.org/10.28925/2663-4023.2024.24.363375Keywords:
image recognition; image processing methods; image processing algorithms; methods of identification; Software.Abstract
The theoretical foundations of building decision-making systems based on the results of image recognition accompanied by texts are considered. The approximate structure of the image recognition system is given. The basis of image recognition systems is the selection of text inscriptions on existing photos, their pre-processing, selection of isolated areas on the image, performance of mathematical operations on individual groups of pixels to bring them to known forms and comparison with them. The description of various methods of image preprocessing is performed. An analysis of the feasibility of using such methods of image binarization as adaptive Bradley-Roth binarization, median filtering, Gaussian filtering, methods of balanced histograms and class variances, discriminant analysis, logistic, probit regression, etc. was carried out. Different algorithms for dividing the image into separate areas for the purpose of their further recognition are considered. among them the moving average algorithm, the algorithm for estimating the probability of finding an object in a selected area based on boundary analysis, Category-independent object proposals, Constrained Parametric Min-Cuts, Multiscale combinatorical grouping, Selective Search, etc. A comparison of different implementations of image processing algorithms to ensure effective recognition, classification and identification of images is performed. Improvement of individual implementations of image processing algorithms allows to reduce their processing time, which is important for working with large data sets. The main focus of the research is on choosing the most effective methods for recognizing inscriptions on images, improving the algorithms that implement them, with the aim of building recognition systems aimed at processing large data sets.
Downloads
References
Ivanytska, A., Zubyk, L., Gololobov, D., Isaienkov, Y., Grynkevych, G., & Bychkov, O. (2023). The system for recognizing useful information of the client’s ID-card based on machine learning technologies. CEUR Workshop Proceedings, Vol. 3687, 115–120.
Mafi, M., Martin, H., Andrian, J., Barreto, A., Cabrerizo, M., & Adjouadi, M. (2019). A Comprehensive Survey on Impulse and Gaussian Denoising Filters for Digital Images. Signal Processing, 157, 236–260.
Mousavirad, S. J., & Ebrahimpour-Komleh, H. (2017). Multilevel image thresholding using entropy of histogram and recently developed population-based metaheuristic algorithms. Evolutionary Intelligence, 10(1–2), 45–75. https://doi.org/10.1007/s12065-017-0152-y
Ma, G., & Yue, X. (2022). An improved whale optimization algorithm based on multilevel threshold image segmentation using the Otsu method. Eng. Appl. Artif. Intell. 113. https://doi.org/10.1016/j. engappai.2022.104960
Abd Elaziz, M., & Lu, S. (2019). Many-objectives multilevel thresholding image segmentation using knee evolutionary algorithm. Expert Syst. Appl. 125, 305–316.
Yadav, R., & Pandey, M. (2022). Image segmentation techniques: A survey. Proceedings of data analytics and management: ICDAM 2021, 1, 231–239.
Endres, I., & Hoiem, D. (2014). Category-Independent Object Proposals with Diverse Ranking. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(2), 222–234. https://doi.org/10.1109/TPAMI.2013.122
Beideman, C., Chandrasekaran, K., & Chao, Xu. (2020). Licensed under Creative Commons License CC-BY Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2020), 17, 1–21.
Aranguren, I., Valdivia, A., Pérez-Cisneros, M., Oliva, D., & Osuna-Enciso, V. (2022). Digital image thresholding by using a lateral inhibition 2d histogram and a mutated electromagnetic field optimization. Multimed Tools Appl. 81(7), 10023–10049.
Maire, M. & Yu, S. X. (2013). Progressive multigrid eigensolvers for multiscale spectral segmentation. ICCV, 2184–2191.
Nadimi-Shahraki, M. H., Taghian. S., Mirjalili, S., & Faris, H. (2020). Mtde: An effective multi-trial vector-based differential evolution algorithm and its applications for engineering design problems. Appl. Soft. Comput. 97. https://doi.org/10.1016/j.asoc.2020.106761
Houssein, E. H., Saad, M. R., Hashim, F. A., Shaban, H., Hassaballah, M. (2020). Lévy flight distribution: A new metaheuristic algorithm for solving engineering optimization problems. Eng. Appl. Artif. Intell. 94. https://doi.org/10.1016/j.engappai.2020.103731
Ahmadianfar, I., Bozorg-Haddad, O, & Chu, X. (2020). Gradient-based optimizer: A new metaheuristic optimization algorithm. Inf. Sci. 540, 131–159. https://doi.org/10.1016/j.ins.2020.06.037
Abdel-Basset, M., Mohamed, R., & Abouhawwash, M. (2022). A new fusion of whale optimizer algorithm with kapur’s entropy for multi-threshold image segmentation: Analysis and validations. Artif. Intell. Rev. 55(8), 6389–6459.
Salehnia, T., MiarNaeimi, F., Izadi, S., Ahmadi, M., Montazerolghaem, A., Mirjalili, S., & Abualigah, L. (2024). A mtis method using a combined of whale and moth-flame optimization algorithms, 625–651.
Sharma, A., Chaturvedi, R., & Bhargava, A. (2022). A novel opposition based improved firefly algorithm for multilevel image segmentation. Multimed Tools Appl. 81(11), 15521–15544.
Chauhan, D., & Yadav, A. (2023). A crossover-based optimization algorithm for multilevel image segmentation. Soft. Comput. 1–33.
Thapliyal, S., & Kumar, N. (2024). Ascaeo: accelerated sine cosine algorithm hybridized with equilibrium optimizer with application in image segmentation using multilevel thresholding. Evolving Syst. 1–62
Liu, Q., Li, N., Jia, H., Qi, Q., & Abualigah, L. (2023). A chimp-inspired remora optimization algorithm for multilevel thresholding image segmentation using cross entropy. Artif. Intell. Rev. 56(1), 159–216.
Chen, Y., Wang, M., Heidari, A., Shi, B., Hu, Z., Zhang, Q., Chen, H., Mafarja, M., & Turabieh, H. (2022). Multi-threshold image segmentation using a multi-strategy shuffled frog leaping algorithm. Expert Syst. Appl. 194. https://doi.org/10.1016/j.eswa.2022.116511
Houssein, E. H., Abdalkarim, N., Hussain, K., & Mohamed, E. (2024). Accurate multilevel thresholding image segmentation via oppositional snake optimization algorithm: Real cases with liver disease. Comput. Biol. Med. 169.
Kumar, B. V., Oliva, D., & Suganthan, P. (2022). Differential Evolution: From Theory to Practice. Springer Singapore. https://doi.org/10.1007/978-981-16-8082-3
Reisenhofer, R., Bosse, S., Kutyniok, G., & Wiegand, T. (2018). A haar wavelet-based perceptual similarity index for image quality assessment. Signal Process: Image Commun. 61, 33–43.
Uijlings, J. R. R., van de Sande, K. E. A., Gevers, T. & Smeulders, A. W. M. (2013). Selective Search for Object Recognition. Int. J. Comput. Vis. 104, 154–171. https://doi.org/10.1007/s11263-013-0620-5
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Людмила Зубик, Сергій Пужай-Череда , Олександр Сапельников, Денис Калугін, Максим Котляр
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.