i-manager Publications

i-manager's Journal on Image Processing (JIP)

Current Issue Vol. 12 Issue 2

A Deep Learning CNN Approach with Unified Feature Extraction for Breast Cancer Detection and Classification
A CNN-LSTM Hybrid Model for Parkinson's Disease Detection from Handwritten Spirals using Transfer Learning
Advanced Medical Image Fusion using Multi-Layer Adaptive Curvature Filtering and Pulse Coupled Neural Network for Enhanced Diagnostic Accuracy
Vehicle Number Plate Detection System using Machine Learning
Hand Gesture Recognition Based on Electromyography Signals using Artificial Neural Network

Most Cited

Volume 12 Issue 1 January - March 2025

Research Paper

Attention-Enhanced Deep Learning Model for Parkinson's Diagnosis

Sakshi Mishra*

Bhilai Institute of Technology, Durg, Chhattisgarh, India.

Mishra, S. (2025). Attention-Enhanced Deep Learning Model for Parkinson's Diagnosis. i-manager’s Journal on Image Processing, 12(1), 1-12. https://doi.org/10.26634/jip.12.1.21789

Abstract

This study presents an AI-based system for early detection of Parkinson's disease using deep learning models Inception V3 and Xception with Attention Mechanism. The system analyzes hand-drawn spiral images, which serve as biomarkers for Parkinson's symptoms like tremors and micrographia. The proposed model extracts critical features from these images using pre-trained convolutional neural networks (CNNs) enhanced with attention layers, ensuring effective classification. The dataset includes spiral drawings from both healthy individuals and Parkinson's patients, allowing the model to learn distinguishing features. The Inception V3 model achieved 100% accuracy, while the Xception model attained 88% accuracy in Parkinson's detection. To evaluate the model's performance, graphs of accuracy against epochs and loss against epochs were plotted to track learning trends. A confusion matrix was generated to analyze misclassifications, and a classification report provided insights into precision, recall, and F1-score. A comparative bar chart was also used to highlight the performance difference between Inception V3 and Xception models. This AI-driven approach provides a non-invasive, cost-effective, and automated diagnostic tool, improving early diagnosis and assisting healthcare professionals in timely intervention.

References

[1]. Abdullah, S. M., Abbas, T., Bashir, M. H., Khaja, I. A., Ahmad, M., Soliman, N. F., & El-Shafai, W. (2023). Deep transfer learning based parkinson's disease detection using optimized feature selection. IEEE Access, 11, 3511- 3524.

[2]. AG, B., Srinivasan, S., P, M., Mathivanan, S. K., & Shah, M. A. (2024). Robust brain tumor classification by fusion of deep learning and channel-wise attention mode approach. BMC Medical Imaging, 24(1), 147.

[3]. Allebawi, M. F., Dhieb, T., Neji, M., Farhat, N., Smaoui, E., Hamdani, T. M., & Alimi, A. M. (2024). Parkinson's Disease Detection from Online Handwriting Based on Beta-Elliptical Approach And Fuzzy Perceptual Detector. IEEE Access.

[4]. Aouraghe, I., Khaissidi, G., & Mrabti, M. (2023). A literature review of online handwriting analysis to detect Parkinson's disease at an early stage. Multimedia Tools and Applications, 82(8), 11923-11948.

[5]. Bennour, A., & Mekhaznia, T. (2024). Park-Net: A deep model for early detection of Parkinson's disease through automatic analysis of handwriting. SN Computer Science, 5(7), 886.

[6]. Bhattacharya, P., & Zölzer, U. (2020). Attentive inception module based convolutional neural network for image enhancement. In 2020 Digital Image Computing: Techniques and Applications (DICTA) (pp. 1-8). IEEE.

[7]. Chollet, F. (2017). Xception: Deep learning with depthwise separable convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 1251-1258).

[8]. Deng, X., Liu, Q., Deng, Y., & Mahadevan, S. (2016). An improved method to construct basic probability assignment based on the confusion matrix for classification problem. Information Sciences, 340, 250-261.

[9]. Ghaffarian, S., Valente, J., Van Der Voort, M., & Tekinerdogan, B. (2021). Effect of attention mechanism in deep learning-based remote sensing image processing: A systematic literature review. Remote Sensing, 13(15), 2965.

[10]. Goutte, C., & Gaussier, E. (2005, March). A probabilistic interpretation of precision, recall and F- score, with implication for evaluation. In European Conference on Information Retrieval (pp. 345-359). Springer Berlin Heidelberg.

[11]. Govindu, A., & Palwe, S. (2023). Early detection of Parkinson's disease using machine learning. Procedia Computer Science, 218, 249-261.

[12]. Huang, Y., Chaturvedi, K., Nayan, A. A., Hesamian, M. H., Braytee, A., & Prasad, M. (2024). Early Parkinson's disease diagnosis through hand-drawn spiral and wave analysis using deep learning techniques. Information, 15(4), 220.

[13]. Impedovo, D., Pirlo, G., & Vessio, G. (2018). Dynamic handwriting analysis for supporting earlier Parkinson's disease diagnosis. Information, 9(10), 247.

[14]. Jin, L., Nong, H., Chen, L., & Su, Z. (2024). A Method for Enhancing Generalization of Adam by Multiple Integrations. arXiv preprint arXiv:2412.12473.

[15]. Kamble, M., Shrivastava, P., & Jain, M. (2021). Digitized spiral drawing classification for Parkinson's disease diagnosis. Measurement: Sensors, 16, 100047.

[16]. Maharana, K., Mondal, S., & Nemade, B. (2022). A review: Data pre-processing and data augmentation techniques. Global Transitions Proceedings, 3(1), 91-99.

[17]. Mao, A., Mohri, M., & Zhong, Y. (2023). Cross-entropy loss functions: Theoretical analysis and applications. In International conference on Machine learning (pp. 23803-23828). PMLR.

[18]. Pratiwi, H., Windarto, A. P., Susliansyah, S., Aria, R. R., Susilowati, S., Rahayu, L. K., & Rahadjeng, I. R. (2020). Sigmoid activation function in selecting the best model of artificial neural networks. In Journal of Physics: Conference Series, 1471 (1), 012010. IOP Publishing.

[19]. Ranjan, N. M., Mate, G., & Bembde, M. (2023). Detection of parkinson's disease using machine learning algorithms and handwriting analysis. Journal of Data Mining and Management, 8(1), 21-29.

[20]. Rithish Kumar Reddy, G., Sai Nruthik Sri Harsha, K., Vaisakh, N. P., & Bellamkonda, S. (2017). Enhanced brain tumor classification with Inception V3 and Xception dual- channel CNN. In International Conference on Engineering, Applied Sciences and System Modeling (pp. 103-115). Springer Nature Singapore.

[21]. Taleb, C., Likforman-Sulem, L., Mokbel, C., & Khachab, M. (2023). Detection of Parkinson's disease from handwriting using deep learning: A comparative study. Evolutionary Intelligence (pp. 1-12).

[22]. Tolosa, E., Garrido, A., Scholz, S. W., & Poewe, W. (2021). Challenges in the diagnosis of Parkinson's disease. The Lancet Neurology, 20(5), 385-397.

[23]. Zeng, H. (2025). Handwriting digital image generation based on GAN: A comparative study of basic GAN and CGAN models. In ITM Web of Conferences, 70, 03019. EDP Sciences.

[24]. Zhang, Y., Lei, H., Huang, Z., Li, Z., Liu, C. M., & Lei, B. (2022). Parkinson's disease classification with self- supervised learning and attention mechanism. In 2022 26th International Conference on Pattern Recognition (ICPR) (pp. 4601-4607). IEEE.

Full Article (HTML)

Pdf

Research Paper

Infrared and Visible Image Fusion using Contrast and Edge-Preserving Filters with Image Statistics

Srikanth M. V.* , Jakkampudi Tanmayi Sai, Aravapalli Nikhil Chowdary, Jannu Ram Charan, Bathina Sai Krishna

*-***** Department of Electronics and Communication Engineering, Usha Rama College of Engineering and Technology, Andhra Pradesh, India.

Srikanth, M. V., Sai, J. T., Chowdary, A. N., Charan, J. R., and Krishna, B. S. (2025). Infrared and Visible Image Fusion using Contrast and Edge-Preserving Filters with Image Statistics. i-manager’s Journal on Image Processing, 12(1), 13-21. https://doi.org/10.26634/jip.12.1.21787

Abstract

Infrared (IR) and visible image fusion is a crucial technique in data fusion and image processing. It allows for the accurate integration of thermal radiation and texture details from source images. However, current methods frequently overlook the challenge of high-contrast fusion, resulting in suboptimal performance when replacing thermal radiation target information in IR images with high-contrast information from visible images. To overcome this limitation, a contrast- balanced framework for IR and visible image fusion has been developed. The innovative approach includes a contrast balance strategy for processing visible images, reducing energy while compensating for overexposed areas in detail. Additionally, a contrast-preserving guided filter decomposes the image into energy-detail layers to filter high contrast and information effectively. To extract active information from the detail layer and brightness information from the energy layer, an image statistics technique and a Gaussian distribution of image entropy schemes are introduced for fusing the detail and energy layers. The final fused result is achieved by combining these layers. The final fused result is achieved by combining the detail and energy layers. Comprehensive experiments demonstrate that the proposed method effectively reduces contrast issues while preserving fine details. Additionally, the proposed approach outperformed leading techniques in both qualitative and quantitative evaluations.

References

[1]. Chen, J., Li, X., Luo, L., Mei, X., & Ma, J. (2020). Infrared and visible image fusion based on target- enhanced multiscale transform decomposition. Information Sciences, 508, 64-78.

[2]. Li, X., Zhou, F., & Tan, H. (2021a). Joint image fusion and denoising via three-layer decomposition and sparse representation. Knowledge-Based Systems, 224, 107087.

[3]. Li, H., Cen, Y., Liu, Y., Chen, X., & Yu, Z. (2021b). Different input resolutions and arbitrary output resolution: A meta learning-based deep framework for infrared and visible image fusion. IEEE Transactions on Image Processing, 30, 4070-4083.

[4]. Li, X., Wang, X., Cheng, X., Tan, H., & Li, X. (2022). Multi-focus image fusion based on hessian matrix decomposition and salient difference focus detection. Entropy, 24(11), 1527.

[5]. Li, H., Zhao, J., Li, J., Yu, Z., & Lu, G. (2023). Feature dynamic alignment and refinement for infrared–visible image fusion: Translation robust fusion. Information Fusion, 95, 26-41.

[6]. Liu, J., Fan, X., Jiang, J., Liu, R., & Luo, Z. (2021). Learning a deep multi-scale feature ensemble and an edge-attention guidance for image fusion. IEEE Transactions on Circuits and Systems for Video Technology, 32(1), 105-119.

[7]. Liu, X., & Wang, L. (2022). Infrared polarization and intensity image fusion method based on multi- decomposition LatLRR. Infrared Physics & Technology, 123, 104129.

[8]. Liu, X., Gao, H., Miao, Q., Xi, Y., Ai, Y., & Gao, D. (2022). MFST: Multi-modal feature self-adaptive transformer for infrared and visible image fusion. Remote Sensing, 14(13), 3233.3

[9]. Mo, Y., Kang, X., Duan, P., Sun, B., & Li, S. (2021). Attribute filter based infrared and visible image fusion. Information Fusion, 75, 41-54.

[10]. Nie, R., Ma, C., Cao, J., Ding, H., & Zhou, D. (2021). A total variation with joint norms for infrared and visible image fusion. IEEE Transactions on Multimedia, 24, 1460- 1472.

[11]. Qi, B., Jin, L., Li, G., Zhang, Y., Li, Q., Bi, G., & Wang, W. (2022). Infrared and visible image fusion based on co- occurrence analysis shearlet transform. Remote Sensing, 14(2), 283.

[12]. Xu, H., Ma, J., Jiang, J., Guo, X., & Ling, H. (2020). U2Fusion: A unified unsupervised image fusion network. IEEE Transactions on Pattern Analysis and Machine Intelligence, 44(1), 502-518.

[13]. Zhang, Y., Liu, Y., Sun, P., Yan, H., Zhao, X., & Zhang, L. (2020). IFCNN: A general image fusion framework based on convolutional neural network. Information Fusion, 54, 99-118.

Full Article (HTML)

Pdf

Research Paper

Multilevel Thresholding using K-Point Strategy Improved Convergence Based Whale Optimization Algorithm for Image Segmentation

Rajesh Babu G.* , Palla Lakshmi Himaja, Yalamanchili Vidya Sri, Bandaru Naga Karthik, Puppala Yaswanth Naga Sai Kiran

*-***** Department of Electronics and Communication Engineering, Usha Rama College of Engineering and Technology, Andhra Pradesh, India.

Babu G. R., Himaja, P. L., Sri, Y. V., Karthik, B. N., and Kiran, P. Y. N. S. (2025). Multilevel Thresholding using K-Point Strategy Improved Convergence Based Whale Optimization Algorithm for Image Segmentation. i-manager’s Journal on Image Processing, 12(1), 22-39. https://doi.org/10.26634/jip.12.1.21733

Abstract

The current study presents an innovative multilevel image segmentation method utilizing an improved Whale Optimization Algorithm (WOA). While WOA has shown promise in various optimization tasks, its performance can be limited by a tendency to be trapped in local optima. To address this challenge, the K-point Strategy Improved Convergence WOA (KSICWOA), which enhances optimization efficiency by incorporating a nonlinear convergence factor, an adaptive weight coefficient, and a k-point initialization strategy. The proposed KSICWOA is then applied alongside Otsu's cross variance and Kapur's entropy as objective functions to determine optimal thresholds for multilevel grayscale image segmentation. Experimental results on benchmark functions as well as real-time images demonstrate that KSICWOA surpasses conventional optimization techniques in terms of search accuracy and convergence speed while effectively avoiding local optima. It provides an average improvement of 28.3%, 25.61%, and 7.1% in terms of PSNR, SSIM, and FSIM over the WOA method. Additionally, tests conducted on standard image segmentation datasets confirm that the KSICWOA-Kapur method accurately and efficiently identifies multilevel thresholds.

References

[1]. Abd Elaziz, M., Heidari, A. A., Fujita, H., & Moayedi, H. (2020). A competitive chain-based Harris Hawks Optimizer for global optimization and multi-level image thresholding problems. Applied Soft Computing, 95, 106347.

[2]. Agarwal, P., Singh, R., Kumar, S., & Bhattacharya, M. (2016). Social spider algorithm employed multi-level thresholding segmentation approach. In Proceedings of First International Conference on Information and Communication Technology for Intelligent Systems: Volume 2 (pp. 249-259). Springer International Publishing.

[3]. Bakhshali, M. A., & Shamsi, M. (2014). Segmentation of color lip images by optimal thresholding using bacterial foraging optimization (BFO). Journal of Computational Science, 5(2), 251-257.

[4]. Bhandari, A. K., & Rahul, K. (2019). A context sensitive Masi entropy for multilevel image segmentation using moth swarm algorithm. Infrared Physics & Technology, 98, 132-154.

[5]. Chao, Y., Dai, M., Chen, K., Chen, P., & Zhang, Z. S. (2015). Image segmentation of multilevel threshold using hybrid PSOGSA with generalized opposition-based learning. Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 23, 879-886.

[6]. Gao, H., Fu, Z., Pun, C. M., Hu, H., & Lan, R. (2018). A multi-level thresholding image segmentation based on an improved artificial bee colony algorithm. Computers & Electrical Engineering, 70, 931-938.

[7]. Gondro, C., & Kinghorn, B. P. (2007). A simple genetic algorithm for multiple sequence alignment. Genetics and Molecular Research, 6(4), 964-982.

[8]. Gourav., Sharma, T., & Singh, H. (2017). Computational approach to image segmentation analysis. International Journal of Modern Education and Computer Science, 9(7), 30-37.

[9]. Haupt, R. L., & Haupt, S. E. (2004). Practical Genetic Algorithms. John Wiley & Sons.

[10]. Jia, H., Lang, C., Oliva, D., Song, W., & Peng, X. (2019). Dynamic harris hawks optimization with mutation mechanism for satellite image segmentation. Remote sensing, 11(12), 1421.

[11]. Kapur, J. N., Sahoo, P. K., & Wong, A. K. (1985). A new method for gray-level picture thresholding using the entropy of the histogram. Computer Vision, Graphics, and Image Processing, 29(3), 273-285.

[12]. Khairuzzaman, A. K. M., & Chaudhury, S. (2017). Multilevel thresholding using grey wolf optimizer for image segmentation. Expert Systems with Applications, 86, 64- 76.

[13]. Khairuzzaman, A. K. M., & Chaudhury, S. (2019). Brain MR image multilevel thresholding by using particle swarm optimization, Otsu method and anisotropic diffusion. International Journal of Applied Metaheuristic Computing (IJAMC), 10(3), 91-106.

[14]. Kotaridis, I., & Lazaridou, M. (2021). Remote sensing image segmentation advances: A meta-analysis. ISPRS Journal of Photogrammetry and Remote Sensing, 173, 309-322.

[15]. Li, L., Qian, B., Lian, J., Zheng, W., & Zhou, Y. (2017). Traffic scene segmentation based on RGB-D image and deep learning. IEEE Transactions on Intelligent Transportation Systems, 19(5), 1664-1669.

[16]. Liu, Q., Li, N., Jia, H., Qi, Q., & Abualigah, L. (2022). Modified remora optimization algorithm for global optimization and multilevel thresholding image segmentation. Mathematics, 10(7), 1014.

[17]. Mirjalili, S. (2015). Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowledge-Based Systems, 89, 228-249.

[18]. Mirjalili, S., & Lewis, A. (2016). The whale optimization algorithm. Advances in engineering software, 95, 51-67.

[19]. Otsu, N. (1975). A threshold selection method from gray-level histograms. Automatica, 11(285-296), 23-27.

[20]. Pare, S., Kumar, A., Bajaj, V., & Singh, G. K. (2016). A multilevel color image segmentation technique based on cuckoo search algorithm and energy curve. Applied Soft Computing, 47, 76-102.

[21]. Rodríguez-Esparza, E., Zanella-Calzada, L. A., Oliva, D., Heidari, A. A., Zaldivar, D., Pérez-Cisneros, M., & Foong, L. K. (2020). An efficient Harris hawks-inspired image segmentation method. Expert Systems with Applications, 155, 113428.

[22]. Shaikh, S. H., Saeed, K., Chaki, N., Shaikh, S. H., Saeed, K., & Chaki, N. (2014). Moving Object Detection using Background Subtraction. Springer International Publishing.

[23]. Upadhyay, P., & Chhabra, J. K. (2020). Kapur's entropy based optimal multilevel image segmentation using crow search algorithm. Applied Soft Computing, 97, 105522.

[24]. Wu, C., Luo, C., Xiong, N., Zhang, W., & Kim, T. H. (2018). A greedy deep learning method for medical disease analysis. IEEE Access, 6, 20021-20030.

[25]. Wunnava, A., Naik, M. K., Panda, R., Jena, B., & Abraham, A. (2020). An adaptive Harris hawks optimization technique for two dimensional grey gradient based multilevel image thresholding. Applied Soft Computing, 95, 106526.

[26]. Xing, Z. (2020). An improved emperor penguin optimization based multilevel thresholding for color image segmentation. Knowledge-Based Systems, 194, 105570.

[27]. Yan, L., Fu, J., Wang, C., Ye, Z., Chen, H., & Ling, H. (2021). Enhanced network optimized generative adversarial network for image enhancement. Multimedia Tools and Applications, 80, 14363-14381.

[28]. Yan, L., Li, K., Gao, R., Wang, C., & Xiong, N. (2022a). An intelligent weighted object detector for feature extraction to enrich global image information. Applied Sciences, 12(15), 7825.

[29]. Yan, L., Sheng, M., Wang, C., Gao, R., & Yu, H. (2022b). Hybrid neural networks based facial expression recognition for smart city. Multimedia Tools and Applications (pp. 1-24).

[30]. Yue, X., & Zhang, H. (2020). Modified hybrid bat algorithm with genetic crossover operation and smart inertia weight for multilevel image segmentation. Applied Soft Computing, 90, 106157.

[31]. Zhou, Y., Yang, X., Ling, Y., & Zhang, J. (2018). Meta- heuristic moth swarm algorithm for multilevel thresholding image segmentation. Multimedia Tools and Applications, 77, 23699-23727.

Full Article (HTML)

Pdf

Research Paper

Animal Detection in Fields using Image Processing

Jay Kumar Appari* , Maahi Kamble, Nupur Choudhary*, Amar Kumar Dey****

*-**** Department of Electronics and Telecommunication Engineering, Bhilai Institute of Technology, Durg, Chhattisgarh, India.

Appari, J. K., Kamble, M., Choudhary, N., and Dey, A. K. (2025). Animal Detection in Fields using Image Processing. i-manager’s Journal on Image Processing, 12(1), 40-49. https://doi.org/10.26634/jip.12.1.21688

Abstract

One of the primary requirements for sustaining a livelihood is agriculture. Low crop productivity is one of the issues facing farmers in the country. Crops destroyed by wild creatures is a major issue in low productivity. The agrarian fields must be defended from any undesirable interruption from creatures. In traditional styles, growers use crackers, electrical walls, direct observation, etc., to keep creatures away from their fields, but it is a threat factor that harms both humans and creatures. The presence of creatures is detected using Image Processing and Machine Learning in the proposed system. The damage to crops caused by wild creatures is dramatically increasing in India. It frequently poses pitfalls to humans and creatures. As wild creatures continue to cause increasing damage to human settlements, tolerance has become difficult. Therefore, an effective solution has been developed to address this situation. With that background, the ideal of this study is to descry wild creatures before entering into the crop fields and enforcing applicable dread- down mechanisms in real time. This paper presents an overview of the methodologies employed in this prototype model, including image segmentation, point birth, and bracket ways. Overall, this study highlights the significance of image processing technologies in advancing the understanding of these models and promoting sustainable relations between humans and wildlife.

References

[1]. Balakrishna, K., Mohammed, F., Ullas, C. R., Hema, C. M., & Sonakshi, S. K. (2021). Application of IOT and machine learning in crop protection against animal intrusion. Global Transitions Proceedings, 2(2), 169-174.

[2]. Balch, T., Dellaert, F., Feldman, A., Guillory, A., Isbell, C. L., Khan, Z., & Wilde, H. (2006). How multirobot systems research will accelerate our understanding of social animal behavior. Proceedings of the IEEE, 94(7), 1445- 1463.

[3]. Bandari, G., Devi, L. N., & Srividya, P. (2022). Wild animal detection using a machine learning approach and alerting using LoRa communication. In 2022 International Conference on Smart Generation Computing, Communication and Networking (SMART GENCON) (pp. 1-5). IEEE.

[4]. Gat, A., Gaikwad, H., Giri, R., & Chaudhari, A. (2021). Animal classifier system for video surveillance and forest monitoring using raspberry-pi. International Journal of Technology Engineering Arts Mathematics Science, 1 (2), 8-13.

[5]. Jiang, S., Liang, S., Chen, C., Zhu, Y., & Li, X. (2019). Class agnostic image common object detection. IEEE Transactions on Image Processing, 28(6), 2836-2846.

[6]. Mangai, N. S., Karthigaikumar, P., Vinod, S. T., & Chandy, D. A. (2018). FPGA implementation of elephant recognition in infrared images to reduce the computational time. Journal of Ambient Intelligence and Humanized Computing (pp. 1-16).

[7]. Nevavuori, P., Narra, N., & Lipping, T. (2019). Crop yield prediction with deep Convolutional Neural Networks. Computers and Electronics in Agriculture, 163, 104859.

[8]. Nilaiswariya, R., Manikandan, J., & Hemalatha, P. (2021). Improving scalability and security medical dataset using recurrent neural network and blockchain technology. In 2021 International Conference on System, Computation, Automation and Networking (ICSCAN) (pp. 1-6). IEEE.

[9]. Nunny, L. (2020). Animal welfare in predator control: Lessons from land and sea. How the management of terrestrial and marine mammals impacts wild animal welfare in human–wildlife conflict scenarios in Europe. Animals, 10(2), 218.

[10]. Srivastava, S., Divekar, A. V., Anilkumar, C., Naik, I., Kulkarni, V., & Pattabiraman, V. (2021). Comparative analysis of deep learning image detection algorithms. Journal of Big data, 8(1),66.

[11]. Tabassum, S. A., Vaishnavi, B. S., & Reddy, D. K. S. (2019). Smart crop protection with image capture over IOT. International Journal of Research in Engineering, IT and Social Science, 9 (2), 93-96.

[12]. Vidya, N. L., Meghana, M., Ravi, P., & Kumar, N. (2021). Virtual fencing using yolo framework in agriculture field. In 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV) (pp. 441-446). IEEE.

[13]. Willi, M., Pitman, R. T., Cardoso, A. W., Locke, C., Swanson, A., Boyer, A., & Fortson, L. (2019). Identifying animal species in camera trap images using deep learning and citizen science. Methods in Ecology and Evolution, 10(1), 80-91.

[14]. Yusuf, A. M., & Suyanto, S. (2021). Authenticity and nominal detection of Indonesian banknotes using ROI and CNN. In 2021 IEEE International Conference on Industry 4.0, Artificial Intelligence, and Communications Technology (IAICT) (pp. 154-160). IEEE.

[15]. Zhang, X., Huang, W., Lin, X., Jiang, L., Wu, Y., & Wu, C. (2020). Complex image recognition algorithm based on immune random forest model. Soft Computing, 24, 12643-12657.

Full Article (HTML)

Pdf

Research Paper

Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net

Ashwini G.* , Ramashri T.**

*_** Department of Electronics and Communication Engineering, Sri Venkateswara University College of Engineering, Tirupati, Andhra Pradesh, India.

Ashwini, G., and Ramashri, T. (2025). Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net. i-manager’s Journal on Image Processing, 12(1), 50-62. https://doi.org/10.26634/jip.12.1.21658

Abstract

Accurate segmentation in medical imaging, particularly for modalities such as Chest X-rays, CT scans, and microscopic images, is critical for diagnosis and treatment. However, noisy and low-quality data can significantly affect performance. This paper presents a novel framework that integrates Noise2Split denoising with a Hybrid Swin Transformer U-Net to enhance segmentation accuracy in these challenging medical imaging tasks. By combining Noise2Split's effective noise reduction with the Swin Transformer's advanced feature extraction and U-Net's robust segmentation architecture, the model efficiently addresses both noise and segmentation challenges. The Swin Transformer effectively captures both local and global context, while the skip connections in U-Net contribute to recovering detailed high- resolution features. Extensive experiments on Chest X-rays, CT scans, and microscopic images demonstrate that this integrated model performs better than traditional methods in terms of segmentation accuracy, making it a valuable tool for clinical applications where imaging quality is compromised.

References

[1]. Ashwini, G., & Ramashri, T. (2023). Denoising and segmentation of medical images using N2S-U-Net. International Journal of Scientific Research in Science and Technology, 10 (6), 586-594.

[2]. Ashwini, G., Ramashri, T., & Ahmed, M. R. (2025). Noise2split-single image denoising via single channeled patch-based learning. International Journal of Image and Graphics, 25(01), 2450057.

[3]. Bibaut, A., Kallus, N., Dimakopoulou, M., Chambaz, A., & van Der Laan, M. (2021). Risk minimization from adaptively collected data: Guarantees for supervised and policy learning. Advances in Neural Information Processing Systems, 34, 19261-19273.

[4]. Cao, H., Wang, Y., Chen, J., Jiang, D., Zhang, X., Tian, Q., & Wang, M. (2022, October). Swin-unet: Unet-like pure transformer for medical image segmentation. In European Conference on Computer Vision (pp. 205- 218). Springer Nature Switzerland.

[5]. Carion, N., Massa, F., Synnaeve, G., Usunier, N., Kirillov, A., & Zagoruyko, S. (2020, August). End-to-end object detection with transformers. In European Conference on Computer Vision (pp. 213-229). Springer International Publishing.

[6]. Chen, J., Lu, Y., Yu, Q., Luo, X., Adeli, E., Wang, Y., & Zhou, Y. (2021). Transunet: Transformers make strong encoders for medical image segmentation. arXiv preprint arXiv:2102.04306.

[7]. Chen, L. C., Zhu, Y., Papandreou, G., Schroff, F., & Adam, H. (2018). Encoder-decoder with atrous separable convolution for semantic image segmentation. In Proceedings of the European Conference on Computer Vision (ECCV) (pp. 801-818).

[8]. Çiçek, Ö., Abdulkadir, A., Lienkamp, S. S., Brox, T., & Ronneberger, O. (2016). 3D U-Net: learning dense volumetric segmentation from sparse annotation. In Medical Image Computing and Computer-Assisted Inter vention– MICCAI 2016 : 19 t h International Conference, Athens, Greece, October 17-21, 2016, Proceedings, Part II 19 (pp. 424-432). Springer International Publishing.

[9]. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T & Houlsby, N. (2020). An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929.

[10]. He, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 770-778).

[11]. Kaggle. (n.d. -a). COVID 19 XRay and CT Scan Image: Extensive COVID-19 X-Ray and CT Chest Images Dataset.

[12]. Kaggle. (n.d. -b). Finding and Measuring Lungs in CT Data: A Collection of CT Images, Manually Segmented Lungs and Measurements in 2/3D.

[13]. Khened, M., Kollerathu, V. A., & Krishnamurthi, G. (2019). Fully convolutional multi-scale residual DenseNets for cardiac segmentation and automated cardiac diagnosis using ensemble of classifiers. Medical Image Analysis, 51, 21-45.

[14]. Liu, Z., Lin, Y., Cao, Y., Hu, H., Wei, Y., Zhang, Z & Guo, B. (2021). Swin transformer: Hierarchical vision transformer using shifted windows. In Proceedings of the IEEE/CVF International Conference on Computer Vision (pp. 10012-10022).

[15]. Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5- 9, 2015, Proceedings, Part III 18 (pp. 234-241). Springer international publishing.

[16]. Sinha, A., & Dolz, J. (2020). Multi-scale self-guided attention for medical image segmentation. IEEE Journal of Biomedical and Health Informatics, 25(1), 121-130.

[17]. Touvron, H., Cord, M., Douze, M., Massa, F., Sablayrolles, A., & Jégou, H. (2021). Training data- efficient image transformers & distillation through attention. In International Conference on Machine Learning (pp. 10347-10357). PMLR.

[18]. Yu, F., & Koltun, V. (2015). Multi-scale context aggregation by dilated convolutions. arXiv preprint arXiv:1511.07122.

[19]. Zhang, Z., Liu, Q., & Wang, Y. (2018). Road extraction by deep residual u-net. IEEE Geoscience and Remote Sensing Letters, 15(5), 749-753.

[20]. Zhou, Z., Rahman Siddiquee, M. M., Tajbakhsh, N., & Liang, J. (2018). Unet++: A nested u-net architecture for medical image segmentation. In Deep Learning in Medical Image Analysis and Multimodal Learning for Clinical Decision Support: 4th International Workshop, DLMIA 2018, and 8th International Workshop, ML-CDS 2018, held in Conjunction with MICCAI 2018, Granada, Spain, September 20, 2018, Proceedings 4 (pp. 3-11). Springer International Publishing.

Full Article (HTML)

Pdf

Research Paper

Malaria Detection using Advanced U-Net Deep Learning Model

Kurella Devi Satwika* , Padala Sasidhar Reddy, Shaik Basheer*, Karri Meghana Rani**, Venkatakrishnamoorthy T.*, Anusha B.****

*-, Department of Electronics and Communication Technology, Sasi Institute of Technology and Engineering, Tadepalligudem, Andhra Pradesh, India.

*** Department of Electronics and Communication Engineering, Sasi Institute of Technology and Engineering, Tadepalligudem, Andhra Pradesh, India.

Satwika, K. D., Reddy, P. S., Basheer, S., Rani, K. M., Venkatakrishnamoorthy, T., and Anusha, B. (2025). Malaria Detection using Advanced U-Net Deep Learning Model. i-manager’s Journal on Image Processing, 12(1), 63-70. https://doi.org/10.26634/jip.12.1.21689

Abstract

Malaria continues to affect human lives extensively around the world, requiring urgent medical diagnostic procedures. This paper presents an improved version of the U-Net deep learning method, which identifies malaria within microscopic blood smear images. The segmentation-based feature extraction within U-Net offers superior performance when compared to ordinary deep learning methods, thus leading to better detection results. U-Net delivers precise location detection of diseased areas, which boosts accuracy, while CNN focuses on identification categories, and ANN faces difficulties when identifying complex spatial patterns. The experimental outcomes indicate that U-Net surpasses ANN and CNN approaches by delivering higher values for sensitivity and specificity. The model provides exact detection results and avoids human mistakes while shortening diagnostic time. The system is suited for practical deployment besides offering optimal performance when resources are limited. Data augmentation techniques improve overall generalization properties, which makes the system resistant to different datasets. A modern technological system for automated malaria detection uses deep learning as its foundation.

References

[1]. Akpo, E. M., Mukamakuza, C. P., & Tuyishimire, E. (2024). Binary segmentation of malaria parasites using u- net segmentation approach: A case of rwanda. In I nternational Congress on I nformation and Communication Technology (pp. 163-176). Springer Nature Singapore.

[2]. Anwar, S. M., Majid, M., Qayyum, A., Awais, M., Alnowami, M., & Khan, M. K. (2018). Medical image analysis using convolutional neural networks: A review. Journal of Medical Systems, 42, 1-13.

[3]. Benachour, Y., Flitti, F., & Khalid, H. M. (2025). Enhancing Malaria Detection through Deep Learning: A Comparative Study of Convolutional Neural Networks. IEEE Access.

[4]. Chaudhry, H. A. H., Farid, M. S., Fiandrotti, A., & Grangetto, M. (2024). A lightweight deep learning architecture for malaria parasite-type classification and life cycle stage detection. Neural Computing and Applications, 36(31), 19795-19805.

[5]. Koirala, A., Jha, M., Bodapati, S., Mishra, A., Chetty, G., Sahu, P. K. & Hukkoo, A. (2022). Deep learning for real- time malaria parasite detection and counting using YOLO-mp. IEEE Access, 10, 102157-102172.

[6]. Kourounis, G., Elmahmudi, A. A., Thomson, B., Hunter, J., Ugail, H., & Wilson, C. (2023). Computer image analysis with artificial intelligence: A practical introduction to convolutional neural networks for medical professionals. Postgraduate Medical Journal, 99(1178), 1287-1294.

[7]. Krishnamoorthy, T. V., Venkataiah, C., Rao, Y. M., Prasad, D. R., Chowdary, K. U., Jayamma, M., & Sireesha, R. (2024). A novel NASNet model with LIME explanability for lung disease classification. Biomedical Signal Processing and Control, 93, 106114.

[8]. Kshatri, S. S., & Singh, D. (2023). Convolutional neural network in medical image analysis: A review. Archives of Computational Methods in Engineering, 30(4), 2793-2810.

[9]. Mujahid, M., Rustam, F., Shafique, R., Montero, E. C., Alvarado, E. S., de la Torre Diez, I., & Ashraf, I. (2024). Efficient deep learning-based approach for malaria detection using red blood cell smears. Scientific Reports, 14(1), 13249.

[10]. Nayak, S., Kumar, S., & Jangid, M. (2019). Malaria detection using multiple deep learning approaches. In 2019 2nd International Conference on Intelligent Communication and Computational Techniques (ICCT) (pp.292-297). IEEE.

[11]. Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5- 9, 2015, Proceedings, Part III 18 (pp. 234-241). Springer international publishing.

[12]. Shekar, G., Revathy, S., & Goud, E. K. (2020). Malaria detection using deep learning. In 2020 4th International Conference on Trends in Electronics and Informatics (ICOEI) (48184) (pp. 746-750). IEEE.

[13]. Siłka, W., Wieczorek, M., Siłka, J., & Wozniak, M. (2023). Malaria detection using advanced deep learning architecture. Sensors, 23(3), 1501.

[14]. Tao, A., & Han, B. (2020). Deep unsupervised learning for Microscopy-Based Malaria detection. arXiv preprint arXiv:2009.00197.

[15]. Zheng, P., Zhu, X., & Guo, W. (2022). Brain tumour segmentation based on an improved U-Net. BMC Medical Imaging, 22(1), 199.

i-manager's Journal on Image Processing (JIP)

Current Issue Vol. 12 Issue 2

Most Read

Most Cited

Volume 12 Issue 1 January - March 2025

Attention-Enhanced Deep Learning Model for Parkinson's Diagnosis

Sakshi Mishra*

Bhilai Institute of Technology, Durg, Chhattisgarh, India.

Mishra, S. (2025). Attention-Enhanced Deep Learning Model for Parkinson's Diagnosis. i-manager’s Journal on Image Processing, 12(1), 1-12. https://doi.org/10.26634/jip.12.1.21789

Abstract

References

Full Article (HTML)

Pdf

Infrared and Visible Image Fusion using Contrast and Edge-Preserving Filters with Image Statistics

Srikanth M. V.* , Jakkampudi Tanmayi Sai**, Aravapalli Nikhil Chowdary***, Jannu Ram Charan****, Bathina Sai Krishna*****

*-***** Department of Electronics and Communication Engineering, Usha Rama College of Engineering and Technology, Andhra Pradesh, India.

Srikanth, M. V., Sai, J. T., Chowdary, A. N., Charan, J. R., and Krishna, B. S. (2025). Infrared and Visible Image Fusion using Contrast and Edge-Preserving Filters with Image Statistics. i-manager’s Journal on Image Processing, 12(1), 13-21. https://doi.org/10.26634/jip.12.1.21787

Abstract

References

Full Article (HTML)

Pdf

Multilevel Thresholding using K-Point Strategy Improved Convergence Based Whale Optimization Algorithm for Image Segmentation

Rajesh Babu G.* , Palla Lakshmi Himaja**, Yalamanchili Vidya Sri***, Bandaru Naga Karthik****, Puppala Yaswanth Naga Sai Kiran*****

*-***** Department of Electronics and Communication Engineering, Usha Rama College of Engineering and Technology, Andhra Pradesh, India.

Abstract

References

Full Article (HTML)

Pdf

Animal Detection in Fields using Image Processing

Jay Kumar Appari* , Maahi Kamble**, Nupur Choudhary***, Amar Kumar Dey****

*-**** Department of Electronics and Telecommunication Engineering, Bhilai Institute of Technology, Durg, Chhattisgarh, India.

Appari, J. K., Kamble, M., Choudhary, N., and Dey, A. K. (2025). Animal Detection in Fields using Image Processing. i-manager’s Journal on Image Processing, 12(1), 40-49. https://doi.org/10.26634/jip.12.1.21688

Abstract

References

Full Article (HTML)

Pdf

Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net

Ashwini G.* , Ramashri T.**

*_** Department of Electronics and Communication Engineering, Sri Venkateswara University College of Engineering, Tirupati, Andhra Pradesh, India.

Ashwini, G., and Ramashri, T. (2025). Hybrid Approach for Denoising and Segmentation: N2S with Swin Transformerenhanced U-Net. i-manager’s Journal on Image Processing, 12(1), 50-62. https://doi.org/10.26634/jip.12.1.21658

Abstract

References

Full Article (HTML)

Pdf

Malaria Detection using Advanced U-Net Deep Learning Model

Kurella Devi Satwika* , Padala Sasidhar Reddy**, Shaik Basheer***, Karri Meghana Rani****, Venkatakrishnamoorthy T.*****, Anusha B.******

Satwika, K. D., Reddy, P. S., Basheer, S., Rani, K. M., Venkatakrishnamoorthy, T., and Anusha, B. (2025). Malaria Detection using Advanced U-Net Deep Learning Model. i-manager’s Journal on Image Processing, 12(1), 63-70. https://doi.org/10.26634/jip.12.1.21689

Abstract

References

Full Article (HTML)

Pdf

Srikanth M. V.* , Jakkampudi Tanmayi Sai, Aravapalli Nikhil Chowdary, Jannu Ram Charan, Bathina Sai Krishna

Rajesh Babu G.* , Palla Lakshmi Himaja, Yalamanchili Vidya Sri, Bandaru Naga Karthik, Puppala Yaswanth Naga Sai Kiran

Jay Kumar Appari* , Maahi Kamble, Nupur Choudhary*, Amar Kumar Dey****

Kurella Devi Satwika* , Padala Sasidhar Reddy, Shaik Basheer*, Karri Meghana Rani**, Venkatakrishnamoorthy T.*, Anusha B.****