i-manager's Journal on Software Engineering (JSE)


Volume 15 Issue 3 January - March 2021

Article

A Review on Data Mining and Big Data Analytics for Healthcare Solutions

Hiralkumari B. Chauhan* , Rahul K. Kher **
* Department of Information Technology, Bhailalbhai and Bhikhabhai Institute of Technology, Vallabh Vidyanagar, Gujarat, India.
** Department of Electronics and Communication Engineering, G H Patel College of Engineering and Technology (GCET), Gujarat, India.
Chauhan, H. B., and Kher, R. K. (2021). A Review on Data Mining and Big Data Analytics for Healthcare Solutions. i-manager's Journal on Software Engineering, 15(3), 1-4. https://doi.org/10.26634/jse.15.3.18254

Abstract

A personalized health care system model is proposed that provides e-health services in term of prediction, disease diagnosis and prevention strategy. Medical Internet of Things (MIoT) is utilized for generating various smart devices, sensors and mobile application. The collected data from hospitals, clinics, and laboratories will be stored on the cloud with the help of cloud and fog computing. Big data analytics and data mining algorithms can provide real time data analysis, testing and decision making. The required percussions will suggest an emergency service to user, if needed.

Research Paper

Identification of the Types of Skin Cancers from Skin Cancer Images and Covid-19 Detection on Chest X-Ray Images using Deep Learning

Debabrat Bharali* , Wahengbam Shibananda Singh **, Kuldeep Nath***, Afrin Haque ****, Dipu Medhi *****
*-***** Department of Computer Science, Regional Institute of Science and Technology, Meghalaya, India.
Bharali, D., Singh, W. S., Nath, K., Haque, A., and Medhi, D. (2021). Identification of the Types of Skin Cancers from Skin Cancer Images and Covid-19 Detection on Chest X-Ray Images using Deep Learning. i-manager's Journal on Software Engineering, 15(3), 5-20. https://doi.org/10.26634/jse.15.3.18329
World Health Organization : COVID-19 - Global literature on coronavirus disease
https://pesquisa.bvsalud.org/global-literature-on-novel-coronavirus-2019-ncov/resource/en/covidwho-1630418
ProQuest Central | ID: covidwho-1630418

Abstract

COVID-19 is a very deadly disease, which has killed thousands and infected millions of people worldwide. More recently in the year 2021, one of its mutants known as "The Delta Variant" has ravaged our country. It is also currently the chief cause of increasing cases in some North-Eastern states like Manipur and Arunachal Pradesh. Different measures have been adopted by the Government in collaboration with local social bodies to identify the infected individuals, detect the level of infection and also vaccinating individuals to shield them from this deadly disease. The current paper is also focused on one such stage, which is quite critical at this juncture, and will use the power of Artificial Intelligence to appropriately identify COVID-19 affected individuals using chest X-Ray images. When implemented, it will make it easier to identify the infection of the lungs by COVID-19. More specifically, the proposed methodology seeks to establish a chain of processes that can help in detecting the infection in the lungs using an advanced and novel image pre-processing with a prediction fusion-based deep learning-based identification system. The image pre-processing technique will initially improve the raw images by selectively optimizing the chromatic intensity and brightness of needy pixels using a deep learning-based Conditional Random Field (CRF) that uses the sigmoidal function. The enhanced image samples are made to undergo training with GoogLeNet and MobileNet deep learning models so that during the testing phase a prediction-fusion approach can be implemented to generate more robust prediction results. An exhaustive implementation with a standard dataset has revealed that the proposed approach can provide a mean accuracy of 98.63%, with the Covid and Normal classes showing 97.17% and 99.22% accuracies respectively. Another deadly disease that has infected thousands of people worldwide is skin cancer. Using the similar technical approach described above, a technique for identifying the type of skin cancer has been developed and experimented by using a standard dataset. Good accuracy of 85.42% has been achieved despite some classes having a comparatively lesser number of image samples. Finally, a Graphical User Interface (GUI) has also been developed by using the trained deep learning files of GoogLeNet and MobileNet so that a user can simply enter the desired image and check the type of prediction/class.

Research Paper

An Effective Model for Choosing Career in Schools using Data Mining Techniques

R. Thirumalaiselvi * , P. Narayanan**
*-** Department of Computer Science, Government Arts College for Men, Chennai, Tamil Nadu, India.
Thirumalaiselvi, R., and Narayanan, P. (2021). An Effective Model for Choosing Career in Schools using Data Mining Techniques. i-manager's Journal on Software Engineering, 15(3), 21-28. https://doi.org/10.26634/jse.15.3.18319

Abstract

An increasing number of future labour force will not just comprise highly skilled human resources, but would also seek to hire personnel with sound technical, analytical, and soft skills to get engaged in cross-cultural and multi-lingual organizational setup. The major challenge of higher education is the lack of knowledge of the talents of students, so their chances of success decreases. Choosing a wrong course leads to incompletion and getting a job becomes tougher. Predicting student skills early can help mentors to advice students in a timely manner and improve student success. In this paper, a model has been created using Naïve Bayes, J48, Random Forest, and Support Vector Machine (SVM) classification algorithm, with 100 attributes. Among the models built, Naïve Bayes and Random Forest algorithm yielded better accuracy rating. In this research work, we attempt to explore dynamic dataset by applying data mining methods to explore student's insights based on characteristics related to academic, technical, environment and interpersonal factors. The model has been tested and found to be performing well in constraint based learning environment.

Research Paper

Classification of the DDoS Attack over Flash Crowd with DNN using World Cup 1998 and CAIDA 2007 Datasets

Ch. Sekhar* , K. Venkata Rao **, M. H. M. Krishna Prasad ***
*,*** Department of Computer Science and Engineering, JNTU Kakinada, Andhra Pradesh, India.
** Department of Computer Science and Engineering, Vignan's IIT Kakinada, Andhra Pradesh, India.
Sekhar, C., Rao, K. V., and Prasad, M. H. M. K. (2021). Classification of the DDoS Attack over Flash Crowd with DNN using World Cup 1998 and CAIDA 2007 Datasets. i-manager's Journal on Software Engineering, 15(3), 29-36. https://doi.org/10.26634/jse.15.3.18353

Abstract

Present day's e-commerce business has tremendously increased as everyone got Internet on their hands through their mobile devices. E-commerce big giants like Amazon, Alibaba, Flipkart, etc. have come up with surprise sales with huge discounts on the products called Flash Events (FE) or Flash Sales (FS). It attracts the customers to purchase the product on such specified dates. Huge client requests were coming into the servers on these days. Based on this scenario, attackers target these networks to degrade the performance of e-commerce portals by generating huge fake server requests called Distributed Denial of Service (DDoS) attacks. Network attacks caused during Flash Events (FE), Flash Sales (FS) are considered as Flash Crowd attacks (FC). With FC attacks, the performance of the server is reduced as well as it affects the clients by not sending proper responses. In this paper, the two datasets to CAIDA and WC 1998 datasets have been considered. WC 1998 dataset deals with flash crowd and CAIDA dataset have DDoS attack information. Similar features from both datasets have been taken and the flash crowd and DDoS attacks have been classified using the Deep Neural Network (DNN) approach. The accuracy of discriminating the DDoS and FC/FE with an accuracy of 70.49 % at 100 epochs and 72.1 % at 1000 epochs has been achieved.

Research Paper

Parkinson’s Disease Prediction using Machine Learning

Preethi S.* , Priyadharshini K. S. **, Kavitha A. ***
*-*** Department of Information Technology, Bannari Amman Institute of Technology, Erode, Tamil Nadu, India.
Preethi, S., Priyadharshini, K. S., and Kavitha, A. (2021). Parkinson's Disease Prediction using Machine Learning. i-manager's Journal on Software Engineering, 15(3), 37-41. https://doi.org/10.26634/jse.15.3.18133

Abstract

Parkinson's disease is a disorder which is identified with loss of neurons and neurologic function. It is a condition that arises when fifty to seventy five percent of the neuronal cells are affected. The symptoms include muscle rigidity, tremors and change in the speech and gait. The genetic factor also increases the risk of Parkinson's disease in a person. Some researchers also suggest that Parkinson's disease is also caused by environmental factors and excessive medications. With the advancement of deep learning and machine learning technologies, disease prediction has received additional attention from big data researchers, and numerous studies have been conducted with a choice of different mechanisms. Studies have shown that about 90% of patients with this disease suffer from certain degree of speech impairment. Therefore, we have chosen voice data as an input for our model. The proposed methodology presents how algorithm works best for identification of disease with high accuracy by splitting the dataset. XGBoost algorithm has been applied on the dataset in order to get accuracy expected out of the model.