Skip to main content

A comprehensive review of machine learning techniques on diabetes detection


Diabetes mellitus has been an increasing concern owing to its high morbidity, and the average age of individual affected by of individual affected by this disease has now decreased to mid-twenties. Given the high prevalence, it is necessary to address with this problem effectively. Many researchers and doctors have now developed detection techniques based on artificial intelligence to better approach problems that are missed due to human errors. Data mining techniques with algorithms such as - density-based spatial clustering of applications with noise and ordering points to identify the cluster structure, the use of machine vision systems to learn data on facial images, gain better features for model training, and diagnosis via presentation of iridocyclitis for detection of the disease through iris patterns have been deployed by various practitioners. Machine learning classifiers such as support vector machines, logistic regression, and decision trees, have been comparative discussed various authors. Deep learning models such as artificial neural networks and recurrent neural networks have been considered, with primary focus on long short-term memory and convolutional neural network architectures in comparison with other machine learning models. Various parameters such as the root-mean-square error, mean absolute errors, area under curves, and graphs with varying criteria are commonly used. In this study, challenges pertaining to data inadequacy and model deployment are discussed. The future scope of such methods has also been discussed, and new methods are expected to enhance the performance of existing models, allowing them to attain greater insight into the conditions on which the prevalence of the disease depends.


Given the growing population, it is necessary to develop systems to augment health and mitigate increasing concerns around the world. As scientific research continues to advance, the development of such system is becoming more efficient. Healthcare systems are designed to provide people with the requirements for good health and perform the detection and diagnosis of disease and conditions correctly with greater efficiency, as proposed in the conventional methods. In general patients are often highly concerned as to the quality of healthcare system and facilities available to provide treatment. The benefits of improvements in healthcare systems tend to affect people who have prevailing ailments more directly, and this group comprises the majority of the group of people affected by many diseases such as diabetes, blood sugar, and blood pressure issues [1]. According to the National Diabetes Statistics Report 2020, every 1 in 10 people in United States have diabetes, and new cases of diabetes 1 and 2 have significantly increased among young people. As health and healthcare form a critical pillars of a healthy society, it is necessary to use the capabilities of computational methods and artificial intelligence [2] to develop new methods for application in healthcare systems to promote a healthier society and reduce the risk of such diseases in our generations, further increasing the quality of life.

There has been a huge impact in the medical world with the advancement of technology. Health outcomes may depend on a matter of seconds for individuals who may not be able to reach a hospital or receive emergency treatment. Technology bridges this gap in distance and resources for all people to whom its benefits are extended. Various technologies have been developed using magnetic resonance imaging machines in video technology. Internet-based applications can provide patients with customizable services. After a few clinical visits, the remainder of the work can be fulfilled through high-tech services such as telehealth. Clinicians can communicate with patients through the Internet to better serve their needs [3]. One example of the use of video technology involves the provision of such mechanization in case of emergency to patients in trauma in rural and urban areas where clinical care may be unavailable [4]. Technology has the capacity to enable home healthcare with better productivity and security [5]. Data accuracy and availability have been proposed as among the most significant problems faced by hospitals, which involve maintaining and further processing patient data. Various algorithms in the field of machine learning and deep learning have been beneficially applied in practice to medical treatments. Some state-of-the-art ideas emerge from the massive implementation of technologies such as the creation of matching algorithms and natural language processing [6]. Data mining can be used to extract data directly, instead of relying on expert knowledge. These methods are considered to produce unique and distinctive patterns to create personalized plans for each hospital [7].

Diabetes mellitus (DM) is one of the most archetypal diseases worldwide. It is a disease that implies that a person’s body systems are unable to work efficiently to use energy from food. There are four major types of diabetes known, including type-1, type-2, gestational, and other forms, with the most common being type-1 and type-2 [8]. Type-1 diabetes usually occurs in the young age group of 30–40 and is insulin dependent. Patients are required to take in doses of insulin for entire lives. In contrast type-2, predominates among the over-40 age cohort and is often related patients’ weight. Type-2 diabetes is known to have greater prevalence globally, accounting for more than 90 % of cases [9]. Because this disease has long been near the top of global rankings listing serious of diseases, many researchers and doctors have proposed algorithms and methods for its treatment and detection. The implementation of these algorithms is rooted in the disciplines of deep learning and machine learning. With predictive analysis supported by neural networks, specifically convolutional neural networks (CNNs) and recurrent neural networks (RNNs), such methods have the ability to determine sentiments and learn high model features automatically [10, 11]. Some researchers have implemented the use of machine learning through algorithms such as gradient boosted trees, which can be used to create predictive models of the progression from prediabetes to diabetes, and have aimed to provide early diagnosis for better treatment and to reduce further risks [12]. Another study applied a modified support vector machine (SVM) algorithm as an efficient method for both linear and non-linear data [13].

Since the advent of artificial intelligence and related technologies, such computational methods have been applied to real-time detection models in almost every field. The use of data mining, machine learning, deep learning, and computer vision has drastically reduced the difficulty of studying newer techniques that can significantly improvement of existing methods. In the next section, the algorithms and methods are surveyed.

Application of latest technology in diabetes detection

The algorithms used in data mining, machine learning, or any field of artificial intelligence perform predictive modeling, that is, the use of data and statistics to predict future outcomes based on historical data. The most common symptoms of diabetes include abnormal metabolism, hyperglycemia, and an associated risk for specific complications affecting the eyes, kidneys, and nervous system, which are major parts of the body. Such symptoms are used to gather data, and then the modeling is performed based on age and gender categories. One such algorithm is ordering points to identify cluster structure (OPTICS), which is set of ordering points to identify clustering structures. OPTICS is an advanced version of density based spatial clustering of applications (DBSCAN) with noise, and it eliminates all negative aspects of DBSCAN. The data clustering method used in this algorithm is a balanced iterative reducing and clustering algorithm using hierarchies (BIRCH) that selects the most suitable data for further analysis. Thus, the naïve Bayes (NB) data mining technique is used, and BIRCH and OPTICS are used for clustering similar types of data and used for identification of the correct algorithm for better accuracy [14]. Apache Spark is among the fastest growing platforms for health analysis. It operates more rapidly than the Hadoop platform, making it more easily usable and applicable to clinical practice [11].

Another such application in the field of OpenCV is the use of the computer-assisted non-invasive DM detection system. This system provided immediate results from facial images. The model examined four health blocks of the skin: forehead, left cheek, right cheek, and nose bridge. Then, feature extraction was performed using a local binary pattern and then classification using the k-nearest neighbor (KNN) clustering and a SVM. All these features were connected with software and show the results in real time [15].

Islam et al. [16] used different algorithms for a diabetes symptom dataset, include NB, random forest (RF), logistic regression (LR), and a decision tree (DT). First, the dataset with the patient system is entered into the system on which predictive algorithms are applied. After this, the dataset is input to the database and the performance or accuracy achieved by the model is observed. The most suitable algorithm is commonly selected based on the highest accuracy and best performance. The user’s data is again taken as the input for the algorithm for further training and evaluation to increase the accuracy of the model in real time.

Irido-diagnosis is a predictive system in which the disease is detected through iris patterns. This was used for the detection of diabetes via the following method. Data were gathered from diabetic and non-diabetic patients on which pre-processing was performed. The next part is called image segmentation, wherein the iris of the eye is separated from the image in the dataset. Normalization is performed wherein the circular iris is converted to a rectangle using polar mapping. Feature extraction is then performed on this final image. A gray-level co-occurrence matrix is used to characterize images, which is a process for examining the texture. This process assigns numbers from 1 to 8 to images areas by calculating how often pairs of pixels with specific values and in a specified spatial relationship occur in the image. In this matrix, features such as contrast, correlation, dissimilarity, homogeneity, variance, and entropy are the features are pertinent to provide high-quality data.

These are just some of the authors among many who have contributed to the literature in this field. Below, a detailed survey of the methodology used by other researchers in this field is provided.

Machine learning in diabetes detection

Machine learning is a method by which a computational system learns the features of input data. Such methods haves proven effective for the detection of diabetes. Many machine learning algorithms have been developed, including supervised, unsupervised, and reinforcement learning methods. This is evidently practical because machine learning methods are driven by data. With such massive amounts of data fed into the database, machine learning can save considerable human effort. Models are trained on this data and provide the most suitable output based on the input data. The models can be trained on any parameters that are feasible for practicality and medical requirements. Some might examine facial features, while others look for blood report data obtained from patients. Because there are many symptoms of the disease, the parameters vary accordingly. With many proposed methods, researchers have probed various algorithms and tweaked numerous hyperparameters to obtain results that seem most suitable for real-life applications.

Choudhury and Gupta [17] used different algorithms to classify people into two categories: high- and low-risk individuals. They used a SVM to establish a hyperplane for categorization, a KNN classification technique for clustering new data into groups, DTs, RF and NB classifiers, and the binary classifier method called LR. On comparing the accuracies for this classification in the form of a confusion matrix, as shown in Fig. 1, the LR algorithm was found to be the most efficient and accurate, while the DT algorithm, achieved the lowest accuracy.

Fig. 1
figure 1

Classification results of SVM, KNN, NB, DTs and LR in form of true positive (TP), false positive (FP), true negative (TN) and false negative (FN) which are the parameters of confusion matrix [17]

Shukla [18] used a LR algorithm, took out a dataset that showed the maximum accuracy would be yielded if parameters such as glucose, body mass index (BMI), and pregnancies, were used, which were represented in the form of a bar chart, as shown in Fig. 2. The author also attempted to showcase that the disease predominantly depends on those features that seem meager to us but have noted by doctors as relevant in possibly leading to a higher risk of diseases later. The LR model trained with the dominant features showed an accuracy of 82.92%. For the model forecasting, 0.458 was the probability of class zero and 0.572 for class one, which estimates the probability of a person being diabetic.

Fig. 2
figure 2

The weight of each of the features which yield the result variable [18]

Dalakleidi et al. [19] used two datasets named PID, Case 1 and Hippokrateion, which is Case 2 from that the PID is split into 50% for training and 50% for testing, whereas the Hippokrateion has a bifurcation of 70% for training and the other 30% for testing. They used binary logistic regression (BLM), logistic model tree algorithm (LMT), which is a combination of LR and DT learning in simple models. The model’s performance was measured using classification accuracy (ACC) and area under the curve (AUC). BLM achieved an ACC of 80.47 and AUC of 0.85, whereas the LMT achieved an ACC of 77.6 and AUC of 0.84 in Case 1. In Case 2, the BLM outperformed LMT with an ACC of 93.45, whereas the LMT had an ACC of 92.86.

Islam et al. [20] used several algorithms to analyze a dataset using the NB and LR algorithms as well as the RF algorithm, after applying 10-fold cross-validation and percentage split evaluation techniques. Figure 3 shows their proposed architecture. The dataset contained records of 520 people who were asked for possible reasons for diabetes. After data pre-processing, there were a total of 314 positive values and 186 negative values. Positive values represent the person being diabetic, and negative implies that they were not. The best result was achieved using the RF algorithm with an accuracy of 99%. Thus, it is an effective algorithm for a newly created dataset. Figure 4 shows exactly how each algorithm performed on modelling and prediction.

Fig. 3
figure 3

Proposed architecture of the detection system [16]

Fig. 4
figure 4

Performance of each algorithm on the newly acquired dataset [20]

Harris et al. [21] performed clinical diagnosis for the detection of non-insulin dependent diabetes mellitus (NIDDM) using weighted linear regression. The relationship between the prevalence of retinopathy and duration of NIDDM was determined according to individual years of duration and assessed using weighted linear regression with weights for each year’s data being inversely proportional to the binomial variance. The author stated that the retinopathy condition is an important parameter for the early diagnosis of the disease. It typically appears almost 4–7 years earlier than the clinical diagnosis of the disease. Figure 5 provides an accurate graph of the obtained results.

Fig. 5
figure 5

Depicts how at the clinical diagnosis of NIDDM, the patients had the prevalent condition of diabetic retinopathy

Ameena and Ashadevi [22] used the R language to build a model on SVM, DTs, RF, and LR. They used a dataset of 768 women, all of whom were older than 20 years. They used the following features: BMI, blood sugar, number of pregnancies, and diabetes pedigree function. They are defined two classes: 1, which affirmed diabetes and 0 for negation. On a comparison of the accuracies, the author concluded that the RF algorithm showed the maximum correct estimations, with an accuracy of almost 77% compared to the other models.

Daanouni et al. [23] used KNN and the DT algorithm on two datasets, with the first one having 2000 instances and the second having 768. They used eight features or attributes to train the model, such as BMI, glucose, blood sugar, and pregnancy. The authors used 80% for training and the remaining 20% for testing. They used optimized hyperparameters to reduce the loss. The results are plotted on two types of data: pre-processing and without. The comparison of results was performed using a receiver operating curve (ROC). The author concluded that KNN has a maximum accuracy of 97.53% and an AUC of 0.9689. Table 1 shows a comparison table for the accuracies obtained for training the model using the KNN classifier.

Table 1 Comparison table obtained for the KNN and the DT model on pre-processed dataset and other without pre-processing [23]

Sisodia D and Sisodia DS [24] used three classifiers, including SVM, NB, and DT. The classification is performed on PIMA Indian diabetes dataset, which is the PIMA Indian diabetes dataset taken from the UCI. To measure the accuracy, internal cross-validation was 10-folds. Accuracy, F-measure, recall, precision, and ROC measures were used. The attributes used were glucose concentration, blood pressure, BMI, age, skin fold thickness, number of pregnancies, 2-h insulin concentration, pedigree function, and class 0 or 1. On modeling, the authors computed that NB showed the maximum accuracy with 586 correctly identifying instances. The following Fig. 6 shows the different types of classifiers used along with number of classified instances.

Fig. 6
figure 6

Different classification algorithms result on prediction modelling where NB outgrows the other two

Ahuja et al. [25] used the dataset from the UCI containing 768 records of women in which 500 were diabetic, while the remaining 268 were not. The authors used eight features for classification and applied a feature selection technique, which is linear discriminant analysis (LDA), to extract the important features required for classification. They used five types of classifiers for machine learning, including SVM, DT, LR, RF, and a multilayer perceptron. The authors used four parameters for evaluation, including accuracy, precision, recall, and F score. Based on these parameters, the authors concluded that multilayer perceptron yields the best results. Table 2 mentions the results using different values of k-fold validation.

Table 2 Accuracy results of different classifiers at different values of k-fold validation (%)

Alehegn et al. [26] used the PIMA Indian diabetes dataset with eight features to train on and the 130 D hospital dataset with a larger number of values. There were four classification methods used, including RF, KNN, NB, and J48-DT algorithm. J48 is an upgraded version of the Iterative Dichotomiser 3 (ID3) classification algorithm. A 10 K cross-validation was used for 90% training and 10% testing. The author built a hybrid model consisting of all of the above algorithms. The author concluded that NB and J48 are good for large data computations, and the KNN classifier is better for smaller datasets. Figure 7 shows the different algorithms used along with the correctly and incorrectly identified instances.

Fig. 7
figure 7

Accuracy of the classification algorithms on the PIMA Indian diabetes dataset. The proposed hybrid model shows the maximum accuracy

Some more work done by other researchers which has been mentioned in Table 3. It contains a study of the machine learning algorithms used in the methods.

Table 3  A comprehensive study of the machine learning methods done by some researchers

Deep learning in diabetes detection

Deep learning is a computational field that is usually involved where high computational power is required. Deep learning focuses on neural networks, their types, training epochs, layers of hidden, input, and output. The input layer is the first layer, and the hidden layers are responsible for all the calculations and manipulations, such as convolutions and pooling. The output layer determines the number of classes for the classification. Because of data augmentation, which means tweaking the data to increase accuracy, is also available in deep learning, it finds many applications with image training. The more layers the network has, the more it is capable of classification. Because of the many advantages, it has been widely used in the medical field to compute results with high accuracy. There are different types of networks with the most proficient artificial neural networks (ANNs), deep neural networks (DNNs), CNNs, and RNNs. Many researchers who work for the detection of a disease compare machine learning and deep learning algorithms to analyze which provides maximum accuracy.

Daanouni et al. [23] used ANNs and DNNs on two datasets of 2000 and 768 instances. They included eight attributes with the label of output as 1 for positive and 0 for negative results. The network was trained on two types of data: pre-processed and non-pre-processed. The DNN model seems to achieve high accuracy on both the data obtained, with an accuracy of 98% for the pre-processed dataset and 99.5% for dataset 1. On dataset 2, the non-pre-processed data had an accuracy of 80.99% and the other 96.35%. Hence, the authors concluded that DNN is an optimal classifier for diabetes detection.

Rakshit et al. [40] used R, SQL, and Python in a Microsoft Azure machine learning studio environment with the PIMA diabetes dataset, in which 80% was used for training and the other 20% for testing. This dataset is primarily concerned with diabetes in women. This contains eight attributes that are important for model building for a class – 2 neural network. Figure 8 shows the general representation of the neural network. The hidden layer had 100 nodes, with the output layer connected to the nth hidden layer. With the model trained for over 1000 epochs with a learning rate of 0.01, they achieved an accuracy of 83.3% on a dataset with 262 negative cases and 131 positive cases.

Fig. 8
figure 8

General representation of a neural network, xm shows the input weights and yp is the output weights

Sapon et al. [41] presented diabetes prediction using supervised ANNs. The dataset comprises approximately 250 patients with 27 variables or features,, where exactly 50% was used for training, while the remaining 50% was used for testing with the MATLAB tool. The gradient algorithms used included the Fletcher-Powell conjugate gradient, Polak-Ribiére conjugate gradient, and scaled conjugate gradient. These algorithms were used to train the model and then analyzed using the correlation coefficient (CC) R. The results of these algorithms are plotted at different epochs against the mean square error. Based on a comparison of the values of R, the authors conclude that the scaled conjugate gradient confirms the highest accuracy with a value of 0.88, followed by the Fletcher-Powell conjugate gradient with a value of 0.097219 and, finally, the Polak-Ribiére conjugate gradient with 0.056466.

Refs. [31, 42] both performed detection using an ANN. Ref. [42] used the PIMA Indian population dataset for women in Phoenix, while [31] performed this algorithm on a questionnaire model that contained data of 1487 people with positive and negative results. The features that remained common for modeling were BMI, age, weight, marital status, pregnancies, and ref. [31] collected a large number of variables, such as consumption of alcohol, meat, cigarette smoking count, beverage variety, and their counts and routine for exercise and sleep. Considering the structure of the ANN, ref. [31] had approximately 15 hidden nodes, while the same varied from 0 to 5 in ref. [42]. Ref. [31] achieved a ACC of 73.52% against ref. [42], who achieved an accuracy of 80.21% on the test data.

Ref. [43] conducted a comparative study of neural networks in diabetes detection. Using the PIMA Indian diabetes set again, they used the eight common features required for model preparation. With the first 576 cases used for testing, they used a 10-fold cross-validation technique too estimate the results. The author used a multilayer neural network (MLNN) with 50 neurons for each hidden layer and an output layer using the non-sigmoid activation function and a PNN, which is a probabilistic neural network with a single hidden layer. The author concluded his results on accuracy, showing that the MLNN model achieved an accuracy of 79.62, and the PNN achieved an accuracy of 78.65.

Ref. [44] used the PIMA diabetes set and computed the model using an ANN and used eight features for modeling. The author explained the different functions used for pre-processing and model training. The activation function was a sigmoid function, and backpropagation is used to calculate the gradient of the loss function. The error function computed the final error to be approximately 8% at the end of the model building. The results were validated on ROC and RMSE. The author achieved an ROC area of 0.88 and RMSE equal to 0.39, which is a FAIR classifier. Figure 9 shows the results as plotted in form of lines.

Fig. 9
figure 9

The graphs plotted between actual value shown by the blue line and predicted values shown by the red line

Ref. [29] used an ANN for his model trained over a dataset consisting of over 30000 instances and 11 features to train on. With the hidden layers equal to 12, the values of the layers were calculated using the sigmoid function. Bagging and boosting methods were implemented. Bagging was also set to reduce the variance in the model, and boosting was performed to reduce the error. The neural network with bagging achieved an accuracy of 85.324%, followed by the ANN model with boosting with an accuracy of 84.815%, and then the ANN with 84.532%. Finally, the author validates the final comparison using an ROC.

Ref. [45] used machine learning and deep learning techniques to detect DM. The dataset used here was the PIMA Indian diabetes dataset consisting of 768 features and eight features to train on with a total of 500 instances belonging to the non-diabetic class and the remaining 268, which are diabetic class. Sixty percent of the data was selected for training and the remaining 40% for testing; a CNN was used. They are composed of three layers, with the classification is performed by the output layer. The prediction accuracy achieved using the model was 76.81%.

Some more work done by other researchers which has been mentioned in Table 4. It contains a study of the deep learning algorithms used in the methods.

Table 4  A comprehensive study of the deep learning methods done by some researchers

Challenges and future scope

Although there many methods and algorithms which have been proposed in the field of machine learning or deep learning, many challenges remain, as mentioned by the authors in their works. The first concern that comes to mind when building a model is data. Refs. [32, 33, 49] were some of the authors that referenced the problem. One of the biggest problems encountered during the survey of the papers was finding articles and papers that did not relate to the popular PIMA Indian dataset. This dataset was chosen to ensure that the models provided good results because of the establishment of the dataset. The datasets are either too small or inadequate, or they lack real-time data. Small datasets pose a problem of overfitting on the model, which shows higher accuracy, but they are not able to deal with newer testing data. Hence, the model is not feasible for real-time implementation. Some authors dealt with CGM data, which was real-time, but the model training with that data was not efficient. The datasets were also selected from particular regions that are not representative of a common system. Different regions have different people and lifestyles. Generally, researchers spend 80% of their time cleaning and managing data for model training. Hence, data complexity leads to higher cost and maintenance charges. The next step is feature selection. While some authors neglected some of the features, some grouped them for feasible training. Every dataset poses the problem of having appropriate features to cater to the needs of a single algorithm. After all the data are made available, the technical stacks are finalized. Many tools are available to construct machine learning models, but choosing a model to optimize performance is also necessary. The next challenge is debugging. This becomes easy if tools such as Jupyter Notebooks are used where the code is divided into cells. This becomes difficult when the model runs on automation batch processes. In addition, as there are only a few diabetes datasets available on the Internet; more public data should be available for research. More research should be performed using heart rate, as it requires less bandwidth, and its computational complexity is also low. They can also be used in cloud or mobile devices. HR signals should also be used to detect other cardiac diseases. In some cases, authors required a time-series dataset. Since they are not available across any online resource, it is difficult to replicate such work. Such special models require extensive tuning and large datasets for both training and testing.

The next part is the construction of an actual model. To achieve perfect accuracy, many parameters must be adjusted. Random states, kernel, number of trees, hyperparameter tuning, and various others are considered while creating a model. Selecting a correct algorithm with suitable hyperparameters should also be performed precisely. Some classification models will only train on a single parameter, which results in a decreased accuracy for the model in real-time detection. It is evident from the analysis of these schemes in all classes that most of them suffer from either a single data input parameter or the feature selection is not optimal. Along with such restrictions on parameters, few classification-based schemes are purely dependent on kinds of hardware devices, which increases the difficulty of availability and adaptability of these schemes.

A healthcare-based machine learning model is only useful if it can be used for the benefit of people. Here, the model deployment in practical applications is critical. Many authors have proposed deploying models on mobile platforms. In real-life implementations, only engineers with background and experience with cloud servers and DevOps can deploy models. In this ongoing process, many issues need to be considered, such as how frequently the predictions are required to be displayed or the number of applications that are required for model processing. Although considerable precautions were taken to ensure there were no discrepancies in the study, no study could claim to be perfect and there is always scope for improvement. The development of more inquisitive study providing deeper insights into aspects that enable the predictive power of models rather than only pre-defined parameters such as accuracy, precision, F1 score, ROC, and AUC would be beneficial in the future. For classification, to distinguish between diabetic and normal profiles, clustering-based schemes provide accurate results. However, most of the clustering algorithms struggle with plug-n-play problems, which means that they usually contain human intervention during classification and analysis, which involves the possibility of error.

Considering all the above challenges, it can still be considered that they can be overcome in the future. Scholars and clinicians will continue to work toward the construction of larger and better datasets and design more efficient models and algorithms for better classification and accuracy. Any of the diseases occurring on a wide scale, such as diabetes, can be controlled through artificial intelligence techniques and automation. One can create state-of-the-art efficient models based on studies that provide early detection of diabetes and can help people to further change their lifestyle. Because deep learning performs better on most datasets, it should be combined with different algorithms to achieve better accuracy and performance. Hybrid schemes play an important role in improving the performance of the models. Through early detection, patients can be treated much earlier to avoid further risks of heart problems in cases of diabetes. Any model that can be deployed on mobile platforms should cater to the masses for their help and be representative. An implication of this survey is that ML models that have yielded efficient results that can be utilized by future researchers to further polish and improve as well as create a pipeline or an ensemble of correct and efficient models to increase the chances of predicting the disease with even more probability. Such models can be further improvised to automate the system created so that it can deal with newer data without problems.


Diabetes can be devastating after a certain period if not detected or diagnosed correctly. Many machine learning methods have been discussed, starting from different basic algorithms such as the LR, SVM, DTs, to further classification including the ID3, C4.5, C5.0, J48 and CART and NB. Ensemble methods, such as bagging, boosting, and RF regressors, are further used to enhance the accuracy and performance of models. These techniques have been implemented on all types of platforms such as Python or MATLAB, and the models have been analyzed using different parameters such as area under curve or confusion matrices or mathematical terms such as the RMSE or MAE. Machine learning has been introduced in medical diagnosis systems as it has proven to be accurate in detection, successful in application to treatments, and is more cost effective. Although the above are very strong classifiers, we believe that deep learning, which is a subset of machine learning, can learn large amounts of unstructured and unlabeled data. Deep learning models are more complex and accurate. Different models for deep learning start from the most basic ANNs to convolutional nets to further RNNs, including LSTM and Bi-LSTM. Temporal and deep belief networks have also been discussed. In contrast, deep learning involves some shortcomings such as increased computational time, resources, and frequent adjustment of the parameters. Deep learning performs better on image datasets; therefore, for diabetes diagnosis, images would be better. Most researchers have implemented several algorithms in both machine and deep learning to compare their performance on the data, while others have combined two or three methods to gain more accuracy on a single system.

Researchers, clinical practitioners, and people in the industry widely believe that artificial intelligence has the power to alter the ongoing situations of late medication and detection due to human errors. Automation has the capability to construct efficient and reliable medical detection systems. Machine learning, by means of its powerful predictive and classification models, plays an important role in helping to achieve this.

Availability of data and materials

All relevant data and material are presented in the main paper.



Ordering points to identify cluster structure


Density based spatial clustering of applications


Diabetes mellitus


Balanced iterative reducing and clustering algorithm using hierarchies


Long short-term memory


Iterative Dichotomiser 3


Support vector machine


K-nearest neighbors


True positive


True negative


False positive


False negative


Binary logistic regression


Logistic model tree algorithm


The classification accuracy


Area under the curve


Linear discriminant analysis


Artificial neural network


Recurrent neural network


Convolutional neural network


Deep neural network


Mean absolute error


Correlation coefficient


Time lag


Receiver operating curve


Extreme learning machine


Non-insulin dependent diabetes mellitus


Body mass index


Decision tree


Logistic regression


Random forest


Naïve Bayes


Multilayer neural network


Root-mean-square error


Empirical mode decomposition


  1. 1.

    Nakahara T, Hyogo H, Yoneda M, Sumida Y, Sumida Y, Fujii H et al (2013) Type 2 diabetes mellitus is associated with the fibrosis severity in patients with nonalcoholic fatty liver disease in a large retrospective cohort of Japanese patients. J Gastroenterol 49(11):1477–1484.

    Article  Google Scholar 

  2. 2.

    Solanki P, Baldaniya D, Jogani D, Chaudhary B, Shah M, Kshirsagar A (2021) Artificial intelligence: new age of transformation in petroleum upstream. Pet Res (in press).

    Article  Google Scholar 

  3. 3.

    Duplaga M (2004) The impact of information technology on quality of healthcare services. In: Bubak M, van Albada GD, Sloot PMA, Dongarra J (eds) Computational science - ICCS 2004. 4th international conference, Kraków, Poland, June 2004. Lecture notes in computer science, vol 3039. Springer, Berlin, Heidelberg, pp 1118-1125.

  4. 4.

    Lassi M, Sonnenwald DH (2010) Identifying factors that may impact the adoption and use of a social science collaboratory: a synthesis of previous research.Inf Res15(3)

  5. 5.

    Bonfiglio S (2012) The role of ICT in a healthcare moving from “clinical-centric” to “patient-centric”. In: Donnelly M, Paggetti C, Nugent C, Mokhtari M (eds) Impact analysis of solutions for chronic disease prevention and management. 10th international conference on smart homes and health telematics, June 2012. Lecture notes in computer science, vol 7251. Springer, Berlin, Heidelberg, pp 250-253.

  6. 6.

    Poston RS, Reynolds RB, Gillenson ML (2006) Technology solutions for improving accuracy and availability of healthcare records. Inf Syst Manag 24(1):59–71.

    Article  Google Scholar 

  7. 7.

    Duan L, Street WN, Xu E (2011) Healthcare information systems: data mining methods in the creation of a clinical recommender system. Enterp Inf Syst 5(2):169–181.

    Article  Google Scholar 

  8. 8.

    Saiti K, Macaš M, Štechová K, Pithová P, Lhotská L (2017) A review of model prediction in diabetes and of designing glucose regulators based on model predictive control for the artificial pancreas. In: Bursa M, Holzinger A, Renda ME, Khuri S (eds) Information technology in bio- and medical informatics. 8th international conference ITBAM 2017, August 2017. Lecture notes in computer science, vol 10443. Springer, Cham, pp 11-19.

  9. 9.

    Haritha R, Sureshbabu D, Sammulal P (2019) Diabetes detection using principal component analysis and neural networks. In: Santosh KC, Hegadi RS (eds) Recent trends in image processing and pattern recognition. Second international conference, RTIP2R 2018, December 2018. Communications in computer and information science, vol 1036. Springer, Singapore.

  10. 10.

    Chen Q, Alrowais R, Burhan M, Ybyraiymkul D, Shahzad MW, Li Y et al (2020) A self-sustainable solar desalination system using direct spray technology. Energy 205:118037.

    Article  Google Scholar 

  11. 11.

    Kunekar PR, Gupta M, Agarwal B (2019) Detection and analysis of life style based diseases in early phase of life: a survey. In: Somani AK, Ramakrishna S, Chaudhary A, Choudhary C, Agarwal B (eds) Emerging technologies in computer engineering: microservices in big data analytics. Second international conference ICETCE 2019, February 2019. Communications in computer and information science, vol 985. Springer, Singapore.

  12. 12.

    Cahn A, Shoshan A, Sagiv T, Yesharim R, Goshen R, Shalev V et al (2020) Prediction of progression from pre-diabetes to diabetes: development and validation of a machine learning model. Diabetes Metab Res Rev 36(2):e3252.

    Article  Google Scholar 

  13. 13.

    Thenappan S, Rajkumar MV, Manoharan PS (2020) Predicting diabetes mellitus using modified support vector machine with cloud security. IETE J Res. (in press)

  14. 14.

    Bai BGM, Nalini BM, Majumdar J (2019) Analysis and detection of diabetes using data mining techniques-a big data application in health care. In: Shetty NR, Patnaik LM, Nagaraj HC, Hamsavath PN, Nalini N (eds) Emerging research in computing, information, communication and applications, vol 882. Springer, Singapore

    Google Scholar 

  15. 15.

    Shu T, Zhang B, Tang YY, Chengdu IEEE (2018) 15-18 July 2018.

  16. 16.

    Islam MT, Raihan M, Farzana F, Aktar N, Ghosh P, Kabiraj S (2020) Typical and non-typical diabetes disease prediction using random forest algorithm. In: Abstracts of the 11th international conference on computing, communication and networking technologies, IEEE, Kharagpur, 1-3 July 2020.

  17. 17.

    Choudhury A, Gupta D (2019) A survey on medical diagnosis of diabetes using machine learning techniques. In: Kalita J, Balas VE, Borah S, Pradhan R (eds) Recent developments in machine learning and data analytics. Advances in intelligent systems and computing, vol 740. Springer, Singapore, pp 67–78.

  18. 18.

    Shukla AK (2020) Patient diabetes forecasting based on machine learning approach. In: Pant M, Sharma TK, Arya R, Sahana BC, Zolfagharinia H (eds) Soft computing: theories and applications. Advances in intelligent systems and computing, vol 1154. Springer, Singapore

    Google Scholar 

  19. 19.

    Dalakleidi KV, Zarkogianni K, Karamanos VG, Thanopoulou AC, Nikita KS (2013) A hybrid genetic algorithm for the selection of the critical features for risk prediction of cardiovascular complications in Type 2 Diabetes patients. In: Abstracts of the 13th IEEE international conference on BioInformatics and BioEngineering, Chania, 10-13 November 2013.

  20. 20.

    Islam MMF, Ferdousi R, Rahman S, Bushra HY (2020) Likelihood prediction of diabetes at early stage using data mining techniques. In: Gupta M, Konar D, Bhattacharyya S, Biswas S (eds) Computer vision and machine intelligence in medical image analysis. Advances in intelligent systems and computing, vol 992. Springer, Singapore, pp 113–125.

  21. 21.

    Harris MI, Klein R, Welborn TA, Knuiman MW (1992) Onset of NIDDM occurs at least 4-7 yr before clinical diagnosis. Diabetes Care 15(7):815–819.

    Article  Google Scholar 

  22. 22.

    Ameena RR, Ashadevi B (2020) Predictive analysis of diabetic women patients using R. In: Peter JD, Fernandes SL (eds) Systems simulation and modeling for cloud computing and big data applications. Elsevier Inc., Amsterdam.

  23. 23.

    Daanouni O, Cherradi B, Tmiri A (2019) Predicting diabetes diseases using mixed data and supervised machine learning algorithms. In: Abstracts of the 4th international conference on smart city applications, ACM, Casablanca, 2-4 October 2019.

  24. 24.

    Sisodia D, Sisodia DS (2018) Prediction of diabetes using classification algorithms. Procedia Comput Sci 132:1578–1585.

    Article  Google Scholar 

  25. 25.

    Ahuja R, Sharma SC, Ali M (2019) A diabetic disease prediction model based on classification algorithms. Ann Emerg Technol Comput 3(3):44–52.

    Article  Google Scholar 

  26. 26.

    Alehegn M, Joshi RR, Mulay P (2019) Diabetes analysis and prediction using random forest, KNN, Naïve Bayes, and J48: an ensemble approach. Int J Sci Technol Res 8(9):1346–1354

    Google Scholar 

  27. 27.

    Perveen S, Shahbaz M, Guergachi A, Keshavjee K (2016) Performance analysis of data mining classification techniques to predict diabetes. Procedia Comput Sci 82:115–121.

    Article  Google Scholar 

  28. 28.

    Khan NS, Muaz MH, Kabir A, Islam MN (2019) A machine learning-based intelligent system for predicting diabetes. Int J Big Data Anal Healthc 4(2):1.

    Article  Google Scholar 

  29. 29.

    Nai-Arun N, Moungmai R (2015) Comparison of classifiers for the risk of diabetes prediction. Procedia Comput Sci 69:132–142.

    Article  Google Scholar 

  30. 30.

    Kocher T, Holtfreter B, Petersmann A, Eickholz P, Hoffmann T, Kaner D et al (2019) Effect of periodontal treatment on HbA1c among patients with prediabetes. J Dent Res 98(2):171–179.

    Article  Google Scholar 

  31. 31.

    Meng XH, Huang YX, Rao DP, Zhang Q, Liu Q (2013) Comparison of three data mining models for predicting diabetes or prediabetes by risk factors. Kaohsiung J Med Sci 29(2):93–99.

    Article  Google Scholar 

  32. 32.

    Sheikhi G, Altınçay H (2016) The cost of type II diabetes mellitus: a machine learning perspective. In: Kyriacou E, Christofides S, Pattichis CS (eds) XIV mediterranean conference on medical and biological engineering and computing 2016. IFMBE proceedings, vol 57. Springer, Cham, pp 818-821.

  33. 33.

    Iyer A, Jeyalatha S, Sumbaly R (2015) Diagnosis of diabetes using classification mining techniques. Int J Data Min Knowl Manag Process 5(1):1–14.

    Article  Google Scholar 

  34. 34.

    Barik S, Mohanty S, Mohanty S, Singh D (2021) Analysis of prediction accuracy of diabetes using classifier and hybrid machine learning techniques. In: Mishra D, Buyya R, Mohapatra P, Patnaik S (eds) Intelligent and cloud computing. Smart innovation, systems and technologies, vol 153. Springer, Singapore, pp 399–409.

  35. 35.

    Ephzibah EP (2011) A hybrid genetic-fuzzy expert system for effective heart disease diagnosis. In: Wyld DC, Wozniak M, Chaki N, Meghanathan N, Nagamalai D (eds) Advances in computing and information technology. first international conference, ACITY 2011, July 2011. Communications in computer and information science, vol 198. Springer, Berlin, Heidelberg, pp 115-121.

  36. 36.

    Zheng T, Xie W, Xu LL, He XY, Zhang Y, You MR et al (2017) A machine learning-based framework to identify type 2 diabetes through electronic health records. Int J Med Inform 97:120–127.

    Article  Google Scholar 

  37. 37.

    Zou Q, Qu KY, Luo YM, Yin DH, Ju Y, Tang H (2018) Predicting diabetes mellitus with machine learning techniques. Front Genet 9:515.

    Article  Google Scholar 

  38. 38.

    Parthiban G, Srivatsa SK (2012) Applying machine learning methods in diagnosing heart disease for diabetic patients. Int J Appl Inf Syst 3(7):25–30.

    Article  Google Scholar 

  39. 39.

    Challa M, Chinnaiyan R (2019) Optimized machine learning approach for the prediction of diabetes-mellitus. In: Smys S, Tavares JMRS, Balas VE, Iliyasu AM (eds) Computational vision and bio-inspired computing. ICCVBIC 2019. Advances in intelligent systems and computing, vol 1108. Springer, Cham, pp 321–328.

  40. 40.

    Rakshit S, Manna S, Biswas S, Kundu R, Gupta P, Maitra S et al (2017) Prediction of diabetes type-II using a two-class neural network. In: Mandal JK, Dutta P, Mukhopadhyay S (eds) Computational intelligence, communications, and business analytics. First international conference, CICBA 2017, March 2017. Communications in computer and information science, vol 776. Springer, Singapore, 65-71.

  41. 41.

    Sapon MA, Ismail K, Zainudin S (2011) Prediction of diabetes by using artificial neural network. In: Abstracts of 2011 international conference on circuits, system and simulation IPCSIT vol. 7, IACSIT Press, Singapore, 28 May 2011

  42. 42.

    Shanker MS (1996) Using neural networks to predict the onset of diabetes mellitus. J Chem Inf Comput Sci 36(1):35–41.

    Article  Google Scholar 

  43. 43.

    Temurtas H, Yumusak N, Temurtas F (2009) A comparative study on diabetes disease diagnosis using neural networks. Expert Syst Appl 36(4):8610–8615.

    Article  Google Scholar 

  44. 44.

    Kumar A, Gupta PK, Srivastava A (2020) A review of modern technologies for tackling COVID-19 pandemic. Diabetes Metab Syndr: Clin Res Rev 14(4):569–573.

    Article  Google Scholar 

  45. 45.

    Yahyaoui A, Jamil A, Rasheed J, Yesiltepe M (2019) A decision support system for diabetes prediction using machine learning and deep learning techniques. In: Abstracts of the 1st international informatics and software engineering conference, IEEE, Ankara, 6-7 November 2019.

  46. 46.

    Prabhu P, Selvabharathi S (2019) Deep belief neural network model for prediction of diabetes mellitus. In: Abstracts of the 3rd international conference on imaging, signal processing and communication, IEEE, Singapore, 27-29 July 2019.

  47. 47.

    Idriss TE, Idri A, Abnane I, Bakkoury Z (2019) Predicting blood glucose using an LSTM neural network. In: Abstracts of 2019 federated conference on computer science and information systems, IEEE, Leipzig, 1-4 September 2019.

  48. 48.

    Jankovic MV, Mosimann S, Bally L, Stettler C, Mougiakakou S, Belgrade IEEE (2016) 22-24 November 2016.

  49. 49.

    Song W, Cai WY, Li J, Jiang FS, He SQ (2019) Predicting blood glucose levels with EMD and LSTM based CGM data. In: Abstracts of the 6th international conference on systems and informatics, IEEE, Shanghai, 2-4 November 2019.

  50. 50.

    Zhang L, Zhu F, Xie L, Wang C, Wang J, Chen R et al (2020) Clinical characteristics of COVID-19-infected cancer patients: a retrospective case study in three hospitals within Wuhan, China. Ann Oncol 31(7):894–901.

    Article  Google Scholar 

  51. 51.

    Marco ML, Heeney D, Binda S, Cifelli CJ, Cotter PD, Foligné B et al (2017) Health benefits of fermented foods: microbiota and beyond. Curr Opin Biotechnol 44:94–102.

    Article  Google Scholar 

  52. 52.

    Wu JH, Li J, Wang J, Zhang L, Wang HD, Wang GL et al (2020) Risk prediction of type 2 diabetes in steel workers based on convolutional neural network. Neural Comput Appl 32(13):9683–9698.

    Article  Google Scholar 

  53. 53.

    Wang LY, Mu Y, Zhao J, Wang XY, Che HL (2020) IGRNet: a deep learning model for non-invasive, real-time diagnosis of prediabetes through electrocardiograms. Sensors (Basel) 20(9):2556.

    Article  Google Scholar 

  54. 54.

    Ayon SI, Islam M (2019) Diabetes prediction: a deep learning approach. Int J Inf Eng Electron Bus 11(2):21–27.

    Article  Google Scholar 

  55. 55.

    Alhassan Z, McGough AS, Alshammari R, Daghstani T, Budgen D, Moubayed NA (2018) Type-2 diabetes mellitus diagnosis from time series clinical data using deep learning models. In: Kůrková V, Manolopoulos Y, Hammer B, Iliadis L, Maglogiannis I (eds) Artificial neural networks and machine learning - ICANN 2018. 27th international conference on artificial neural networks, October 2018. Lecture notes in computer science, vol 11141. Springer, Cham.

  56. 56.

    Kumar NM, Manjula R (2019) Design of multi-layer perceptron for the diagnosis of diabetes mellitus using keras in deep learning. In: Satapathy SC, Bhateja V, Das S (eds) Smart intelligent computing and applications. Smart innovation, systems and technologies, vol 104. Springer, Singapore

    Google Scholar 

  57. 57.

    Mahajan AS (2020) Medical diagnosis of diabetes using deep learning techniques and big data analytics. J Emerg Technol Innov Res 7:1490–1497

    Google Scholar 

  58. 58.

    Deshmukh T, Fadewar HS, Shukla A (2020) The detection of Prameha (diabetes) in Ayurvedic way with the help of fuzzy deep learning. In: Gunjan VK, Diaz VG, Cardona M, Solanki VK, Sunitha KVN (eds) ICICCT 2019 - System reliability, quality control, safety, maintenance and management. Springer, Singapore.

Download references


The authors are grateful to Department of Electronics and Communication Engineering, Nirma University and Department of Chemical Engineering School of Technology, Pandit Deendayal Energy University for the permission to publish this research.


Not applicable.

Author information




All the authors make substantial contributions to this manuscript; TS and MS participated in drafting the manuscript; TS wrote the main manuscript; all the authors discussed the results and implication on the manuscript at all stages The authors read and approved the final manuscript.

Corresponding author

Correspondence to Manan Shah.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Sharma, T., Shah, M. A comprehensive review of machine learning techniques on diabetes detection. Vis. Comput. Ind. Biomed. Art 4, 30 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Machine learning
  • Deep learning
  • Health care
  • Diabetes detection