تعداد نشریات | 43 |
تعداد شمارهها | 1,637 |
تعداد مقالات | 13,304 |
تعداد مشاهده مقاله | 29,859,817 |
تعداد دریافت فایل اصل مقاله | 11,940,864 |
A GA Approach for Tuning Membership Functions of a Fuzzy Expert System for Heart Disease Prognosis Development Risk | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Journal of Computing and Security | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
مقاله 2، دوره 4، شماره 1، فروردین 2017، صفحه 13-23 اصل مقاله (641.86 K) | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
نوع مقاله: Research Article | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
شناسه دیجیتال (DOI): 10.22108/jcs.2018.106877.0 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
نویسندگان | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Rana Akhoondi1؛ Rahil Hosseini* 1؛ Mahdi Mazinani2 | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1Department of Computer Engineering, Shahr-e-Qods branch, Islamic Azad University, Tehran, Iran. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
2Department of Electronic Engineering, Shahr-e-Qods Branch, Islamic Azad University, Tehran, Iran. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
چکیده | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Application of soft computing hybrid models have been concentrated to cope with uncertainty in the medical expert systems, recently. Heart disease is one of the mortal diseases that can be controlled in early stages. In this paper a hybrid Fuzzy-GA model for the Heart Disease Prediction (HDP) problem has been proposed. For this, first a Fuzzy Expert System (FES) using Mamdani model was presented. Then the membership functions parameters of the FES were optimized using the hybrid Fuzzy-Genetic Algorithm (Fuzzy-GA). The reason of selecting fuzzy method was its high potential to address the uncertainty sources in the knowledge of medical experts. Performance of the FES and Fuzzy-GA model were evaluated using a real dataset of 380 patients collected from Parsian Hospital in Karaj, Iran. Accuracy of the designed FES before optimization was 85.52%. After optimization using the hybrid Fuzzy-GA, the accuracy of this system was increased to 92.37%. The proposed hybrid model competes with its counterparts in terms of interpretability and accuracy in prognosis process of the heart disease. This model is promising for early diagnosis of the heart disease and saving more people lives. | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
کلیدواژهها | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
FES؛ Genetic Algorithm؛ Fuzzy-GA System؛ HDP | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
سایر فایل های مرتبط با مقاله
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
اصل مقاله | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The prognosis and control of disease is much easier and less expensive than their treatment. Controlling the heart disease can prevent sever damage to patients. Heart is an important organ of human body and its disruption may lead to death. Expert systems can assist prognosis of the heart disease and consequently early diagnosis of disease and its survival rates [1, 2]. This paper aims at prediction of the heart disease and improvement of the survival rate of this mortal disease. The prognosis is always simpler, and cheaper than treatment. However several studies have been reported on diagnosis of this disease, the HDP has less been concentrated. The objective is to optimize parameters of the FES using a hybrid fuzzy-Genetic algorithm design a FES for HDP to enhance the accuracy while keeping its interpretability. The role of fuzzy model is to manage the uncertainty in the symptoms and knowledge of experts. It has the capability of computing with words and reasoning in presence of vague and uncertain knowledge of experts [3, 4]. The FES was designed based on Mamdani inference model; the inputs and output of system were considered as fuzzy sets. A FES suffers from static behavior; i.e., it cannot learn from the environment. To address this issue, soft computing paradigm suggest to integrate fuzzy models with evolutionary algorithms. Evolutionary computation is simulation of the process of biological evolution of human intelligence and behavioral learning in computer systems for the purpose of optimization. To optimize intelligent models such as neural network and fuzzy systems, the GA has been frequently applied. Between different evolutionary algorithms, GA has been frequently and commonly applied model. This paper focuses on the GA capability for selection and combination of superior genes have been concentrated in the process of evolution and optimization of the FES. For this, GA was combined with the FES to optimize its parameters. Efficiency of a FES can improve through the following processes [5]:
This study is an attempt toward improving FES dynamic behavior with combining it with the evolutionary nature of the GA during the tuning task. The paper structure is organized as follows: Section 2 presents an overview of related works for prediction of heart disease, in Section 3 the design details of the FES for (HDP) has been explained, followed by the hybrid Fuzzy-GA method for tuning the FES. Experimental results and performance evaluation of the proposed method along with a comparison between the proposed method in this study and related works are presented in Section 4.Then Discussion and conclusion have been drawn in Section 5.
In this section, an overview of related works reported for (HDP) is presented. A prediction model for heart disease based on hybrid Artificial Neural Network and Gray Wolf Optimization (ANN-GWO) algorithm was designed in [6]. This paper combines two intelligent methods i.e., (GWO) and (ANN) and applied it to the HDP. This algorithm takes advantage of the ANN to search the relation between I/O variables and the GWO to find the optimized weights for the ANN and avoid trapping in the local optima. Efficiency of the hybrid (ANN-GWO) model in comparison with the back-propagation neural network (BPNN) in terms of Root Mean Square Error (RMSE). The results reveals that this model ameliorate the time to convergence of the algorithm and the overall accuracy of the model. An intelligent hybrid model to assess the risk of the HDP was designed in [7]. The objective of this model was to design a rule set for the HDP through combinations of the Rough Set (RS) theory for addressing uncertainty, and swarm intelligence in the Particle Swarm Optimization (PSO) with learning capability of the ANN and Fuzzy logic (FL). The RS theory was combined with the PSO into the Intelligent Hybrid Quick Reduced PSO (IHQRPSO) algorithm to find a set of optimum features. This feature set was used in the ANN to classify samples into healthy and unhealthy categories. The severity of disease was predicted using the rules of the FL. Accuracy of the IHQRPSO algorithm was 95.25%. Another system was designed for prediction of heart disease based on hybrid genetic fuzzy model in [8]. The objective of this work was diagnosis of heart disease based on commonly applied soft computing models; i.e., GA and FL. The proposed system takes advantages of the GA and FL to improve accuracy of prediction rates of heart disease in patients. The model accuracy on the UCI heart disease dataset was reported as 86% with the 90% specificity and sensitivity of 80%. A classification system for healthcare data was presented using Genetic fuzzy method in [9]. This paper integrates fuzzified standard additive model (SAM) with the GA, to address uncertainty and time complexity issues. This method was evaluated using the Wisconsin breast cancer (WBCD) and Cleveland heart disease (CHD) datasets in the UCI repository. Their method was computationally expensive. A model was proposed for prediction of heart disease based on firefly algorithm in [10]. This paper utilizes rough sets theory and interval type-2 FL system (IT2FLS). Their learning process utilizes the fuzzy C-mean clustering algorithm and adjusting parameter is conducted through chaos theory combined with firefly and the GA. A system was proposed for classification of Electrocardiogram (ECG) heart beat with a modified algorithm based on Artificial Bee Colony in [11]. It was applied to the ECG signals which was obtained from the MITBIH database. The result was compared to the related classifiers. A HDP model using a coronary artery disease (CAD) was reported in [12]. This model was optimized through an adaptive Neuro-fuzzy inference system (ANFIS) with combination of a linear discriminant analysis. The prediction rate of the ANFIS-LDA method was 80.2%. A neural network with back propagation learning was presented to model diagnosis of heart disease in [13]. The proposed system uses 13 features for heart disease four class prediction model. Their experimental results shows that classification accuracy of 92% on the UCI heart disease dataset. A classification system was presented for (HDP) using K-Nearest Neighborhood (KNN) and the GA in [14]. In this paper, a hybrid algorithm using combination of the KNN with the GA was proposed. Results shows that this algorithm enhances the diagnosis of the heart disease. The accuracy of this system was equal to 95.73%. A hybrid PSO and FES was proposed for the diagnosis of CAD in [15] on the Cleveland and Hungarian Heart Disease datasets. The PSO adjusts the parameters of the fuzzy membership functions (MFs). After MFs tuning process, the accuracy of the FES reached to 93.27%. The main advantage of this approach is the capability of interpretation of the FES output compared with related studies. A FES for CAD prediction was reported in [16]. In this paper, clinical parameters were employed for developing an expert system to predict the CAD (CAD) disease in early stages. The sensitivity of this system was reported 95.85% with the specificity of 83.33%. A medical diagnosis system was proposed for CD using a GA in [17]. In this paper, a FES was proposed for classifying the patients suffering from cardiovascular diseases. Then the membership functions are genetically tuned. The performance of this system was about 2% more than the other methods. A prediction model for CAD was presented using data mining tasks in [18]. This paper applies clustering model to define number of clusters. Then, a decision tree and artificial neural network were applied to classify the patients with the CAD. This model error was reported as 0.074. The summery of an analytical review of the related works has been presented in Table 1.
This section presents the design details of the FES for HDP followed by the hybrid Fuzzy-GA approach.
The FES was used for HDP includes the following main components: 1) Inference engine, and 2) knowledge-base; the former performs the reasoning process using and the later includes the fuzzy rules. The details of the FES and its rules for prognosis of the heart disease was reported by the authors in [19]. This section provides a brief review of this system and its improvement are explained. In this study, the design of the FES was based on our previously reported work in [19]. This study extends the design of this FES for HDP and improves its accuracy through the improved rule extraction method explained in the proposed hybrid Fuzzy-GA method explained in the next sections. The rest of this section briefly explains the design details of the FES. More details are provided in [19]. The rules of the FES was designed using knowledge of the cardiologists. This system includes 6 linguistic variables (5 input variables and one output variable). The MFs of the fuzzy linguistic variable are shown in Table 2. Gaussian membership function was considered in the FES as shown bellows [19]: −(x−c)2 f (x, σ, c) = e
2σ2 (1) Where the parameter c and σ2 are mean and variance, respectively.
The rules of the FES can be generated through considering all the combinations of variables and their linguistic terms. However, this way we would end up with a huge number of rules which may deteriorate the overall Table 1. Review of Intelligent Models Applied to (HDP)
Table 2. The Linguistic Terms of Input /Output Variables [19]
performance of the FES. One of the methods to extract the most important and effective rules is using the lookup table technique. In this approach, those rules which are compatible with majority of samples are selected. For this, at first each sample is considered as a fuzzy rule and evaluated separately on dataset. Then those fuzzy rules that are fired more by samples in the dataset and covers a greater area of problem state space are selected. The FES for HDP designed in our previous study [19], has been modified through the rule improvement process using look-up table technique. The improved FES was used for tuning and optimization of the parameters of the MF through the Fuzzy-GA tuning approach. Tuning these parameters can ameliorate the accuracy of FES and improve its interpretability.
In this section, the steps of the hybrid Fuzzy-GA method for tuning membership function parameters in a FES are explained as follows:
#parameters to adjust = (#variables × #linguistic terms × #parameters in a Gaussian MF) (2) In this study, this value is computed as 3 ∗ 3 ∗ 2 (for blood pressure. cholestrol and heart rate variables) +(3 ∗ 2 ∗ 2) (for blood sugar. smoking and heart disease variables) = 30.
was specified in previous step, are sorted to create an individual. Each of these parameters are considered as a gene in chromosome. Initialization of parameters are performed using the knowledge of experts. The size of population was considered to be 20.
The above mentioned steps for tuning of FIS using GA are illustrated in the flowchart presented in Figure 1. The fuzzy rule set after applying this Fuzzy-GA is presented in Table 4. Table 3. The Different Crossover and Mutation Parameters
Table 4. The Improved Rule Set of the FES
Figure 1. The Flowchart of FGA to Tune FIS.
Representation of the structure of chromosome is an important stage in design of the Fuzzy-GA. This paper works on the optimization of the MF parameters of the FES. Then MF parameters were considered as gens in a chromosome. The Gaussian membership function includes two parameters, σ.µ, that show standard deviation and mean, respectively. In fact, for optimization of database of a FES, the parameters of Gaussian membership functions are required to be optimized. The designed chromosome for proposed Fuzzy-GA approach is illustrated 0.26 Best: 0.0753618 Mean: 0.129177 0.24
0.22 0.2 Fitness Value 0.18
0.16 0.14 0.12 0.1 0.08 0.06 0 50 100 150 200 250 300 350 400 Generation Figure 2. Execute the GA to Evolve the FIS.
σ1 c1 σ2 c2 σ3 c3 σ4 c4 σ5 c5 σ6 c6 σ7 c7 σ8 c8 σ9 c9 σ10 c10 . . .
low normal high 1 0.8 Degree of Membership 0.6
0.4 0.2 0 6 8 10 12 14 16 18 20 22 Blood Pressure Figure 3. The Chromosome Representation for the Fuzzy-GA Method. in Figure 3.
The genetic operators directly affect the convergence property of the GA. The cross-over combines the fittest selected parents. The mutation operation provides variation for the GA. High crossover and low mutation rates can improve convergence of the GA. One of the common problems associated to evolutionary algorithm such as GA is premature convergence. This problem occurs when the algorithm traps in local optima. To address this problem, the following solutions have been considered:
This study applies and controls these parameters to avoid early convergence of the algorithm. To select the appropriate probability of mutation and crossover, the hybrid Fuzzy-GA was executed using different rates for crossover and mutation as shown in Table 3. The best performance of GA was obtained using 0.95 and 0.05 Pcrossover and Pmutation, respectively. 1
0.9 0.8 True Positive Rate (Sensitivity) 0.7
0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 False Positive Rate (1-Specificity) Figure 4. The ROC Curve of FES for HDP.
The Fuzzy-GA method for HDP was evaluated and performance of the proposed method was investigated using an ROC curve analysis. Performance evaluation was experimented for the Fuzzy-GA for prognosis of the heart disease on real data of patients.
The dataset includes 380 patients, which was collected from Parsian Hospital in Karaj, Iran. In this dataset 32.1% of samples was related to male and remaining to female. The dataset includes 55.26% of high risk patients and the rest are healthy people. The samples actual labels defined by medical experts were compared to outputs of FES and Fuzzy-GA method. The input features are Gender, Age, Blood pressure, Blood sugar, Cholesterol, Heart rate and Smoking.
One of the common evaluation method for classification models is ROC (Receiver Operating Characteristic) curve analysis. An ROC curve demonstrates the trade-off between system benefits and its cost as the observer changes the decision threshold. The ROC curve plots True positive (TP) on the y-axis against False Positive (FP) on the x-axis. The performance of each classifier is represented by the area under the ROC curve. The area under the ROC curve is a measure of the performance of the classifier [22, 23]. True positive (TP): If the object is positive (abnormal) and is classified as positive. False negative (FN): If the object is positive and it is classified as negative (normal). True negative (TN): If an object is negative and is classified as negative. False positive (FP): If an object is negative and is classified as positive. According to [22, 23], the ROC curve depicts the trade-off between the classifier’s benefits (tp rates) and its cost (fp rates). Each point of this curve shows (fp rates, tp rates) a decision based on selected threshold. The area under ROC curve (AUC) has been frequently employed as a performance measure. The performance of fuzzy system and Fuzzy-GA system for HDP were investigated using an ROC curve analysis and related metrics as follows and shown in Figures 4 and 5. 1
0.9 0.8 True Positive Rate (Sensitivity) 0.7
0.6 0.5 0.4 0.3 0.2 0.1 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 False Positive Rate (1-Specificity) Figure 5. The ROC Curve of Fuzzy-GA system for HDP. Table 5. Confusion Matrix of Fuzzy System
Table 6. Confusion Matrix of Fuzzy-GA System
Accuracy = Specificity = Sensitivity = TN + TP TN + TP + FN + FP TN
TN + FN TP FP + TP (3) (4) (5)
The performance of 2-class classifier for HDP was demonstrated using the confusion matrix. Confusion matrix shows all the predictions of 2-class classifier for data. A classification system predicts the class label of an object [24, 25]. The confusion matrix of fuzzy system and the Fuzzy-GA system shown in Tables 5 and 6, respectively. According to Table 6, after applying the Fuzzy-GA approach, the number of the TP and TN circumstances have increased (26 more samples are correctly classified) while number of misclassified samples has decreased from 55 samples in FES to 29 samples. The ROC analysis results before and after optimization have been compared in Table 7. According to the obtained results in this Table, the Fuzzy-GA system for HDP outperforms the FES. After optimization and tuning of parameters of fuzzy system using Fuzzy-GA algorithm, its accuracy and sensitivity (tp rate) have increased. The AUC (area under the ROC curve) of Fuzzy-GA with 96.23% is greater than the fuzzy system
Table 7. Comparison of Performance of Fuzzy System and Fuzzy-GA System Using the ROC Curve Criteria.
with an accuracy of 91.77%.
Comparison of the proposed methods in this study with the other related works for prediction of heart disease has been presented in Table 8. A Mamdani fuzzy inference model was used which has a high interpretability to interact with the medical experts. The performance of the proposed model is competitive with its counterpart methods for its high accuracy and the interpretability, i.e.; lower number of input variables and less fuzzy rules. Also the proposed FES works with input features that are not expensive to achieve for all the people. The Fuzzy-GA method for tuning the parameters of membership functions in FES was initialized using the knowledge of experts. The extensive usage and simple implementation and high interpretability of generated solutions are the reasons for selection of Genetic Algorithm for optimization of FES. The advantages of proposed Fuzzy-GA method for HDP have been summarized as follows:
According to the Table 8, the classification system in [11] which uses heart rate (ECG) as input, has higher accuracy than the systems that use clinical parameters as input data. The Fuzzy-GA model proposed in this paper compared to the work in [11], utilizes simple features with lower complexity. Furthermore, the aim of the proposed Fuzzy-GA method was to maintain the trade-off between the FES accuracy and its interpretability. The accuracy of proposed Fuzzy-GA model is competitive while providing higher interpretability in the input feature sets and the process of the design of the FESs.
In this paper, a Fuzzy-GA system for HDP was proposed. The Fuzzy-GA system has improved accuracy of the FES. The designed system predicts the risk of heart disease development. Performance of the proposed model was investigated applying a dataset including 380 patients collected from Parsian Hospital in Karaj, Iran. The dataset includes actual diagnosis of medical experts. Accuracy of the FES was 85.52% and after tuning with Fuzzy-GA method was reached to 92.37%. The Fuzzy-GA method in comparison with other related works has higher interpretability to interact with the medical experts by using fuzzy inference model with less input parameters. Our future work is to optimize the (HDP) system using other evolutionary algorithms. Table 8. Comparison of the Accuracy of the Related Works With Proposed Works in This Study.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
مراجع | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
آمار تعداد مشاهده مقاله: 767 تعداد دریافت فایل اصل مقاله: 471 |