SlideShare une entreprise Scribd logo
1  sur  4
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.108-111
ISSN: 2278-2400
108
Holistic Forecasting of Onset of Diabetes through
Data Mining Techniques
S.Sankaranarayanan1
, T.Pramananda Perumal2
Research Scholar, R & D Centre, Bharathiar University, Coimbatore, India &
1
Associate professor of Computer Science, Government Arts College, Kumbakonam, India
2
Principal, Presidency College, Chennai, India
Email:profsankaranarayanan@yahoo.in1
, pramanandaperumal@yahoo.com2
Abstract— Diabetes is one of the modern day diseases that poses
serious threat for the affected and is ever challenging for
physicians who are involved in its management and
control.Type2 diabetes mellitus ranges in exponential rating day
by day in its increase. Mere not being aware of the facts and
causes that can lead to such state, unawareness about diabetic
symptoms and late detection make diabetic condition
unmanageable and is really a challenging task to be faced all
victims. This paper suggests holistic measures and means by
which any common man can get into it to check whether he / she
is a would-be victim of Diabetes through simple checking of
symptoms that may lead to Diabetic condition, analyses the
factual causes of the aforesaid disease. This would certainly
make a person to ensure for the locus-centric state of whether of
being a diabetic or not. The problem of diagnosing the onset and
incidence of Diabetes is addressed more with a data mining
approach in mind. As the success of any data mining approach is
solely dependant on the underlying dataset upon which learning
is manifested and taken for, this paper inspects more on locating
prima-facie symptoms of diabetes disorder. A sagacious insight
of analyzing the actual causes of diabetes is set and hence a
comprehensive set of data for diabetic condition is proposed
here. Subjecting this data to data analysis through simple data
mining techniques v.i.z., FP-Growth and Apriori would certainly
model a holistic inference engine that could help a doctor to be
more astute in confirming the diabetic condition of patients.
Association rules are also being inducted based on both of these
approaches. A heuristic computer aided diagnosis (CAD) system
for diabetes can be built upon this.
Keywords— Holistic forecasting of Diabetes, Association rule
mining (ARM) for Diabetes, Augmented dataset for Diabetes,
Computer Aided Diagnosis (CAD) for Diabetes
I. INTRODUCTION
Diabetes mellitus, a metabolic disorder is a state of the body by
which it is not properly coping with the synthesis of glucose
level to blood stream and other organs of the body normally due
to lack of Insulin hormone. Type1, the Insulin Dependant
Diabetes Mellitus is the worst case compared to type2 in which
no Insulin is required for the management of this disorder and
sustainability. Dietary modifications, bodily exercise are more
useful and sometimes anti-diabetic drugs are believed to offer
remedy in such cases. A complex analysis by a trained and
competent doctor marks the first phase in the treatment process.
While the basic causes of diabetes are genetic
susceptibility or insulin resistance, other contributing factors are
as being given here below
 Sedentary lifestyle
 Physical inactivity and lack of exercise
 Excessive sleep
 Overeating and consequent obesity
 Mental stress and strain
The early signs of diabetes should not be ignored at any cost
because once intensified, this disease can lead to tough
complications like heart disease, stroke, foot ulcers, kidney
failure, and eye damage. Normally before the incidence of
diabetic symptoms, nobody cares about this deadly disease.It is
customary on the part of the doctors to investigate the leading
signs and symptoms with a set of pre-determined bio-parameters
and conclude whether a person is really diabetic or not. Only
patents’ attributes such as blood glucose level (BGL), glycated
hemoglobin (HbA1c), age, sex etc. are normally taken for
consideration by field experts for disease analysis. Though an
appreciable amount of efforts have been put by both doctors and
patients, diabetes remains to be an ever challenging for its
management. Also no mechanism is available as of now to
accurately predict whether a non-diabetic or pre-diabetic would
probably develop diabetes in future. The reason is mostly
because of that the latent causes and the corresponding
parameters are often ignored or omitted miserably by medical
experts. And now time has come for us to pour a holistic insight
into this disease and to appropriately inspect the ground reasons
which incidentally cause diabetes culminated. In this paper,
various holistic measures are suggested to fill the gap which
prevails in early prediction of diabetes. The success rate of this
holistic system of prediction using various data mining
techniques would certainly work out to a 100% hit and success.
II. THE BACKGROUND
A. Data mining for diabetic health care
Data mining of databases nowadays has become a dependable
mechanism to predict decisions including diseases like heart
attack, diabetic mellitus and other complicated ailments [1].
Data mining addresses to both knowledge extraction and
machine learning mostly by using Classification, Clustering and
Association techniques. Applying a particular data mining
technique suiting a particular application is domain centric.
Classification consists of predicting a certain outcome or goal
based on a given instance of input values. It uses a training set,
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.108-111
ISSN: 2278-2400
109
which maps a set of attributes to the predictor attribute. The
algorithm tries to discover relationships between the attributes
that would make it possible to predict the outcome. Next the
algorithm is given a data set not seen before which contains the
same set of attributes, except for the class label which is not yet
known. Diabetes disease diagnosis via proper interpretation of
the diabetes data is an important classification problem.
Statistical analysis of attribute oriented data and mining
associative logic rules from a larger data are also the classical
approaches to data mining. Decision tree is yet another model in
which a tree node denotes a test on parametric attributes and the
leaves can model classes [2]. Cluster analysis deals with data
objects without a consultative (known) class label and thus
contributes to unsupervised learning. Data mining algorithm is
used for testing the accuracy in predicting diabetic status [3].
B. Data set for Diabetic analysis through Data mining
Diabetic patients are often classified by well-known benchmark
data set available at the University of California- the Irvine data
repository, the so-called “Pima Indians” diabetic database and
dataset (PIDD), which is a collection of medical diagnostic
reports of 768 examples from a native-American population
living near Phoenix, Arizona. This data set was used to develop
algorithms to predict whether a patient shows signs of diabetes
according to 1990 World Health Organization criteria. Thus
WHO has proposed a set of nine attributes as depicted in Table I;
all are physiological parameters and medical test results for
diagnosing diabetes disease.
Table I – PIDD
Item.No Item
1 Pregnancy recurrence / times
2 Plasma glucose level from Oral Glucose
Forbearance Test -OGFT(mg /dL)
3 Diastolic blood tension (mm/Hg)
4 Triceps skin fold thickness (mm)
5 2-hour serum insulin (μU/ml)
6 BMI, the Body Mass Value (kg/m2)
7 Diabetes Pedigree function
8 Age (years)
9 Disease Status (0-Healthy, 1-Diabetes)
C. Holistic observation of Diabetic Symptoms
Diabetic disease condition of patients is often diagnosed by
Expert-doctors with a set of pre-conceived parameters, the
so-called patient symptoms as depicted in Table1. Yet they lag in
to have a holistic insight which is required to be put in place in
lieu of the traditional and existing diabetic analysis pattern by
which diabetes can be predicted comprehensively and
holistically. To understand more of diabetic symptoms
holistically, it is worthwhile to see the factors proposed by the
consultant of Endocrinology and Diabetology, Moolchand
Medcity, New Delhi. Dr Sanjiv Bhambani portrays the top 12
symptoms of diabetes disorder [4] as follows:
 Disproportionate increase of Thirst (Polydypsia)
 Extreme dryness of Mouth
 Excessive Hunger (Polyphagia)
 Frequent Urination and associate Urine Infection
(Polyuria)
 Sudden Weight Loss
 Fatigue
 Blurred Vision
 Headaches
 Change in Skin Complexion ,Dry skin, Itching
 Slow healing Wounds
 More skin and / or Yeast infections
 Numbness of Limbs, Tingling of hands and feet
D. The proposed data set for Holistic mining of Diabetes
By exploring the root causes which are reasonably responsible
for patients to have the incidence of Diabetes, there is need to
append with more of attributes in the existing system which
evaluates and predicts Diabetes disease. This augmentation in
the diabetes dataset would certainly make the diabetic diagnosis
more viable, reliable and exploring. Any computer aided
diagnosis system designed upon this augmentation would be
steadfast and consistent in predicting the incidence of diabetes in
any case as to be either pre-diabetic or would-be diabetic. The
proposed data set is deliberated here in Table II.
Table II – (a) the Augmented Dataset for Diabetic Analysis
Item.No Item
1 Pregnancy recurrence / times
2 Plasma glucose level from Oral Glucose
Forbearance Test -OGFT(mg /dL)
3 Diastolic blood tension (mm/Hg)
4 Triceps skin fold thickness (mm)
5 2-hour serum insulin (μU/ml)
6 BMI, the Body mass value (kg/m2)
7 Diabetes Pedigree function
8 Age (years)
9 Polydypsia & Dried Mouth (1-Yes, 0-No)
10 Polyphagia (1-Yes, 0-No)
11 Polyuria (1-Yes, 0-No)
12 Abrupt Weight Loss (1-Yes, 0-No)
13 Visionary Blurring (1-Yes, 0-No)
Table II – (b) the Augmented Dataset for Diabetic Analysis
14 Sudden change in Skin Contour & Itching &
Infections (1-Yes, 0-No)
15 Numbness of Hands - Limbs and Tingling of
Palms – Soles (1-Yes, 0-No)
16 Excessively Fatigue and Headache Status
(1-Yes, 0-No)
17 Disease Status (0-Healthy, 1-Diabetes)
Any database built on the above attributes would certainly be
information rich in all possible directions that would make it
more dependable in the task of predicting diabetes. It has 17
attributes; all being numeric variables out of which the last
attribute is the target variable which predicts the status of
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.108-111
ISSN: 2278-2400
110
diabetes holistically.These attributes are, the PIDD proposed
plus the holistic parameters identified here in this paper. They
are summarized here below:
1. Pregnancy recurrence / times
2. Two Hour plasma glucose - OGFT
3. Diastolic blood tension
4. Triceps skin fold thickness
5. Two hour serum insulin
6. BMI
7. Diabetes pedigree function
8. Age
Apart from the PIDD attributes as given above, other factors
which can also have a determining role in diabetic condition are
carefully identified and appended to be the new attributes here in
this paper. They are merely conditional attributes as given below
as to be either prevalent or not (1- if prevalent, 0-if not prevalent)
9. Polydypsia & Dried Mouth
10. Polyphagia
11. Polyuria
12. Abrupt Weight Loss
13. Visionary Blurring
14. Sudden change in Skin Contour & Itching ,
Infections
15. Numbness of Hands /Limbs and Tingling of Palms/
Soles
16. Excessively Fatigue and Headache Status
All the above 16 values holistically predict the disease status, the
target variable i.e if the subjected patient is diabetic within 3
years (class=1) or not (class=0).
III. LITERATURE SURVEY
Surveying of literature was conducted for the above problem
area and extracted that most of the studies [2], [3], [5], [6] have
applied data mining only to the conventional PIDD and the
results obtained of these studies were not stupendous because of
the fact that these have missed to make mention of all possible
factors that have either active or passive role in causing diabetes,
as mentioned here in the augmented dataset given in Table II.
But data mining study if made on this proposed dataset, a
superset to PIDD would be more viable in precisely predicting
the diabetic condition of human race.
IV. DATA MINING METHODS FOR DIABETES DATA
Data mining process can be extremely useful for medical
practitioners for extracting hidden medical knowledge.
Prediction models are the core data mining methods mainly used
in healthcare and engineering field and the techniques used in
such prediction are shown in Fig. 1. Performance evaluation is
done by comparing various models used and accuracy is
measured thereof.
Fig. 1 Steps in Data mining predicting models
A. Data mining through Association Rule Mining (ARM)
ARM establishes relation among the set of items included in a
database [5] by means of a strategy similar to a market-bucket
analysis. Any co-occurrence of instances of data items of a
database are identified as relationships and decomposed as rules
by ARM. Thus association rule mining can mine probable
causes of diabetes that can be used for better clinical diagnosis
[6]. The generalized framework of this method is summarized
here.
An association rule has the form X⇒Y, where X, Y⊂ I, the data
instance set & X∩Y=Ø. The rule X⇒Y holds in the transaction
set D
X also contain Y. The rule X⇒Y has support s in the transaction
ntains X∪Y [6], [7]. The
rules so generated are usually pruned according to some notion
of confidence. An efficient data storage used here is the T-tree,
the total support tree which uses set enumeration to store
frequent itemsets [6].
B. Apriori and Frequent Pattern growth
Ariori algorithm finds frequent itemsets from the database and
thus identifies association rules. Frequent pattern growth
(FP-Growth) also discovers frequent itemset without candidate
itemset generation. FP-Growth is a two-step process as given
below:
Step 1: A data structure called the Frequent Pattern tree
(FP-tree) is built over the given data set
Step 2: Frequent itemsets are formulated from the FP-tree
V. RESULTS AND DISCUSSION
Association rule mining generates very useful rules for the
diabetes dataset proposed here. Apriori and FP-growth have
generated same set of rules from the augmented dataset. These
rules provide valuable knowledge and have the potential to
improve clinical decision making in diabetes disease. Some of
the rules which comply to 100.0% confidence are summarized
here below:
1. IF (Age >=40) AND (Polyphagia = 1) AND (Weight
Loss = 0) THEN Not Diabetic
2. IF (Age >=65) AND (Polyphagia = 1) AND (Weight
Loss = 0) AND (Numbness/Tingling =1) THEN Not
Diabetic
3. IF (Age >=70) AND (Polyphagia = 1) AND (Weight
Loss = 1) AND (Numbness/Tingling =1) AND
(Polyuria = 0) THEN Not Diabetic
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.108-111
ISSN: 2278-2400
111
4. IF (Diastolic Blood Tension >= 72) AND (Diastolic
Blood Tension <= 92) AND (BMI<=25) AND (Age <=
44) AND (Polydypsia = 1) AND (Weight Loss = 0)
AND (Skin change = 1) THEN Not Diabetic
5. IF (Diastolic Blood Tension = 69) AND (BMI<=25)
AND (Age <= 56) AND (Polydypsia = 1) AND
(Weight Loss = 0) AND (Numbness/Tingling =1)
THEN Not Diabetic
6. IF (Diastolic Blood Tension >=92) AND (BMI=30)
AND (Polydypsia = 1)THEN Diabetic
7. IF (Age > 45) AND (Polydypsia = 1) AND (Visionary
Blurring = 1) THEN Diabetic
8. IF (Age >=40) AND (Polydypsia = 1) AND
(Polyphagia = 1) AND (Weight Loss = 1) THEN
Diabetic
9. IF (Age >=53) AND (Polyphagia = 1) AND (Weight
Loss = 1) AND (Numbness/Tingling =1) THEN
Diabetic
10. IF (Age >=58) AND (Polyuria= 1) AND (Weight Loss
= 1) AND (Numbness/Tingling =1) AND (Fatigue= 1)
THEN Diabetic
11. IF (OGFT > 132) AND (BMI>27) AND (Polydypsia =
1) AND (Weight Loss =1) AND (Fatigue = 1) THEN
Diabetic
REFERENCES
[1] Sellappan Palaniappan and Rafiah Awang, “Intelligent disease
prediction system using data mining techniques” International Journal
of Computer Science and Network Security, vol.8 (8), pp.343-350,
2008.
[2] S. Sankaranarayanan and T. Pramananda Perumal, “Diabetic
diagnosis through data mining” Indian Journal of Innovations and
Developments, Vol 3(2), pp. 30-34, September 2014.
[3] Ravi Sanakal and T. Jayakumari, “Prognosis of Diabetes Using Data
mining Approach-Fuzzy C Means Clustering and Support Vector
Machine"International Journal of Computer Trends and Technology,
Vol 11(2), pp.94-98, May 2014.
[4] Dr Sanjiv Bhambani in “Ten natural ways to control Diabetes” by
Bipasha at url www.The FitIndian.com.
[5] S. Sankaranarayanan and T. Pramananda Perumal, “A predictive
approach for diabetes mellitus disease through Data mining
technologies”, CPS-IEEE/ iExplore, 978-1-4799-2876
/13$31.00©2013, IEEE, DOI10.1109/WCCCT.2014.65, pp.231-233,
2014.
[6] Milan Zorman e.t al. “Mining Diabetes Database with Decision Trees
and Association Rules”, 2002.
[7] Carlos Ordonnez, “Comparing Association Rules and Decision Trees
for Disease Prediction”, HIKM’06, November 11, 2006.

Contenu connexe

Similaire à Holistic Forecasting of Onset of Diabetes through Data Mining Techniques

Cuidadomdicoendiabetes 2009-091110224108-phpapp01
Cuidadomdicoendiabetes 2009-091110224108-phpapp01Cuidadomdicoendiabetes 2009-091110224108-phpapp01
Cuidadomdicoendiabetes 2009-091110224108-phpapp01
hattye Board
 
Iisrt zz srinivas ravi
Iisrt zz srinivas raviIisrt zz srinivas ravi
Iisrt zz srinivas ravi
IISRT
 
Untitled presentation.pptx
Untitled presentation.pptxUntitled presentation.pptx
Untitled presentation.pptx
AkshayaPriyaJanartha
 

Similaire à Holistic Forecasting of Onset of Diabetes through Data Mining Techniques (20)

Dexlife Handbook DRAFT
Dexlife Handbook DRAFTDexlife Handbook DRAFT
Dexlife Handbook DRAFT
 
Preliminary Diagnostic System for Endocrine related diseases
Preliminary Diagnostic System for Endocrine related diseasesPreliminary Diagnostic System for Endocrine related diseases
Preliminary Diagnostic System for Endocrine related diseases
 
Approach to Support Diabetes through Data Visualization
Approach to Support Diabetes through Data Visualization Approach to Support Diabetes through Data Visualization
Approach to Support Diabetes through Data Visualization
 
IRJET- Diabetes Prediction by Machine Learning over Big Data from Healthc...
IRJET-  	  Diabetes Prediction by Machine Learning over Big Data from Healthc...IRJET-  	  Diabetes Prediction by Machine Learning over Big Data from Healthc...
IRJET- Diabetes Prediction by Machine Learning over Big Data from Healthc...
 
Futuro en el tratamiento de la DM2
Futuro en el tratamiento de la DM2Futuro en el tratamiento de la DM2
Futuro en el tratamiento de la DM2
 
A Systematic Review Of Type-2 Diabetes By Hadoop Map-Reduce
A Systematic Review Of Type-2 Diabetes By Hadoop Map-ReduceA Systematic Review Of Type-2 Diabetes By Hadoop Map-Reduce
A Systematic Review Of Type-2 Diabetes By Hadoop Map-Reduce
 
Cuidadomdicoendiabetes 2009-091110224108-phpapp01
Cuidadomdicoendiabetes 2009-091110224108-phpapp01Cuidadomdicoendiabetes 2009-091110224108-phpapp01
Cuidadomdicoendiabetes 2009-091110224108-phpapp01
 
Evolving Role of Digital Biomarkers in Healthcare
Evolving Role of Digital Biomarkers in HealthcareEvolving Role of Digital Biomarkers in Healthcare
Evolving Role of Digital Biomarkers in Healthcare
 
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
Analysis and Prediction of Diabetes Diseases using Machine Learning Algorithm...
 
Iisrt zz srinivas ravi
Iisrt zz srinivas raviIisrt zz srinivas ravi
Iisrt zz srinivas ravi
 
Pre-Diabetes State: Anthropometric and Haematological Parameters
Pre-Diabetes State: Anthropometric and Haematological ParametersPre-Diabetes State: Anthropometric and Haematological Parameters
Pre-Diabetes State: Anthropometric and Haematological Parameters
 
Non-invasive Diagnostic Tools: Cardiometabolic Risk Assessment and Prediction
Non-invasive Diagnostic Tools: Cardiometabolic Risk Assessment and PredictionNon-invasive Diagnostic Tools: Cardiometabolic Risk Assessment and Prediction
Non-invasive Diagnostic Tools: Cardiometabolic Risk Assessment and Prediction
 
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
Early Stage Diabetic Disease Prediction and Risk Minimization using Machine L...
 
Untitled presentation.pptx
Untitled presentation.pptxUntitled presentation.pptx
Untitled presentation.pptx
 
PSEDM-DOH WorkshopDiabetes Management Training Using Insulin v_7 - 20170321.pptx
PSEDM-DOH WorkshopDiabetes Management Training Using Insulin v_7 - 20170321.pptxPSEDM-DOH WorkshopDiabetes Management Training Using Insulin v_7 - 20170321.pptx
PSEDM-DOH WorkshopDiabetes Management Training Using Insulin v_7 - 20170321.pptx
 
C14 idf atlas 2015 eng
C14 idf atlas 2015 engC14 idf atlas 2015 eng
C14 idf atlas 2015 eng
 
Dentists Against Diabetes
Dentists Against DiabetesDentists Against Diabetes
Dentists Against Diabetes
 
Data Visualization: New Health Data Value
Data Visualization: New Health Data ValueData Visualization: New Health Data Value
Data Visualization: New Health Data Value
 
Diabetes prediction using machine learning
Diabetes prediction using machine learningDiabetes prediction using machine learning
Diabetes prediction using machine learning
 
Using Big Data to Drive Diabetes Management and Care
Using Big Data to Drive Diabetes Management and CareUsing Big Data to Drive Diabetes Management and Care
Using Big Data to Drive Diabetes Management and Care
 

Plus de ijcnes

Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic VehiclesSecured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
ijcnes
 

Plus de ijcnes (20)

A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...
 
Economic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian EconomyEconomic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian Economy
 
An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...
 
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
 
Challenges of E-government in Oman
Challenges of E-government in OmanChallenges of E-government in Oman
Challenges of E-government in Oman
 
Power Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage SystemPower Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage System
 
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
 
Feature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image RetrievalFeature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image Retrieval
 
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud ComputingChallenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
 
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
 
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
 
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor NetworksAn Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
 
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic VehiclesSecured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
 
Virtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor NetworksVirtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor Networks
 
Mitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 FactorizationMitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 Factorization
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniques
 
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
 
Priority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNPriority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSN
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based System
 
Investigation on Challenges in Cloud Security to Provide Effective Cloud Comp...
Investigation on Challenges in Cloud Security to Provide Effective Cloud Comp...Investigation on Challenges in Cloud Security to Provide Effective Cloud Comp...
Investigation on Challenges in Cloud Security to Provide Effective Cloud Comp...
 

Dernier

XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
ssuser89054b
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
dollysharma2066
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 

Dernier (20)

Unleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leapUnleashing the Power of the SORA AI lastest leap
Unleashing the Power of the SORA AI lastest leap
 
Unit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdfUnit 1 - Soil Classification and Compaction.pdf
Unit 1 - Soil Classification and Compaction.pdf
 
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...Booking open Available Pune Call Girls Pargaon  6297143586 Call Hot Indian Gi...
Booking open Available Pune Call Girls Pargaon 6297143586 Call Hot Indian Gi...
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Wakad Call Me 7737669865 Budget Friendly No Advance Booking
 
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar  ≼🔝 Delhi door step de...
Call Now ≽ 9953056974 ≼🔝 Call Girls In New Ashok Nagar ≼🔝 Delhi door step de...
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank  Design by Working Stress - IS Method.pdfIntze Overhead Water Tank  Design by Working Stress - IS Method.pdf
Intze Overhead Water Tank Design by Working Stress - IS Method.pdf
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Palanpur 7001035870 Whatsapp Number, 24/07 Booking
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Thermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.pptThermal Engineering -unit - III & IV.ppt
Thermal Engineering -unit - III & IV.ppt
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 

Holistic Forecasting of Onset of Diabetes through Data Mining Techniques

  • 1. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.108-111 ISSN: 2278-2400 108 Holistic Forecasting of Onset of Diabetes through Data Mining Techniques S.Sankaranarayanan1 , T.Pramananda Perumal2 Research Scholar, R & D Centre, Bharathiar University, Coimbatore, India & 1 Associate professor of Computer Science, Government Arts College, Kumbakonam, India 2 Principal, Presidency College, Chennai, India Email:profsankaranarayanan@yahoo.in1 , pramanandaperumal@yahoo.com2 Abstract— Diabetes is one of the modern day diseases that poses serious threat for the affected and is ever challenging for physicians who are involved in its management and control.Type2 diabetes mellitus ranges in exponential rating day by day in its increase. Mere not being aware of the facts and causes that can lead to such state, unawareness about diabetic symptoms and late detection make diabetic condition unmanageable and is really a challenging task to be faced all victims. This paper suggests holistic measures and means by which any common man can get into it to check whether he / she is a would-be victim of Diabetes through simple checking of symptoms that may lead to Diabetic condition, analyses the factual causes of the aforesaid disease. This would certainly make a person to ensure for the locus-centric state of whether of being a diabetic or not. The problem of diagnosing the onset and incidence of Diabetes is addressed more with a data mining approach in mind. As the success of any data mining approach is solely dependant on the underlying dataset upon which learning is manifested and taken for, this paper inspects more on locating prima-facie symptoms of diabetes disorder. A sagacious insight of analyzing the actual causes of diabetes is set and hence a comprehensive set of data for diabetic condition is proposed here. Subjecting this data to data analysis through simple data mining techniques v.i.z., FP-Growth and Apriori would certainly model a holistic inference engine that could help a doctor to be more astute in confirming the diabetic condition of patients. Association rules are also being inducted based on both of these approaches. A heuristic computer aided diagnosis (CAD) system for diabetes can be built upon this. Keywords— Holistic forecasting of Diabetes, Association rule mining (ARM) for Diabetes, Augmented dataset for Diabetes, Computer Aided Diagnosis (CAD) for Diabetes I. INTRODUCTION Diabetes mellitus, a metabolic disorder is a state of the body by which it is not properly coping with the synthesis of glucose level to blood stream and other organs of the body normally due to lack of Insulin hormone. Type1, the Insulin Dependant Diabetes Mellitus is the worst case compared to type2 in which no Insulin is required for the management of this disorder and sustainability. Dietary modifications, bodily exercise are more useful and sometimes anti-diabetic drugs are believed to offer remedy in such cases. A complex analysis by a trained and competent doctor marks the first phase in the treatment process. While the basic causes of diabetes are genetic susceptibility or insulin resistance, other contributing factors are as being given here below  Sedentary lifestyle  Physical inactivity and lack of exercise  Excessive sleep  Overeating and consequent obesity  Mental stress and strain The early signs of diabetes should not be ignored at any cost because once intensified, this disease can lead to tough complications like heart disease, stroke, foot ulcers, kidney failure, and eye damage. Normally before the incidence of diabetic symptoms, nobody cares about this deadly disease.It is customary on the part of the doctors to investigate the leading signs and symptoms with a set of pre-determined bio-parameters and conclude whether a person is really diabetic or not. Only patents’ attributes such as blood glucose level (BGL), glycated hemoglobin (HbA1c), age, sex etc. are normally taken for consideration by field experts for disease analysis. Though an appreciable amount of efforts have been put by both doctors and patients, diabetes remains to be an ever challenging for its management. Also no mechanism is available as of now to accurately predict whether a non-diabetic or pre-diabetic would probably develop diabetes in future. The reason is mostly because of that the latent causes and the corresponding parameters are often ignored or omitted miserably by medical experts. And now time has come for us to pour a holistic insight into this disease and to appropriately inspect the ground reasons which incidentally cause diabetes culminated. In this paper, various holistic measures are suggested to fill the gap which prevails in early prediction of diabetes. The success rate of this holistic system of prediction using various data mining techniques would certainly work out to a 100% hit and success. II. THE BACKGROUND A. Data mining for diabetic health care Data mining of databases nowadays has become a dependable mechanism to predict decisions including diseases like heart attack, diabetic mellitus and other complicated ailments [1]. Data mining addresses to both knowledge extraction and machine learning mostly by using Classification, Clustering and Association techniques. Applying a particular data mining technique suiting a particular application is domain centric. Classification consists of predicting a certain outcome or goal based on a given instance of input values. It uses a training set,
  • 2. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.108-111 ISSN: 2278-2400 109 which maps a set of attributes to the predictor attribute. The algorithm tries to discover relationships between the attributes that would make it possible to predict the outcome. Next the algorithm is given a data set not seen before which contains the same set of attributes, except for the class label which is not yet known. Diabetes disease diagnosis via proper interpretation of the diabetes data is an important classification problem. Statistical analysis of attribute oriented data and mining associative logic rules from a larger data are also the classical approaches to data mining. Decision tree is yet another model in which a tree node denotes a test on parametric attributes and the leaves can model classes [2]. Cluster analysis deals with data objects without a consultative (known) class label and thus contributes to unsupervised learning. Data mining algorithm is used for testing the accuracy in predicting diabetic status [3]. B. Data set for Diabetic analysis through Data mining Diabetic patients are often classified by well-known benchmark data set available at the University of California- the Irvine data repository, the so-called “Pima Indians” diabetic database and dataset (PIDD), which is a collection of medical diagnostic reports of 768 examples from a native-American population living near Phoenix, Arizona. This data set was used to develop algorithms to predict whether a patient shows signs of diabetes according to 1990 World Health Organization criteria. Thus WHO has proposed a set of nine attributes as depicted in Table I; all are physiological parameters and medical test results for diagnosing diabetes disease. Table I – PIDD Item.No Item 1 Pregnancy recurrence / times 2 Plasma glucose level from Oral Glucose Forbearance Test -OGFT(mg /dL) 3 Diastolic blood tension (mm/Hg) 4 Triceps skin fold thickness (mm) 5 2-hour serum insulin (μU/ml) 6 BMI, the Body Mass Value (kg/m2) 7 Diabetes Pedigree function 8 Age (years) 9 Disease Status (0-Healthy, 1-Diabetes) C. Holistic observation of Diabetic Symptoms Diabetic disease condition of patients is often diagnosed by Expert-doctors with a set of pre-conceived parameters, the so-called patient symptoms as depicted in Table1. Yet they lag in to have a holistic insight which is required to be put in place in lieu of the traditional and existing diabetic analysis pattern by which diabetes can be predicted comprehensively and holistically. To understand more of diabetic symptoms holistically, it is worthwhile to see the factors proposed by the consultant of Endocrinology and Diabetology, Moolchand Medcity, New Delhi. Dr Sanjiv Bhambani portrays the top 12 symptoms of diabetes disorder [4] as follows:  Disproportionate increase of Thirst (Polydypsia)  Extreme dryness of Mouth  Excessive Hunger (Polyphagia)  Frequent Urination and associate Urine Infection (Polyuria)  Sudden Weight Loss  Fatigue  Blurred Vision  Headaches  Change in Skin Complexion ,Dry skin, Itching  Slow healing Wounds  More skin and / or Yeast infections  Numbness of Limbs, Tingling of hands and feet D. The proposed data set for Holistic mining of Diabetes By exploring the root causes which are reasonably responsible for patients to have the incidence of Diabetes, there is need to append with more of attributes in the existing system which evaluates and predicts Diabetes disease. This augmentation in the diabetes dataset would certainly make the diabetic diagnosis more viable, reliable and exploring. Any computer aided diagnosis system designed upon this augmentation would be steadfast and consistent in predicting the incidence of diabetes in any case as to be either pre-diabetic or would-be diabetic. The proposed data set is deliberated here in Table II. Table II – (a) the Augmented Dataset for Diabetic Analysis Item.No Item 1 Pregnancy recurrence / times 2 Plasma glucose level from Oral Glucose Forbearance Test -OGFT(mg /dL) 3 Diastolic blood tension (mm/Hg) 4 Triceps skin fold thickness (mm) 5 2-hour serum insulin (μU/ml) 6 BMI, the Body mass value (kg/m2) 7 Diabetes Pedigree function 8 Age (years) 9 Polydypsia & Dried Mouth (1-Yes, 0-No) 10 Polyphagia (1-Yes, 0-No) 11 Polyuria (1-Yes, 0-No) 12 Abrupt Weight Loss (1-Yes, 0-No) 13 Visionary Blurring (1-Yes, 0-No) Table II – (b) the Augmented Dataset for Diabetic Analysis 14 Sudden change in Skin Contour & Itching & Infections (1-Yes, 0-No) 15 Numbness of Hands - Limbs and Tingling of Palms – Soles (1-Yes, 0-No) 16 Excessively Fatigue and Headache Status (1-Yes, 0-No) 17 Disease Status (0-Healthy, 1-Diabetes) Any database built on the above attributes would certainly be information rich in all possible directions that would make it more dependable in the task of predicting diabetes. It has 17 attributes; all being numeric variables out of which the last attribute is the target variable which predicts the status of
  • 3. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.108-111 ISSN: 2278-2400 110 diabetes holistically.These attributes are, the PIDD proposed plus the holistic parameters identified here in this paper. They are summarized here below: 1. Pregnancy recurrence / times 2. Two Hour plasma glucose - OGFT 3. Diastolic blood tension 4. Triceps skin fold thickness 5. Two hour serum insulin 6. BMI 7. Diabetes pedigree function 8. Age Apart from the PIDD attributes as given above, other factors which can also have a determining role in diabetic condition are carefully identified and appended to be the new attributes here in this paper. They are merely conditional attributes as given below as to be either prevalent or not (1- if prevalent, 0-if not prevalent) 9. Polydypsia & Dried Mouth 10. Polyphagia 11. Polyuria 12. Abrupt Weight Loss 13. Visionary Blurring 14. Sudden change in Skin Contour & Itching , Infections 15. Numbness of Hands /Limbs and Tingling of Palms/ Soles 16. Excessively Fatigue and Headache Status All the above 16 values holistically predict the disease status, the target variable i.e if the subjected patient is diabetic within 3 years (class=1) or not (class=0). III. LITERATURE SURVEY Surveying of literature was conducted for the above problem area and extracted that most of the studies [2], [3], [5], [6] have applied data mining only to the conventional PIDD and the results obtained of these studies were not stupendous because of the fact that these have missed to make mention of all possible factors that have either active or passive role in causing diabetes, as mentioned here in the augmented dataset given in Table II. But data mining study if made on this proposed dataset, a superset to PIDD would be more viable in precisely predicting the diabetic condition of human race. IV. DATA MINING METHODS FOR DIABETES DATA Data mining process can be extremely useful for medical practitioners for extracting hidden medical knowledge. Prediction models are the core data mining methods mainly used in healthcare and engineering field and the techniques used in such prediction are shown in Fig. 1. Performance evaluation is done by comparing various models used and accuracy is measured thereof. Fig. 1 Steps in Data mining predicting models A. Data mining through Association Rule Mining (ARM) ARM establishes relation among the set of items included in a database [5] by means of a strategy similar to a market-bucket analysis. Any co-occurrence of instances of data items of a database are identified as relationships and decomposed as rules by ARM. Thus association rule mining can mine probable causes of diabetes that can be used for better clinical diagnosis [6]. The generalized framework of this method is summarized here. An association rule has the form X⇒Y, where X, Y⊂ I, the data instance set & X∩Y=Ø. The rule X⇒Y holds in the transaction set D X also contain Y. The rule X⇒Y has support s in the transaction ntains X∪Y [6], [7]. The rules so generated are usually pruned according to some notion of confidence. An efficient data storage used here is the T-tree, the total support tree which uses set enumeration to store frequent itemsets [6]. B. Apriori and Frequent Pattern growth Ariori algorithm finds frequent itemsets from the database and thus identifies association rules. Frequent pattern growth (FP-Growth) also discovers frequent itemset without candidate itemset generation. FP-Growth is a two-step process as given below: Step 1: A data structure called the Frequent Pattern tree (FP-tree) is built over the given data set Step 2: Frequent itemsets are formulated from the FP-tree V. RESULTS AND DISCUSSION Association rule mining generates very useful rules for the diabetes dataset proposed here. Apriori and FP-growth have generated same set of rules from the augmented dataset. These rules provide valuable knowledge and have the potential to improve clinical decision making in diabetes disease. Some of the rules which comply to 100.0% confidence are summarized here below: 1. IF (Age >=40) AND (Polyphagia = 1) AND (Weight Loss = 0) THEN Not Diabetic 2. IF (Age >=65) AND (Polyphagia = 1) AND (Weight Loss = 0) AND (Numbness/Tingling =1) THEN Not Diabetic 3. IF (Age >=70) AND (Polyphagia = 1) AND (Weight Loss = 1) AND (Numbness/Tingling =1) AND (Polyuria = 0) THEN Not Diabetic
  • 4. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.108-111 ISSN: 2278-2400 111 4. IF (Diastolic Blood Tension >= 72) AND (Diastolic Blood Tension <= 92) AND (BMI<=25) AND (Age <= 44) AND (Polydypsia = 1) AND (Weight Loss = 0) AND (Skin change = 1) THEN Not Diabetic 5. IF (Diastolic Blood Tension = 69) AND (BMI<=25) AND (Age <= 56) AND (Polydypsia = 1) AND (Weight Loss = 0) AND (Numbness/Tingling =1) THEN Not Diabetic 6. IF (Diastolic Blood Tension >=92) AND (BMI=30) AND (Polydypsia = 1)THEN Diabetic 7. IF (Age > 45) AND (Polydypsia = 1) AND (Visionary Blurring = 1) THEN Diabetic 8. IF (Age >=40) AND (Polydypsia = 1) AND (Polyphagia = 1) AND (Weight Loss = 1) THEN Diabetic 9. IF (Age >=53) AND (Polyphagia = 1) AND (Weight Loss = 1) AND (Numbness/Tingling =1) THEN Diabetic 10. IF (Age >=58) AND (Polyuria= 1) AND (Weight Loss = 1) AND (Numbness/Tingling =1) AND (Fatigue= 1) THEN Diabetic 11. IF (OGFT > 132) AND (BMI>27) AND (Polydypsia = 1) AND (Weight Loss =1) AND (Fatigue = 1) THEN Diabetic REFERENCES [1] Sellappan Palaniappan and Rafiah Awang, “Intelligent disease prediction system using data mining techniques” International Journal of Computer Science and Network Security, vol.8 (8), pp.343-350, 2008. [2] S. Sankaranarayanan and T. Pramananda Perumal, “Diabetic diagnosis through data mining” Indian Journal of Innovations and Developments, Vol 3(2), pp. 30-34, September 2014. [3] Ravi Sanakal and T. Jayakumari, “Prognosis of Diabetes Using Data mining Approach-Fuzzy C Means Clustering and Support Vector Machine"International Journal of Computer Trends and Technology, Vol 11(2), pp.94-98, May 2014. [4] Dr Sanjiv Bhambani in “Ten natural ways to control Diabetes” by Bipasha at url www.The FitIndian.com. [5] S. Sankaranarayanan and T. Pramananda Perumal, “A predictive approach for diabetes mellitus disease through Data mining technologies”, CPS-IEEE/ iExplore, 978-1-4799-2876 /13$31.00©2013, IEEE, DOI10.1109/WCCCT.2014.65, pp.231-233, 2014. [6] Milan Zorman e.t al. “Mining Diabetes Database with Decision Trees and Association Rules”, 2002. [7] Carlos Ordonnez, “Comparing Association Rules and Decision Trees for Disease Prediction”, HIKM’06, November 11, 2006.