SlideShare une entreprise Scribd logo
1  sur  4
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.12-15
ISSN: 2278-2400
12
A Study on Prediction Techniques of Data Mining in
Educational Domain
T. Durga Nadarajan1
, Pramananda Perumal2
, B.Jagadhesan3
1
Research Scholar, Dept of Computer Science ,Presidency College,Chennai-600005, India
2
Principal, Presidency College,Chennai-600005, India
3
Associate Professor,D.B.Jain College,Chennai-600097, India
durga.nadan@gmail.com,pramanandaperumal@yahoo.com,jagadhesanb@gmail.com
Abstract—Data mining techniques are efficient for automated
discovery of understandable patterns in large databases.
Number of techniques available are Association rules mining
or market basket analysis, supervised classification, Cluster
analysis, etc. Each of these techniques have its own
significance. As data mining plays major role in almost all
the areas, many researches are being done recently. This
paper focuses on different applications of data mining in the
field of Education. The methodologies used in students
performance prediction, grouping up of students, visualizing
the data while analyzing the predictions and technical
manipulations in online tests have been discussed. The
mandatory areas to be focused before commencing the
research are also discussed. These methodologies are useful
in building exact data model through which predictions can
be made easily. The future scope of this paper is to predict
the students choices before they enter into graduation level.
Hopefully considering some essential factors and choosing
proper methodology for prediction by the researchers better
performance is assured .
Keywords- Clustering, Association Rule, Predication,
Decision trees, KDD,EDM,Regression,CART,K-nearest
Neighbor
I. INTRODUCTION
The necessity for data mining development was evolved due
to the rapid growth of the volume of stored data. One can
extract valuable knowledge from a very large database using
these techniques. Ordinary methods could no longer produce
results showing hidden patterns in vast amount of data. Using
advanced methods derived from pattern recognition
descriptive model can be constructed using data mining on
input data. Prediction and description of data can be served
with different data models. KDD-Knowledge Discovery and
Data mining is a multidisciplinary area to extract knowledge
from the data using different models and tools.Data mining
techniques depend upon a lot of careful analysis and a good
understanding. It is perhaps best to identify the best technique
to solve the problem. Applications of data mining include
aviation, astronomy, banking and finance, weather, crime
prevention, healthcare, E-commerce, Education and many
more [1]. The large volume of data in various fiels are stored
in the form of documents, files, images, videos, files, sound
and many other formats. Proper decision making requires
accurate method of knowledge extraction from the large
repository of data [2]. Much of the innovation in the field is
coming from the academic community . Data mining in
education is an emerging field in
which methods are developed from a educational
environments [3].Data mining techniques suggested for the
educational data mining are regression, Nearest Neighbor,
Clustering, Classification he knowledge discovered from these
tools can be used for performance prediction, Attrition Risk
Detection, Data Visualization, Intelligent Feedback, Course
Recommendations, Students Skill Estimation, Behavior
Detection, Manipulations in On-line Examinations, and so on
as per user’s demand[4].
II. RELATED WORKS
Katrina Sin and Loganathan Muthu have discussed about
various techniques adapted for educational data mining. Also,
various applications in learning is also discussed elaborately
with few tools implemented.Monika Goyal and Rajan Vohra
have presented mining techniques like classification and
clustering for various applications. Also, they have discussed
to analyse student performance analysis, course curriculum
design, grouping of students etc(9).Student performance
prediction is done by Dina Abdulaziz Alhammadi. He has
used K-means,Naive Bayes algorithm and compared the
algorithms for highest accuracy. He has concluded C4.5 to be
the best algorithm(7).Nikita Jain, Vishal Srivastava have
presented data mining concepts with ANN(5), most suitable
way of solving problems . It has a high degree of tolerance,
robustness, parallel processing, adaptiveness and so on. They
concluded that these techniques are used in scientific,
commercial, educational platforms.Student’s performance
analysis is done by Mythili and Mohammad using various
methodologies .After comparing this technique with other
methodologies they concluded that random forest
performance is better and best suitable for predicting students
performance(11).Data mining software is explained by Han,
Kamber. They have also discussed methods in different
dimension and the relationship has been identified during
these processes.Brijesh Kumar Baradwaj, Saurabh Pal[4] have
described about datamining basic concepts and various
methods of predicting the results with suitable input of data
III. METHODOLOGIES
Knowledge discovery in database(KDD) is extracting
information from a large source of data. Many techniques are
available to operate on big volumes of data mainly to
discover hidden patterns and possible relationships which are
helpful in decision making. The steps in extraction of
knowledge from the data are shown in Figure 1. Some basic
requirements for a data mining research to be successful are
 Mandatory is data
 Adequate & well cleaned data
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.12-15
ISSN: 2278-2400
13
 Problems are well defined.
 Problem should be in such a way that it cannot be
solved by simple query
 Actionable results to be found
Number of techniques available are Association rules mining or
market basket analysis, supervised classification, Cluster analysis,
etc. Each of these techniques have its own significance. Ordinary
methods could no longer produce results showing hidden
patterns in vast amount of data These methods and techniques
are to be explained briefly to have better understanding.
Classification
Mmost widely used data mining technique is classification.
Supervised learning requires training dataset where testing
requires additional data. This involves classification and
learning. Decision trees can be constructed to identify and
describe aread of high risk which are the evaluated with the
help of flow chart like tree structure. Another often used
classification technique is Naïve Bayes method. In this
method, only one scan of whole data is needed and also, if at
some stage there are additional training data then each training
sample may increase or decrease the probability that a
hypothesis is correct. The evaluation criteria for this technique
are speed, robustness, scalability, flexibility and time
complexity.
Clustering
Objects or things are made into groups so that groups can be
commented rather than individuals . Cluster analysis method
can deal with large as well as small set of data . This method
also scales well with datasets in which number of attributes is
large. This method does not require more than one scan of
disk resident data. With minimal input parameters it is able to
give the desirable output even though data analyst has limited
knowledge on the dataset. Classification can also be combined
for fruitful results, but very expensive. This approach can deal
with any cluster shape and the results are independent of the
data input order.
Predication
Predication can adapt Regression technique. Relationship
between many independent and dependent variables with
suitable data can be modeled with the help of Regression. In
data mining independent variables are attributes which are
already known and response variables are the ones to be
predicted. Many real-world problems are combination of both
prediction and others. Combined techniques like neural
networks, regression, decision trees are necessary to predict
the future . Classification and prediction can use the same
models frequently. CART decision tree algorithm can be used
for building both the classification trees which is to classify
categorical response variables and regression trees often used
to forecast continuous response variables. Neural network is
another technique for creating both classification and
regression models.
Association
Association rules mining or market basket analysis involves
set of association rules to correlate between large number of
databases. This includes applications in marketing, customer
segmentation, e-commerce, shopping behavior analysis and
many more. The process of mining can be made more
efficient using Apriori algorithm. One major disadvantage is
the large number of rules. Many software packages are
available for association mining. It should be noted that
lowering the support threshold results in many more frequent
item sets and often increases the number of candidate item
sets and maximum length of frequent item sets resulting in
cost increases.
Neural Networks
Neural network is associated with weights with each of its
connection in which the dataset is input or output units. This
methodology is very useful to infer an accurate pattern and the
trend can be detected in complicated dataset where it cannot
be identified by humans or computers. The input weights are
adjusted so that correct class is found. This is well suited for
continuous predictions or forecasting in the research.
Decision Trees
Decision tree method is a popular and relatively simple super
supervised classification method that involves each node of
the tree specifying a test of some attribute and each branch
from the node corresponding to one of the values of the
attribute. Many methods are adapted for decision tree.
Specifically Classification (CART) and Chi Square Automatic
Interaction Detection (CHAID) are popular.
Nearest Neighbor
Nearest neighbor algorithm classifies each record by
combining classes of K records where k>=1. The dataset
should be similar to that of a historical dataset. This technique
is also named as K-nearest neighbor technique.[4]
IV. APPLICATIONS IN EDUCATION
The areas of data mining applications are automotive,
manufacturing, robotics, aviation, students performance
analysis, prediction, telecommunication, banking sectors,
finance, scientific research , commercial, government sectors
and many more.[3]
For extraction of useful patterns, it is very mandatory to
employ data mining process. It comprises of five elements
which are as follows:
 Extraction, conversion and loading transaction
 Storage and management of data in database system
 Data access granted to business analyst and
professionals.
 Analyzing the data using software
 Last but not least, data to be presented in an easy and
useful format may be in graphs or charts[5]
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.12-15
ISSN: 2278-2400
14
EDM
Educational Data Mining (EDM)[10] develops various
methods or techniques for educational institutions. Its main
aim is to extract various data from these institutions to
understand students. To analyze students performance, data
enrolment, performance prediction, Students learning practice
to improve their skill set, course curriculum changes in many
institutions are some main areas in which data mining is
applied. Current educational practice is a major problem
which affects the students performance. This can be taken as
one of the most emerging area of research.[9][10][11]
Data Visualization
Reports on educational data become more and more complex
as education grow bigger. Data can be visualized using
different techniques to easily identify the trends and relations
in the data just by looking on the visual reports. The results of
the analysis are always presented in statistical formats. This
helps us to analyze informations like students enter and exit,
materials used by students for study purpose, the trends ,
studying patterns, downloads done by students, pages browsed
by them, total time taken for study and understand, sequence
of activities during study and so on. Focussed studies in
educational data system are daily, seasonal and hourly
behavioral patterns. It emphasizes decision making.
Visualization plays an important role for administrators to
analyze information usage. It helps educators and students to
visualize learning traces.
Student Performance Prediction
In students performance prediction, the design, development
or implementation of new models, frameworks and algorithms
for predicting or measuring students performance are carried
out through various methodologies. Mainly classification
technique plays an important role which combines datasets
individually and analysis is being done on quantitative
manner. Most common areas focused are marks analysis,
academic analytics, quantitative evaluations and many more.
Based on logged data Grade analysis, performance analysis
are done using various techniques. Regression technique is
one of the models deployed to predict many areas in
educational sectors. Decision trees, Naïve Bayes algorithms
have also proved in this area of data mining. These techniques
are widely helpful in predicting exam results thereby
improving the students performance. These techniques are
applied for distance education and e-learning also for
predicting the students performance.[6][7][10]
Clustering Students
Depending upon personal characteristics, customized features
clusters are created in this category. This is done to develop a
personalized learning system by the instructor which promotes
well organized learning methods. Generally clustering
algorithm looks for a group of similar characters from a
sequenced database. Hierarchical clustering algorithm is used
in e-learning systems for effective study. This algorithm
group up students based on their features, profile, learning
style etc. k-means, agglomerative clustering ,model based
clustering, hierarchical clustering are few often used
clustering algorithms.[9]
Manipulations in Online Examination
Nowadays examinations are conducted online and possibility
of malpractice and fraud are more. These are not only done
by the students but mainly by the hackers. Many data mining
techniques are there to build models to detect and prevent the
malpractices. Scandals in businesses and journalism are very
common for the hackers. Models are built on the basis of
stress , common cheating practices attempted by students to
get a better grade in the exams.
V. OPEN ISSUES AND FUTURE SCOPE
Everything is data and there are myriad of opportunities to
deal with them. Major problem faced are some techniques
have high processing time with less accuracy and vice versa.
Educational models have to be built with greater potentiality.
Researchers should concentrate in selecting the best
classification and clustering algorithm to build models in this
area. . With the help of the above survey not only the students
performance but also the students preferences for higher
education based on some essential factors can also be
focussed for future research purpose.
VI. CONCLUSIONS
Various educational applications like students performance
prediction, manipulations in online exams, clustering students
and many more have been discussed by many authors. Basic
requirements before commencement of a research and various
methodologies like clustering, classification, prediction,
association, neural network have been analysed. Hopefully
this survey will prove to be helpful for the researchers in this
area of educational data mining to choose their best technique
for students performance prediction to build their model . This
can also be extended in predicting the student’s preferences
for higher education.First of all I would like to extend my
gratitude to Almighty who always support me in my
endeavours. I extend my heartfelt thanks to my guide ,
colleagues, my family and friends for their encouragement
and thankful to all those who helped me at every stage of my
work.
References
[1] Mrs.Bharati M.Ramageri, “Data mining techniques and
Applications”, Indian Journal of Computer Science and
Engineering Vol.1 No.4,2008
[2] G.K.Gupta, ”Ïntroduction to Data mining with Case
studies”,pp.05-19, 2006
[3] Heikki Mannila, “Data Mining: machine learning, statistics and
databases”, IEEE, 1996
[4] Katrina Sin Loganathan Muthu, Application of Big Data in
Education and learning Analytics-A Literature Review,
ICTACT Vol5 Issue:4,Jul-2015
[5] J Han and M Kamber, “Data Mining: Concepts and Techniques”
2000
[6] Brijesh Kumar Baradwaj, Saurabh Pal “Mining Educational
Data to Analyze Students”,IJACSA,Vol.2, No.6,2011
[7] Nikita Jain, vishal Srivatsava, ”Data mining Techniques:A
Survey Paper”, (IJRET)International Journal of Research in
Engineering and Technology,Vol.2 Issue:11,Nov- 2013
[8] Naeimah Delavari, Somnuk Phon Amnuaisuk, ”Data mining
Application in higher Learning Institutions”, Informatics in
Education,Vol.7,No.1,2008
[9] Dina Abdulaziz Alhammadi, ”Data Mining in Education An
Experimental Study”, International Journal of Computer
Applications Vol.62,N0.15,Jan-2013
Integrated Intelligent Research (IIR) International Journal of Business Intelligents
Volume: 05 Issue: 01 June 2016 Page No.12-15
ISSN: 2278-2400
15
[10] Dr.Mohd Maqsood Ali,”Role of Data mining in Education
sector”, International Journal of Computer Science and Mobile
Computing Vol.2,Issue.4,Apr2013
[11] Monika Goyal and Rajan Vohra, “Applications of Data mining
in Higher Education”, IJCSI International Journal of Computer
Science Issues,Vol.2,Issue:2,No.1,Mar-2012
[12] Dr.Mohd Maqsood Ali,”Role of Data mining in Education
Sector”,International Journal of Computer Science and Mobile
Computing Vol. 2,Issue.4,Apr2013
[13] Richard A.Huebner, Ä survey of educational data mining
research”, Research in Higher Education Journal,2012
[14] Mrs.M.S.Mythili Dr.A.R.Mohamed Shanavas, “Än analysis of
students performance using classification algorithms”, ISOR
[15] Kumar,V An Empirical Study of the Applications of Data
mining techniques in higher education”(IJACSA) International
Journal of Advanced Computer Science and Applications,80-84
[16] Z.J Kovacic ”Early prediction of student success: Mining
student enrolment data”, Proceedings of Information Science
and IT Education Conference 2010

Contenu connexe

Similaire à A Study on Prediction Techniques of Data Mining in Educational Domain

Assessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalAssessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalIRJET Journal
 
IRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET Journal
 
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RSelecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RIOSR Journals
 
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...IJMER
 
A Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification TechniquesA Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification Techniquesijsrd.com
 
IRJET - Employee Performance Prediction System using Data Mining
IRJET - Employee Performance Prediction System using Data MiningIRJET - Employee Performance Prediction System using Data Mining
IRJET - Employee Performance Prediction System using Data MiningIRJET Journal
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data miningeSAT Journals
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data miningeSAT Publishing House
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Reviewijdpsjournal
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYEditor IJMTER
 
Exploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxExploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxYashwanthKumar306128
 
A Systematic Review On Educational Data Mining
A Systematic Review On Educational Data MiningA Systematic Review On Educational Data Mining
A Systematic Review On Educational Data MiningKatie Robinson
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...IJERA Editor
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...nalini manogaran
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionIRJET Journal
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniquesijsrd.com
 

Similaire à A Study on Prediction Techniques of Data Mining in Educational Domain (20)

Assessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s RecitalAssessment of Decision Tree Algorithms on Student’s Recital
Assessment of Decision Tree Algorithms on Student’s Recital
 
L016136369
L016136369L016136369
L016136369
 
IRJET- Medical Data Mining
IRJET- Medical Data MiningIRJET- Medical Data Mining
IRJET- Medical Data Mining
 
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-RSelecting the correct Data Mining Method: Classification & InDaMiTe-R
Selecting the correct Data Mining Method: Classification & InDaMiTe-R
 
IJET-V2I6P22
IJET-V2I6P22IJET-V2I6P22
IJET-V2I6P22
 
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...Applying Classification Technique using DID3 Algorithm to improve Decision Su...
Applying Classification Technique using DID3 Algorithm to improve Decision Su...
 
A Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification TechniquesA Survey of Modern Data Classification Techniques
A Survey of Modern Data Classification Techniques
 
IRJET - Employee Performance Prediction System using Data Mining
IRJET - Employee Performance Prediction System using Data MiningIRJET - Employee Performance Prediction System using Data Mining
IRJET - Employee Performance Prediction System using Data Mining
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data mining
 
Privacy preservation techniques in data mining
Privacy preservation techniques in data miningPrivacy preservation techniques in data mining
Privacy preservation techniques in data mining
 
Data Mining System and Applications: A Review
Data Mining System and Applications: A ReviewData Mining System and Applications: A Review
Data Mining System and Applications: A Review
 
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEYCLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
CLASSIFICATION ALGORITHM USING RANDOM CONCEPT ON A VERY LARGE DATA SET: A SURVEY
 
Exploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptxExploratory data analysis for business MODULE 1.pptx
Exploratory data analysis for business MODULE 1.pptx
 
A Systematic Review On Educational Data Mining
A Systematic Review On Educational Data MiningA Systematic Review On Educational Data Mining
A Systematic Review On Educational Data Mining
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...
 
Data Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope SurveyData Mining Applications And Feature Scope Survey
Data Mining Applications And Feature Scope Survey
 
50120130406032
5012013040603250120130406032
50120130406032
 
Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...Anomaly detection via eliminating data redundancy and rectifying data error i...
Anomaly detection via eliminating data redundancy and rectifying data error i...
 
Review of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & PredictionReview of Algorithms for Crime Analysis & Prediction
Review of Algorithms for Crime Analysis & Prediction
 
Comprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction TechniquesComprehensive Survey of Data Classification & Prediction Techniques
Comprehensive Survey of Data Classification & Prediction Techniques
 

Plus de ijcnes

A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...ijcnes
 
Economic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian EconomyEconomic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian Economyijcnes
 
An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...ijcnes
 
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...ijcnes
 
Challenges of E-government in Oman
Challenges of E-government in OmanChallenges of E-government in Oman
Challenges of E-government in Omanijcnes
 
Power Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage SystemPower Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage Systemijcnes
 
Holistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining TechniquesHolistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining Techniquesijcnes
 
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...ijcnes
 
Feature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image RetrievalFeature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image Retrievalijcnes
 
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud ComputingChallenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud Computingijcnes
 
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...ijcnes
 
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...ijcnes
 
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor NetworksAn Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networksijcnes
 
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic VehiclesSecured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic Vehiclesijcnes
 
Virtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor NetworksVirtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor Networksijcnes
 
Mitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 FactorizationMitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 Factorizationijcnes
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniquesijcnes
 
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...ijcnes
 
Priority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNPriority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNijcnes
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based Systemijcnes
 

Plus de ijcnes (20)

A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...A Survey of Ontology-based Information Extraction for Social Media Content An...
A Survey of Ontology-based Information Extraction for Social Media Content An...
 
Economic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian EconomyEconomic Growth of Information Technology (It) Industry on the Indian Economy
Economic Growth of Information Technology (It) Industry on the Indian Economy
 
An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...An analysis of Mobile Learning Implementation in Shinas College of Technology...
An analysis of Mobile Learning Implementation in Shinas College of Technology...
 
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
A Survey on the Security Issues of Software Defined Networking Tool in Cloud ...
 
Challenges of E-government in Oman
Challenges of E-government in OmanChallenges of E-government in Oman
Challenges of E-government in Oman
 
Power Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage SystemPower Management in Micro grid Using Hybrid Energy Storage System
Power Management in Micro grid Using Hybrid Energy Storage System
 
Holistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining TechniquesHolistic Forecasting of Onset of Diabetes through Data Mining Techniques
Holistic Forecasting of Onset of Diabetes through Data Mining Techniques
 
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
A Survey on Disease Prediction from Retinal Colour Fundus Images using Image ...
 
Feature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image RetrievalFeature Extraction in Content based Image Retrieval
Feature Extraction in Content based Image Retrieval
 
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud ComputingChallenges and Mechanisms for Securing Data in Mobile Cloud Computing
Challenges and Mechanisms for Securing Data in Mobile Cloud Computing
 
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
Detection of Node Activity and Selfish & Malicious Behavioral Patterns using ...
 
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
Optimal Channel and Relay Assignment in Ofdmbased Multi-Relay Multi-Pair Two-...
 
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor NetworksAn Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
An Effective and Scalable AODV for Wireless Ad hoc Sensor Networks
 
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic VehiclesSecured Seamless Wi-Fi Enhancement in Dynamic Vehicles
Secured Seamless Wi-Fi Enhancement in Dynamic Vehicles
 
Virtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor NetworksVirtual Position based Olsr Protocol for Wireless Sensor Networks
Virtual Position based Olsr Protocol for Wireless Sensor Networks
 
Mitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 FactorizationMitigation and control of Defeating Jammers using P-1 Factorization
Mitigation and control of Defeating Jammers using P-1 Factorization
 
An analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining TechniquesAn analysis and impact factors on Agriculture field using Data Mining Techniques
An analysis and impact factors on Agriculture field using Data Mining Techniques
 
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
A Study on Code Smell Detection with Refactoring Tools in Object Oriented Lan...
 
Priority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSNPriority Based Multi Sen Car Technique in WSN
Priority Based Multi Sen Car Technique in WSN
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based System
 

Dernier

Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network DevicesChandrakantDivate1
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...drmkjayanthikannan
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdfKamal Acharya
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxSCMS School of Architecture
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...Amil baba
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxSCMS School of Architecture
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptxJIT KUMAR GUPTA
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . pptDineshKumar4165
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesRAJNEESHKUMAR341697
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Call Girls Mumbai
 
Wadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxWadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxNadaHaitham1
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXssuser89054b
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaOmar Fathy
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 

Dernier (20)

Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Computer Networks Basics of Network Devices
Computer Networks  Basics of Network DevicesComputer Networks  Basics of Network Devices
Computer Networks Basics of Network Devices
 
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
Unit 4_Part 1 CSE2001 Exception Handling and Function Template and Class Temp...
 
School management system project Report.pdf
School management system project Report.pdfSchool management system project Report.pdf
School management system project Report.pdf
 
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
Call Girls in South Ex (delhi) call me [🔝9953056974🔝] escort service 24X7
 
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptxS1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
S1S2 B.Arch MGU - HOA1&2 Module 3 -Temple Architecture of Kerala.pptx
 
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
NO1 Top No1 Amil Baba In Azad Kashmir, Kashmir Black Magic Specialist Expert ...
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptxHOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
HOA1&2 - Module 3 - PREHISTORCI ARCHITECTURE OF KERALA.pptx
 
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
COST-EFFETIVE  and Energy Efficient BUILDINGS ptxCOST-EFFETIVE  and Energy Efficient BUILDINGS ptx
COST-EFFETIVE and Energy Efficient BUILDINGS ptx
 
Thermal Engineering Unit - I & II . ppt
Thermal Engineering  Unit - I & II . pptThermal Engineering  Unit - I & II . ppt
Thermal Engineering Unit - I & II . ppt
 
Engineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planesEngineering Drawing focus on projection of planes
Engineering Drawing focus on projection of planes
 
Integrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - NeometrixIntegrated Test Rig For HTFE-25 - Neometrix
Integrated Test Rig For HTFE-25 - Neometrix
 
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
Bhubaneswar🌹Call Girls Bhubaneswar ❤Komal 9777949614 💟 Full Trusted CALL GIRL...
 
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced LoadsFEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
FEA Based Level 3 Assessment of Deformed Tanks with Fluid Induced Loads
 
Wadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptxWadi Rum luxhotel lodge Analysis case study.pptx
Wadi Rum luxhotel lodge Analysis case study.pptx
 
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX
 
Introduction to Serverless with AWS Lambda
Introduction to Serverless with AWS LambdaIntroduction to Serverless with AWS Lambda
Introduction to Serverless with AWS Lambda
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 

A Study on Prediction Techniques of Data Mining in Educational Domain

  • 1. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.12-15 ISSN: 2278-2400 12 A Study on Prediction Techniques of Data Mining in Educational Domain T. Durga Nadarajan1 , Pramananda Perumal2 , B.Jagadhesan3 1 Research Scholar, Dept of Computer Science ,Presidency College,Chennai-600005, India 2 Principal, Presidency College,Chennai-600005, India 3 Associate Professor,D.B.Jain College,Chennai-600097, India durga.nadan@gmail.com,pramanandaperumal@yahoo.com,jagadhesanb@gmail.com Abstract—Data mining techniques are efficient for automated discovery of understandable patterns in large databases. Number of techniques available are Association rules mining or market basket analysis, supervised classification, Cluster analysis, etc. Each of these techniques have its own significance. As data mining plays major role in almost all the areas, many researches are being done recently. This paper focuses on different applications of data mining in the field of Education. The methodologies used in students performance prediction, grouping up of students, visualizing the data while analyzing the predictions and technical manipulations in online tests have been discussed. The mandatory areas to be focused before commencing the research are also discussed. These methodologies are useful in building exact data model through which predictions can be made easily. The future scope of this paper is to predict the students choices before they enter into graduation level. Hopefully considering some essential factors and choosing proper methodology for prediction by the researchers better performance is assured . Keywords- Clustering, Association Rule, Predication, Decision trees, KDD,EDM,Regression,CART,K-nearest Neighbor I. INTRODUCTION The necessity for data mining development was evolved due to the rapid growth of the volume of stored data. One can extract valuable knowledge from a very large database using these techniques. Ordinary methods could no longer produce results showing hidden patterns in vast amount of data. Using advanced methods derived from pattern recognition descriptive model can be constructed using data mining on input data. Prediction and description of data can be served with different data models. KDD-Knowledge Discovery and Data mining is a multidisciplinary area to extract knowledge from the data using different models and tools.Data mining techniques depend upon a lot of careful analysis and a good understanding. It is perhaps best to identify the best technique to solve the problem. Applications of data mining include aviation, astronomy, banking and finance, weather, crime prevention, healthcare, E-commerce, Education and many more [1]. The large volume of data in various fiels are stored in the form of documents, files, images, videos, files, sound and many other formats. Proper decision making requires accurate method of knowledge extraction from the large repository of data [2]. Much of the innovation in the field is coming from the academic community . Data mining in education is an emerging field in which methods are developed from a educational environments [3].Data mining techniques suggested for the educational data mining are regression, Nearest Neighbor, Clustering, Classification he knowledge discovered from these tools can be used for performance prediction, Attrition Risk Detection, Data Visualization, Intelligent Feedback, Course Recommendations, Students Skill Estimation, Behavior Detection, Manipulations in On-line Examinations, and so on as per user’s demand[4]. II. RELATED WORKS Katrina Sin and Loganathan Muthu have discussed about various techniques adapted for educational data mining. Also, various applications in learning is also discussed elaborately with few tools implemented.Monika Goyal and Rajan Vohra have presented mining techniques like classification and clustering for various applications. Also, they have discussed to analyse student performance analysis, course curriculum design, grouping of students etc(9).Student performance prediction is done by Dina Abdulaziz Alhammadi. He has used K-means,Naive Bayes algorithm and compared the algorithms for highest accuracy. He has concluded C4.5 to be the best algorithm(7).Nikita Jain, Vishal Srivastava have presented data mining concepts with ANN(5), most suitable way of solving problems . It has a high degree of tolerance, robustness, parallel processing, adaptiveness and so on. They concluded that these techniques are used in scientific, commercial, educational platforms.Student’s performance analysis is done by Mythili and Mohammad using various methodologies .After comparing this technique with other methodologies they concluded that random forest performance is better and best suitable for predicting students performance(11).Data mining software is explained by Han, Kamber. They have also discussed methods in different dimension and the relationship has been identified during these processes.Brijesh Kumar Baradwaj, Saurabh Pal[4] have described about datamining basic concepts and various methods of predicting the results with suitable input of data III. METHODOLOGIES Knowledge discovery in database(KDD) is extracting information from a large source of data. Many techniques are available to operate on big volumes of data mainly to discover hidden patterns and possible relationships which are helpful in decision making. The steps in extraction of knowledge from the data are shown in Figure 1. Some basic requirements for a data mining research to be successful are  Mandatory is data  Adequate & well cleaned data
  • 2. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.12-15 ISSN: 2278-2400 13  Problems are well defined.  Problem should be in such a way that it cannot be solved by simple query  Actionable results to be found Number of techniques available are Association rules mining or market basket analysis, supervised classification, Cluster analysis, etc. Each of these techniques have its own significance. Ordinary methods could no longer produce results showing hidden patterns in vast amount of data These methods and techniques are to be explained briefly to have better understanding. Classification Mmost widely used data mining technique is classification. Supervised learning requires training dataset where testing requires additional data. This involves classification and learning. Decision trees can be constructed to identify and describe aread of high risk which are the evaluated with the help of flow chart like tree structure. Another often used classification technique is Naïve Bayes method. In this method, only one scan of whole data is needed and also, if at some stage there are additional training data then each training sample may increase or decrease the probability that a hypothesis is correct. The evaluation criteria for this technique are speed, robustness, scalability, flexibility and time complexity. Clustering Objects or things are made into groups so that groups can be commented rather than individuals . Cluster analysis method can deal with large as well as small set of data . This method also scales well with datasets in which number of attributes is large. This method does not require more than one scan of disk resident data. With minimal input parameters it is able to give the desirable output even though data analyst has limited knowledge on the dataset. Classification can also be combined for fruitful results, but very expensive. This approach can deal with any cluster shape and the results are independent of the data input order. Predication Predication can adapt Regression technique. Relationship between many independent and dependent variables with suitable data can be modeled with the help of Regression. In data mining independent variables are attributes which are already known and response variables are the ones to be predicted. Many real-world problems are combination of both prediction and others. Combined techniques like neural networks, regression, decision trees are necessary to predict the future . Classification and prediction can use the same models frequently. CART decision tree algorithm can be used for building both the classification trees which is to classify categorical response variables and regression trees often used to forecast continuous response variables. Neural network is another technique for creating both classification and regression models. Association Association rules mining or market basket analysis involves set of association rules to correlate between large number of databases. This includes applications in marketing, customer segmentation, e-commerce, shopping behavior analysis and many more. The process of mining can be made more efficient using Apriori algorithm. One major disadvantage is the large number of rules. Many software packages are available for association mining. It should be noted that lowering the support threshold results in many more frequent item sets and often increases the number of candidate item sets and maximum length of frequent item sets resulting in cost increases. Neural Networks Neural network is associated with weights with each of its connection in which the dataset is input or output units. This methodology is very useful to infer an accurate pattern and the trend can be detected in complicated dataset where it cannot be identified by humans or computers. The input weights are adjusted so that correct class is found. This is well suited for continuous predictions or forecasting in the research. Decision Trees Decision tree method is a popular and relatively simple super supervised classification method that involves each node of the tree specifying a test of some attribute and each branch from the node corresponding to one of the values of the attribute. Many methods are adapted for decision tree. Specifically Classification (CART) and Chi Square Automatic Interaction Detection (CHAID) are popular. Nearest Neighbor Nearest neighbor algorithm classifies each record by combining classes of K records where k>=1. The dataset should be similar to that of a historical dataset. This technique is also named as K-nearest neighbor technique.[4] IV. APPLICATIONS IN EDUCATION The areas of data mining applications are automotive, manufacturing, robotics, aviation, students performance analysis, prediction, telecommunication, banking sectors, finance, scientific research , commercial, government sectors and many more.[3] For extraction of useful patterns, it is very mandatory to employ data mining process. It comprises of five elements which are as follows:  Extraction, conversion and loading transaction  Storage and management of data in database system  Data access granted to business analyst and professionals.  Analyzing the data using software  Last but not least, data to be presented in an easy and useful format may be in graphs or charts[5]
  • 3. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.12-15 ISSN: 2278-2400 14 EDM Educational Data Mining (EDM)[10] develops various methods or techniques for educational institutions. Its main aim is to extract various data from these institutions to understand students. To analyze students performance, data enrolment, performance prediction, Students learning practice to improve their skill set, course curriculum changes in many institutions are some main areas in which data mining is applied. Current educational practice is a major problem which affects the students performance. This can be taken as one of the most emerging area of research.[9][10][11] Data Visualization Reports on educational data become more and more complex as education grow bigger. Data can be visualized using different techniques to easily identify the trends and relations in the data just by looking on the visual reports. The results of the analysis are always presented in statistical formats. This helps us to analyze informations like students enter and exit, materials used by students for study purpose, the trends , studying patterns, downloads done by students, pages browsed by them, total time taken for study and understand, sequence of activities during study and so on. Focussed studies in educational data system are daily, seasonal and hourly behavioral patterns. It emphasizes decision making. Visualization plays an important role for administrators to analyze information usage. It helps educators and students to visualize learning traces. Student Performance Prediction In students performance prediction, the design, development or implementation of new models, frameworks and algorithms for predicting or measuring students performance are carried out through various methodologies. Mainly classification technique plays an important role which combines datasets individually and analysis is being done on quantitative manner. Most common areas focused are marks analysis, academic analytics, quantitative evaluations and many more. Based on logged data Grade analysis, performance analysis are done using various techniques. Regression technique is one of the models deployed to predict many areas in educational sectors. Decision trees, Naïve Bayes algorithms have also proved in this area of data mining. These techniques are widely helpful in predicting exam results thereby improving the students performance. These techniques are applied for distance education and e-learning also for predicting the students performance.[6][7][10] Clustering Students Depending upon personal characteristics, customized features clusters are created in this category. This is done to develop a personalized learning system by the instructor which promotes well organized learning methods. Generally clustering algorithm looks for a group of similar characters from a sequenced database. Hierarchical clustering algorithm is used in e-learning systems for effective study. This algorithm group up students based on their features, profile, learning style etc. k-means, agglomerative clustering ,model based clustering, hierarchical clustering are few often used clustering algorithms.[9] Manipulations in Online Examination Nowadays examinations are conducted online and possibility of malpractice and fraud are more. These are not only done by the students but mainly by the hackers. Many data mining techniques are there to build models to detect and prevent the malpractices. Scandals in businesses and journalism are very common for the hackers. Models are built on the basis of stress , common cheating practices attempted by students to get a better grade in the exams. V. OPEN ISSUES AND FUTURE SCOPE Everything is data and there are myriad of opportunities to deal with them. Major problem faced are some techniques have high processing time with less accuracy and vice versa. Educational models have to be built with greater potentiality. Researchers should concentrate in selecting the best classification and clustering algorithm to build models in this area. . With the help of the above survey not only the students performance but also the students preferences for higher education based on some essential factors can also be focussed for future research purpose. VI. CONCLUSIONS Various educational applications like students performance prediction, manipulations in online exams, clustering students and many more have been discussed by many authors. Basic requirements before commencement of a research and various methodologies like clustering, classification, prediction, association, neural network have been analysed. Hopefully this survey will prove to be helpful for the researchers in this area of educational data mining to choose their best technique for students performance prediction to build their model . This can also be extended in predicting the student’s preferences for higher education.First of all I would like to extend my gratitude to Almighty who always support me in my endeavours. I extend my heartfelt thanks to my guide , colleagues, my family and friends for their encouragement and thankful to all those who helped me at every stage of my work. References [1] Mrs.Bharati M.Ramageri, “Data mining techniques and Applications”, Indian Journal of Computer Science and Engineering Vol.1 No.4,2008 [2] G.K.Gupta, ”Ïntroduction to Data mining with Case studies”,pp.05-19, 2006 [3] Heikki Mannila, “Data Mining: machine learning, statistics and databases”, IEEE, 1996 [4] Katrina Sin Loganathan Muthu, Application of Big Data in Education and learning Analytics-A Literature Review, ICTACT Vol5 Issue:4,Jul-2015 [5] J Han and M Kamber, “Data Mining: Concepts and Techniques” 2000 [6] Brijesh Kumar Baradwaj, Saurabh Pal “Mining Educational Data to Analyze Students”,IJACSA,Vol.2, No.6,2011 [7] Nikita Jain, vishal Srivatsava, ”Data mining Techniques:A Survey Paper”, (IJRET)International Journal of Research in Engineering and Technology,Vol.2 Issue:11,Nov- 2013 [8] Naeimah Delavari, Somnuk Phon Amnuaisuk, ”Data mining Application in higher Learning Institutions”, Informatics in Education,Vol.7,No.1,2008 [9] Dina Abdulaziz Alhammadi, ”Data Mining in Education An Experimental Study”, International Journal of Computer Applications Vol.62,N0.15,Jan-2013
  • 4. Integrated Intelligent Research (IIR) International Journal of Business Intelligents Volume: 05 Issue: 01 June 2016 Page No.12-15 ISSN: 2278-2400 15 [10] Dr.Mohd Maqsood Ali,”Role of Data mining in Education sector”, International Journal of Computer Science and Mobile Computing Vol.2,Issue.4,Apr2013 [11] Monika Goyal and Rajan Vohra, “Applications of Data mining in Higher Education”, IJCSI International Journal of Computer Science Issues,Vol.2,Issue:2,No.1,Mar-2012 [12] Dr.Mohd Maqsood Ali,”Role of Data mining in Education Sector”,International Journal of Computer Science and Mobile Computing Vol. 2,Issue.4,Apr2013 [13] Richard A.Huebner, Ä survey of educational data mining research”, Research in Higher Education Journal,2012 [14] Mrs.M.S.Mythili Dr.A.R.Mohamed Shanavas, “Än analysis of students performance using classification algorithms”, ISOR [15] Kumar,V An Empirical Study of the Applications of Data mining techniques in higher education”(IJACSA) International Journal of Advanced Computer Science and Applications,80-84 [16] Z.J Kovacic ”Early prediction of student success: Mining student enrolment data”, Proceedings of Information Science and IT Education Conference 2010