SlideShare a Scribd company logo
1 of 15
Download to read offline
The Design and Development of an
Expert System Prototype for Enhancing
Exam Quality


                                                   Presenter: Paul DeCarlo
                                                   Professor: Nouhad Rizk
                                                   Date: 6 / 11/ 2010



DeCarlo, P., & Rizk, N. (2010). The Design and Development of an Expert
System Prototype for Enhancing Exam Quality. International Conference on
Introduction
 Data mining or knowledge discovery in databases (KDD) is the
  automatic extraction of implicit and interesting patterns from
  large data collections (Klosgen & Zytkow, 2002).
 Rule discovery is one of the most popular data mining
  techniques, especially in EDM(Educational Data Mining),
  because it shows the teacher information that has been
  discovered and does so in an intuitive way (Romero & Ventura,
  2004).
 Conventional rule-based expert systems, use human expert
  knowledge to solve real-world problems that normally would
  require human intelligence. Expert knowledge is often
  represented in the form of rules or as data within the computer.
  Rule-based expert systems have played an important role in
  modern intelligent systems and their applications in strategic
  goal setting, planning, design, scheduling, fault monitoring,
  diagnosis and so on (Abraham, 2005).
Study Purpose
 Course evaluations are typically done once per semester at college
  universities. Furthermore, students who drop a course are usually
  not considered in these evaluations.

 Course evaluations typically do not consider evaluating the efficacy
  of course materials including assignments, textbooks, review
  materials, etc.

 Evaluations done at intervals, would be able to capture issues as
  they happen and include students who are intending on dropping.
  This could inform the educator of what is actually happening in their
  course instead of providing information after the fact.

 Most of the current data mining tools are too complex for educators
  to use, thus a system which can automate this process to create a
  human readable evaluations dynamically is of much importance if a
  system of this type is expected to be adopted by educators.
What is Exam Quality?
    Exam quality refers to how well an examination of learned material
    reflects the information provided in course learning materials. Think of
    the way validity is defined in research methods.
What is Association Rule
Learning?
   Association rules are a data mining technique used to discover relations
    between variables in large example sets.

    Support is defined as the probability that an example contains a subset X
    when randomly chosen from the total set of responses. Support of an
    association rule ‗A=>B‘ is defined as the ‗support of (A union B)‘.

   Confidence refers to the likelihood that for a transaction containing A, how
    likely is it that it also contains B. Confidence of an association rule ‗A=>B‘ is
    defined as the ‗probability that an example contains B given A divided by
    the probability that an example contains A‘. This is the same as the
    ‗support of (A union B) divided by the support of (A)‘.

  Algorithm developed by Rakesh Agrawal (1993).
1. Minimum support is applied to find all frequent itemsets in a database.
2. These frequent itemsets and the minimum confidence constraint are used
   to form rules.

    The rules this technique produces can be interpreted as easily as they can
    be read. For example, a typical rule may take the form {studies daily} =>
    {has a high GPA}. This would mean that the feature 'studies daily‘ implies a
    given example {has a high GPA}.
Data Collection Driven by W-CAT
model
 We must have a set of data from which to mine our rules. There has been
  research done applying data mining techniques to CMS logs (i.e. Moodle).
  Our system seeks to include subjective data and requires a survey interface
  to collect current data. Some of the information we ask in the survey could
  be gathered from a CMS (homework / exam review completion).
 To drive our data collection process and the overall workflow of our system,
  we used the Witty Cat model developed by Dr. Nouhad Rizk at the
  University of Houston to guide the creation of our survey.

                                The Witty Cat Model
Example Survey Questions &
    Results

•Data gathered using
the open-source
LimeSurvey online
survey software.
•The responses can
be considered valid,
as invitations to the
survey are distributed
using a secure token
system.
The RapidMiner Process Tree
            •We used the open-source data mining tool,
            RapidMiner for the generation of our rulesets.
            This tool allows for visualization and handling
            of remote databases.
            •In our initial study (DeCarlo & Rizk 2010), the
            survey data was cleansed by converting the
            numerical grades to nominals A-F. These
            were then converted to binomial data.
            •Questions which used a 5 point ranking scale
            were discretized into bins and processed as
            binomial data.
            •Frequent Itemsets were generated and we
            then applied then generated our association
            rules using a built-in implementation of
            Agrawal’s Apriori algorithm.
Rules generated in our case
 study




Results from 50 students enrolled in a College level Computer
Organization and Design
Course at the University of Houston Fall 2009.
Impact of Pilot Study on Instructor
Methodology
 Our study showed that 62% of students owned the course textbook. Of
  that 62% only 8% found it useful. This information allowed the instructor
  to consider teaching more from the text, removing the text completely ,
  or adopting an alternative text. Further inquiry from the students
  revealed that they were in favor of a better textbook. Specifically one
  which offered more MIPS programming overview. This was
  corroborated by 62% of students supporting an increase in
  programming exercises.

 Our rules implied that students who viewed the video->expected to do
  well on exam2‘ and students who studied primarily using the review
  video->received F. We can consider the pairing of these rules to imply
  that the review video instills false confidence in students. I personally
  inform my students now that there is a review video, but research
  indicates it may lead to a failing grade if used on its own.

 From these examples we can see that a system of this type may be
  beneficial to both students and instructors.
Development Issues raised in Pilot
Study
 We need to implement safe guards to protect against meaningless or
  contradictory rule generation. By seeking pre-defined rules this can cut
  the computational resources needed to generate our rules and solve
  both of these issues while remaining adaptive.


 Other techniques may prove to be more useful to achieve a dynamic
  assessment. For example neural networks . A neural network of
  Moodle data combined with information on the final evaluation of the
  students has been used to obtain models that permit to predict what
  students are in situation to surpass a course(Calvo-Florez 2006). Other
  researchers suggest using a combination of techniques to achieve more
  interesting results (Romero 2007).
Current state of WittyCat
 We have created a stand-alone data collection system which does not
  rely on LimeSurvey.


 We are implementing the automation of the Apriori algorithm on
  collected data for one-click rule generation.


 We are currently developing an inference engine to provide backward
  chaining driven explanations of the conclusions arrived at in a human-
  readable format.
Overview of the Desired Final
Product




A dynamic, adaptive, specified, intelligent, assessment tool with expert
adaptation.
How you can contribute
 If you are teaching an online course with examination given at intervals,
  we can use your data and generate feedback. We are also interested in
  your subjective critiques of the W-CAT analysis.


 To use the system simply visit wittycat.volatileassertion.com and
  register. Next, watch the instructional video and import your students
  and course materials as outlined in your syllabus.


 Your participation can help us determine the subjective validity of the
  assessments produced by our tool


 You may contact Paul DeCarlo at pjdecarlo@uh.edu or Dr. Nouhad Rizk
  at njrizk@uh.edu for further information.
Suggestions, Questions,
Critiques?

More Related Content

What's hot

Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsIJDKP
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEIJDKP
 
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUESTUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUEIJDKP
 
Data science lecture4_doaa_mohey
Data science lecture4_doaa_moheyData science lecture4_doaa_mohey
Data science lecture4_doaa_moheyDoaa Mohey Eldin
 
Predicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data miningPredicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data miningLovely Professional University
 
Application of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsApplication of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsijcsa
 
IRJET- Evaluation Technique of Student Performance in various Courses
IRJET- Evaluation Technique of Student Performance in various CoursesIRJET- Evaluation Technique of Student Performance in various Courses
IRJET- Evaluation Technique of Student Performance in various CoursesIRJET Journal
 
Machine learning for Data Science
Machine learning for Data ScienceMachine learning for Data Science
Machine learning for Data ScienceDr. Vaibhav Kumar
 
STUDENT PERFORMANCE ANALYSIS USING DECISION TREE
STUDENT PERFORMANCE ANALYSIS USING DECISION TREESTUDENT PERFORMANCE ANALYSIS USING DECISION TREE
STUDENT PERFORMANCE ANALYSIS USING DECISION TREEAkshay Jain
 
A Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data MiningA Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data MiningEditor IJCATR
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerIJERA Editor
 
Machine learning in Data Science
Machine learning in Data ScienceMachine learning in Data Science
Machine learning in Data ScienceDr. Vaibhav Kumar
 
EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020Ritika Saxena
 
IRJET- Predictive Analytics for Placement of Student- A Comparative Study
IRJET- Predictive Analytics for Placement of Student- A Comparative StudyIRJET- Predictive Analytics for Placement of Student- A Comparative Study
IRJET- Predictive Analytics for Placement of Student- A Comparative StudyIRJET Journal
 
Predicting More from Less: Synergies of Learning
Predicting More from Less: Synergies of LearningPredicting More from Less: Synergies of Learning
Predicting More from Less: Synergies of LearningCS, NcState
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Predictionijtsrd
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 

What's hot (20)

Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
 
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVEDATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
DATA MINING IN EDUCATION : A REVIEW ON THE KNOWLEDGE DISCOVERY PERSPECTIVE
 
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUESTUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
STUDENTS’ PERFORMANCE PREDICTION SYSTEM USING MULTI AGENT DATA MINING TECHNIQUE
 
Data science lecture4_doaa_mohey
Data science lecture4_doaa_moheyData science lecture4_doaa_mohey
Data science lecture4_doaa_mohey
 
Predicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data miningPredicting students performance using classification techniques in data mining
Predicting students performance using classification techniques in data mining
 
Application of hidden markov model in question answering systems
Application of hidden markov model in question answering systemsApplication of hidden markov model in question answering systems
Application of hidden markov model in question answering systems
 
IRJET- Evaluation Technique of Student Performance in various Courses
IRJET- Evaluation Technique of Student Performance in various CoursesIRJET- Evaluation Technique of Student Performance in various Courses
IRJET- Evaluation Technique of Student Performance in various Courses
 
final
finalfinal
final
 
Machine learning for Data Science
Machine learning for Data ScienceMachine learning for Data Science
Machine learning for Data Science
 
STUDENT PERFORMANCE ANALYSIS USING DECISION TREE
STUDENT PERFORMANCE ANALYSIS USING DECISION TREESTUDENT PERFORMANCE ANALYSIS USING DECISION TREE
STUDENT PERFORMANCE ANALYSIS USING DECISION TREE
 
A Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data MiningA Survey on the Classification Techniques In Educational Data Mining
A Survey on the Classification Techniques In Educational Data Mining
 
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using RapidminerStudy and Analysis of K-Means Clustering Algorithm Using Rapidminer
Study and Analysis of K-Means Clustering Algorithm Using Rapidminer
 
System Adoption: Socio-Technical Integration
System Adoption: Socio-Technical IntegrationSystem Adoption: Socio-Technical Integration
System Adoption: Socio-Technical Integration
 
Machine learning in Data Science
Machine learning in Data ScienceMachine learning in Data Science
Machine learning in Data Science
 
EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020EDM_IJTIR_Article_201504020
EDM_IJTIR_Article_201504020
 
IRJET- Predictive Analytics for Placement of Student- A Comparative Study
IRJET- Predictive Analytics for Placement of Student- A Comparative StudyIRJET- Predictive Analytics for Placement of Student- A Comparative Study
IRJET- Predictive Analytics for Placement of Student- A Comparative Study
 
B05110409
B05110409B05110409
B05110409
 
Predicting More from Less: Synergies of Learning
Predicting More from Less: Synergies of LearningPredicting More from Less: Synergies of Learning
Predicting More from Less: Synergies of Learning
 
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and PredictionUsing ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 

Similar to ICELW Conference Slides

The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...Thada Jantakoon
 
Analyzing undergraduate students’ performance in various perspectives using d...
Analyzing undergraduate students’ performance in various perspectives using d...Analyzing undergraduate students’ performance in various perspectives using d...
Analyzing undergraduate students’ performance in various perspectives using d...Alexander Decker
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...IJERA Editor
 
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...Scott Bou
 
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...IRJET Journal
 
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...IJDKP
 
Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...IJSRD
 
Student Assestment Questionnaire
Student Assestment QuestionnaireStudent Assestment Questionnaire
Student Assestment QuestionnaireMichael Mendoza
 
Online Examination and Evaluation System
Online Examination and Evaluation SystemOnline Examination and Evaluation System
Online Examination and Evaluation SystemIRJET Journal
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemIJSRD
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...IJCNCJournal
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...IJCNCJournal
 
Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...IJCNCJournal
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning StrategyJessie Chuang
 
Indexing based Genetic Programming Approach to Record Deduplication
Indexing based Genetic Programming Approach to Record DeduplicationIndexing based Genetic Programming Approach to Record Deduplication
Indexing based Genetic Programming Approach to Record Deduplicationidescitation
 
Exploring Semantic Question Generation Methodology and a Case Study for Algor...
Exploring Semantic Question Generation Methodology and a Case Study for Algor...Exploring Semantic Question Generation Methodology and a Case Study for Algor...
Exploring Semantic Question Generation Methodology and a Case Study for Algor...IJCI JOURNAL
 
IRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET Journal
 

Similar to ICELW Conference Slides (20)

The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...The Architecture of System for Predicting Student Performance based on the Da...
The Architecture of System for Predicting Student Performance based on the Da...
 
Analyzing undergraduate students’ performance in various perspectives using d...
Analyzing undergraduate students’ performance in various perspectives using d...Analyzing undergraduate students’ performance in various perspectives using d...
Analyzing undergraduate students’ performance in various perspectives using d...
 
An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...An Empirical Study of the Applications of Classification Techniques in Studen...
An Empirical Study of the Applications of Classification Techniques in Studen...
 
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
Analysis Of Data Mining Model For Successful Implementation Of Data Warehouse...
 
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
A Comprehensive Review of Relevant Techniques used in Course Recommendation S...
 
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
WEB-BASED DATA MINING TOOLS : PERFORMING FEEDBACK ANALYSIS AND ASSOCIATION RU...
 
Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...Student Performance Evaluation in Education Sector Using Prediction and Clust...
Student Performance Evaluation in Education Sector Using Prediction and Clust...
 
Student Assestment Questionnaire
Student Assestment QuestionnaireStudent Assestment Questionnaire
Student Assestment Questionnaire
 
Online Examination and Evaluation System
Online Examination and Evaluation SystemOnline Examination and Evaluation System
Online Examination and Evaluation System
 
T0 numtq0n tk=
T0 numtq0n tk=T0 numtq0n tk=
T0 numtq0n tk=
 
Analysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry SystemAnalysis on Student Admission Enquiry System
Analysis on Student Admission Enquiry System
 
IJET-V2I6P22
IJET-V2I6P22IJET-V2I6P22
IJET-V2I6P22
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
 
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...
 
Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...Correlation based feature selection (cfs) technique to predict student perfro...
Correlation based feature selection (cfs) technique to predict student perfro...
 
Data-Driven Learning Strategy
Data-Driven Learning StrategyData-Driven Learning Strategy
Data-Driven Learning Strategy
 
Introduction to machine learning
Introduction to machine learningIntroduction to machine learning
Introduction to machine learning
 
Indexing based Genetic Programming Approach to Record Deduplication
Indexing based Genetic Programming Approach to Record DeduplicationIndexing based Genetic Programming Approach to Record Deduplication
Indexing based Genetic Programming Approach to Record Deduplication
 
Exploring Semantic Question Generation Methodology and a Case Study for Algor...
Exploring Semantic Question Generation Methodology and a Case Study for Algor...Exploring Semantic Question Generation Methodology and a Case Study for Algor...
Exploring Semantic Question Generation Methodology and a Case Study for Algor...
 
IRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning TechniquesIRJET- Analysis of Student Performance using Machine Learning Techniques
IRJET- Analysis of Student Performance using Machine Learning Techniques
 

ICELW Conference Slides

  • 1. The Design and Development of an Expert System Prototype for Enhancing Exam Quality Presenter: Paul DeCarlo Professor: Nouhad Rizk Date: 6 / 11/ 2010 DeCarlo, P., & Rizk, N. (2010). The Design and Development of an Expert System Prototype for Enhancing Exam Quality. International Conference on
  • 2. Introduction  Data mining or knowledge discovery in databases (KDD) is the automatic extraction of implicit and interesting patterns from large data collections (Klosgen & Zytkow, 2002).  Rule discovery is one of the most popular data mining techniques, especially in EDM(Educational Data Mining), because it shows the teacher information that has been discovered and does so in an intuitive way (Romero & Ventura, 2004).  Conventional rule-based expert systems, use human expert knowledge to solve real-world problems that normally would require human intelligence. Expert knowledge is often represented in the form of rules or as data within the computer. Rule-based expert systems have played an important role in modern intelligent systems and their applications in strategic goal setting, planning, design, scheduling, fault monitoring, diagnosis and so on (Abraham, 2005).
  • 3. Study Purpose  Course evaluations are typically done once per semester at college universities. Furthermore, students who drop a course are usually not considered in these evaluations.  Course evaluations typically do not consider evaluating the efficacy of course materials including assignments, textbooks, review materials, etc.  Evaluations done at intervals, would be able to capture issues as they happen and include students who are intending on dropping. This could inform the educator of what is actually happening in their course instead of providing information after the fact.  Most of the current data mining tools are too complex for educators to use, thus a system which can automate this process to create a human readable evaluations dynamically is of much importance if a system of this type is expected to be adopted by educators.
  • 4. What is Exam Quality?  Exam quality refers to how well an examination of learned material reflects the information provided in course learning materials. Think of the way validity is defined in research methods.
  • 5. What is Association Rule Learning?  Association rules are a data mining technique used to discover relations between variables in large example sets.  Support is defined as the probability that an example contains a subset X when randomly chosen from the total set of responses. Support of an association rule ‗A=>B‘ is defined as the ‗support of (A union B)‘.  Confidence refers to the likelihood that for a transaction containing A, how likely is it that it also contains B. Confidence of an association rule ‗A=>B‘ is defined as the ‗probability that an example contains B given A divided by the probability that an example contains A‘. This is the same as the ‗support of (A union B) divided by the support of (A)‘.  Algorithm developed by Rakesh Agrawal (1993). 1. Minimum support is applied to find all frequent itemsets in a database. 2. These frequent itemsets and the minimum confidence constraint are used to form rules.  The rules this technique produces can be interpreted as easily as they can be read. For example, a typical rule may take the form {studies daily} => {has a high GPA}. This would mean that the feature 'studies daily‘ implies a given example {has a high GPA}.
  • 6. Data Collection Driven by W-CAT model  We must have a set of data from which to mine our rules. There has been research done applying data mining techniques to CMS logs (i.e. Moodle). Our system seeks to include subjective data and requires a survey interface to collect current data. Some of the information we ask in the survey could be gathered from a CMS (homework / exam review completion).  To drive our data collection process and the overall workflow of our system, we used the Witty Cat model developed by Dr. Nouhad Rizk at the University of Houston to guide the creation of our survey. The Witty Cat Model
  • 7. Example Survey Questions & Results •Data gathered using the open-source LimeSurvey online survey software. •The responses can be considered valid, as invitations to the survey are distributed using a secure token system.
  • 8. The RapidMiner Process Tree •We used the open-source data mining tool, RapidMiner for the generation of our rulesets. This tool allows for visualization and handling of remote databases. •In our initial study (DeCarlo & Rizk 2010), the survey data was cleansed by converting the numerical grades to nominals A-F. These were then converted to binomial data. •Questions which used a 5 point ranking scale were discretized into bins and processed as binomial data. •Frequent Itemsets were generated and we then applied then generated our association rules using a built-in implementation of Agrawal’s Apriori algorithm.
  • 9. Rules generated in our case study Results from 50 students enrolled in a College level Computer Organization and Design Course at the University of Houston Fall 2009.
  • 10. Impact of Pilot Study on Instructor Methodology  Our study showed that 62% of students owned the course textbook. Of that 62% only 8% found it useful. This information allowed the instructor to consider teaching more from the text, removing the text completely , or adopting an alternative text. Further inquiry from the students revealed that they were in favor of a better textbook. Specifically one which offered more MIPS programming overview. This was corroborated by 62% of students supporting an increase in programming exercises.  Our rules implied that students who viewed the video->expected to do well on exam2‘ and students who studied primarily using the review video->received F. We can consider the pairing of these rules to imply that the review video instills false confidence in students. I personally inform my students now that there is a review video, but research indicates it may lead to a failing grade if used on its own.  From these examples we can see that a system of this type may be beneficial to both students and instructors.
  • 11. Development Issues raised in Pilot Study  We need to implement safe guards to protect against meaningless or contradictory rule generation. By seeking pre-defined rules this can cut the computational resources needed to generate our rules and solve both of these issues while remaining adaptive.  Other techniques may prove to be more useful to achieve a dynamic assessment. For example neural networks . A neural network of Moodle data combined with information on the final evaluation of the students has been used to obtain models that permit to predict what students are in situation to surpass a course(Calvo-Florez 2006). Other researchers suggest using a combination of techniques to achieve more interesting results (Romero 2007).
  • 12. Current state of WittyCat  We have created a stand-alone data collection system which does not rely on LimeSurvey.  We are implementing the automation of the Apriori algorithm on collected data for one-click rule generation.  We are currently developing an inference engine to provide backward chaining driven explanations of the conclusions arrived at in a human- readable format.
  • 13. Overview of the Desired Final Product A dynamic, adaptive, specified, intelligent, assessment tool with expert adaptation.
  • 14. How you can contribute  If you are teaching an online course with examination given at intervals, we can use your data and generate feedback. We are also interested in your subjective critiques of the W-CAT analysis.  To use the system simply visit wittycat.volatileassertion.com and register. Next, watch the instructional video and import your students and course materials as outlined in your syllabus.  Your participation can help us determine the subjective validity of the assessments produced by our tool  You may contact Paul DeCarlo at pjdecarlo@uh.edu or Dr. Nouhad Rizk at njrizk@uh.edu for further information.