Students academic performance using clustering technique

•Télécharger en tant que PPTX, PDF•

0 j'aime•1,147 vues

saniacorreya

Using Clustering tool analyzing a students performance

Formation

Students Academic
Performance
Knowledge Discovery from Data

Introduction..
 Our project aim is to find students academic performance
and find out whether there is any general pattern in their
marks and performance.
 So here ,We are analyzing both internal and external
marks of a student.
 We did the following KDD preprocessing steps to mine
our data.

Learning the application domain
 Learning the application domain is the first step in KDD
process .
 Need to have a clear understanding about the application
domain and our objectives.
 The institution considered for mining is MCA batch of Rajagiri
College of Social Sciences.
 We collected all previous year academic record from the
department of computer science

Create a target data set:
data selection
 We selected 2007-2010 batch marks for analysing the
pattern.
 There were around 45 records(45 students).
 Both the internal and external marks of each student were
selected, in order to find out the performance pattern.

Data cleaning & preprocessing
 Data cleaning is the step where noise and irrelevant data are
removed from the large data set.
 This is a very important pre-processing step because our
outcome would be dependent on the quality of selected data.
 Remove duplicate records, enter logically correct values for
missing records(absent students), remove unnecessary data
fields and standardize data format.

 There was no much duplicate data or unnecessary data in the
collected record . The dataset was partially cleaned.
 Student internal mark and external mark were stored in
different records.
 By applying data integration these records were integrated
into one record.
 The new dataset consist of internal mark details and external
mark details of each student in one record.

Data reduction & transformation
 Data is transformed into appropriate form for making it ready for
data mining step.
 The dataset contains marks of 5 theory paper and 2 lab paper of
all 5 semesters.
 These marks are transformed into sum of internal marks and sum
of external marks of each student for the easiness of analysing
the pattern.

Cluster Analysis
 The data mining technique we used here is clustering.
 A cluster is a collection of data objects that are similar to
one another within same cluster and are dissimilar to
objects in other cluster.
 We first partitioned the set of data into groups based on
data similarity and then assign labels
Choosing functions of data mining

K-MEANS Partitioning
 The K-means algorithm takes input parameter k and
partitions the set of n objects into k clusters.
 Here we selected no: of cluster as 4
 Objects are distributed to a cluster based on cluster
center to which it is nearest.
 For each semester we found out the clusters separately
and labeled them as students Excellent, Good, Fair and
Poor
Choosing mining algorithms

The Tool used for pattern evaluation is ORANGE

Data mining search for patterns of
interest
 From the mining process we found that “All the 5 semester
clusters followed the same pattern of performance”.
 A student with high internal mark has higher external
marks and a student with less internal marks has less
external marks.
 There is a direct relation between the internal and the
external marks.
 At some case this evaluation is not valid, cases like
 Being absent for internal exam and scoring high marks for
the externals (vice versa)

CONCLUSION
 A students performance in his university exam can be
predicted with the help of his internal marks. There is
a direct relation between the internal and the external
marks.
 A student with low internals will get low marks for
externals too

Use of discovered knowledge
representation

Contenu connexe

Tendances

USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMSAfrican Virtual University

Student management systemAmit Gandhi

Student management systemAnnu Venkata Nagarjuna

Student database management system PROJECTRai Saheb Bhanwar Singh College Nasrullaganj

Student management systemStudent

Student database management systemArpita Naik

C++ student management systemABHIJITPATRA23

Student Record Management presentationRahul Kumar

Student information systemSaqib Nadeem

Student information-system-project-outlineAmit Panwar

Student Database Management SystemAjay Bidyarthy

Studentmanagementsystem1amitgupta

Crime prediction and strategy detection in data miningDaffodil International University

Student management systemGaurav Subham

student database management systemMd. Riadul Islam

Student AttendanceBUBT

Student Attendance System pptManishasingh755956

Overview of Student Management System Components-eduWareArth InfoSoft P. Ltd.

student mangementAditya Gaud

Design of a prototype web based students’ record management system – webstremsAlexander Decker

Tendances (20)

USING LEARNING ANALYTICS TO PREDICT STUDENTS’ PERFORMANCE IN MOODLE LMS

Student management system

Student database management system PROJECT

Student management system

Student database management system

C++ student management system

Student Record Management presentation

Student information system

Student information-system-project-outline

Student Database Management System

Studentmanagementsystem

Crime prediction and strategy detection in data mining

Student management system

student database management system

Student Attendance

Student Attendance System ppt

Overview of Student Management System Components-eduWare

student mangement

Design of a prototype web based students’ record management system – webstrems

En vedette

Factors affecting the academic performance of college students (1)Shubham Aggarwal

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...Editor IJCATR

LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuideLinkedIn Sales Solutions

Sania rtpsaniacorreya

Smartcards and Authentication Tokenssaniacorreya

Data Mining _ WekaRamya Krishna Puttur

Attendance and student performance arp (1)Cindy Paynter

Some Thoughts on Learning Analytics and Educational Data MiningMark Brown

Data Mining Project for student academic specialization and performanceMohammed Kharma

Mining Student Data LIVE_EUR_v2Rebecca Weaver, MBA, PMP

Grand challenges for the Educational Data Mining and Learning Sciences Commun...alywise

Provision and management of school plant as a correlate of science students a...Alexander Decker

Predicting Student Performance in Solving Parameterized ExercisesShaghayegh (Sherry) Sahebi

Ethical HackingJakub Ruzicka

Solar and wind power forecastingRCREEE

My First Data Science Project (using Rapid Miner)Data Science Thailand

Social Web: (Big) Data Mining | summer 2014/2015 course syllabusJakub Ruzicka

The effects of skipping breakfast on the academic performance Hafizah R

Big Data in EducationAlfred Essa

Data Mining – analyse Bank Marketing Data SetMateusz Brzoska

En vedette (20)

Factors affecting the academic performance of college students (1)

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...

LinkedIn Summer Sales Guide - B2B Sales Influencers #LISummerGuide

Sania rtp

Smartcards and Authentication Tokens

Data Mining _ Weka

Attendance and student performance arp (1)

Some Thoughts on Learning Analytics and Educational Data Mining

Data Mining Project for student academic specialization and performance

Mining Student Data LIVE_EUR_v2

Grand challenges for the Educational Data Mining and Learning Sciences Commun...

Provision and management of school plant as a correlate of science students a...

Predicting Student Performance in Solving Parameterized Exercises

Ethical Hacking

Solar and wind power forecasting

My First Data Science Project (using Rapid Miner)

Social Web: (Big) Data Mining | summer 2014/2015 course syllabus

The effects of skipping breakfast on the academic performance

Big Data in Education

Data Mining – analyse Bank Marketing Data Set

Similaire à Students academic performance using clustering technique

EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE cscpconf

IRJET- Academic Performance Analysis SystemIRJET Journal

Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Predictionijtsrd

Data Clustering in Education for StudentsIRJET Journal

Predicting students' performance using id3 and c4.5 classification algorithmsIJDKP

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...Editor IJCATR

DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...ijcsa

Big data projectKedar Kumar

M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...IRJET Journal

A Survey on the Classification Techniques In Educational Data MiningEditor IJCATR

Clustering Students of Computer in Terms of Level of ProgrammingEditor IJCATR

Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...IIRindia

IRJET- Using Data Mining to Predict Students PerformanceIRJET Journal

Student Performance Evaluation in Education Sector Using Prediction and Clust...IJSRD

Analysis on Student Admission Enquiry SystemIJSRD

Fuzzy Association Rule Mining based Model to Predict Students’ Performance IJECEIAES

Brown, chapter 4 By SavaediSavaedi

Correlation based feature selection (cfs) technique to predict student perfro...IJCNCJournal

CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...IJCNCJournal

Similaire à Students academic performance using clustering technique (20)

EFFICIENCY OF DECISION TREES IN PREDICTING STUDENT’S ACADEMIC PERFORMANCE

IRJET- Academic Performance Analysis System

Using ID3 Decision Tree Algorithm to the Student Grade Analysis and Prediction

Data Clustering in Education for Students

Predicting students' performance using id3 and c4.5 classification algorithms

A Model for Predicting Students’ Academic Performance using a Hybrid of K-mea...

DATA MINING METHODOLOGIES TO STUDY STUDENT'S ACADEMIC PERFORMANCE USING THE...

Big data project

M-Learners Performance Using Intelligence and Adaptive E-Learning Classify th...

A Survey on the Classification Techniques In Educational Data Mining

Clustering Students of Computer in Terms of Level of Programming

Performance Evaluation of Feature Selection Algorithms in Educational Data Mi...

IRJET- Using Data Mining to Predict Students Performance

Student Performance Evaluation in Education Sector Using Prediction and Clust...

Analysis on Student Admission Enquiry System

Fuzzy Association Rule Mining based Model to Predict Students’ Performance

Brown, chapter 4 By Savaedi

Correlation based feature selection (cfs) technique to predict student perfro...

CORRELATION BASED FEATURE SELECTION (CFS) TECHNIQUE TO PREDICT STUDENT PERFRO...

Plus de saniacorreya

PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHMsaniacorreya

Object recognitionsaniacorreya

Color and human visionsaniacorreya

Manipulator robot for crack detection and weldingsaniacorreya

Windows 10 pptsaniacorreya

Li fisaniacorreya

Plus de saniacorreya (6)

PROJECT REPORT ON CRYPTOGRAPHIC ALGORITHM

Object recognition

Color and human vision

Manipulator robot for crack detection and welding

Windows 10 ppt

Li fi

Dernier

Mattingly "AI & Prompt Design: The Basics of Prompt Design"National Information Standards Organization (NISO)

9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt

Código Creativo y Arte de Software | Unidad 1Maestría en Comunicación Digital Interactiva - UNR

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood

Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxRAM LAL ANAND COLLEGE, DELHI UNIVERSITY.

Nutritional Needs Presentation - HLTH 104misteraugie

Introduction to Nonprofit Accounting: The BasicsTechSoup

Mastering the Unannounced Regulatory InspectionSafetyChain Software

Interactive Powerpoint_How to Master effective communicationnomboosow

Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur

Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K

A Critique of the Proposed National Education Policy ReformChameera Dedduwage

The Most Excellent Way | 1 Corinthians 13Steve Thomason

The basics of sentences session 2pptx copy.pptxheathfieldcps1

Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732

Software Engineering Methodologies (overview)eniolaolutunde

Measures of Central Tendency: Mean, Median and ModeThiyagu K

Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622

Dernier (20)

Mattingly "AI & Prompt Design: The Basics of Prompt Design"

9548086042 for call girls in Indira Nagar with room service

Código Creativo y Arte de Software | Unidad 1

SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx

Sanyam Choudhary Chemistry practical.pdf

INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx

Nutritional Needs Presentation - HLTH 104

Introduction to Nonprofit Accounting: The Basics

Mastering the Unannounced Regulatory Inspection

Interactive Powerpoint_How to Master effective communication

Web & Social Media Analytics Previous Year Question Paper.pdf

BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...

Measures of Dispersion and Variability: Range, QD, AD and SD

A Critique of the Proposed National Education Policy Reform

The Most Excellent Way | 1 Corinthians 13

The basics of sentences session 2pptx copy.pptx

Separation of Lanthanides/ Lanthanides and Actinides

Software Engineering Methodologies (overview)

Measures of Central Tendency: Mean, Median and Mode

Disha NEET Physics Guide for classes 11 and 12.pdf

Students academic performance using clustering technique

1. Students Academic Performance Knowledge Discovery from Data

2. Introduction..  Our project aim is to find students academic performance and find out whether there is any general pattern in their marks and performance.  So here ,We are analyzing both internal and external marks of a student.  We did the following KDD preprocessing steps to mine our data.

3. Learning the application domain  Learning the application domain is the first step in KDD process .  Need to have a clear understanding about the application domain and our objectives.  The institution considered for mining is MCA batch of Rajagiri College of Social Sciences.  We collected all previous year academic record from the department of computer science

4. Create a target data set: data selection  We selected 2007-2010 batch marks for analysing the pattern.  There were around 45 records(45 students).  Both the internal and external marks of each student were selected, in order to find out the performance pattern.

5. Internal & External Dataset

6. Data cleaning & preprocessing  Data cleaning is the step where noise and irrelevant data are removed from the large data set.  This is a very important pre-processing step because our outcome would be dependent on the quality of selected data.  Remove duplicate records, enter logically correct values for missing records(absent students), remove unnecessary data fields and standardize data format.

7.  There was no much duplicate data or unnecessary data in the collected record . The dataset was partially cleaned.  Student internal mark and external mark were stored in different records.  By applying data integration these records were integrated into one record.  The new dataset consist of internal mark details and external mark details of each student in one record.

9. Data reduction & transformation  Data is transformed into appropriate form for making it ready for data mining step.  The dataset contains marks of 5 theory paper and 2 lab paper of all 5 semesters.  These marks are transformed into sum of internal marks and sum of external marks of each student for the easiness of analysing the pattern.

10.

11. Cluster Analysis  The data mining technique we used here is clustering.  A cluster is a collection of data objects that are similar to one another within same cluster and are dissimilar to objects in other cluster.  We first partitioned the set of data into groups based on data similarity and then assign labels Choosing functions of data mining

12. K-MEANS Partitioning  The K-means algorithm takes input parameter k and partitions the set of n objects into k clusters.  Here we selected no: of cluster as 4  Objects are distributed to a cluster based on cluster center to which it is nearest.  For each semester we found out the clusters separately and labeled them as students Excellent, Good, Fair and Poor Choosing mining algorithms

13. The Tool used for pattern evaluation is ORANGE

14. Orange Cluster Analysis

15. No of cluster selected is 4

16. Semester 1 poor Fair Good Excellent

17. Semester 2

18. Semester 3

19. Semester 4

20. Semester 5

21. Centroid Analysis

22. Semester 1

23. Semester 2

24. Semester 3

25. Semester 4

26. Semester 5

27. Combined Centroid Analysis

28. Data mining search for patterns of interest  From the mining process we found that “All the 5 semester clusters followed the same pattern of performance”.  A student with high internal mark has higher external marks and a student with less internal marks has less external marks.  There is a direct relation between the internal and the external marks.  At some case this evaluation is not valid, cases like  Being absent for internal exam and scoring high marks for the externals (vice versa)

29. CONCLUSION  A students performance in his university exam can be predicted with the help of his internal marks. There is a direct relation between the internal and the external marks.  A student with low internals will get low marks for externals too

30. Use of discovered knowledge representation

31. Thank You

Students academic performance using clustering technique

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (20)

Similaire à Students academic performance using clustering technique

Similaire à Students academic performance using clustering technique (20)

Plus de saniacorreya

Plus de saniacorreya (6)

Dernier

Dernier (20)

Students academic performance using clustering technique