SlideShare une entreprise Scribd logo
1  sur  6
Project Report
                     Version 1.1
                    May 6, 2010



WKU Job Applicant’s Profile Evaluator using KNN

                  Vijayeandra Parthepan
                  Mohnish Thallavajhula
              Professor: Dr. Huanjing Wang




              Submitted in partial fulfillment
                  Of the requirements of
                   CS565 Data Mining




          Western Kentucky University
Project Report
                                                                                                                                                 12/05/11



                                                                 Table of Contents

1.0. Introduction ................................................................................................................................................ 3
2.0. Motivation ................................................................................................................................................... 3
3.0. Dataset Description .................................................................................................................................. 3
4.0. Approaches ................................................................................................................................................ 4
5.0. Future Work ................................................................................................................................................ 5
5.0. Results ......................................................................................................................................................... 5
6.0. Conclusion .................................................................................................................................................. 6
7.0. Referrences ................................................................................................................................................ 6




                                                                                      2
Project Report
                                                                                                  12/05/11


1.0.   INTRODUCTION:

                 K Nearest Neighbor (KNN) is the supervised data mining pattern recognition algorithm. It
       classifies objects based on closest training exam-nearest neighbor algorithm. It is amongst the simplest of
       all machine learning algorithms. An object is classified by a majority vote of its neighbors. K is small
       positive integer and it is usually previously set.
                 WKU job applicant’s profile evaluator using KNN analyzes the status of the current job applicant
       based on the applicant’s details and classifies the applicant to the group of jobs that the applicant can apply.




2.0.   MOTIVATION:

                 The potential employee’s who wish to find some jobs in the university are not sure which jobs
       they are most likely to get and hence they may end up applying to jobs which may not suit their profile. So,
       in order to make their job search more accurate, we are going to compare their profile with already existing
       employee’s and provides them the job suggestions. We are going to analyze the status of WKU employees
       using KNN. The KNN algorithm classifies the new employee to a particular class based on the existing
       records. The k – “nearest” details of the existing job assignments will be considered and the job applicant
       will be classified into which group the applicant belongs to.




3.0.    DATASET DESCRIPTION:

   Training data is the existing assignments of the jobs.


   Sample Training Data:
   A G 3.0 CS        2
   B UG 2.5 ANY 3
   C G 3.0 MPH 5


   Test data is the details of the Job Applicant.


   Sample Test Data:
   G 3.5 CS 5




                                                            3
Project Report
                                                                                                      12/05/11
   Test Data Description:
   Training data has:
   Class Name               in 1st column
   Qualification            in 2nd column
   GPA                      in 3rd column
   Department               in 4th column
   Years of experience      in 5th column


   Training Data Description:
   Qualification            in 1st column
   GPA                      in 2nd column
   Department               in 3rd column
   Years of experience      in 4th column




4.0.   APPROACHES:


                   After calculating the group to which the Job Applicant belongs to, the list of jobs that the Job
       applicant can apply is displayed.


                   The algorithm of the k-nearest neighbor that we apply in our project is as follows,
                   1. Calculate the “distance” from the test record to the training records.
                   2. Find the “k - nearest” training records.
                   3. Check the majority class from the k – nearest training records.
                   4. The class label for the training record is predicted as the class with the majority votes/weight
                   among the k – nearest training.


       We are classifying the job applicants based on their details into different classes of jobs.


                   Group A: {Graduate Assistant, Research Assistant}
                   Group B: {Lab Assistant, Desk Clerk, Night Clerk}
                   Group C: {Shuttle driver, Receptionist}


       The application has been developed using C# .NET.




                                                            4
Project Report
                                                                                                     12/05/11


5.0.   FUTURE WORK:


                 Convert the Windows implementation into Web Application.
                 Provide direct application process to the jobs by taking the applicant’s details.




6.0.   RESULTS:


   Screen shot of the help menu:




                                                          5
Project Report
                                                                                                12/05/11

    Screen shot of the main menu:




7.0.   CONCLUSION:


       By implementing k – NN, the applicant is classified into a particular group of jobs. Thus, the job
   application process is simplified. Since we have implemented k – NN, the implementation is much simpler than
   it’s counter parts i.e. Decision Trees, Naïve Bayes, Support Vector Machines.


8.0.    REFERRENCES:


   http://en.wikipedia.org/wiki/K-nearest_neighbor_algorithm




                                                        6

Contenu connexe

Similaire à Dm project report

IRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and Lime
IRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and LimeIRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and Lime
IRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and LimeIRJET Journal
 
IRJET- Student Placement Prediction using Machine Learning
IRJET- Student Placement Prediction using Machine LearningIRJET- Student Placement Prediction using Machine Learning
IRJET- Student Placement Prediction using Machine LearningIRJET Journal
 
IRJET- A Review on Object Tracking based on KNN Classifier
IRJET- A Review on Object Tracking based on KNN ClassifierIRJET- A Review on Object Tracking based on KNN Classifier
IRJET- A Review on Object Tracking based on KNN ClassifierIRJET Journal
 
Robotics-Based Learning in the Context of Computer Programming
Robotics-Based Learning in the Context of Computer ProgrammingRobotics-Based Learning in the Context of Computer Programming
Robotics-Based Learning in the Context of Computer ProgrammingJacob Storer
 
CompSci: 221 Winter 2017 Search Engine for UCI
CompSci: 221 Winter 2017 Search Engine for UCICompSci: 221 Winter 2017 Search Engine for UCI
CompSci: 221 Winter 2017 Search Engine for UCISoham Kulkarni
 
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Qazi Maaz Arshad
 
Hemraj_Patil_Resume
Hemraj_Patil_ResumeHemraj_Patil_Resume
Hemraj_Patil_ResumeHemraj Patil
 
Image Forgery / Tampering Detection Using Deep Learning and Cloud
Image Forgery / Tampering Detection Using Deep Learning and CloudImage Forgery / Tampering Detection Using Deep Learning and Cloud
Image Forgery / Tampering Detection Using Deep Learning and CloudIRJET Journal
 
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGUSING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGIRJET Journal
 
Web based Software Development
Web based Software DevelopmentWeb based Software Development
Web based Software Developmentdaveparky
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBoston Institute of Analytics
 
CV INSPECTION USING NLP AND MACHINE LEARNING
CV INSPECTION USING NLP AND MACHINE LEARNINGCV INSPECTION USING NLP AND MACHINE LEARNING
CV INSPECTION USING NLP AND MACHINE LEARNINGIRJET Journal
 
DataScience SG | Undergrad Series | 26th Sep 19
DataScience SG | Undergrad Series | 26th Sep 19DataScience SG | Undergrad Series | 26th Sep 19
DataScience SG | Undergrad Series | 26th Sep 19Yong Siang (Ivan) Tan
 
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...IRJET Journal
 

Similaire à Dm project report (20)

Jasa
JasaJasa
Jasa
 
IRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and Lime
IRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and LimeIRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and Lime
IRJET- Stabilization of Black Cotton Soil using Rice Husk Ash and Lime
 
IRJET- Student Placement Prediction using Machine Learning
IRJET- Student Placement Prediction using Machine LearningIRJET- Student Placement Prediction using Machine Learning
IRJET- Student Placement Prediction using Machine Learning
 
IRJET- A Review on Object Tracking based on KNN Classifier
IRJET- A Review on Object Tracking based on KNN ClassifierIRJET- A Review on Object Tracking based on KNN Classifier
IRJET- A Review on Object Tracking based on KNN Classifier
 
Robotics-Based Learning in the Context of Computer Programming
Robotics-Based Learning in the Context of Computer ProgrammingRobotics-Based Learning in the Context of Computer Programming
Robotics-Based Learning in the Context of Computer Programming
 
CompSci: 221 Winter 2017 Search Engine for UCI
CompSci: 221 Winter 2017 Search Engine for UCICompSci: 221 Winter 2017 Search Engine for UCI
CompSci: 221 Winter 2017 Search Engine for UCI
 
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
Cse443 Project Report - LPU (Modern Big Data Analysis with SQL Specialization)
 
Hemraj_Patil_Resume
Hemraj_Patil_ResumeHemraj_Patil_Resume
Hemraj_Patil_Resume
 
Image Forgery / Tampering Detection Using Deep Learning and Cloud
Image Forgery / Tampering Detection Using Deep Learning and CloudImage Forgery / Tampering Detection Using Deep Learning and Cloud
Image Forgery / Tampering Detection Using Deep Learning and Cloud
 
Niranjan_Ingale
Niranjan_IngaleNiranjan_Ingale
Niranjan_Ingale
 
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLINGUSING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
USING IMAGE CLASSIFICATION TO INCENTIVIZE RECYCLING
 
Web based Software Development
Web based Software DevelopmentWeb based Software Development
Web based Software Development
 
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis ProjectBank Loan Approval Analysis: A Comprehensive Data Analysis Project
Bank Loan Approval Analysis: A Comprehensive Data Analysis Project
 
praveen resume
praveen resumepraveen resume
praveen resume
 
Mounika Gottumukkala - Resume
Mounika Gottumukkala - ResumeMounika Gottumukkala - Resume
Mounika Gottumukkala - Resume
 
praveen resume
praveen resumepraveen resume
praveen resume
 
Resume
ResumeResume
Resume
 
CV INSPECTION USING NLP AND MACHINE LEARNING
CV INSPECTION USING NLP AND MACHINE LEARNINGCV INSPECTION USING NLP AND MACHINE LEARNING
CV INSPECTION USING NLP AND MACHINE LEARNING
 
DataScience SG | Undergrad Series | 26th Sep 19
DataScience SG | Undergrad Series | 26th Sep 19DataScience SG | Undergrad Series | 26th Sep 19
DataScience SG | Undergrad Series | 26th Sep 19
 
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
 

Plus de Vijayeandra Parthepan (11)

Olat design
Olat designOlat design
Olat design
 
Faculty travel forms
Faculty travel formsFaculty travel forms
Faculty travel forms
 
Presentation
PresentationPresentation
Presentation
 
Orp project
Orp projectOrp project
Orp project
 
Sr sof orp
Sr sof orpSr sof orp
Sr sof orp
 
Desing document orp2
Desing document orp2Desing document orp2
Desing document orp2
 
Olat report
Olat reportOlat report
Olat report
 
Report
ReportReport
Report
 
Dbms project.ppt
Dbms project.pptDbms project.ppt
Dbms project.ppt
 
Project description
Project descriptionProject description
Project description
 
Design document travel_forms
Design document travel_formsDesign document travel_forms
Design document travel_forms
 

Dernier

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 

Dernier (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 

Dm project report

  • 1. Project Report Version 1.1 May 6, 2010 WKU Job Applicant’s Profile Evaluator using KNN Vijayeandra Parthepan Mohnish Thallavajhula Professor: Dr. Huanjing Wang Submitted in partial fulfillment Of the requirements of CS565 Data Mining Western Kentucky University
  • 2. Project Report 12/05/11 Table of Contents 1.0. Introduction ................................................................................................................................................ 3 2.0. Motivation ................................................................................................................................................... 3 3.0. Dataset Description .................................................................................................................................. 3 4.0. Approaches ................................................................................................................................................ 4 5.0. Future Work ................................................................................................................................................ 5 5.0. Results ......................................................................................................................................................... 5 6.0. Conclusion .................................................................................................................................................. 6 7.0. Referrences ................................................................................................................................................ 6 2
  • 3. Project Report 12/05/11 1.0. INTRODUCTION: K Nearest Neighbor (KNN) is the supervised data mining pattern recognition algorithm. It classifies objects based on closest training exam-nearest neighbor algorithm. It is amongst the simplest of all machine learning algorithms. An object is classified by a majority vote of its neighbors. K is small positive integer and it is usually previously set. WKU job applicant’s profile evaluator using KNN analyzes the status of the current job applicant based on the applicant’s details and classifies the applicant to the group of jobs that the applicant can apply. 2.0. MOTIVATION: The potential employee’s who wish to find some jobs in the university are not sure which jobs they are most likely to get and hence they may end up applying to jobs which may not suit their profile. So, in order to make their job search more accurate, we are going to compare their profile with already existing employee’s and provides them the job suggestions. We are going to analyze the status of WKU employees using KNN. The KNN algorithm classifies the new employee to a particular class based on the existing records. The k – “nearest” details of the existing job assignments will be considered and the job applicant will be classified into which group the applicant belongs to. 3.0. DATASET DESCRIPTION: Training data is the existing assignments of the jobs. Sample Training Data: A G 3.0 CS 2 B UG 2.5 ANY 3 C G 3.0 MPH 5 Test data is the details of the Job Applicant. Sample Test Data: G 3.5 CS 5 3
  • 4. Project Report 12/05/11 Test Data Description: Training data has: Class Name in 1st column Qualification in 2nd column GPA in 3rd column Department in 4th column Years of experience in 5th column Training Data Description: Qualification in 1st column GPA in 2nd column Department in 3rd column Years of experience in 4th column 4.0. APPROACHES: After calculating the group to which the Job Applicant belongs to, the list of jobs that the Job applicant can apply is displayed. The algorithm of the k-nearest neighbor that we apply in our project is as follows, 1. Calculate the “distance” from the test record to the training records. 2. Find the “k - nearest” training records. 3. Check the majority class from the k – nearest training records. 4. The class label for the training record is predicted as the class with the majority votes/weight among the k – nearest training. We are classifying the job applicants based on their details into different classes of jobs. Group A: {Graduate Assistant, Research Assistant} Group B: {Lab Assistant, Desk Clerk, Night Clerk} Group C: {Shuttle driver, Receptionist} The application has been developed using C# .NET. 4
  • 5. Project Report 12/05/11 5.0. FUTURE WORK: Convert the Windows implementation into Web Application. Provide direct application process to the jobs by taking the applicant’s details. 6.0. RESULTS: Screen shot of the help menu: 5
  • 6. Project Report 12/05/11 Screen shot of the main menu: 7.0. CONCLUSION: By implementing k – NN, the applicant is classified into a particular group of jobs. Thus, the job application process is simplified. Since we have implemented k – NN, the implementation is much simpler than it’s counter parts i.e. Decision Trees, Naïve Bayes, Support Vector Machines. 8.0. REFERRENCES: http://en.wikipedia.org/wiki/K-nearest_neighbor_algorithm 6