SlideShare a Scribd company logo
1 of 18
Download to read offline
PROJECT : IMAGE CLASSIFICATION
December 20, 2015
Name: Mayank
RUID: 165004149
Name: Ramakanth Vemula
RUID: 167004695
Name: Sanjivi Muttena
RUID: 164005979
1
PROBLEM 1 - RESULTS OF EXECUTION OF ALGORITHMS
1) Algorithm 1 : NaiveBayes
All the results are obtained by testing over 1000 testing data points for digits
Table 1: Naive Bayes - Digit - Time
Time Taken For Digit Data(in seconds)
Training Data Time 1 Time 2 Time 3 Time 4 Time 5 Time 6 Time 7 Mean SD
500 24.82 24.634 24.629 25.135 24.386 24.937 25.018 24.794 0.241
1000 25.822 25.651 25.739 26.052 25.73 25.882 26.273 25.878 0.200
1500 26.828 26.472 26.671 26.936 27.284 27.168 26.917 26.896 0.256
2000 28.026 27.417 27.821 27.696 27.882 27.948 27.461 27.750 0.218
2500 28.353 27.972 28.113 28.271 27.835 27.589 28.032 28.023 0.240
3000 29.161 28.528 29.037 28.624 28.883 28.698 29.035 28.852 0.222
3500 31.204 30.978 30.472 30.598 31.102 31.092 30.893 30.905 0.253
4000 31.864 31.792 31.316 31.927 30.998 31.734 30.727 31.479 0.436
4500 32.013 32.232 32.516 31.958 32.163 32.427 32.143 32.207 0.189
5000 32.812 32.741 32.583 32.831 32.482 32.912 32.812 32.739 0.141
Figure 1: Naive Bayes Digit Time
2
Table 2: Naive Bayes - Digit - Accuracy
Accuracy For Digit Data(%)
Data Point Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Mean SD
500 71.7 71.3 72.1 70.8 71.2 70.9 72.1 71.442 0.495
1000 73.5 73.9 74.1 73.1 74.2 72.4 72.7 73.41 0.65
1500 77.7 75.6 76.4 77.3 75.9 76.2 77.2 76.61 0.73
2000 75.6 77.3 76.8 77.1 75.9 76.3 76.5 76.5 0.573
2500 76.3 77.6 78.2 76.1 76.4 77.7 77.3 77.08 0.75
3000 77.4 78.4 76.9 76.5 77.2 78.1 76.7 77.31 0.65
3500 75.9 77.1 76.3 76.8 75.8 77.2 76.2 76.47 0.52
4000 77.6 77.9 77.4 76 76.9 77.6 77.1 77.21 0.58
4500 77.1 78.3 77.6 78.4 77.3 78.6 76.9 77.74 0.63
5000 77 77 77 77 77 77 77 77 0
Figure 2: Naive Bayes Digit Accuracy
3
Table 3: Naive Bayes - Face - Time
Time Taken For Face Data(in seconds)
Data Point Time 1 Time 2 Time 3 Time 4 Time 5 Time 6 Time 7 Mean SD
500 5.184 5.211 5.176 5.125 5.094 5.182 5.146 5.159 0.0372
1000 5.734 5.761 5.627 5.695 5.814 5.703 5.83 5.737 0.065
1500 5.974 6.041 6.264 6.103 6.029 6.146 6.042 6.085 0.088
2000 6.292 6.462 6.381 6.315 6.402 6.311 6.298 6.351 0.059
2500 6.831 6.743 6.796 6.753 6.693 6.731 6.801 6.764 0.044
3000 7.104 7.098 7.147 7.183 7.112 7.065 7.15 7.12 0.036
3500 7.351 7.401 7.372 7.301 7.391 7.41 7.396 7.374 0.035
4000 7.741 7.769 7.809 7.791 7.81 7.739 7.796 7.77 0.027
4500 8.09 8.171 8.014 7.995 8.151 8.073 8.102 8.085 0.0601
5000 8.351 8.406 8.397 8.358 8.382 8.413 8.319 8.375 0.0314
Figure 3: Naive Bayes Face Time
4
Table 4: Naive Bayes- Face- Accuracy
Accuracy For Face Data(%)
Data Point Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Mean SD
500 59.9 64.8 58.4 59.3 70.1 65.6 61.6 62.81 3.90
1000 79.5 78.4 82.1 76.5 80.8 73.4 81.1 78.82 2.810
1500 84.6 88.1 80.4 82.9 84.1 83 80.9 83.42 2.38
2000 86.5 88.9 87.7 87.2 89.7 88.1 87.5 87.94 0.99
2500 88.6 87.9 90.1 88.9 87.4 89.3 89.8 88.85 0.90
3000 88.4 88.2 89.8 90.3 87.9 88.4 89.5 88.92 0.85
3500 90.1 89.7 89.3 90.3 89.8 90.2 88.9 89.75 0.47
4000 89.4 88.9 89.9 90.7 89.6 90.5 89.6 89.8 0.58
4500 90.2 90.8 89.7 91.1 90.9 89.8 90.3 90.4 0.50
5000 91.8 91.8 91.8 91.8 91.8 91.8 91.8 91.8 0
Figure 4: Naive Bayes Face Accuracy
5
Table 5: Perceptron - Digit - Time
Time Taken For Digit Data(in seconds)
Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD
500 27.3 21.9 26.4 25.2 2.36
1000 47.3 46.8 46.34 46.81 0.392
1500 70.36 68.2 67.6 68.72 1.18
2000 97.8 102.7 105.4 101.966 3.145720197
2500 130.1 124.1 125.3 126.5 2.592296279
3000 146.2 135.5 141.1 140.93 4.369
3500 158.7 161.4 166.5 162.2 3.234
4000 170.8 183.8 189.15 181.25 7.705
4500 184.8 199.5 220.3 201.53 14.563
5000 201.4 221.1 215.9 212.8 8.33
Figure 5: Perceptron Digit Time
6
Table 6: Perceptron - Digit - Accuracy
Accuracy For Digit Data(in seconds)
Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD
500 67.1 63.1 71.2 67.133 2.863
1000 74.5 78.3 72.3 75.033 2.478
1500 77.2 79.6 73.8 76.866 2.379
2000 80.1 78.1 76.4 78.2 1.512
2500 77.3 81.5 79.3 79.36 1.715
3000 79.1 76.2 80.4 78.566 1.755
3500 79.4 82.1 77.1 79.533 1.35
4000 81.1 77.2 80.7 79.66 1.95
4500 83.2 80.2 79.1 80.833 1.732
5000 80.1 78.2 81.1 79.8 1.202
Figure 6: Perceptron Digit Accuracy
7
Table 7: Perceptron - Face - Time
Time Taken For Face Data(in seconds)
Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD
500 2.4 2.1 2.3 2.266 0.124
1000 4.5 4.9 4.8 4.733 0.169
1500 7.7 7.6 7.4 7.56 0.124
2000 9.7 9.8 10.1 9.866 0.169
2500 12.4 12.2 12.7 12.433 0.205
3000 15.1 14.9 15.2 15.066 0.124
3500 17.3 17.6 17.5 17.466 0.124
4000 20.1 20.2 19.2 19.83 0.449
4500 22.7 22.6 22.4 22.566 0.125
5000 25.6 25.4 25.8 25.6 0.163
Figure 7: Perceptron Face Time
8
Table 8: Perceptron - Face - Accuracy
Accuracy For Face Data(in seconds)
Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD
500 67.1 70.4 73 70.16 2.091
1000 74.4 75.1 72.1 73.866 1.281
1500 77.4 79.6 76 77.666 1.48
2000 80.1 78.4 81.2 79.9 1.152
2500 82.3 81.2 80 81.16 0.939
3000 84.2 81.9 84.3 83.1 1.108
3500 84.4 85.7 83.5 84.6 0.90
4000 86.4 89.1 87.4 88.25 1.35
4500 87.2 88.1 88.5 87.93 0.54
5000 90.1 89.1 88.9 89.366 0.525
Figure 8: Perceptron Face Accuracy
9
Table 9: Mira - Digit - Time
Time Taken For Digit Data(in seconds)
Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD
500 24.3 23.6 24.9 24.26 0.531
1000 46.1 47.4 48.2 47.233 0.865
1500 67.2 69.8 68.9 68.633 1.078
2000 91.7 93.2 94.3 93.06 1.065
2500 109.4 114.8 115.8 113.33 2.81
3000 138.5 144.7 141.9 141.7 2.53
3500 152.6 158.1 162.2 157.63 3.93
4000 183.2 186.3 178.1 182.5333333 3.380663972
4500 203.8 193.5 199 198.7666667 4.208193067
5000 227.3 220.5 233.1 226.966 5.149
Figure 9: Mira Digit Time
10
Table 10: Mira - Digit - Accuracy
Accuracy For Digit Data(in seconds)
Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD
500 72.3 76.4 73.1 73.93 1.536
1000 74.1 76.2 77.5 75.93 1.40
1500 75.1 76.4 73.8 75.1 1.061
2000 82.3 80 78.1 80.13 1.717
2500 79.4 80.3 81.2 80.3 0.734
3000 80.1 78.9 79.9 79.633 0.524
3500 82 80.4 81.3 81.23 0.8
4000 80.1 79.5 78.8 79.46 0.3
4500 81 80.2 82.1 81.1 0.778
5000 79.2 80.4 80.8 80.13 0.679
Figure 10: Mira Face Accuracy
11
Table 11: Mira - Face - Time
Time Taken For Face Data(in seconds)
Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD
500 5.732 5.845 5.912 5.829 0.07428025
1000 8.325 8.826 8.967 8.706 0.275
1500 11.458 12.241 11.251 11.65 0.426
2000 13.696 13.176 14.218 13.696 0.425
2500 16.885 16.903 17.48 17.089 0.276
3000 18.74 18.539 18.436 18.571 0.126
3500 20.561 20.704 20.019 20.428 0.295
4000 22.498 22.014 22.328 22.28 0.200
4500 24.893 24.481 23.947 24.440 0.387
5000 28.115 28.947 28.571 28.544 0.340
Figure 11: Mira Face Time
12
Table 12: Mira - Face -Accuracy
Accuracy For Face Data(in seconds)
Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD
500 67.8 66.2 69.3 67.766 1.26
1000 72.5 74.8 75.9 74.4 1.416
1500 76.1 77.8 77.1 77 0.697
2000 79.1 78.3 77.9 78.433 0.498
2500 81.5 81.8 82.6 81.96 0.464
3000 84.1 83.7 83.9 83.9 0.163
3500 86.4 86.5 85.9 86.266 0.262
4000 86.9 87.1 87 87 0.0816
4500 89.1 88.7 90.1 89.3 0.588
5000 88.9 91.6 88.5 89.66 1.376
Figure 12: Mira Face Accuracy
13
PROBLEM 3 - DISCUSSION OF ALGORITHMS AND RESULTS
(A) Naive Bayes:
A naive Bayes classifier models a joint distribution over a label Y and a set of observed
random variables, or features, {F1,F2,...Fn}, using the assumption that the full joint
distribution can be factored as follows (features are conditionally independent given
the label):
P(F1 ...Fn,Y ) = P(Y )
i
P(Fi |Y )
To classify a datum, we can find the most probable label given the feature values for
each pixel, using Bayes theorem:
P(y|f1 ... fn) =
P(f1 ... fn|y)P(y)
P(f1 ... fn)
=
P(y) i = 1m
P(fi |y)
P(f1 ... fn)
ar gmaxy P(y|f1 ... fn) = ar gmaxy
P(y) m
i=1 P(fi |y)
P(f1 ... fn)
ar gmaxy P(y|f1 ... fn) = ar gmaxy P(y)
m
i=1
P(fi |y)
Because multiplying many probabilities together often results in underflow, we will
instead compute log probabilities which have the same argmax.
ar gmaxy logP(y)
m
i=1
P(fi |y) = ar gmaxy log(P(y, f1 ... fn))
= ar gmaxy log(P(y)+
m
i=1
logP(fi |y))
Use math.log(), a built-in Python function to compute logarithms.
Parameter Estimation:
Our naive Bayes model has several parameters to estimate. One parameter is the prior
distribution over labels (digits, or face/not-face), P(Y ). We can estimate P(Y ) directly
14
from the training data:
ˆP(y) =
c(y)
n
where c(y) is the number of training instances with label y and n is the total number of
training instances. The other parameters to estimate are the conditional probabilities
of our features given each label y: P(Fi |Y = y). We do this for each possible feature
value (fi ∈ 0,1).
ˆP(Fi = fi |Y = y) =
c(fi , y)
fi
c(fi , y)
where c(fi , y) is the number of times pixel Fi took value fi in the training examples of
label y.
Smoothing: Your current parameter estimates are unsmoothed, that is, you are using
the empirical estimates for the parameters P(fi |y). These estimates are rarely adequate
in real systems. Minimally, we need to make sure that no parameter ever receives
an estimate of zero, but good smoothing can boost accuracy quite a bit by reducing
overfitting. In this project, we use Laplace smoothing, which adds k counts to every
possible observation value:
P(Fi = fi |Y = y) =
c(Fi = fi ,Y = y)+k
fi
(c(Fi = fi ,Y = y)+k)
If k=0, the probabilities are unsmoothed. As k grows larger, the probabilities are
smoothed more and more. You can use your validation set to determine a good value
for k.
Conclusion: As we increase the training data from 10% to 100% we see that the accu-
racy jumps from a mere 62.8% to a reasonable 91% in face recognition and 71% to 77%
in digit recognition.The trade off is time, more the data - more processing, as from the
data collected we see the difference of 3 seconds in training face data and 8 seconds for
training digit data.
Also we see the increase in accuracy to minimize after some point. In our project
data we see that there is not much difference in accuracy between 50% training data
and 100% training data. So we could improve training time on this by using 50% data
15
instead of training on 100%.
(B) Perceptron:
The perceptron algorithm uses a weight vector to make decisions unlike Native Bayes
which uses probability. The weight vector here is represented by ωy
for each class y.
For a given feature list f , the perceptron algorithm computes the class y whose weight
vector is most similar to the input vector f . The feature vector defined in our code is a
map from pixel locations to indicators of whether they are on or not.
Formally, given a feature vector f (in our case, a map from pixel locations to indicators
of whether they are on), we score each class with:
score(f , y) =
i
fi w
y
i
The class with highest score is chosen as the predicted label for that data instance
Before classifying the training set, the weights have to be learnt by the algorithm. For
this, the training set is scanned one instance at a time. When we come to an instance
(f , y), we find the label with highest score:
y = arg maxy score(f , y )
We compare y i.e the result obtained by previous equations, to the true label y. If the
two labels are equal (y = y) then we’ve gotten the instance correct and can proceed
with the other items in the training set. Otherwise, we guessed a false positive y in
place of y. That means that wy
should have scored f higher, and wy
should have
scored f lower. In order to prevent this error in the future we update these two weight
vectors accordingly.
wy
+ = f
wy
− = f
Conclusion: We ran our algorithm for 3 iterations. We see a huge difference in accuracy,
from 67% to 90%(digits) and 70% to 90%(faces). This happens becuase perceptron
iterates repeatedly through the training data. This leads us to higher training time.
16
Perceptron makes weight corrections and starts to improve after each iteration, this is
also the reason behind the significant accuracy rate.
(C) Mira:
Similar to a perceptron classifier, the MIRA classifier also keeps a weight vector wy
of
each label y. Here too we scan over the data, one instance at a time. When we come to
an instance (f , y), we find the label with highest score:
y = arg maxy score(f , y )
We compare y to the true label y. If the labels are equal( y = y) , we’ve gotten the
instance correct, and we do nothing. Otherwise, we guessed y but we should have
guessed y. The difference between mira and perceptron is that in Mira we update the
weight vectors of these labels with variable step size:
ωy
= ωy
+τf
ωy
= ωy
−τf
Here τ > 0 and is chosen such that it minimizes :
minu (1/2) c||(ω )c
−ωc
||2
2
subject to the condition that
(ω )y
f ≥ (ω )y
f +1
This is equivalent to
minτ||τf ||2
2 subject to τ ≥
(ωy
−ωy
)f +1
2||f ||2
2
and τ ≥ 0
We can notice that, ωy
f ≥ ωy
f , so the condition τ ≥ 0 is always true given τ ≥
(ωy
−ωy
)f +1
2||f ||2
2
Solving the problem we get,
17
τ =
(ωy
−ωy
)f +1
2||f ||2
2
We cap the maximum possible value of τ by a positive constant C,
τ = min C,τ =
(ωy
−ωy
)f +1
2||f ||2
2
Conclusion:The accuracy increased from 67% to approx 90% as we increased the
training data. Similar to perceptron the accuracy improves as we the iterations, this
happens because the weights gets updated.Training time is high when compared to
Naive Bayes but is similar to perceptron.

More Related Content

What's hot

Applied Business Statistics ,ken black , ch 15
Applied Business Statistics ,ken black , ch 15Applied Business Statistics ,ken black , ch 15
Applied Business Statistics ,ken black , ch 15AbdelmonsifFadl
 
Educational certificate diploma engineering jashim
Educational certificate diploma engineering jashimEducational certificate diploma engineering jashim
Educational certificate diploma engineering jashimJashim Pd-2
 
Insulated copper-flexi-bars
Insulated copper-flexi-barsInsulated copper-flexi-bars
Insulated copper-flexi-barsTeknomega S.r.l.
 
Electic circuits fundamentals thomas floyd, david buchla 8th edition
Electic circuits fundamentals thomas floyd, david buchla 8th editionElectic circuits fundamentals thomas floyd, david buchla 8th edition
Electic circuits fundamentals thomas floyd, david buchla 8th edition명중 김
 
Forecasting model primer_v1
Forecasting model primer_v1Forecasting model primer_v1
Forecasting model primer_v1Vinod Krishnan
 
Macarena count to 100
Macarena count to 100Macarena count to 100
Macarena count to 100rpower1
 
P3 แบบทดสอบคิดคำนวณคล่อง
P3 แบบทดสอบคิดคำนวณคล่องP3 แบบทดสอบคิดคำนวณคล่อง
P3 แบบทดสอบคิดคำนวณคล่องKhunnawang Khunnawang
 
Lista completa de asignaciones CADIVI: 2004-2012
Lista completa de asignaciones CADIVI: 2004-2012Lista completa de asignaciones CADIVI: 2004-2012
Lista completa de asignaciones CADIVI: 2004-2012Álvaro Muñoz
 
کتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdf
کتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdfکتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdf
کتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdfFaizan ali Siddiqui
 
Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...
Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...
Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...jicywig
 
Tablas esta ii c6120713
Tablas esta ii c6120713Tablas esta ii c6120713
Tablas esta ii c6120713LLendy GIl
 
MTAP Grade 3 Session 2 new.pptx
MTAP Grade 3 Session 2 new.pptxMTAP Grade 3 Session 2 new.pptx
MTAP Grade 3 Session 2 new.pptxLuisSalenga1
 

What's hot (20)

Applied Business Statistics ,ken black , ch 15
Applied Business Statistics ,ken black , ch 15Applied Business Statistics ,ken black , ch 15
Applied Business Statistics ,ken black , ch 15
 
Educational certificate diploma engineering jashim
Educational certificate diploma engineering jashimEducational certificate diploma engineering jashim
Educational certificate diploma engineering jashim
 
Insulated copper-flexi-bars
Insulated copper-flexi-barsInsulated copper-flexi-bars
Insulated copper-flexi-bars
 
Electic circuits fundamentals thomas floyd, david buchla 8th edition
Electic circuits fundamentals thomas floyd, david buchla 8th editionElectic circuits fundamentals thomas floyd, david buchla 8th edition
Electic circuits fundamentals thomas floyd, david buchla 8th edition
 
Forecasting model primer_v1
Forecasting model primer_v1Forecasting model primer_v1
Forecasting model primer_v1
 
Macarena count to 100
Macarena count to 100Macarena count to 100
Macarena count to 100
 
Testimonial.PDF
Testimonial.PDFTestimonial.PDF
Testimonial.PDF
 
Activity book for children 1 pdf
Activity book for children 1   pdfActivity book for children 1   pdf
Activity book for children 1 pdf
 
P3 แบบทดสอบคิดคำนวณคล่อง
P3 แบบทดสอบคิดคำนวณคล่องP3 แบบทดสอบคิดคำนวณคล่อง
P3 แบบทดสอบคิดคำนวณคล่อง
 
Lista completa de asignaciones CADIVI: 2004-2012
Lista completa de asignaciones CADIVI: 2004-2012Lista completa de asignaciones CADIVI: 2004-2012
Lista completa de asignaciones CADIVI: 2004-2012
 
Present value tables
Present value tablesPresent value tables
Present value tables
 
Math paper 1 to 4
Math paper 1 to 4Math paper 1 to 4
Math paper 1 to 4
 
F.table
F.tableF.table
F.table
 
UAP certificate
UAP certificateUAP certificate
UAP certificate
 
کتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdf
کتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdfکتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdf
کتاب المساٸل۔جلد۔2۔مفتی سلمان منصورپوری.pdf
 
Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...
Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...
Introduction to Engineering Experimentation 3rd Edition by Wheeler Solutions ...
 
College Testimonial
College TestimonialCollege Testimonial
College Testimonial
 
Tablas esta ii c6120713
Tablas esta ii c6120713Tablas esta ii c6120713
Tablas esta ii c6120713
 
MTAP Grade 3 Session 2 new.pptx
MTAP Grade 3 Session 2 new.pptxMTAP Grade 3 Session 2 new.pptx
MTAP Grade 3 Session 2 new.pptx
 
EVS - UKG.pdf
EVS - UKG.pdfEVS - UKG.pdf
EVS - UKG.pdf
 

Viewers also liked

Veterinary Stethoscope
Veterinary StethoscopeVeterinary Stethoscope
Veterinary StethoscopeJo Essenburg
 
βιβλια σε ροδες
βιβλια σε ροδεςβιβλια σε ροδες
βιβλια σε ροδεςDimitris Gkotzos
 
Caracteristica serviciilor de acces la internet fix si mobil in RM
Caracteristica serviciilor de acces la internet fix si mobil in RMCaracteristica serviciilor de acces la internet fix si mobil in RM
Caracteristica serviciilor de acces la internet fix si mobil in RMmoldovaictsummit2016
 
Brochure TIMBRE Information System
Brochure TIMBRE Information SystemBrochure TIMBRE Information System
Brochure TIMBRE Information SystemMarco Pesce
 
AcademicRecord-4952863-08_Jan_2017
AcademicRecord-4952863-08_Jan_2017AcademicRecord-4952863-08_Jan_2017
AcademicRecord-4952863-08_Jan_2017Pam Harris
 
ICT SMEs from emerging markets as early adopters of new marketing techniques (1)
ICT SMEs from emerging markets as early adopters of new marketing techniques (1)ICT SMEs from emerging markets as early adopters of new marketing techniques (1)
ICT SMEs from emerging markets as early adopters of new marketing techniques (1)moldovaictsummit2016
 
Clearswift f5 implementation_technical guide
Clearswift f5 implementation_technical guideClearswift f5 implementation_technical guide
Clearswift f5 implementation_technical guideMarco Essomba
 
Ideo presentation ( builders)
Ideo presentation ( builders)Ideo presentation ( builders)
Ideo presentation ( builders)BCSPRODUCTIONS
 

Viewers also liked (12)

Veterinary Stethoscope
Veterinary StethoscopeVeterinary Stethoscope
Veterinary Stethoscope
 
βιβλια σε ροδες
βιβλια σε ροδεςβιβλια σε ροδες
βιβλια σε ροδες
 
Caracteristica serviciilor de acces la internet fix si mobil in RM
Caracteristica serviciilor de acces la internet fix si mobil in RMCaracteristica serviciilor de acces la internet fix si mobil in RM
Caracteristica serviciilor de acces la internet fix si mobil in RM
 
p.052-053_Bloggers
p.052-053_Bloggersp.052-053_Bloggers
p.052-053_Bloggers
 
Tarea 1
Tarea 1Tarea 1
Tarea 1
 
Brochure TIMBRE Information System
Brochure TIMBRE Information SystemBrochure TIMBRE Information System
Brochure TIMBRE Information System
 
AcademicRecord-4952863-08_Jan_2017
AcademicRecord-4952863-08_Jan_2017AcademicRecord-4952863-08_Jan_2017
AcademicRecord-4952863-08_Jan_2017
 
ICT SMEs from emerging markets as early adopters of new marketing techniques (1)
ICT SMEs from emerging markets as early adopters of new marketing techniques (1)ICT SMEs from emerging markets as early adopters of new marketing techniques (1)
ICT SMEs from emerging markets as early adopters of new marketing techniques (1)
 
New Resume
New ResumeNew Resume
New Resume
 
Clearswift f5 implementation_technical guide
Clearswift f5 implementation_technical guideClearswift f5 implementation_technical guide
Clearswift f5 implementation_technical guide
 
Ideo presentation ( builders)
Ideo presentation ( builders)Ideo presentation ( builders)
Ideo presentation ( builders)
 
Evaluation part 2
Evaluation part 2Evaluation part 2
Evaluation part 2
 

Similar to ARitificial Intelligence - Project - Data Classification

Sampling: An an often overlooked art in exploratory data analysis
Sampling: An an often overlooked art in exploratory data analysisSampling: An an often overlooked art in exploratory data analysis
Sampling: An an often overlooked art in exploratory data analysisEli Bressert
 
face recognition using Principle Componet Analysis
face recognition using Principle Componet Analysisface recognition using Principle Componet Analysis
face recognition using Principle Componet AnalysisAbhilash Kotawar
 
Appendix  A  Future value in.docx
Appendix  A  Future value in.docxAppendix  A  Future value in.docx
Appendix  A  Future value in.docxrossskuddershamus
 
Comparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text MiningComparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text MiningAndrea Gigli
 
Image orientation classification analysis
Image orientation classification analysisImage orientation classification analysis
Image orientation classification analysisRohit Dandona
 
Machine Learning Model for M.S admissions
Machine Learning Model for M.S admissionsMachine Learning Model for M.S admissions
Machine Learning Model for M.S admissionsOmkar Rane
 
Algebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions ManualAlgebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions Manualkejeqadaqo
 
Incremental statistics for partitioned tables in 11g by wwf from ebay COC
Incremental statistics for partitioned tables in 11g  by wwf from ebay COCIncremental statistics for partitioned tables in 11g  by wwf from ebay COC
Incremental statistics for partitioned tables in 11g by wwf from ebay COCLouis liu
 
01_introduction_lab.pdf
01_introduction_lab.pdf01_introduction_lab.pdf
01_introduction_lab.pdfzehiwot hone
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with RYanchang Zhao
 
Presentation1 Associate Professor
Presentation1 Associate ProfessorPresentation1 Associate Professor
Presentation1 Associate ProfessorSanjeevKarmakar
 
Assignment help australia
Assignment help australiaAssignment help australia
Assignment help australiaMark Jack
 
Web trafic time series forecasting
Web trafic time series forecastingWeb trafic time series forecasting
Web trafic time series forecastingKorivi Sravan Kumar
 
R Activity in Biostatistics
R Activity in BiostatisticsR Activity in Biostatistics
R Activity in BiostatisticsLarry Sultiz
 
Numerical estimation 1
Numerical estimation 1Numerical estimation 1
Numerical estimation 1Isaac Ayuba M.
 
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)Hansol Kang
 

Similar to ARitificial Intelligence - Project - Data Classification (20)

Sampling: An an often overlooked art in exploratory data analysis
Sampling: An an often overlooked art in exploratory data analysisSampling: An an often overlooked art in exploratory data analysis
Sampling: An an often overlooked art in exploratory data analysis
 
face recognition using Principle Componet Analysis
face recognition using Principle Componet Analysisface recognition using Principle Componet Analysis
face recognition using Principle Componet Analysis
 
Learning R
Learning RLearning R
Learning R
 
Appendix  A  Future value in.docx
Appendix  A  Future value in.docxAppendix  A  Future value in.docx
Appendix  A  Future value in.docx
 
Comparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text MiningComparing Machine Learning Algorithms in Text Mining
Comparing Machine Learning Algorithms in Text Mining
 
Image orientation classification analysis
Image orientation classification analysisImage orientation classification analysis
Image orientation classification analysis
 
Machine Learning Model for M.S admissions
Machine Learning Model for M.S admissionsMachine Learning Model for M.S admissions
Machine Learning Model for M.S admissions
 
2. intro. to six sigma
2. intro. to six sigma2. intro. to six sigma
2. intro. to six sigma
 
Algebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions ManualAlgebra and Trigonometry 9th Edition Larson Solutions Manual
Algebra and Trigonometry 9th Edition Larson Solutions Manual
 
Incremental statistics for partitioned tables in 11g by wwf from ebay COC
Incremental statistics for partitioned tables in 11g  by wwf from ebay COCIncremental statistics for partitioned tables in 11g  by wwf from ebay COC
Incremental statistics for partitioned tables in 11g by wwf from ebay COC
 
01_introduction_lab.pdf
01_introduction_lab.pdf01_introduction_lab.pdf
01_introduction_lab.pdf
 
histgram[1].ppt
histgram[1].ppthistgram[1].ppt
histgram[1].ppt
 
Regression and Classification with R
Regression and Classification with RRegression and Classification with R
Regression and Classification with R
 
Presentation1 Associate Professor
Presentation1 Associate ProfessorPresentation1 Associate Professor
Presentation1 Associate Professor
 
Assignment help australia
Assignment help australiaAssignment help australia
Assignment help australia
 
Web trafic time series forecasting
Web trafic time series forecastingWeb trafic time series forecasting
Web trafic time series forecasting
 
R Activity in Biostatistics
R Activity in BiostatisticsR Activity in Biostatistics
R Activity in Biostatistics
 
Numerical estimation 1
Numerical estimation 1Numerical estimation 1
Numerical estimation 1
 
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
딥러닝 중급 - AlexNet과 VggNet (Basic of DCNN : AlexNet and VggNet)
 
REvit training
REvit trainingREvit training
REvit training
 

Recently uploaded

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsJoseMangaJr1
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...amitlee9823
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 

Recently uploaded (20)

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night StandCall Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 

ARitificial Intelligence - Project - Data Classification

  • 1. PROJECT : IMAGE CLASSIFICATION December 20, 2015 Name: Mayank RUID: 165004149 Name: Ramakanth Vemula RUID: 167004695 Name: Sanjivi Muttena RUID: 164005979
  • 2. 1 PROBLEM 1 - RESULTS OF EXECUTION OF ALGORITHMS 1) Algorithm 1 : NaiveBayes All the results are obtained by testing over 1000 testing data points for digits Table 1: Naive Bayes - Digit - Time Time Taken For Digit Data(in seconds) Training Data Time 1 Time 2 Time 3 Time 4 Time 5 Time 6 Time 7 Mean SD 500 24.82 24.634 24.629 25.135 24.386 24.937 25.018 24.794 0.241 1000 25.822 25.651 25.739 26.052 25.73 25.882 26.273 25.878 0.200 1500 26.828 26.472 26.671 26.936 27.284 27.168 26.917 26.896 0.256 2000 28.026 27.417 27.821 27.696 27.882 27.948 27.461 27.750 0.218 2500 28.353 27.972 28.113 28.271 27.835 27.589 28.032 28.023 0.240 3000 29.161 28.528 29.037 28.624 28.883 28.698 29.035 28.852 0.222 3500 31.204 30.978 30.472 30.598 31.102 31.092 30.893 30.905 0.253 4000 31.864 31.792 31.316 31.927 30.998 31.734 30.727 31.479 0.436 4500 32.013 32.232 32.516 31.958 32.163 32.427 32.143 32.207 0.189 5000 32.812 32.741 32.583 32.831 32.482 32.912 32.812 32.739 0.141 Figure 1: Naive Bayes Digit Time
  • 3. 2 Table 2: Naive Bayes - Digit - Accuracy Accuracy For Digit Data(%) Data Point Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Mean SD 500 71.7 71.3 72.1 70.8 71.2 70.9 72.1 71.442 0.495 1000 73.5 73.9 74.1 73.1 74.2 72.4 72.7 73.41 0.65 1500 77.7 75.6 76.4 77.3 75.9 76.2 77.2 76.61 0.73 2000 75.6 77.3 76.8 77.1 75.9 76.3 76.5 76.5 0.573 2500 76.3 77.6 78.2 76.1 76.4 77.7 77.3 77.08 0.75 3000 77.4 78.4 76.9 76.5 77.2 78.1 76.7 77.31 0.65 3500 75.9 77.1 76.3 76.8 75.8 77.2 76.2 76.47 0.52 4000 77.6 77.9 77.4 76 76.9 77.6 77.1 77.21 0.58 4500 77.1 78.3 77.6 78.4 77.3 78.6 76.9 77.74 0.63 5000 77 77 77 77 77 77 77 77 0 Figure 2: Naive Bayes Digit Accuracy
  • 4. 3 Table 3: Naive Bayes - Face - Time Time Taken For Face Data(in seconds) Data Point Time 1 Time 2 Time 3 Time 4 Time 5 Time 6 Time 7 Mean SD 500 5.184 5.211 5.176 5.125 5.094 5.182 5.146 5.159 0.0372 1000 5.734 5.761 5.627 5.695 5.814 5.703 5.83 5.737 0.065 1500 5.974 6.041 6.264 6.103 6.029 6.146 6.042 6.085 0.088 2000 6.292 6.462 6.381 6.315 6.402 6.311 6.298 6.351 0.059 2500 6.831 6.743 6.796 6.753 6.693 6.731 6.801 6.764 0.044 3000 7.104 7.098 7.147 7.183 7.112 7.065 7.15 7.12 0.036 3500 7.351 7.401 7.372 7.301 7.391 7.41 7.396 7.374 0.035 4000 7.741 7.769 7.809 7.791 7.81 7.739 7.796 7.77 0.027 4500 8.09 8.171 8.014 7.995 8.151 8.073 8.102 8.085 0.0601 5000 8.351 8.406 8.397 8.358 8.382 8.413 8.319 8.375 0.0314 Figure 3: Naive Bayes Face Time
  • 5. 4 Table 4: Naive Bayes- Face- Accuracy Accuracy For Face Data(%) Data Point Run 1 Run 2 Run 3 Run 4 Run 5 Run 6 Run 7 Mean SD 500 59.9 64.8 58.4 59.3 70.1 65.6 61.6 62.81 3.90 1000 79.5 78.4 82.1 76.5 80.8 73.4 81.1 78.82 2.810 1500 84.6 88.1 80.4 82.9 84.1 83 80.9 83.42 2.38 2000 86.5 88.9 87.7 87.2 89.7 88.1 87.5 87.94 0.99 2500 88.6 87.9 90.1 88.9 87.4 89.3 89.8 88.85 0.90 3000 88.4 88.2 89.8 90.3 87.9 88.4 89.5 88.92 0.85 3500 90.1 89.7 89.3 90.3 89.8 90.2 88.9 89.75 0.47 4000 89.4 88.9 89.9 90.7 89.6 90.5 89.6 89.8 0.58 4500 90.2 90.8 89.7 91.1 90.9 89.8 90.3 90.4 0.50 5000 91.8 91.8 91.8 91.8 91.8 91.8 91.8 91.8 0 Figure 4: Naive Bayes Face Accuracy
  • 6. 5 Table 5: Perceptron - Digit - Time Time Taken For Digit Data(in seconds) Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD 500 27.3 21.9 26.4 25.2 2.36 1000 47.3 46.8 46.34 46.81 0.392 1500 70.36 68.2 67.6 68.72 1.18 2000 97.8 102.7 105.4 101.966 3.145720197 2500 130.1 124.1 125.3 126.5 2.592296279 3000 146.2 135.5 141.1 140.93 4.369 3500 158.7 161.4 166.5 162.2 3.234 4000 170.8 183.8 189.15 181.25 7.705 4500 184.8 199.5 220.3 201.53 14.563 5000 201.4 221.1 215.9 212.8 8.33 Figure 5: Perceptron Digit Time
  • 7. 6 Table 6: Perceptron - Digit - Accuracy Accuracy For Digit Data(in seconds) Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD 500 67.1 63.1 71.2 67.133 2.863 1000 74.5 78.3 72.3 75.033 2.478 1500 77.2 79.6 73.8 76.866 2.379 2000 80.1 78.1 76.4 78.2 1.512 2500 77.3 81.5 79.3 79.36 1.715 3000 79.1 76.2 80.4 78.566 1.755 3500 79.4 82.1 77.1 79.533 1.35 4000 81.1 77.2 80.7 79.66 1.95 4500 83.2 80.2 79.1 80.833 1.732 5000 80.1 78.2 81.1 79.8 1.202 Figure 6: Perceptron Digit Accuracy
  • 8. 7 Table 7: Perceptron - Face - Time Time Taken For Face Data(in seconds) Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD 500 2.4 2.1 2.3 2.266 0.124 1000 4.5 4.9 4.8 4.733 0.169 1500 7.7 7.6 7.4 7.56 0.124 2000 9.7 9.8 10.1 9.866 0.169 2500 12.4 12.2 12.7 12.433 0.205 3000 15.1 14.9 15.2 15.066 0.124 3500 17.3 17.6 17.5 17.466 0.124 4000 20.1 20.2 19.2 19.83 0.449 4500 22.7 22.6 22.4 22.566 0.125 5000 25.6 25.4 25.8 25.6 0.163 Figure 7: Perceptron Face Time
  • 9. 8 Table 8: Perceptron - Face - Accuracy Accuracy For Face Data(in seconds) Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD 500 67.1 70.4 73 70.16 2.091 1000 74.4 75.1 72.1 73.866 1.281 1500 77.4 79.6 76 77.666 1.48 2000 80.1 78.4 81.2 79.9 1.152 2500 82.3 81.2 80 81.16 0.939 3000 84.2 81.9 84.3 83.1 1.108 3500 84.4 85.7 83.5 84.6 0.90 4000 86.4 89.1 87.4 88.25 1.35 4500 87.2 88.1 88.5 87.93 0.54 5000 90.1 89.1 88.9 89.366 0.525 Figure 8: Perceptron Face Accuracy
  • 10. 9 Table 9: Mira - Digit - Time Time Taken For Digit Data(in seconds) Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD 500 24.3 23.6 24.9 24.26 0.531 1000 46.1 47.4 48.2 47.233 0.865 1500 67.2 69.8 68.9 68.633 1.078 2000 91.7 93.2 94.3 93.06 1.065 2500 109.4 114.8 115.8 113.33 2.81 3000 138.5 144.7 141.9 141.7 2.53 3500 152.6 158.1 162.2 157.63 3.93 4000 183.2 186.3 178.1 182.5333333 3.380663972 4500 203.8 193.5 199 198.7666667 4.208193067 5000 227.3 220.5 233.1 226.966 5.149 Figure 9: Mira Digit Time
  • 11. 10 Table 10: Mira - Digit - Accuracy Accuracy For Digit Data(in seconds) Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD 500 72.3 76.4 73.1 73.93 1.536 1000 74.1 76.2 77.5 75.93 1.40 1500 75.1 76.4 73.8 75.1 1.061 2000 82.3 80 78.1 80.13 1.717 2500 79.4 80.3 81.2 80.3 0.734 3000 80.1 78.9 79.9 79.633 0.524 3500 82 80.4 81.3 81.23 0.8 4000 80.1 79.5 78.8 79.46 0.3 4500 81 80.2 82.1 81.1 0.778 5000 79.2 80.4 80.8 80.13 0.679 Figure 10: Mira Face Accuracy
  • 12. 11 Table 11: Mira - Face - Time Time Taken For Face Data(in seconds) Digits training Data Set Size Time 1 Time 2 Time 3 Mean SD 500 5.732 5.845 5.912 5.829 0.07428025 1000 8.325 8.826 8.967 8.706 0.275 1500 11.458 12.241 11.251 11.65 0.426 2000 13.696 13.176 14.218 13.696 0.425 2500 16.885 16.903 17.48 17.089 0.276 3000 18.74 18.539 18.436 18.571 0.126 3500 20.561 20.704 20.019 20.428 0.295 4000 22.498 22.014 22.328 22.28 0.200 4500 24.893 24.481 23.947 24.440 0.387 5000 28.115 28.947 28.571 28.544 0.340 Figure 11: Mira Face Time
  • 13. 12 Table 12: Mira - Face -Accuracy Accuracy For Face Data(in seconds) Digits training Data Set Size Run 1 Run 2 Run 3 Mean SD 500 67.8 66.2 69.3 67.766 1.26 1000 72.5 74.8 75.9 74.4 1.416 1500 76.1 77.8 77.1 77 0.697 2000 79.1 78.3 77.9 78.433 0.498 2500 81.5 81.8 82.6 81.96 0.464 3000 84.1 83.7 83.9 83.9 0.163 3500 86.4 86.5 85.9 86.266 0.262 4000 86.9 87.1 87 87 0.0816 4500 89.1 88.7 90.1 89.3 0.588 5000 88.9 91.6 88.5 89.66 1.376 Figure 12: Mira Face Accuracy
  • 14. 13 PROBLEM 3 - DISCUSSION OF ALGORITHMS AND RESULTS (A) Naive Bayes: A naive Bayes classifier models a joint distribution over a label Y and a set of observed random variables, or features, {F1,F2,...Fn}, using the assumption that the full joint distribution can be factored as follows (features are conditionally independent given the label): P(F1 ...Fn,Y ) = P(Y ) i P(Fi |Y ) To classify a datum, we can find the most probable label given the feature values for each pixel, using Bayes theorem: P(y|f1 ... fn) = P(f1 ... fn|y)P(y) P(f1 ... fn) = P(y) i = 1m P(fi |y) P(f1 ... fn) ar gmaxy P(y|f1 ... fn) = ar gmaxy P(y) m i=1 P(fi |y) P(f1 ... fn) ar gmaxy P(y|f1 ... fn) = ar gmaxy P(y) m i=1 P(fi |y) Because multiplying many probabilities together often results in underflow, we will instead compute log probabilities which have the same argmax. ar gmaxy logP(y) m i=1 P(fi |y) = ar gmaxy log(P(y, f1 ... fn)) = ar gmaxy log(P(y)+ m i=1 logP(fi |y)) Use math.log(), a built-in Python function to compute logarithms. Parameter Estimation: Our naive Bayes model has several parameters to estimate. One parameter is the prior distribution over labels (digits, or face/not-face), P(Y ). We can estimate P(Y ) directly
  • 15. 14 from the training data: ˆP(y) = c(y) n where c(y) is the number of training instances with label y and n is the total number of training instances. The other parameters to estimate are the conditional probabilities of our features given each label y: P(Fi |Y = y). We do this for each possible feature value (fi ∈ 0,1). ˆP(Fi = fi |Y = y) = c(fi , y) fi c(fi , y) where c(fi , y) is the number of times pixel Fi took value fi in the training examples of label y. Smoothing: Your current parameter estimates are unsmoothed, that is, you are using the empirical estimates for the parameters P(fi |y). These estimates are rarely adequate in real systems. Minimally, we need to make sure that no parameter ever receives an estimate of zero, but good smoothing can boost accuracy quite a bit by reducing overfitting. In this project, we use Laplace smoothing, which adds k counts to every possible observation value: P(Fi = fi |Y = y) = c(Fi = fi ,Y = y)+k fi (c(Fi = fi ,Y = y)+k) If k=0, the probabilities are unsmoothed. As k grows larger, the probabilities are smoothed more and more. You can use your validation set to determine a good value for k. Conclusion: As we increase the training data from 10% to 100% we see that the accu- racy jumps from a mere 62.8% to a reasonable 91% in face recognition and 71% to 77% in digit recognition.The trade off is time, more the data - more processing, as from the data collected we see the difference of 3 seconds in training face data and 8 seconds for training digit data. Also we see the increase in accuracy to minimize after some point. In our project data we see that there is not much difference in accuracy between 50% training data and 100% training data. So we could improve training time on this by using 50% data
  • 16. 15 instead of training on 100%. (B) Perceptron: The perceptron algorithm uses a weight vector to make decisions unlike Native Bayes which uses probability. The weight vector here is represented by ωy for each class y. For a given feature list f , the perceptron algorithm computes the class y whose weight vector is most similar to the input vector f . The feature vector defined in our code is a map from pixel locations to indicators of whether they are on or not. Formally, given a feature vector f (in our case, a map from pixel locations to indicators of whether they are on), we score each class with: score(f , y) = i fi w y i The class with highest score is chosen as the predicted label for that data instance Before classifying the training set, the weights have to be learnt by the algorithm. For this, the training set is scanned one instance at a time. When we come to an instance (f , y), we find the label with highest score: y = arg maxy score(f , y ) We compare y i.e the result obtained by previous equations, to the true label y. If the two labels are equal (y = y) then we’ve gotten the instance correct and can proceed with the other items in the training set. Otherwise, we guessed a false positive y in place of y. That means that wy should have scored f higher, and wy should have scored f lower. In order to prevent this error in the future we update these two weight vectors accordingly. wy + = f wy − = f Conclusion: We ran our algorithm for 3 iterations. We see a huge difference in accuracy, from 67% to 90%(digits) and 70% to 90%(faces). This happens becuase perceptron iterates repeatedly through the training data. This leads us to higher training time.
  • 17. 16 Perceptron makes weight corrections and starts to improve after each iteration, this is also the reason behind the significant accuracy rate. (C) Mira: Similar to a perceptron classifier, the MIRA classifier also keeps a weight vector wy of each label y. Here too we scan over the data, one instance at a time. When we come to an instance (f , y), we find the label with highest score: y = arg maxy score(f , y ) We compare y to the true label y. If the labels are equal( y = y) , we’ve gotten the instance correct, and we do nothing. Otherwise, we guessed y but we should have guessed y. The difference between mira and perceptron is that in Mira we update the weight vectors of these labels with variable step size: ωy = ωy +τf ωy = ωy −τf Here τ > 0 and is chosen such that it minimizes : minu (1/2) c||(ω )c −ωc ||2 2 subject to the condition that (ω )y f ≥ (ω )y f +1 This is equivalent to minτ||τf ||2 2 subject to τ ≥ (ωy −ωy )f +1 2||f ||2 2 and τ ≥ 0 We can notice that, ωy f ≥ ωy f , so the condition τ ≥ 0 is always true given τ ≥ (ωy −ωy )f +1 2||f ||2 2 Solving the problem we get,
  • 18. 17 τ = (ωy −ωy )f +1 2||f ||2 2 We cap the maximum possible value of τ by a positive constant C, τ = min C,τ = (ωy −ωy )f +1 2||f ||2 2 Conclusion:The accuracy increased from 67% to approx 90% as we increased the training data. Similar to perceptron the accuracy improves as we the iterations, this happens because the weights gets updated.Training time is high when compared to Naive Bayes but is similar to perceptron.