SlideShare une entreprise Scribd logo
1  sur  3
Télécharger pour lire hors ligne
International Journal of Research in Advent Technology, Vol.9, No.1, January 2021
E-ISSN: 2321-9637
Available online at www.ijrat.org
1
doi: 10.32622/ijrat.91202104
Tests for a Single Upper Outlier in a Johnson SB Sample with
Unknown Parameters
Tanuja Sriwastava, Mukti Kant Sukla
Abstract: Outliers are unexpected observations, which
deviate from the majority of observations. Outlier
detection and prediction are challenging tasks, because
outliers are rare by definition. A test statistic for single
upper outlier is proposed and applied to Johnson SB
sample with unknown parameters. The Johnson SB
distribution have four parameters and is extremely
flexible, which means that it can fit a wide range of
distribution shapes. Because of its distributional shapes it
has a variety of applications in many fields. The test
statistic proposed for the case when parameters are
known (Sriwastava, T., 2018) used here for developing
the test statistic when parameters are unknown. Critical
points were calculated for different sample sizes and for
different level of significance. The performance of the
test in the presence of a single upper outlier is
investigated. One numerical example was given for
highlighting the result.
Keywords: Outlier, Johnson SB Distribution, Trimmed
Sample, Critical Points, Simulation.
I. INTRODUCTION
The p.d.f of Johnson SB distribution with location
parameter 𝜉, scale parameter 𝜆 and two shape parameters
𝛾 𝑎𝑛𝑑 𝛿 is given by
𝑓(𝑥; 𝜉, 𝜆, 𝛾 , 𝛿) =
𝛿
√2𝜋
𝜆
(𝜆−(𝑥−𝜉))(𝑥−𝜉)
𝑒𝑥𝑝 [−
1
2
{𝛾 +
𝛿 ln (
𝑥−𝜉
𝜆−(𝑥−𝜉)
)}
2
], (1)
𝜉 ≤ 𝑥 ≤ 𝜉 + 𝜆, 𝛿 > 0, −∞ < 𝛾 < ∞, 𝜆 > 0, − ∞ < 𝜉 <
∞.
This distribution is extremely flexible, which means that
it can fit a wide range of distribution shapes. Because of
its distributional shapes it has a variety of application in
many fields like human exposure (Flynn, 2004), forestry
data (Zang et. al, 2003), rainfall data (Kotteguda, 1987)
and manay more because of its flexible nature. One of the
most important application of this distribution is in the
complex data set like microarray data analysis (Florence
George, 2007) etc.
The detection of outliers in such situations are
considerably important. A number of test statistic have
been proposed for several distributions when their
parameters are known (Barnett & Lewis, 1994).
A test statistic has been proposed for the Johnson SB
distribution when parameters are known by (Sriwastava,
T., 2018) in one of her research paper. But in practice, no
parameter would be known. In such situations, estimates
of all the parameters are used in the construction of a test
statistic for detection of outliers in a sample from a
Johnson SB distribution. Then its critical values and the
corresponding performance study were done by
simulation technique. Since the study is about outlying
observation, the entire sample should not be considered
for the estimation of all the parameters. Hence, the
estimates given by George and Ramachandran (2011) are
used as estimates of all the parameters, using a trimmed
sample obtained after removing the suspected outlying
observation(s).
The estimates of the parameters of this
distribution as given by George & Ramachandran (2011)
using maximum likelihood least square method was
considered are as follows.
𝛾
̂ = −
𝛿 ∑ 𝑔(
𝑥𝑖−𝜉
𝜆
)
𝑛−1
𝑖=1
𝑛−1
= −𝛿𝑔̅ . (2)
𝛿
̂2
=
𝑛−1
∑ [𝑔(
𝑥𝑖−𝜉
𝜆
)]
2
𝑛−1
𝑖=1 −
1
𝑛−1
[∑ 𝑔(
𝑥𝑖−𝜉
𝜆
)
𝑛−1
𝑖=1 ]
2 =
1
𝑣𝑎𝑟(𝑔)
.
(3)
where, 𝑔 (
𝑥−𝜉
𝜆
) = log (
𝑥−𝜉
𝜆−(𝑥−𝜉)
), 𝑔̅ is the mean and
𝑣𝑎𝑟(𝑔) is the variance of the values of g defined here.
The estimates of 𝜆 and 𝜉 were as follows;
𝜆
̂ =
(𝑛−1) ∑ 𝑥𝑖
𝑛−1
𝑖=1 𝑓−1(
𝑧𝑖−𝛾
𝛿
)−∑ 𝑓−1(
𝑧𝑖−𝛾
𝛿
)
𝑛−1
𝑖=1 ∑ 𝑥𝑖
𝑛−1
𝑖=1
(𝑛−1) ∑ [𝑓−1(
𝑧𝑖−𝛾
𝛿
)]
2
𝑛−1
𝑖=1 −[∑ 𝑓−1(
𝑧𝑖−𝛾
𝛿
)
𝑛−1
𝑖=1 ]
2 .(4)
𝜉
̂ = 𝑥̅ − 𝜆∗
𝑚𝑒𝑎𝑛 [𝑓−1
(
𝑧−𝛾
𝛿
)] , (5)
where z=
𝑥−𝜉
𝜆
is a standard normal variate. Hence, the
quantiles of x and the corresponding quantiles of z can be
considered as paired observations. When there were 100
or more x values, the percentiles 1 through 99 were
considered, while for k number of data points of x, where
k is less than 100, 𝑘 − 1 quantiles of x were considered.
Manuscript revised on January 29, 2021 and published on February
10, 2021
Dr. Tanuja Sriwastava, Assistant Professor, Department of Statistics, Sri
Venkateswara College, University of Delhi. Email ID:
tanujastat24@gmail.com,
Dr. Mukti Kanta Sukla, Associate Professor, Department of Statistics,
Sri Venkateswara College, University of Delhi. Email ID:
suklamk@gmail.com
International Journal of Research in Advent Technology, Vol.9, No.1, January 2021
E-ISSN: 2321-9637
Available online at www.ijrat.org
2
doi: 10.32622/ijrat.91202104
These 𝑘 − 1 quantiles of x and the corresponding 𝑘 − 1
quantiles of z were considered as paired observations.
II.PROPOSED OUTLIER DETECTION TEST
STATISTICS
Let 𝑋1, 𝑋2, ⋯ , 𝑋𝑛 be a random sample from a Johnson SB
distribution with 𝛾 and 𝛿 as shape parameters, 𝜆 a scale
parameter and 𝜉 a location parameter and
𝑋(1), 𝑋(2), ⋯ , 𝑋(𝑛) be the corresponding order statistics of
the n observations.
In the paper of (Sriwastava, T. 2018), a test statistic was
proposed for detection of an upper outlier for a sample
from Johnson SB distribution, when all parameters were
assumed to be known. But in all these, if all the
parameters of Johnson SB distribution were assumed to be
unknown, then we propose to use the MLE-least square
estimators of George and Ramachandran (2011) in place
of all the parameters in the above said test statistics.
The test statistic for the case of an upper outlier
(unknown parameters) obtained is given as
𝑊′
={(
𝑋(𝑛)−𝜉
̂
𝑋(𝑛−1)−𝜉
̂) (
𝜆
̂−(𝑋(𝑛−1)−𝜉
̂)
𝜆
̂−(𝑋(𝑛)−𝜉
̂)
)}
𝛿
̂
, (6)
where 𝑋(𝑘) is the kth
order statistic, 𝑘 = 1,2, ⋯ , 𝑛.
2.1 Critical values of the test statistic
To detect an upper outlying observation in a sample from
a Johnson SB distribution, the test statistic 𝑊′
was used.
This test statistic should reject the null hypothesis for
larger value of 𝑊′
. Critical values of the test statistic 𝑊′
were obtained using simulation technique with 10,000
replications for different sample sizes. For this, a random
sample was generated from a Johnson SB distribution and
the estimates of all the parameters were calculated using
equation 2, 3, 4, 5 with this sample. Then the value of the
test statistic was calculated and the whole process was
replicated 10,000 times and the percentile value at 90, 95
and 99 percent was calculated, which is the critical values
at 10, 5 and 1 percent levels of significance respectively.
The critical value table for sample sizes n=3,
10(10)40(20)100, 200, 500, 1000 and at 10, 5 and 1
percent levels of significance are shown in table 1.
Table 1. Critical values 𝑊
𝛼 of the statistic 𝑊′
n
100 α% Level
10% 5% 1%
3 3.90214 4.45028 13.7898
10 3.22234 4.30638 7.82518
20 2.79391 3.66518 6.0721
30 2.54645 3.2718 5.2587
40 2.41443 2.99493 4.93746
60 2.35604 2.90261 4.75126
80 2.30658 2.85917 4.48976
100 2.26234 2.82311 4.30515
200 2.12077 2.59739 3.83966
500 1.96466 2.37174 3.52657
1000 1.91691 2.27917 3.39466
It can be seen from the table that the values are decreases
with increase in sample size and on comparison of these
critical values with that of the one with known
parameters, it can be seen that the values are very close to
each other for sample sizes 10 onwards.
2.2 Numerical Example
For highlighting the utility of the statistic, the following
20 observations were taken from the Census data of India
taken in the year 2011, (in, 000).
1028610, 1045547, 1062388, 1095722, 1112186,
1128521, 1160813, 1176742, 1192506, 1223581,
1238887, 1254019, 1283600, 1298041, 1312240,
1339741, 1352695, 1365302, 1388994, 1399838.
Using the above data for the case when the outlying
observation is from another sample with a shift in all the
parameters. The critical value at 5% level of significance
for considered below is 3.5966. The parameters were
estimated using a sample obtained by leaving out the
largest observation (as that is the suspected outlying
observation). Then the value of test statistic 𝑊′
was
calculated with the estimated values of the parameters
and was found to be 5.99957. On comparing this with the
critical value at 5% level of significance for sample size
20, the null hypothesis gets rejected i.e. the largest
observation of the sample is confirmed as outlying.
III. PERFORMANCE STUDY
The performance study was done using simulation
technique for detection of an upper outlier. A random
sample of size n was generated using R software from a
Johnson SB distribution with location parameter 𝜉(=10),
scale parameter 𝜆(=30), with two shape parameters 𝛾(=1)
and 𝛿(=2) (known). Then a contaminant observation was
introduced in the sample.
For introducing a contaminant observation, another
sample of Johnson SB distribution with a shift (𝑎𝜆),
where 0 < a < 1 in the location parameter was generated.
The largest observation of the original sample was
replaced with the largest observation of the second
sample. Since the estimates of the parameters depend
upon only n-1 largest observations, the largest
observation of the sample need not be removed for
estimation purpose. Thus the values of the estimates of all
the parameters were calculated using MLE least square
International Journal of Research in Advent Technology, Vol.9, No.1, January 2021
E-ISSN: 2321-9637
Available online at www.ijrat.org
3
doi: 10.32622/ijrat.91202104
method. Using these estimated values of the parameters,
the value of the test statistic 𝑊𝑈
′
was calculated at 5%
level of significance. Here, the process was simulated
10,000 times and the number of times the test statistic
falling in the critical region was noted. Simulation study
was carried out for different sample sizes. The
probabilities of rejection of null hypothesis for sample
sizes n = 10(10)30, 60, 100, 200, 500, 1000 and for
different values of ‘a’ are shown in the table 2.
Table 2: Probabilities of identification of the upper
contaminant observation.
a
n
0.033 0.067 0.1 0.133
10 0.4056 0.5249 0.6475 0.7569
20 0.3058 0.4362 0.5843 0.7315
30 0.3242 0.4845 0.6557 0.8001
60 0.2379 0.4095 0.6038 0.7856
100 0.2055 0.3971 0.6323 0.8253
200 0.2899 0.4253 0.7987 0.8541
500 0.3643 0.6603 0.8878 0.9821
1000 0.2401 0.5449 0.8283 0.963
a
n
0.2 0.27 0.3
10 0.9056 0.9753 0.9898
20 0.9166 0.9847 0.9942
30 0.9583 0.9948 0.9985
60 0.9669 0.9964 0.9987
100 0.9781 0.9994 0.9995
200 0.9887 1 1
500 0.9868 1 1
1000 0.9994 1 1
It can be observed from this table that, the test statistic
performs well for higher values of the shift i.e. for a>0.1.
As the values of shift increases, performance of the test
statistic also increases. On comparison with that of the
known parameter case, it can be seen that in this case,
performance is better for higher values of the shift,
whereas in the earlier case, performance was better for
lower values of the shift.
IV. CONCLUSION
It can be concluded from above study that the suggested
test statistic for an upper outlier is performing very well
for higher values of the shift. For the lower shift of values
performance of the test statistic was not satisfactory. The
higher values of shift and large sample sizes the test
statistic 𝑊′
is best for one contaminant observation. The
proposed work for future research work and
implementation includes:
 The generalization of proposed work to multiple
outlier case.
 Apply the proposed outlier detection technique
to variety of application.
 The method used here is may useful for
developing outlier detection test statistic for
complicated distribution like; Johnson SB.
REFERENCES
[1] Barnett V, Lewis T. Outliers in Statistical Data. John Wiley, 1994.
[2] Flynn MR. The 4 parameter lognormal (SB) model of human
exposure. Ann Occup Hyg. 2004; 48:617-22.
[3] George F. Johnson’s system of distribution and Microarray data
analysis. Graduate Theses and Dissertations, University of South
Florida, 2007.
[4] George, F. and Ramachandran, K.M. (2011). Estimation of
parameters of Johnson’s system of distributions. Journal of Modern
Applied Statistical Methods, 10, no. 2.
[5] Johnson NL. Systems of frequency curves generated by methods of
translation. Biometrica. 1949; 58:547-558.
[6] Kottegoda NT. Fitting Johnson SB curve by method of maximum
likelihood to annual maximum daily rainfalls. Water Resour Res.
1987; 23:728-732.
[7] Sriwastava T. An upper outlier detection procedure in a sample
from a Johnson SB distribution with known parameters.
International Journal of Applied Statistics and Mathematics. 2018;
3(2), 194-198.
[8] Zhang L, Packard PC, Liu C. A comparison of estimation methods
for fitting Weibull and Johnson’s SB distributions to mixed spruce-
fir stands in northeastern North America. Can J Forest Res. 2003;
33:1340-1347.
AUTHORS PROFILE
Dr. Tanuja Sriwastava She has completed her M.Sc.
(Statistics) from Banaras Hindu University, Varanasi and
D.Phil in Statistics from University of Allahabad,
Prayagraj. She has Qualified UGC-NET(JRF) and has
published 5 international and national publications. Her
area of interest in research is mainly focused on
Distribution theory and Statistical Inference.
Dr. Mukti Kanta Sukla A committed senior
Associate Professor in the Department of Statistics
with over 25 years of experience in one of the leading
colleges of Delhi University, Sri Venkateswara
College. Focused on research and has 10 prior
international and national publications. Did his Ph.D
from Utkal University, the research focusing on
Stochastic Modelling based on the modelling of the rainfall data of
Mahanadi Delta Region. Has vast teaching experience on courses like
Linear Models, Econometrics, Algebra, Sample Survey Methods etc.
Possessing excellent administrative, verbal communication and
leadership skills along with constructive and effective methods that
promote a stimulating learning environment.

Contenu connexe

Tendances

Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.DrBarada Mohanty
 
2 way ANOVA(Analysis Of VAriance
2 way ANOVA(Analysis Of VAriance2 way ANOVA(Analysis Of VAriance
2 way ANOVA(Analysis Of VAriancemusadoto
 
Analysis of Variance
Analysis of Variance Analysis of Variance
Analysis of Variance jyothimonc
 
Statr session 19 and 20
Statr session 19 and 20Statr session 19 and 20
Statr session 19 and 20Ruru Chowdhury
 
Analysis of variance ppt @ bec doms
Analysis of variance ppt @ bec domsAnalysis of variance ppt @ bec doms
Analysis of variance ppt @ bec domsBabasab Patil
 
Matt brayley
Matt brayleyMatt brayley
Matt brayleyJJkedst
 
Assessing normality lab 6
Assessing normality lab 6Assessing normality lab 6
Assessing normality lab 6Laura Sandoval
 
Development of a Spatial Path-Analysis Method for Spatial Data Analysis
Development of a Spatial Path-Analysis Method for Spatial Data AnalysisDevelopment of a Spatial Path-Analysis Method for Spatial Data Analysis
Development of a Spatial Path-Analysis Method for Spatial Data AnalysisIJECEIAES
 
Analysis of Variance-ANOVA
Analysis of Variance-ANOVAAnalysis of Variance-ANOVA
Analysis of Variance-ANOVARabin BK
 
Chap15 analysis of variance
Chap15 analysis of varianceChap15 analysis of variance
Chap15 analysis of varianceJudianto Nugroho
 
Anova by Hazilah Mohd Amin
Anova by Hazilah Mohd AminAnova by Hazilah Mohd Amin
Anova by Hazilah Mohd AminHazilahMohd
 
ANOVA TEST by shafeek
ANOVA TEST by shafeekANOVA TEST by shafeek
ANOVA TEST by shafeekShafeek S
 
JSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerJSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerDennis Sweitzer
 

Tendances (18)

Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.Advanced Methods of Statistical Analysis used in Animal Breeding.
Advanced Methods of Statistical Analysis used in Animal Breeding.
 
2 way ANOVA(Analysis Of VAriance
2 way ANOVA(Analysis Of VAriance2 way ANOVA(Analysis Of VAriance
2 way ANOVA(Analysis Of VAriance
 
Analysis of Variance
Analysis of Variance Analysis of Variance
Analysis of Variance
 
Analysis of variance anova
Analysis of variance anovaAnalysis of variance anova
Analysis of variance anova
 
Statr session 19 and 20
Statr session 19 and 20Statr session 19 and 20
Statr session 19 and 20
 
Analysis of variance ppt @ bec doms
Analysis of variance ppt @ bec domsAnalysis of variance ppt @ bec doms
Analysis of variance ppt @ bec doms
 
Matt brayley
Matt brayleyMatt brayley
Matt brayley
 
Two Way ANOVA
Two Way ANOVATwo Way ANOVA
Two Way ANOVA
 
Assessing normality lab 6
Assessing normality lab 6Assessing normality lab 6
Assessing normality lab 6
 
Chi square test
Chi square testChi square test
Chi square test
 
Student’s t test
Student’s  t testStudent’s  t test
Student’s t test
 
Development of a Spatial Path-Analysis Method for Spatial Data Analysis
Development of a Spatial Path-Analysis Method for Spatial Data AnalysisDevelopment of a Spatial Path-Analysis Method for Spatial Data Analysis
Development of a Spatial Path-Analysis Method for Spatial Data Analysis
 
Analysis of Variance-ANOVA
Analysis of Variance-ANOVAAnalysis of Variance-ANOVA
Analysis of Variance-ANOVA
 
Chap15 analysis of variance
Chap15 analysis of varianceChap15 analysis of variance
Chap15 analysis of variance
 
Factorial Experiments
Factorial ExperimentsFactorial Experiments
Factorial Experiments
 
Anova by Hazilah Mohd Amin
Anova by Hazilah Mohd AminAnova by Hazilah Mohd Amin
Anova by Hazilah Mohd Amin
 
ANOVA TEST by shafeek
ANOVA TEST by shafeekANOVA TEST by shafeek
ANOVA TEST by shafeek
 
JSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzerJSM2013,Proceedings,paper307699_79238,DSweitzer
JSM2013,Proceedings,paper307699_79238,DSweitzer
 

Similaire à 91202104

Efficiency of ratio and regression estimators using double sampling
Efficiency of ratio and regression estimators using double samplingEfficiency of ratio and regression estimators using double sampling
Efficiency of ratio and regression estimators using double samplingAlexander Decker
 
Statistical techniques used in measurement
Statistical techniques used in measurementStatistical techniques used in measurement
Statistical techniques used in measurementShivamKhajuria3
 
A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...Alexander Decker
 
A Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their PerformancesA Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their PerformancesIOSRJM
 
Sampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptxSampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptxRajJirel
 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: EstimationParag Shah
 
Trends in Computer Science and Information Technology
Trends in Computer Science and Information TechnologyTrends in Computer Science and Information Technology
Trends in Computer Science and Information Technologypeertechzpublication
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Sneh Kumari
 
Non parametrics tests
Non parametrics testsNon parametrics tests
Non parametrics testsrodrick koome
 
Parameter Optimisation for Automated Feature Point Detection
Parameter Optimisation for Automated Feature Point DetectionParameter Optimisation for Automated Feature Point Detection
Parameter Optimisation for Automated Feature Point DetectionDario Panada
 
Business Research Methods PPT -IV
Business Research Methods PPT -IVBusiness Research Methods PPT -IV
Business Research Methods PPT -IVRavinder Singh
 
Advanced statistics Lesson 1
Advanced statistics Lesson 1Advanced statistics Lesson 1
Advanced statistics Lesson 1Cliffed Echavez
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATAorajjournal
 
tps5e_Ch10_2.ppt
tps5e_Ch10_2.ppttps5e_Ch10_2.ppt
tps5e_Ch10_2.pptDunakanshon
 

Similaire à 91202104 (20)

Efficiency of ratio and regression estimators using double sampling
Efficiency of ratio and regression estimators using double samplingEfficiency of ratio and regression estimators using double sampling
Efficiency of ratio and regression estimators using double sampling
 
Statistical techniques used in measurement
Statistical techniques used in measurementStatistical techniques used in measurement
Statistical techniques used in measurement
 
Austin Statistics
Austin StatisticsAustin Statistics
Austin Statistics
 
A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...A note on estimation of population mean in sample survey using auxiliary info...
A note on estimation of population mean in sample survey using auxiliary info...
 
A Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their PerformancesA Study of Some Tests of Uniformity and Their Performances
A Study of Some Tests of Uniformity and Their Performances
 
Sampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptxSampling_Distribution_stat_of_Mean_New.pptx
Sampling_Distribution_stat_of_Mean_New.pptx
 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: Estimation
 
Trends in Computer Science and Information Technology
Trends in Computer Science and Information TechnologyTrends in Computer Science and Information Technology
Trends in Computer Science and Information Technology
 
C054
C054C054
C054
 
Analysis of variance (ANOVA)
Analysis of variance (ANOVA)Analysis of variance (ANOVA)
Analysis of variance (ANOVA)
 
Non parametrics tests
Non parametrics testsNon parametrics tests
Non parametrics tests
 
Anova; analysis of variance
Anova; analysis of varianceAnova; analysis of variance
Anova; analysis of variance
 
Orlu_R N Poster
Orlu_R N PosterOrlu_R N Poster
Orlu_R N Poster
 
Parameter Optimisation for Automated Feature Point Detection
Parameter Optimisation for Automated Feature Point DetectionParameter Optimisation for Automated Feature Point Detection
Parameter Optimisation for Automated Feature Point Detection
 
Business Research Methods PPT -IV
Business Research Methods PPT -IVBusiness Research Methods PPT -IV
Business Research Methods PPT -IV
 
Advanced statistics Lesson 1
Advanced statistics Lesson 1Advanced statistics Lesson 1
Advanced statistics Lesson 1
 
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATARESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
RESIDUALS AND INFLUENCE IN NONLINEAR REGRESSION FOR REPEATED MEASUREMENT DATA
 
Testing of hypothesis
Testing of hypothesisTesting of hypothesis
Testing of hypothesis
 
panel regression.pptx
panel regression.pptxpanel regression.pptx
panel regression.pptx
 
tps5e_Ch10_2.ppt
tps5e_Ch10_2.ppttps5e_Ch10_2.ppt
tps5e_Ch10_2.ppt
 

Plus de IJRAT

96202108
9620210896202108
96202108IJRAT
 
97202107
9720210797202107
97202107IJRAT
 
93202101
9320210193202101
93202101IJRAT
 
92202102
9220210292202102
92202102IJRAT
 
87202003
8720200387202003
87202003IJRAT
 
87202001
8720200187202001
87202001IJRAT
 
86202013
8620201386202013
86202013IJRAT
 
86202008
8620200886202008
86202008IJRAT
 
86202005
8620200586202005
86202005IJRAT
 
86202004
8620200486202004
86202004IJRAT
 
85202026
8520202685202026
85202026IJRAT
 
711201940
711201940711201940
711201940IJRAT
 
711201939
711201939711201939
711201939IJRAT
 
711201935
711201935711201935
711201935IJRAT
 
711201927
711201927711201927
711201927IJRAT
 
711201905
711201905711201905
711201905IJRAT
 
710201947
710201947710201947
710201947IJRAT
 
712201907
712201907712201907
712201907IJRAT
 
712201903
712201903712201903
712201903IJRAT
 
711201903
711201903711201903
711201903IJRAT
 

Plus de IJRAT (20)

96202108
9620210896202108
96202108
 
97202107
9720210797202107
97202107
 
93202101
9320210193202101
93202101
 
92202102
9220210292202102
92202102
 
87202003
8720200387202003
87202003
 
87202001
8720200187202001
87202001
 
86202013
8620201386202013
86202013
 
86202008
8620200886202008
86202008
 
86202005
8620200586202005
86202005
 
86202004
8620200486202004
86202004
 
85202026
8520202685202026
85202026
 
711201940
711201940711201940
711201940
 
711201939
711201939711201939
711201939
 
711201935
711201935711201935
711201935
 
711201927
711201927711201927
711201927
 
711201905
711201905711201905
711201905
 
710201947
710201947710201947
710201947
 
712201907
712201907712201907
712201907
 
712201903
712201903712201903
712201903
 
711201903
711201903711201903
711201903
 

Dernier

Industrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIESIndustrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIESNarmatha D
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptMadan Karki
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm Systemirfanmechengr
 
Industrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptIndustrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptNarmatha D
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating SystemRashmi Bhat
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncssuser2ae721
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHC Sai Kiran
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort servicejennyeacort
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgsaravananr517913
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...Chandu841456
 

Dernier (20)

Industrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIESIndustrial Safety Unit-I SAFETY TERMINOLOGIES
Industrial Safety Unit-I SAFETY TERMINOLOGIES
 
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
🔝9953056974🔝!!-YOUNG call girls in Rajendra Nagar Escort rvice Shot 2000 nigh...
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Indian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.pptIndian Dairy Industry Present Status and.ppt
Indian Dairy Industry Present Status and.ppt
 
Class 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm SystemClass 1 | NFPA 72 | Overview Fire Alarm System
Class 1 | NFPA 72 | Overview Fire Alarm System
 
Industrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.pptIndustrial Safety Unit-IV workplace health and safety.ppt
Industrial Safety Unit-IV workplace health and safety.ppt
 
Virtual memory management in Operating System
Virtual memory management in Operating SystemVirtual memory management in Operating System
Virtual memory management in Operating System
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsyncWhy does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
Why does (not) Kafka need fsync: Eliminating tail latency spikes caused by fsync
 
Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Introduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECHIntroduction to Machine Learning Unit-3 for II MECH
Introduction to Machine Learning Unit-3 for II MECH
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort serviceGurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
Gurgaon ✡️9711147426✨Call In girls Gurgaon Sector 51 escort service
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfgUnit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
Unit7-DC_Motors nkkjnsdkfnfcdfknfdgfggfg
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...
 

91202104

  • 1. International Journal of Research in Advent Technology, Vol.9, No.1, January 2021 E-ISSN: 2321-9637 Available online at www.ijrat.org 1 doi: 10.32622/ijrat.91202104 Tests for a Single Upper Outlier in a Johnson SB Sample with Unknown Parameters Tanuja Sriwastava, Mukti Kant Sukla Abstract: Outliers are unexpected observations, which deviate from the majority of observations. Outlier detection and prediction are challenging tasks, because outliers are rare by definition. A test statistic for single upper outlier is proposed and applied to Johnson SB sample with unknown parameters. The Johnson SB distribution have four parameters and is extremely flexible, which means that it can fit a wide range of distribution shapes. Because of its distributional shapes it has a variety of applications in many fields. The test statistic proposed for the case when parameters are known (Sriwastava, T., 2018) used here for developing the test statistic when parameters are unknown. Critical points were calculated for different sample sizes and for different level of significance. The performance of the test in the presence of a single upper outlier is investigated. One numerical example was given for highlighting the result. Keywords: Outlier, Johnson SB Distribution, Trimmed Sample, Critical Points, Simulation. I. INTRODUCTION The p.d.f of Johnson SB distribution with location parameter 𝜉, scale parameter 𝜆 and two shape parameters 𝛾 𝑎𝑛𝑑 𝛿 is given by 𝑓(𝑥; 𝜉, 𝜆, 𝛾 , 𝛿) = 𝛿 √2𝜋 𝜆 (𝜆−(𝑥−𝜉))(𝑥−𝜉) 𝑒𝑥𝑝 [− 1 2 {𝛾 + 𝛿 ln ( 𝑥−𝜉 𝜆−(𝑥−𝜉) )} 2 ], (1) 𝜉 ≤ 𝑥 ≤ 𝜉 + 𝜆, 𝛿 > 0, −∞ < 𝛾 < ∞, 𝜆 > 0, − ∞ < 𝜉 < ∞. This distribution is extremely flexible, which means that it can fit a wide range of distribution shapes. Because of its distributional shapes it has a variety of application in many fields like human exposure (Flynn, 2004), forestry data (Zang et. al, 2003), rainfall data (Kotteguda, 1987) and manay more because of its flexible nature. One of the most important application of this distribution is in the complex data set like microarray data analysis (Florence George, 2007) etc. The detection of outliers in such situations are considerably important. A number of test statistic have been proposed for several distributions when their parameters are known (Barnett & Lewis, 1994). A test statistic has been proposed for the Johnson SB distribution when parameters are known by (Sriwastava, T., 2018) in one of her research paper. But in practice, no parameter would be known. In such situations, estimates of all the parameters are used in the construction of a test statistic for detection of outliers in a sample from a Johnson SB distribution. Then its critical values and the corresponding performance study were done by simulation technique. Since the study is about outlying observation, the entire sample should not be considered for the estimation of all the parameters. Hence, the estimates given by George and Ramachandran (2011) are used as estimates of all the parameters, using a trimmed sample obtained after removing the suspected outlying observation(s). The estimates of the parameters of this distribution as given by George & Ramachandran (2011) using maximum likelihood least square method was considered are as follows. 𝛾 ̂ = − 𝛿 ∑ 𝑔( 𝑥𝑖−𝜉 𝜆 ) 𝑛−1 𝑖=1 𝑛−1 = −𝛿𝑔̅ . (2) 𝛿 ̂2 = 𝑛−1 ∑ [𝑔( 𝑥𝑖−𝜉 𝜆 )] 2 𝑛−1 𝑖=1 − 1 𝑛−1 [∑ 𝑔( 𝑥𝑖−𝜉 𝜆 ) 𝑛−1 𝑖=1 ] 2 = 1 𝑣𝑎𝑟(𝑔) . (3) where, 𝑔 ( 𝑥−𝜉 𝜆 ) = log ( 𝑥−𝜉 𝜆−(𝑥−𝜉) ), 𝑔̅ is the mean and 𝑣𝑎𝑟(𝑔) is the variance of the values of g defined here. The estimates of 𝜆 and 𝜉 were as follows; 𝜆 ̂ = (𝑛−1) ∑ 𝑥𝑖 𝑛−1 𝑖=1 𝑓−1( 𝑧𝑖−𝛾 𝛿 )−∑ 𝑓−1( 𝑧𝑖−𝛾 𝛿 ) 𝑛−1 𝑖=1 ∑ 𝑥𝑖 𝑛−1 𝑖=1 (𝑛−1) ∑ [𝑓−1( 𝑧𝑖−𝛾 𝛿 )] 2 𝑛−1 𝑖=1 −[∑ 𝑓−1( 𝑧𝑖−𝛾 𝛿 ) 𝑛−1 𝑖=1 ] 2 .(4) 𝜉 ̂ = 𝑥̅ − 𝜆∗ 𝑚𝑒𝑎𝑛 [𝑓−1 ( 𝑧−𝛾 𝛿 )] , (5) where z= 𝑥−𝜉 𝜆 is a standard normal variate. Hence, the quantiles of x and the corresponding quantiles of z can be considered as paired observations. When there were 100 or more x values, the percentiles 1 through 99 were considered, while for k number of data points of x, where k is less than 100, 𝑘 − 1 quantiles of x were considered. Manuscript revised on January 29, 2021 and published on February 10, 2021 Dr. Tanuja Sriwastava, Assistant Professor, Department of Statistics, Sri Venkateswara College, University of Delhi. Email ID: tanujastat24@gmail.com, Dr. Mukti Kanta Sukla, Associate Professor, Department of Statistics, Sri Venkateswara College, University of Delhi. Email ID: suklamk@gmail.com
  • 2. International Journal of Research in Advent Technology, Vol.9, No.1, January 2021 E-ISSN: 2321-9637 Available online at www.ijrat.org 2 doi: 10.32622/ijrat.91202104 These 𝑘 − 1 quantiles of x and the corresponding 𝑘 − 1 quantiles of z were considered as paired observations. II.PROPOSED OUTLIER DETECTION TEST STATISTICS Let 𝑋1, 𝑋2, ⋯ , 𝑋𝑛 be a random sample from a Johnson SB distribution with 𝛾 and 𝛿 as shape parameters, 𝜆 a scale parameter and 𝜉 a location parameter and 𝑋(1), 𝑋(2), ⋯ , 𝑋(𝑛) be the corresponding order statistics of the n observations. In the paper of (Sriwastava, T. 2018), a test statistic was proposed for detection of an upper outlier for a sample from Johnson SB distribution, when all parameters were assumed to be known. But in all these, if all the parameters of Johnson SB distribution were assumed to be unknown, then we propose to use the MLE-least square estimators of George and Ramachandran (2011) in place of all the parameters in the above said test statistics. The test statistic for the case of an upper outlier (unknown parameters) obtained is given as 𝑊′ ={( 𝑋(𝑛)−𝜉 ̂ 𝑋(𝑛−1)−𝜉 ̂) ( 𝜆 ̂−(𝑋(𝑛−1)−𝜉 ̂) 𝜆 ̂−(𝑋(𝑛)−𝜉 ̂) )} 𝛿 ̂ , (6) where 𝑋(𝑘) is the kth order statistic, 𝑘 = 1,2, ⋯ , 𝑛. 2.1 Critical values of the test statistic To detect an upper outlying observation in a sample from a Johnson SB distribution, the test statistic 𝑊′ was used. This test statistic should reject the null hypothesis for larger value of 𝑊′ . Critical values of the test statistic 𝑊′ were obtained using simulation technique with 10,000 replications for different sample sizes. For this, a random sample was generated from a Johnson SB distribution and the estimates of all the parameters were calculated using equation 2, 3, 4, 5 with this sample. Then the value of the test statistic was calculated and the whole process was replicated 10,000 times and the percentile value at 90, 95 and 99 percent was calculated, which is the critical values at 10, 5 and 1 percent levels of significance respectively. The critical value table for sample sizes n=3, 10(10)40(20)100, 200, 500, 1000 and at 10, 5 and 1 percent levels of significance are shown in table 1. Table 1. Critical values 𝑊 𝛼 of the statistic 𝑊′ n 100 α% Level 10% 5% 1% 3 3.90214 4.45028 13.7898 10 3.22234 4.30638 7.82518 20 2.79391 3.66518 6.0721 30 2.54645 3.2718 5.2587 40 2.41443 2.99493 4.93746 60 2.35604 2.90261 4.75126 80 2.30658 2.85917 4.48976 100 2.26234 2.82311 4.30515 200 2.12077 2.59739 3.83966 500 1.96466 2.37174 3.52657 1000 1.91691 2.27917 3.39466 It can be seen from the table that the values are decreases with increase in sample size and on comparison of these critical values with that of the one with known parameters, it can be seen that the values are very close to each other for sample sizes 10 onwards. 2.2 Numerical Example For highlighting the utility of the statistic, the following 20 observations were taken from the Census data of India taken in the year 2011, (in, 000). 1028610, 1045547, 1062388, 1095722, 1112186, 1128521, 1160813, 1176742, 1192506, 1223581, 1238887, 1254019, 1283600, 1298041, 1312240, 1339741, 1352695, 1365302, 1388994, 1399838. Using the above data for the case when the outlying observation is from another sample with a shift in all the parameters. The critical value at 5% level of significance for considered below is 3.5966. The parameters were estimated using a sample obtained by leaving out the largest observation (as that is the suspected outlying observation). Then the value of test statistic 𝑊′ was calculated with the estimated values of the parameters and was found to be 5.99957. On comparing this with the critical value at 5% level of significance for sample size 20, the null hypothesis gets rejected i.e. the largest observation of the sample is confirmed as outlying. III. PERFORMANCE STUDY The performance study was done using simulation technique for detection of an upper outlier. A random sample of size n was generated using R software from a Johnson SB distribution with location parameter 𝜉(=10), scale parameter 𝜆(=30), with two shape parameters 𝛾(=1) and 𝛿(=2) (known). Then a contaminant observation was introduced in the sample. For introducing a contaminant observation, another sample of Johnson SB distribution with a shift (𝑎𝜆), where 0 < a < 1 in the location parameter was generated. The largest observation of the original sample was replaced with the largest observation of the second sample. Since the estimates of the parameters depend upon only n-1 largest observations, the largest observation of the sample need not be removed for estimation purpose. Thus the values of the estimates of all the parameters were calculated using MLE least square
  • 3. International Journal of Research in Advent Technology, Vol.9, No.1, January 2021 E-ISSN: 2321-9637 Available online at www.ijrat.org 3 doi: 10.32622/ijrat.91202104 method. Using these estimated values of the parameters, the value of the test statistic 𝑊𝑈 ′ was calculated at 5% level of significance. Here, the process was simulated 10,000 times and the number of times the test statistic falling in the critical region was noted. Simulation study was carried out for different sample sizes. The probabilities of rejection of null hypothesis for sample sizes n = 10(10)30, 60, 100, 200, 500, 1000 and for different values of ‘a’ are shown in the table 2. Table 2: Probabilities of identification of the upper contaminant observation. a n 0.033 0.067 0.1 0.133 10 0.4056 0.5249 0.6475 0.7569 20 0.3058 0.4362 0.5843 0.7315 30 0.3242 0.4845 0.6557 0.8001 60 0.2379 0.4095 0.6038 0.7856 100 0.2055 0.3971 0.6323 0.8253 200 0.2899 0.4253 0.7987 0.8541 500 0.3643 0.6603 0.8878 0.9821 1000 0.2401 0.5449 0.8283 0.963 a n 0.2 0.27 0.3 10 0.9056 0.9753 0.9898 20 0.9166 0.9847 0.9942 30 0.9583 0.9948 0.9985 60 0.9669 0.9964 0.9987 100 0.9781 0.9994 0.9995 200 0.9887 1 1 500 0.9868 1 1 1000 0.9994 1 1 It can be observed from this table that, the test statistic performs well for higher values of the shift i.e. for a>0.1. As the values of shift increases, performance of the test statistic also increases. On comparison with that of the known parameter case, it can be seen that in this case, performance is better for higher values of the shift, whereas in the earlier case, performance was better for lower values of the shift. IV. CONCLUSION It can be concluded from above study that the suggested test statistic for an upper outlier is performing very well for higher values of the shift. For the lower shift of values performance of the test statistic was not satisfactory. The higher values of shift and large sample sizes the test statistic 𝑊′ is best for one contaminant observation. The proposed work for future research work and implementation includes:  The generalization of proposed work to multiple outlier case.  Apply the proposed outlier detection technique to variety of application.  The method used here is may useful for developing outlier detection test statistic for complicated distribution like; Johnson SB. REFERENCES [1] Barnett V, Lewis T. Outliers in Statistical Data. John Wiley, 1994. [2] Flynn MR. The 4 parameter lognormal (SB) model of human exposure. Ann Occup Hyg. 2004; 48:617-22. [3] George F. Johnson’s system of distribution and Microarray data analysis. Graduate Theses and Dissertations, University of South Florida, 2007. [4] George, F. and Ramachandran, K.M. (2011). Estimation of parameters of Johnson’s system of distributions. Journal of Modern Applied Statistical Methods, 10, no. 2. [5] Johnson NL. Systems of frequency curves generated by methods of translation. Biometrica. 1949; 58:547-558. [6] Kottegoda NT. Fitting Johnson SB curve by method of maximum likelihood to annual maximum daily rainfalls. Water Resour Res. 1987; 23:728-732. [7] Sriwastava T. An upper outlier detection procedure in a sample from a Johnson SB distribution with known parameters. International Journal of Applied Statistics and Mathematics. 2018; 3(2), 194-198. [8] Zhang L, Packard PC, Liu C. A comparison of estimation methods for fitting Weibull and Johnson’s SB distributions to mixed spruce- fir stands in northeastern North America. Can J Forest Res. 2003; 33:1340-1347. AUTHORS PROFILE Dr. Tanuja Sriwastava She has completed her M.Sc. (Statistics) from Banaras Hindu University, Varanasi and D.Phil in Statistics from University of Allahabad, Prayagraj. She has Qualified UGC-NET(JRF) and has published 5 international and national publications. Her area of interest in research is mainly focused on Distribution theory and Statistical Inference. Dr. Mukti Kanta Sukla A committed senior Associate Professor in the Department of Statistics with over 25 years of experience in one of the leading colleges of Delhi University, Sri Venkateswara College. Focused on research and has 10 prior international and national publications. Did his Ph.D from Utkal University, the research focusing on Stochastic Modelling based on the modelling of the rainfall data of Mahanadi Delta Region. Has vast teaching experience on courses like Linear Models, Econometrics, Algebra, Sample Survey Methods etc. Possessing excellent administrative, verbal communication and leadership skills along with constructive and effective methods that promote a stimulating learning environment.