SlideShare une entreprise Scribd logo
1  sur  4
Télécharger pour lire hors ligne
IJSRD - International Journal for Scientific Research & Development| Vol. 1, Issue 4, 2013 | ISSN (online): 2321-0613
All rights reserved by www.ijsrd.com 862
Abstract--In today’s digital world, Data sets are increasing
exponentially. Statistical analysis using clustering in
various scientific and engineering applications become
very challenging issue for such large data set. Clustering
on huge data set and its performance are two major factors
demand for optimization. Parallelization is well-known
approach to optimize performance. It has been observed
from recent research work that GPU based parallelization
help to achieve high degree of performance. Hence, this
thesis focuses on optimizing hierarchical clustering
algorithms using parallelization. It covers implementation
of optimized algorithm on various parallel environments
using Open MP on multi-core architecture and using
CUDA on may-core architecture.
I. INTRODUCTION
Clustering is the task of grouping a set of objects in such a
way that objects in the same group (called cluster) are
more similar (in some sense or another) to each other than
to those in other groups (clusters). It is a main task of
explorative data mining, and a common technique for
statistical data analysis used in many fields, including
machine learning, pattern recognition, image analysis,
information retrieval, and bioinformatics.
A "clustering" is essentially a set of such clusters, usually
containing all objects in the data set. Additionally, it may
specify the relationship of the clusters to each other, for
example a hierarchy of clusters embedded in each other.
Clustering can be roughly distinguished in:
1) Hard clustering: each object belongs to a cluster or
not
2) Soft clustering (fuzzy clustering): each object belongs
to each cluster to a certain degree (e.g. a likelihood of
belonging to the cluster)
A. Classification of Clustering
Traditionally clustering techniques are
broadly divided in [1]:
1) Hierarchical methods
2) Partitioning Methods
3) Density Based Algorithms
4) Grid Based Clustering
B. Hierarchical Methods
Hierarchical clustering is a method of cluster
analysis which seeks to build a hierarchy of
clusters. Hierarchical clustering generally falls into
two types:
1) Agglomerative Algorithm
2) Division Algorithm
In hierarchical clustering the data are not partitioned into
a particular cluster in a single step. Instead, a series of
partitions takes place, which may run from a single
cluster containing all objects to n clusters each containing
a single object. Hierarchical Clustering is subdivided into
agglomerative methods, which proceed by series of
fusions of the n objects into groups, and divisive
methods, which separate n objects successively into finer
groupings. Hierarchical clustering may be represented by
a two dimensional diagram known as dendrogram which
illustrates the fusions or divisions made at each
successive stage of analysis.
C. Partitioning Methods
The partitioning methods generally result in a set of M
clusters, each object belonging to one cluster. Each cluster
may be represented by a centroid or a cluster
representative; this is some sort of summary description of
all the objects contained in a cluster. The precise form of
this description will depend on the type of the object
which is being clustered. In case where real-valued data is
available, the arithmetic mean of the attribute vectors for
all objects within a cluster provides an appropriate
representative; alternative types of centroid may be
required in other cases, e.g., a cluster of documents can be
represented by a list of those keywords that occur in some
minimum number of documents within a cluster. If the
number of the clusters is large, the centroids can be
further clustered to produces hierarchy within a dataset.
D. Density-Based Algorithms
Density-based algorithms are capable of discovering
clusters of arbitrary shapes. Also this provides a natural
protection against outliers. These algorithms group
objects according to specific density objective functions.
Density is usually defined as the number of objects in a
particular neighborhood of a data objects. In these
approaches a given cluster continues growing as long as
the number of objects in the neighborhood exceeds some
parameter.
E. Grid Based Clustering
These focus on spatial data i.e. the data that model the
geometric structure of objects in the space, their
relationships, properties and operations. This technique
quantizes the data set into a no. of cells and then work with
objects belonging to these cells. They do not relocate
points but ratter builds several hierarchical levels of groups
of objects. The merging of grids and consequently clusters,
does not depend on a distance measure .It is determined by
a predefined parameter.
1) Mode of Performance Optimization
Performance optimization is foremost objective when
we are performing many computations on processor.
Performance Optimization of Clustering On GPU
Vijal D. Patel1
Prof. Sumitra Menaria2
1, 2
C.S.E. Dept., Parul Institute of Engineering & Technology
Performance Optimization of Clustering on GPU
(IJSRD/Vol. 1/Issue 4/2013/0014)
All rights reserved by www.ijsrd.com 863
So following techniques are used for performance
optimization:
1) Efficient Resources (Devices )
2) Optimized computing Algorithm
3) Best utilization of resources with less cost
(Parallel processing)
This paper proposes a mechanism for High Performance
Computing.
II. GPU ARCHITECTURE
GPU (Graphics Processing Unit) have been of large
interest in last decade to achieve high performance
computing through massive data parallelism model. GPU
has hundreds of core processors. GPUs are designed for
graphics. GPU is used to process similar operation on
independent vertices & pixels in parallel. GPU is more
suitable for stream computations like SIMD (single
Instruction multiple data) programming model. For higher
efficiency, GPU processes many data elements in parallel
with same program module.
Last decade, to satisfy demand of high performance
computing a tremendous rise in computing power has been
observed. Although CPU has been constantly increasing
their performance, the growth has been over shadowed by
the improvement in performance of GPU. This trend has
led to draw the interest of researchers with high
computational requirements to look for GPUs as a possible
solution. This has led to the emergence of a new term
GPGPU (General Purpose computing using Graphical
Processing Units).
GPGPU is defined as General Purpose computation using
GPU and graphics API in applications other than 3D
graphics. In order to achieve higher efficiency, GPGPU is
designed as a combination between hardware components
and software that allows the use of a traditional GPU
(Graphics processor) to perform non-graphical computing
tasks with higher processing power. [2]
Fig. 1 describes GPU architecture. Each GPU has number
of Streaming Multiprocessors(SM) i.e.30 SMs and each
streaming multiprocessor has eight streaming core
processor. This architecture has characteristics to perform
same operation on collection of records (Data parallelism).
Hence, GPU uses streaming processor as core processor to
achieve massive data parallelism. Each NVIDIA GPU has
a hundreds of parallel cores and within each core has
floating point unit, logic unit (add, sub, mul), move and
compare unit, and a branch unit.
Figure (1): GPU Architecture [2]
Hence, GPU uses streaming processor as core processor to
achieve massive data parallelism. Each NVIDIA GPU has
a hundreds of parallel cores and within each core has
floating point unit, logic unit (add, sub, mul), move and
compare unit, and a branch unit.
Each core is managed by a thread manager. A thread
manager can spawn and manage hundreds of threads per
core. A very important feature is that it uses lightweight
threads that have little creation overhead and zero-
overhead scheduling. The GPU is highly parallel compute
coprocessor that has its own device memory with high
bandwidth.
III. CUDA
Compute Unified Device Architecture (CUDA) is a
Programming model and the APIs required for performing
computation tasks on NVIDIA GPUs. CUDA is a scalable
highly parallel and multithreaded API to execute a
program on any number of processor without recompiling.
CUDA allow a program written for a platform to be
executed in other platform without recompilation. CUDA
provides efficient programming environment for regular
applications. CUDA is designed to support many
programming languages as shown in Fig.2. It provides
extension to programming languages for performing
computation on NVIDIA GPUs. CUDA C is extension to
C language with CUDA API to operate on NVIDIA GPU.
It allows a programmer to write a program for one thread,
and it instantiate for multiple parallel threads.
Figure (2): CUDA Interface [2]
A. CUDA Programming Execution model
Figure (3): CUDA execution model [2]
Programming model provided for designing and executing
general purpose computation kernel on NVIDIA GPU is
named as Compute Unified Device Architecture
(CUDA).CUDA programming modal is available for most
known operating systems Windows, and Linux. CUDA
programming modal allows programmer to use GPU for
parallel programming without knowledge of graphics.
Fig.3 describes CUDA computational model to be
Performance Optimization of Clustering on GPU
(IJSRD/Vol. 1/Issue 4/2013/0014)
All rights reserved by www.ijsrd.com 864
executed on CPU and GPU both. A CUDA program is
sequential host program and one or many kernel code as
shown in Fig.3.Serial code will be executed by a CPU
thread, and Kernel code will be executed by GPU threads
grouped in a block.
Kernel here specifies portion of computational code to be
executed on NVIDIA GPU device. CUDA kernel executes
on a grid of thread blocks, where a thread block is a group
of threads that work in SIMT(Single Instruction Multiple
Thread) model. Threads of each block execute cooperates
each other using shared memory and synchronizing its
execution. Threads from different blocks operate
independently. A grid can be of 1 or 2 dimensions, and a
thread block can be of 1, 2, or 3 dimension as shown in
Fig.3. A grid can have up to 65535 blocks of threads in
each dimension and a thread block can support 768 or
1024 threads depending on GPU architecture. Each thread
on a block and a block on grid has a unique identification.
IV. MCL ALGORITHM
MCL uses two simple algebraic operations, expansion and
inflation, on the stochastic (Markov) matrix associated
with a graph. The Markov matrix M associated with a
graph G is defined by normalizing all columns of the
adjacency matrix of G. The clustering process simulates
random walks (or flow) within the graph using expansion
operations, and then strengthens the flow where it is
already strong and weakens it where it is weak using
inflation operations. By continuously alternating these two
processes, the underlying structure of the graph gradually
becomes apparent, and there is
Converges to a result with regions with strong internal
flow (clusters) separated by boundaries within which flow
is absent.
1. // G is a graph, matrix M and a real
number r > 1
2. // columns of M with power coefficient r is
written Γr(M), and Γr is called the inflation
operator with power coefficient r
3. // Γr(Mij) = Mijr / Σr,j(M)
4. add loops to G
5. set Γ to some value
6. set M_1 to be the matrix of random walks on
G
7. while (change)
8. {
9. M_2 = M_1 * M_1 # expansion
10. M_1 = Γ(M_2) # inflation
11. change = difference(M_1, M_2)
12. }
13. set CLUSTERING as the components of
M_1
CUDA MCL Algorithm
Listing (1): MCL Algorithm [12]
V. CLUSTERING RESULTS AND ANALYSIS
In Fig 4 some of the results of our clusters analysis are
shown, comparing number of clusters against cluster sizes,
for the clusters output from data sets PPI1 (Bio GRID)
Figure (4): Comparison of MCL Execution time for SpMv
using ELLAPCK-R format
Figure (5): Speedup of MCL Execution time for Sparse-
matrix using ELLAPCK-R format
Performance Optimization of Clustering on GPU
(IJSRD/Vol. 1/Issue 4/2013/0014)
All rights reserved by www.ijsrd.com 865
Figure (6): Comparison of MCL Execution time for SpMv
matrix using ELLPACK-R format on GPU without
transfer of data
Figure (7): Comparison of MCL Execution time on CPU
and GPU using ELLPACK-R Format
VI. CONCLUSION AND FUTURE WORK
1) As it has been Clustering Algorithm giving best
performance on small data set so we will implement it
on large data set and different GPU platform and
optimize the Performance.
2) After Observation Parallel Computing is most
important mode of Performance Optimization.
3) For high Performance we will implement Many
Clustering Algorithm on Different
4) Platform of GPU and find out which platform is best
according to Application.
REFERENCES
[1] Pradeep Rai, Shubha Singh, “A Survey of Clustering
Techniques” in proceedings of the International
journal of Computer Applications (0975-8887)
Volume 7-No.12, October 2010.
[2] NVIDIA Corporation, NVIDIA Programming Guide,
Version 3.1.1,July 2010.
[3] Osama Abu Abbas, “Comparison Between Data
Clustering Algorithm” in proceedings of the
International Arab Journal of Information Technology,
Vol. 5, No. 3, July 2008.
[4] A.K. Jain, M.N. Murty, P.J. Flynn, “Data Clustering:
A Review” ACM computing Surveys, Vol. 31, No. 3,
September 1999.
[5] Wenbin Fang, Milan Lu, Xiangye Xiao, Bingsheng He,
Qiong Luo, “Frequent Itemset Mining on Graphics
Processors” in proceeding of the Fifth International
Workshop on Data management on New Hardware,
June 28, 2009.
[6] Balaji Dhanasekaran, Norman Rubin, “A new method
for GPU based Irregular Reductions and Its
Application to K-Means Clustering” in proceedings of
the International Conference on GPGPU-4, March
2011.
[7] Feng Cao, A K.H. Tung, A Zhou, “Scalable Clustering
Using Graphics Processors” School of Computing,
National University of Singapore, 2004.
[8] Ren Wu, Bin Zhang, Meichun Hsu, “Clustering
Billions of Data Points Using GPUs” UCHPC-
MAW’09, May 2009, Ischia, Italy.
[9] Wenjing Ma, Gagan Agrawal, “A Translation System
for Enabling Data Mining Applications On GPUs”,
ICS’09, June 2009, New York, USA, pp. 400-409.
[10] Hanaa M. Hussain, K Benkrid, A Erdogan, H Seker,
“Highly Parameterized K-means Clustering on FPGAs:
Comparative Results with GPPs and GPUs”, in
proceeding of the International Conference on
Reconfigurable Computing and FPGAs, IEEE 2011.
[11] Dar-Jen Chang, M Kantardzic, M Ouyang, “
Hierarchical clustering with CUDA/GPU”, UCHPC-
MAW, 2010.
[12] Alhadi Bustamam, Kevin Burrage, Nicholas A.
Hamilton, “Fast Parallel Markov Clustering in
Bioinformatics Using Massively Parallel Computing
on GPU with CUDA and ELLPACK-R Sparse
Format”, in proceeding of the IEEE/ACM
TRANSACTIONS ON COMPUTATIONAL
BIOLOGY AND BIOINFORMATICS, VOL. 9, NO.
3, MAY/JUNE 2012.

Contenu connexe

Tendances

International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Parallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using openclParallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using opencleSAT Publishing House
 
Distributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private CloudDistributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private CloudIJERA Editor
 
Dynamically Partitioning Big Data Using Virtual Machine Mapping
Dynamically Partitioning Big Data Using Virtual Machine MappingDynamically Partitioning Big Data Using Virtual Machine Mapping
Dynamically Partitioning Big Data Using Virtual Machine MappingAM Publications
 
Improving of artifical neural networks performance by using gpu's a survey
Improving of artifical neural networks performance by using gpu's  a surveyImproving of artifical neural networks performance by using gpu's  a survey
Improving of artifical neural networks performance by using gpu's a surveycsandit
 
IMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEY
IMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEYIMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEY
IMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEYcsandit
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...cscpconf
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Predictionmiyurud
 
Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...
Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...
Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...AM Publications
 
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET- Review of Existing Methods in K-Means Clustering AlgorithmIRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET- Review of Existing Methods in K-Means Clustering AlgorithmIRJET Journal
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Editor IJARCET
 

Tendances (16)

International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
J41046368
J41046368J41046368
J41046368
 
Eg4301808811
Eg4301808811Eg4301808811
Eg4301808811
 
Parallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using openclParallel k nn on gpu architecture using opencl
Parallel k nn on gpu architecture using opencl
 
Distributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private CloudDistributed Framework for Data Mining As a Service on Private Cloud
Distributed Framework for Data Mining As a Service on Private Cloud
 
Dynamically Partitioning Big Data Using Virtual Machine Mapping
Dynamically Partitioning Big Data Using Virtual Machine MappingDynamically Partitioning Big Data Using Virtual Machine Mapping
Dynamically Partitioning Big Data Using Virtual Machine Mapping
 
Ijetcas14 316
Ijetcas14 316Ijetcas14 316
Ijetcas14 316
 
I017425763
I017425763I017425763
I017425763
 
Improving of artifical neural networks performance by using gpu's a survey
Improving of artifical neural networks performance by using gpu's  a surveyImproving of artifical neural networks performance by using gpu's  a survey
Improving of artifical neural networks performance by using gpu's a survey
 
IMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEY
IMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEYIMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEY
IMPROVING OF ARTIFICIAL NEURAL NETWORKS PERFORMANCE BY USING GPU’S: A SURVEY
 
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
TOWARDS REDUCTION OF DATA FLOW IN A DISTRIBUTED NETWORK USING PRINCIPAL COMPO...
 
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link PredictionMemory Efficient Graph Convolutional Network based Distributed Link Prediction
Memory Efficient Graph Convolutional Network based Distributed Link Prediction
 
20 26
20 26 20 26
20 26
 
Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...
Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...
Enhancement of Map Function Image Processing System Using DHRF Algorithm on B...
 
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET- Review of Existing Methods in K-Means Clustering AlgorithmIRJET- Review of Existing Methods in K-Means Clustering Algorithm
IRJET- Review of Existing Methods in K-Means Clustering Algorithm
 
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
 

En vedette

Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)
Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)
Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)ijsrd.com
 
GreenSQL Security
 GreenSQL Security GreenSQL Security
GreenSQL Securityijsrd.com
 
Exhaust analysis of four stroke single cylinder diesel engine using copper ba...
Exhaust analysis of four stroke single cylinder diesel engine using copper ba...Exhaust analysis of four stroke single cylinder diesel engine using copper ba...
Exhaust analysis of four stroke single cylinder diesel engine using copper ba...ijsrd.com
 
Wireless Sensor Network: Topology Issues
Wireless Sensor Network: Topology IssuesWireless Sensor Network: Topology Issues
Wireless Sensor Network: Topology Issuesijsrd.com
 
Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...
Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...
Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...ijsrd.com
 
Process parameter optimization of SLM process and application of Taguchi appr...
Process parameter optimization of SLM process and application of Taguchi appr...Process parameter optimization of SLM process and application of Taguchi appr...
Process parameter optimization of SLM process and application of Taguchi appr...ijsrd.com
 
Handwritten Script Recognition
Handwritten Script RecognitionHandwritten Script Recognition
Handwritten Script Recognitionijsrd.com
 
Garbage Management using Android Smartphone
Garbage Management using Android SmartphoneGarbage Management using Android Smartphone
Garbage Management using Android Smartphoneijsrd.com
 
Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...
Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...
Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...ijsrd.com
 
Waste Minimization and Cost Reduction in Process Industries
Waste Minimization and Cost Reduction in Process IndustriesWaste Minimization and Cost Reduction in Process Industries
Waste Minimization and Cost Reduction in Process Industriesijsrd.com
 
Performance, Optimization and CFD Analysis of Submersible Pump Impeller
Performance, Optimization and CFD Analysis of Submersible Pump ImpellerPerformance, Optimization and CFD Analysis of Submersible Pump Impeller
Performance, Optimization and CFD Analysis of Submersible Pump Impellerijsrd.com
 
Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...
Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...
Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...ijsrd.com
 
Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...
Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...
Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...ijsrd.com
 
VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...
VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...
VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...ijsrd.com
 
Analysis of VoIP Forensics with Digital Evidence Procedure
Analysis of VoIP Forensics with Digital Evidence ProcedureAnalysis of VoIP Forensics with Digital Evidence Procedure
Analysis of VoIP Forensics with Digital Evidence Procedureijsrd.com
 
Voltage Stability & Power Quality Assessment of Distributed Generation Based ...
Voltage Stability & Power Quality Assessment of Distributed Generation Based ...Voltage Stability & Power Quality Assessment of Distributed Generation Based ...
Voltage Stability & Power Quality Assessment of Distributed Generation Based ...ijsrd.com
 
Real-Time Monitoring and Control System for Industry
Real-Time Monitoring and Control System for IndustryReal-Time Monitoring and Control System for Industry
Real-Time Monitoring and Control System for Industryijsrd.com
 

En vedette (17)

Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)
Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)
Energy Efficient LEACH protocol for Wireless Sensor Network (I-LEACH)
 
GreenSQL Security
 GreenSQL Security GreenSQL Security
GreenSQL Security
 
Exhaust analysis of four stroke single cylinder diesel engine using copper ba...
Exhaust analysis of four stroke single cylinder diesel engine using copper ba...Exhaust analysis of four stroke single cylinder diesel engine using copper ba...
Exhaust analysis of four stroke single cylinder diesel engine using copper ba...
 
Wireless Sensor Network: Topology Issues
Wireless Sensor Network: Topology IssuesWireless Sensor Network: Topology Issues
Wireless Sensor Network: Topology Issues
 
Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...
Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...
Analysis of Parking Demand and Suggesting Alternative for Parking (A Case Stu...
 
Process parameter optimization of SLM process and application of Taguchi appr...
Process parameter optimization of SLM process and application of Taguchi appr...Process parameter optimization of SLM process and application of Taguchi appr...
Process parameter optimization of SLM process and application of Taguchi appr...
 
Handwritten Script Recognition
Handwritten Script RecognitionHandwritten Script Recognition
Handwritten Script Recognition
 
Garbage Management using Android Smartphone
Garbage Management using Android SmartphoneGarbage Management using Android Smartphone
Garbage Management using Android Smartphone
 
Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...
Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...
Analysis of SEPIC for PV-Applications using PI Controller and Current Mode Co...
 
Waste Minimization and Cost Reduction in Process Industries
Waste Minimization and Cost Reduction in Process IndustriesWaste Minimization and Cost Reduction in Process Industries
Waste Minimization and Cost Reduction in Process Industries
 
Performance, Optimization and CFD Analysis of Submersible Pump Impeller
Performance, Optimization and CFD Analysis of Submersible Pump ImpellerPerformance, Optimization and CFD Analysis of Submersible Pump Impeller
Performance, Optimization and CFD Analysis of Submersible Pump Impeller
 
Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...
Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...
Material Removal Rate, Tool Wear Rate and Surface Roughness Analysis of EDM P...
 
Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...
Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...
Enhancing the Performance & Emission for the Blend of Diesel & Pyrolysis oil ...
 
VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...
VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...
VRaaS [Virtual Reality as a Service]: Integrated architecture for VR Applicat...
 
Analysis of VoIP Forensics with Digital Evidence Procedure
Analysis of VoIP Forensics with Digital Evidence ProcedureAnalysis of VoIP Forensics with Digital Evidence Procedure
Analysis of VoIP Forensics with Digital Evidence Procedure
 
Voltage Stability & Power Quality Assessment of Distributed Generation Based ...
Voltage Stability & Power Quality Assessment of Distributed Generation Based ...Voltage Stability & Power Quality Assessment of Distributed Generation Based ...
Voltage Stability & Power Quality Assessment of Distributed Generation Based ...
 
Real-Time Monitoring and Control System for Industry
Real-Time Monitoring and Control System for IndustryReal-Time Monitoring and Control System for Industry
Real-Time Monitoring and Control System for Industry
 

Similaire à Performance Optimization of Clustering On GPU

SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY cscpconf
 
Effective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesEffective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesIJCSEA Journal
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONScseij
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Editor IJARCET
 
Image Processing Application on Graphics processors
Image Processing Application on Graphics processorsImage Processing Application on Graphics processors
Image Processing Application on Graphics processorsCSCJournals
 
Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsIJMER
 
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...IOSR Journals
 
Parallel algorithms for multi-source graph traversal and its applications
Parallel algorithms for multi-source graph traversal and its applicationsParallel algorithms for multi-source graph traversal and its applications
Parallel algorithms for multi-source graph traversal and its applicationsSubhajit Sahu
 
MapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine LearningMapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine Learningbutest
 
B036407011
B036407011B036407011
B036407011theijes
 
Accelerated seam carving using cuda
Accelerated seam carving using cudaAccelerated seam carving using cuda
Accelerated seam carving using cudaeSAT Journals
 
IRJET- Enhanced Density Based Method for Clustering Data Stream
IRJET-  	  Enhanced Density Based Method for Clustering Data StreamIRJET-  	  Enhanced Density Based Method for Clustering Data Stream
IRJET- Enhanced Density Based Method for Clustering Data StreamIRJET Journal
 
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...IJCNCJournal
 
High Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming ParadigmsHigh Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming ParadigmsQuEST Global (erstwhile NeST Software)
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET Journal
 
A frame work for clustering time evolving data
A frame work for clustering time evolving dataA frame work for clustering time evolving data
A frame work for clustering time evolving dataiaemedu
 

Similaire à Performance Optimization of Clustering On GPU (20)

SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
SPEED-UP IMPROVEMENT USING PARALLEL APPROACH IN IMAGE STEGANOGRAPHY
 
Effective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU ArchitecturesEffective Sparse Matrix Representation for the GPU Architectures
Effective Sparse Matrix Representation for the GPU Architectures
 
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONSA SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
A SURVEY ON GPU SYSTEM CONSIDERING ITS PERFORMANCE ON DIFFERENT APPLICATIONS
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045Volume 2-issue-6-2040-2045
Volume 2-issue-6-2040-2045
 
Image Processing Application on Graphics processors
Image Processing Application on Graphics processorsImage Processing Application on Graphics processors
Image Processing Application on Graphics processors
 
Accelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous PlatformsAccelerating Real Time Applications on Heterogeneous Platforms
Accelerating Real Time Applications on Heterogeneous Platforms
 
GRID COMPUTING
GRID COMPUTINGGRID COMPUTING
GRID COMPUTING
 
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
K Means Clustering Algorithm for Partitioning Data Sets Evaluated From Horizo...
 
Parallel algorithms for multi-source graph traversal and its applications
Parallel algorithms for multi-source graph traversal and its applicationsParallel algorithms for multi-source graph traversal and its applications
Parallel algorithms for multi-source graph traversal and its applications
 
MapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine LearningMapReduce: Distributed Computing for Machine Learning
MapReduce: Distributed Computing for Machine Learning
 
B036407011
B036407011B036407011
B036407011
 
Accelerated seam carving using cuda
Accelerated seam carving using cudaAccelerated seam carving using cuda
Accelerated seam carving using cuda
 
Accelerated seam carving using cuda
Accelerated seam carving using cudaAccelerated seam carving using cuda
Accelerated seam carving using cuda
 
IRJET- Enhanced Density Based Method for Clustering Data Stream
IRJET-  	  Enhanced Density Based Method for Clustering Data StreamIRJET-  	  Enhanced Density Based Method for Clustering Data Stream
IRJET- Enhanced Density Based Method for Clustering Data Stream
 
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
 
High Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming ParadigmsHigh Performance Medical Reconstruction Using Stream Programming Paradigms
High Performance Medical Reconstruction Using Stream Programming Paradigms
 
Ultra Fast SOM using CUDA
Ultra Fast SOM using CUDAUltra Fast SOM using CUDA
Ultra Fast SOM using CUDA
 
IRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CLIRJET- Latin Square Computation of Order-3 using Open CL
IRJET- Latin Square Computation of Order-3 using Open CL
 
A frame work for clustering time evolving data
A frame work for clustering time evolving dataA frame work for clustering time evolving data
A frame work for clustering time evolving data
 

Plus de ijsrd.com

IoT Enabled Smart Grid
IoT Enabled Smart GridIoT Enabled Smart Grid
IoT Enabled Smart Gridijsrd.com
 
A Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of ThingsA Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of Thingsijsrd.com
 
IoT for Everyday Life
IoT for Everyday LifeIoT for Everyday Life
IoT for Everyday Lifeijsrd.com
 
Study on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOTStudy on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOTijsrd.com
 
Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...ijsrd.com
 
Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...ijsrd.com
 
A Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's LifeA Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's Lifeijsrd.com
 
Pedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language LearningPedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language Learningijsrd.com
 
Virtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation SystemVirtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation Systemijsrd.com
 
Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...ijsrd.com
 
Understanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart RefrigeratorUnderstanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart Refrigeratorijsrd.com
 
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...ijsrd.com
 
A Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processingA Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processingijsrd.com
 
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web LogsWeb Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logsijsrd.com
 
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEMAPPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEMijsrd.com
 
Making model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point TrackingMaking model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point Trackingijsrd.com
 
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...ijsrd.com
 
Study and Review on Various Current Comparators
Study and Review on Various Current ComparatorsStudy and Review on Various Current Comparators
Study and Review on Various Current Comparatorsijsrd.com
 
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...ijsrd.com
 
Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.ijsrd.com
 

Plus de ijsrd.com (20)

IoT Enabled Smart Grid
IoT Enabled Smart GridIoT Enabled Smart Grid
IoT Enabled Smart Grid
 
A Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of ThingsA Survey Report on : Security & Challenges in Internet of Things
A Survey Report on : Security & Challenges in Internet of Things
 
IoT for Everyday Life
IoT for Everyday LifeIoT for Everyday Life
IoT for Everyday Life
 
Study on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOTStudy on Issues in Managing and Protecting Data of IOT
Study on Issues in Managing and Protecting Data of IOT
 
Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...Interactive Technologies for Improving Quality of Education to Build Collabor...
Interactive Technologies for Improving Quality of Education to Build Collabor...
 
Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...Internet of Things - Paradigm Shift of Future Internet Application for Specia...
Internet of Things - Paradigm Shift of Future Internet Application for Specia...
 
A Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's LifeA Study of the Adverse Effects of IoT on Student's Life
A Study of the Adverse Effects of IoT on Student's Life
 
Pedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language LearningPedagogy for Effective use of ICT in English Language Learning
Pedagogy for Effective use of ICT in English Language Learning
 
Virtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation SystemVirtual Eye - Smart Traffic Navigation System
Virtual Eye - Smart Traffic Navigation System
 
Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...Ontological Model of Educational Programs in Computer Science (Bachelor and M...
Ontological Model of Educational Programs in Computer Science (Bachelor and M...
 
Understanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart RefrigeratorUnderstanding IoT Management for Smart Refrigerator
Understanding IoT Management for Smart Refrigerator
 
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
DESIGN AND ANALYSIS OF DOUBLE WISHBONE SUSPENSION SYSTEM USING FINITE ELEMENT...
 
A Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processingA Review: Microwave Energy for materials processing
A Review: Microwave Energy for materials processing
 
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web LogsWeb Usage Mining: A Survey on User's Navigation Pattern from Web Logs
Web Usage Mining: A Survey on User's Navigation Pattern from Web Logs
 
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEMAPPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
APPLICATION OF STATCOM to IMPROVED DYNAMIC PERFORMANCE OF POWER SYSTEM
 
Making model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point TrackingMaking model of dual axis solar tracking with Maximum Power Point Tracking
Making model of dual axis solar tracking with Maximum Power Point Tracking
 
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
A REVIEW PAPER ON PERFORMANCE AND EMISSION TEST OF 4 STROKE DIESEL ENGINE USI...
 
Study and Review on Various Current Comparators
Study and Review on Various Current ComparatorsStudy and Review on Various Current Comparators
Study and Review on Various Current Comparators
 
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
Reducing Silicon Real Estate and Switching Activity Using Low Power Test Patt...
 
Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.Defending Reactive Jammers in WSN using a Trigger Identification Service.
Defending Reactive Jammers in WSN using a Trigger Identification Service.
 

Dernier

Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdfKamal Acharya
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 

Dernier (20)

(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Online banking management system project.pdf
Online banking management system project.pdfOnline banking management system project.pdf
Online banking management system project.pdf
 
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 

Performance Optimization of Clustering On GPU

  • 1. IJSRD - International Journal for Scientific Research & Development| Vol. 1, Issue 4, 2013 | ISSN (online): 2321-0613 All rights reserved by www.ijsrd.com 862 Abstract--In today’s digital world, Data sets are increasing exponentially. Statistical analysis using clustering in various scientific and engineering applications become very challenging issue for such large data set. Clustering on huge data set and its performance are two major factors demand for optimization. Parallelization is well-known approach to optimize performance. It has been observed from recent research work that GPU based parallelization help to achieve high degree of performance. Hence, this thesis focuses on optimizing hierarchical clustering algorithms using parallelization. It covers implementation of optimized algorithm on various parallel environments using Open MP on multi-core architecture and using CUDA on may-core architecture. I. INTRODUCTION Clustering is the task of grouping a set of objects in such a way that objects in the same group (called cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters). It is a main task of explorative data mining, and a common technique for statistical data analysis used in many fields, including machine learning, pattern recognition, image analysis, information retrieval, and bioinformatics. A "clustering" is essentially a set of such clusters, usually containing all objects in the data set. Additionally, it may specify the relationship of the clusters to each other, for example a hierarchy of clusters embedded in each other. Clustering can be roughly distinguished in: 1) Hard clustering: each object belongs to a cluster or not 2) Soft clustering (fuzzy clustering): each object belongs to each cluster to a certain degree (e.g. a likelihood of belonging to the cluster) A. Classification of Clustering Traditionally clustering techniques are broadly divided in [1]: 1) Hierarchical methods 2) Partitioning Methods 3) Density Based Algorithms 4) Grid Based Clustering B. Hierarchical Methods Hierarchical clustering is a method of cluster analysis which seeks to build a hierarchy of clusters. Hierarchical clustering generally falls into two types: 1) Agglomerative Algorithm 2) Division Algorithm In hierarchical clustering the data are not partitioned into a particular cluster in a single step. Instead, a series of partitions takes place, which may run from a single cluster containing all objects to n clusters each containing a single object. Hierarchical Clustering is subdivided into agglomerative methods, which proceed by series of fusions of the n objects into groups, and divisive methods, which separate n objects successively into finer groupings. Hierarchical clustering may be represented by a two dimensional diagram known as dendrogram which illustrates the fusions or divisions made at each successive stage of analysis. C. Partitioning Methods The partitioning methods generally result in a set of M clusters, each object belonging to one cluster. Each cluster may be represented by a centroid or a cluster representative; this is some sort of summary description of all the objects contained in a cluster. The precise form of this description will depend on the type of the object which is being clustered. In case where real-valued data is available, the arithmetic mean of the attribute vectors for all objects within a cluster provides an appropriate representative; alternative types of centroid may be required in other cases, e.g., a cluster of documents can be represented by a list of those keywords that occur in some minimum number of documents within a cluster. If the number of the clusters is large, the centroids can be further clustered to produces hierarchy within a dataset. D. Density-Based Algorithms Density-based algorithms are capable of discovering clusters of arbitrary shapes. Also this provides a natural protection against outliers. These algorithms group objects according to specific density objective functions. Density is usually defined as the number of objects in a particular neighborhood of a data objects. In these approaches a given cluster continues growing as long as the number of objects in the neighborhood exceeds some parameter. E. Grid Based Clustering These focus on spatial data i.e. the data that model the geometric structure of objects in the space, their relationships, properties and operations. This technique quantizes the data set into a no. of cells and then work with objects belonging to these cells. They do not relocate points but ratter builds several hierarchical levels of groups of objects. The merging of grids and consequently clusters, does not depend on a distance measure .It is determined by a predefined parameter. 1) Mode of Performance Optimization Performance optimization is foremost objective when we are performing many computations on processor. Performance Optimization of Clustering On GPU Vijal D. Patel1 Prof. Sumitra Menaria2 1, 2 C.S.E. Dept., Parul Institute of Engineering & Technology
  • 2. Performance Optimization of Clustering on GPU (IJSRD/Vol. 1/Issue 4/2013/0014) All rights reserved by www.ijsrd.com 863 So following techniques are used for performance optimization: 1) Efficient Resources (Devices ) 2) Optimized computing Algorithm 3) Best utilization of resources with less cost (Parallel processing) This paper proposes a mechanism for High Performance Computing. II. GPU ARCHITECTURE GPU (Graphics Processing Unit) have been of large interest in last decade to achieve high performance computing through massive data parallelism model. GPU has hundreds of core processors. GPUs are designed for graphics. GPU is used to process similar operation on independent vertices & pixels in parallel. GPU is more suitable for stream computations like SIMD (single Instruction multiple data) programming model. For higher efficiency, GPU processes many data elements in parallel with same program module. Last decade, to satisfy demand of high performance computing a tremendous rise in computing power has been observed. Although CPU has been constantly increasing their performance, the growth has been over shadowed by the improvement in performance of GPU. This trend has led to draw the interest of researchers with high computational requirements to look for GPUs as a possible solution. This has led to the emergence of a new term GPGPU (General Purpose computing using Graphical Processing Units). GPGPU is defined as General Purpose computation using GPU and graphics API in applications other than 3D graphics. In order to achieve higher efficiency, GPGPU is designed as a combination between hardware components and software that allows the use of a traditional GPU (Graphics processor) to perform non-graphical computing tasks with higher processing power. [2] Fig. 1 describes GPU architecture. Each GPU has number of Streaming Multiprocessors(SM) i.e.30 SMs and each streaming multiprocessor has eight streaming core processor. This architecture has characteristics to perform same operation on collection of records (Data parallelism). Hence, GPU uses streaming processor as core processor to achieve massive data parallelism. Each NVIDIA GPU has a hundreds of parallel cores and within each core has floating point unit, logic unit (add, sub, mul), move and compare unit, and a branch unit. Figure (1): GPU Architecture [2] Hence, GPU uses streaming processor as core processor to achieve massive data parallelism. Each NVIDIA GPU has a hundreds of parallel cores and within each core has floating point unit, logic unit (add, sub, mul), move and compare unit, and a branch unit. Each core is managed by a thread manager. A thread manager can spawn and manage hundreds of threads per core. A very important feature is that it uses lightweight threads that have little creation overhead and zero- overhead scheduling. The GPU is highly parallel compute coprocessor that has its own device memory with high bandwidth. III. CUDA Compute Unified Device Architecture (CUDA) is a Programming model and the APIs required for performing computation tasks on NVIDIA GPUs. CUDA is a scalable highly parallel and multithreaded API to execute a program on any number of processor without recompiling. CUDA allow a program written for a platform to be executed in other platform without recompilation. CUDA provides efficient programming environment for regular applications. CUDA is designed to support many programming languages as shown in Fig.2. It provides extension to programming languages for performing computation on NVIDIA GPUs. CUDA C is extension to C language with CUDA API to operate on NVIDIA GPU. It allows a programmer to write a program for one thread, and it instantiate for multiple parallel threads. Figure (2): CUDA Interface [2] A. CUDA Programming Execution model Figure (3): CUDA execution model [2] Programming model provided for designing and executing general purpose computation kernel on NVIDIA GPU is named as Compute Unified Device Architecture (CUDA).CUDA programming modal is available for most known operating systems Windows, and Linux. CUDA programming modal allows programmer to use GPU for parallel programming without knowledge of graphics. Fig.3 describes CUDA computational model to be
  • 3. Performance Optimization of Clustering on GPU (IJSRD/Vol. 1/Issue 4/2013/0014) All rights reserved by www.ijsrd.com 864 executed on CPU and GPU both. A CUDA program is sequential host program and one or many kernel code as shown in Fig.3.Serial code will be executed by a CPU thread, and Kernel code will be executed by GPU threads grouped in a block. Kernel here specifies portion of computational code to be executed on NVIDIA GPU device. CUDA kernel executes on a grid of thread blocks, where a thread block is a group of threads that work in SIMT(Single Instruction Multiple Thread) model. Threads of each block execute cooperates each other using shared memory and synchronizing its execution. Threads from different blocks operate independently. A grid can be of 1 or 2 dimensions, and a thread block can be of 1, 2, or 3 dimension as shown in Fig.3. A grid can have up to 65535 blocks of threads in each dimension and a thread block can support 768 or 1024 threads depending on GPU architecture. Each thread on a block and a block on grid has a unique identification. IV. MCL ALGORITHM MCL uses two simple algebraic operations, expansion and inflation, on the stochastic (Markov) matrix associated with a graph. The Markov matrix M associated with a graph G is defined by normalizing all columns of the adjacency matrix of G. The clustering process simulates random walks (or flow) within the graph using expansion operations, and then strengthens the flow where it is already strong and weakens it where it is weak using inflation operations. By continuously alternating these two processes, the underlying structure of the graph gradually becomes apparent, and there is Converges to a result with regions with strong internal flow (clusters) separated by boundaries within which flow is absent. 1. // G is a graph, matrix M and a real number r > 1 2. // columns of M with power coefficient r is written Γr(M), and Γr is called the inflation operator with power coefficient r 3. // Γr(Mij) = Mijr / Σr,j(M) 4. add loops to G 5. set Γ to some value 6. set M_1 to be the matrix of random walks on G 7. while (change) 8. { 9. M_2 = M_1 * M_1 # expansion 10. M_1 = Γ(M_2) # inflation 11. change = difference(M_1, M_2) 12. } 13. set CLUSTERING as the components of M_1 CUDA MCL Algorithm Listing (1): MCL Algorithm [12] V. CLUSTERING RESULTS AND ANALYSIS In Fig 4 some of the results of our clusters analysis are shown, comparing number of clusters against cluster sizes, for the clusters output from data sets PPI1 (Bio GRID) Figure (4): Comparison of MCL Execution time for SpMv using ELLAPCK-R format Figure (5): Speedup of MCL Execution time for Sparse- matrix using ELLAPCK-R format
  • 4. Performance Optimization of Clustering on GPU (IJSRD/Vol. 1/Issue 4/2013/0014) All rights reserved by www.ijsrd.com 865 Figure (6): Comparison of MCL Execution time for SpMv matrix using ELLPACK-R format on GPU without transfer of data Figure (7): Comparison of MCL Execution time on CPU and GPU using ELLPACK-R Format VI. CONCLUSION AND FUTURE WORK 1) As it has been Clustering Algorithm giving best performance on small data set so we will implement it on large data set and different GPU platform and optimize the Performance. 2) After Observation Parallel Computing is most important mode of Performance Optimization. 3) For high Performance we will implement Many Clustering Algorithm on Different 4) Platform of GPU and find out which platform is best according to Application. REFERENCES [1] Pradeep Rai, Shubha Singh, “A Survey of Clustering Techniques” in proceedings of the International journal of Computer Applications (0975-8887) Volume 7-No.12, October 2010. [2] NVIDIA Corporation, NVIDIA Programming Guide, Version 3.1.1,July 2010. [3] Osama Abu Abbas, “Comparison Between Data Clustering Algorithm” in proceedings of the International Arab Journal of Information Technology, Vol. 5, No. 3, July 2008. [4] A.K. Jain, M.N. Murty, P.J. Flynn, “Data Clustering: A Review” ACM computing Surveys, Vol. 31, No. 3, September 1999. [5] Wenbin Fang, Milan Lu, Xiangye Xiao, Bingsheng He, Qiong Luo, “Frequent Itemset Mining on Graphics Processors” in proceeding of the Fifth International Workshop on Data management on New Hardware, June 28, 2009. [6] Balaji Dhanasekaran, Norman Rubin, “A new method for GPU based Irregular Reductions and Its Application to K-Means Clustering” in proceedings of the International Conference on GPGPU-4, March 2011. [7] Feng Cao, A K.H. Tung, A Zhou, “Scalable Clustering Using Graphics Processors” School of Computing, National University of Singapore, 2004. [8] Ren Wu, Bin Zhang, Meichun Hsu, “Clustering Billions of Data Points Using GPUs” UCHPC- MAW’09, May 2009, Ischia, Italy. [9] Wenjing Ma, Gagan Agrawal, “A Translation System for Enabling Data Mining Applications On GPUs”, ICS’09, June 2009, New York, USA, pp. 400-409. [10] Hanaa M. Hussain, K Benkrid, A Erdogan, H Seker, “Highly Parameterized K-means Clustering on FPGAs: Comparative Results with GPPs and GPUs”, in proceeding of the International Conference on Reconfigurable Computing and FPGAs, IEEE 2011. [11] Dar-Jen Chang, M Kantardzic, M Ouyang, “ Hierarchical clustering with CUDA/GPU”, UCHPC- MAW, 2010. [12] Alhadi Bustamam, Kevin Burrage, Nicholas A. Hamilton, “Fast Parallel Markov Clustering in Bioinformatics Using Massively Parallel Computing on GPU with CUDA and ELLPACK-R Sparse Format”, in proceeding of the IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, VOL. 9, NO. 3, MAY/JUNE 2012.