SlideShare une entreprise Scribd logo
1  sur  3
Télécharger pour lire hors ligne
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 10, Issue 4 (Mar. - Apr. 2013), PP 38-40
www.iosrjournals.org
www.iosrjournals.org 38 | Page
A Study on Optimization of Top-k Queries in Relational
Databases
Neha Singh, P.K. Pandey*
Department of Computer Science, A.P.S. University, Rewa (M.P.)
* Department of Physics, Govt. Science College, Rewa (M.P.)
Abstract: Efficient execution of top-k queries is increasingly becoming a major challenge for relational
database technology. Top-k query has emerged as a key requirement in modern application. In these
applications, efficient and adaptive evaluation of top-k queries is an integral part of application semantics. In
this paper we discuss a query optimization framework of top-k queries that fully integrates rank-join operators.
Ranking queries produce results that are ordered on some computed score. A key property of top-k queries is
that, users are interested only in the first k results and not in total ranking of all query results. This property
directly impacts the optimization of top-k queries by optimizing for the first k results. Traditionally, most real
world database systems offer the feature of first n row optimization.
I. Introduction
The increasing importance of top-k queries warrants an efficient support of ranking in the relational
database management system. Emerging applications that depend on ranking queries warrant efficient support
of ranking in database management system. Integration of database and information retrieval technologies ahs
been an active research topic recently [1]. Top-k queries are dominant in many emerging applications as web
databases, multimedia databases, middlewares, datamining etc. Supporting ranking gives database systems the
ability to efficiently answer Information Retrieval queries. A top-k query is an important type of query that
allows for supporting IR applications on top of database systems. Database system provide guarantee of
efficient data handling with solid integrity and consistency. An important subject in this connection is
determining the suitable size of inputs in relation N for answering to the rank ware query so that in this manner
top-k query answers are gotten. It is vital in information integration with large scale to select top-k rank aware
from multiple sources and it has also basic role for minimizing transfer cost because as the size of relations
become smaller, transfer cost become less [6]. The algorithm that is used for answering to the ranking queries is
estimating input sizes by means of statistical relations, monotony hypothesis and random variables [2, 3]. The
other new innovation provides a systematic and principled framework to support efficient evaluations of top-k
queries in relational database systems by extending relational algebra and query optimization [4]. In this
connection one another solution is improving the join and using the ripple join thath minimize the time in order
to get estimation with relatively acceptable precision for query results. The main idea of ripple join is join
algorithm aware the suggested rank for supporting join queries with top-k relational database [5]. In this paper
we study the optimization of top-k queries.
II. Relational Database
A Relational database management system (RDBMS) is a database management system (DBMS) that
is based on the relational model as introduced by E. F. Codd. A relational database of data items organized as a
set of formally described relations (also known as table) from which data can be accessed or reassembled in
many different ways without having to reorganize the database relations.
III. Top-K Queries
The top-k query is define as- Given a database D of m objects, each of which is characterized by n
attributes, a scoring function f, according to which we rank the objects in D, and the number of expected results
k. Then a top-k query Q returns the k objects with the highest rank in f. In Top-k query, query define on n
attribute a1, a2 ,…, an and relation M in the form of R1 ,R2 ,…,RM that each ai (i=1:n) belongs to one relation
Rj (j=1:M). Each of the attributes has special domain in comparison with their kind. According to the query, a
series of attributes of these relations are applied for projection, a series of attributes of these relations are used
for restriction and join. In the rank aware queries there is apart for ranking that some of relations attribute are
presented in the form of a ranking relation which is called ranking function. Ranking function f is formed in the
form of attribute n' that is n' <=n. A theory for ranking function f is this: ranking function changes in comparison
with all relations are monotonic. In addition to this, the number of suitable answers in rank aware queries is
determined too that is just Top k. Consider a set of relations R1 to Rm. Each tuple in Ri is associated with some
A Study on Optimization of Top-k Queries in Relational Databases
www.iosrjournals.org 39 | Page
score that gives it a rank within Ri. The top-k join query joins R1 to Rm and produces the results ranked on a
total score. The total score is computed according to some function, say F that combines individual scores. Note
that the score attached with each relation can be the value of one attribute or a value computed using a predicate
on a subset of its attributes. A possible SQL-like notation for expressing a top-k join query is as follows:
SELECT *
FROM R1, R2, . . . ,Rm
WHERE join condition (R1, R2, . . . ,Rm)
ORDER BY F (R1.score, R2.score. . . Rm.score)
LIMIT k;
Where LIMIT limits the number of results reported to the user.
IV. Query Optimization
Query optimization is necessary for high level relational queries to provide an opportunity for the
database system to systematically evaluate alternative query execution strategies and to choose an optimal
strategy. Query optimization is a function of many relational database management systems in which
multiple query plans for satisfying a query are examined and a good query plan is identified. The function of
query optimization is to select an efficient execution strategy. The figure 1 shows the query optimization process
which generates the best query execution plan. For a given query, the search space can be defined as the set of
equivalent operator tress that can be produced using transformation rules.
Input Query
Best QEP
figure 1: Query Optimization Process
V. Optimization of Top-k Queries
Optimization of top-k queries (also known as Rank aware query optimization) in relational database
has contributed to the existing system for the query operations significantly. In this section we discuss a rank
aware query optimization framework that fully integrates rank-join operators into relational query engine and
how to extend the traditional query optimization to handle the new rank join operators. The two major task are
includes while integrating the new rank-join operators in query optimizer- enlarging the space of possible plans
to include those plans that use rank join operators as a possible join alternative and providing a costing
mechanism for the new operators to help the optimizer prune expensive plans in favor of more general cheaper
plans.
The first task, enlarging the plan space is achieved by extending the enumeration algorithm to produce
new execution plans. Now we exhibit how to extend the enumeration space to generate top-k query execution
plans. Top-k plans will integrate the rank-join operators into general execution plans. For a query with n input
relations, R1 to Rn, assume there exists a ranking function f(s1,s2,…sn), where si is score expression on relation
Ri. For two sets of input relations, P and Q, we extend the space of plans that join P and Q to include rank join
plans by make suitable for following:
- Join Eligibility: P and Q are rank-join eligible if all the following apply:
1. There is a join condition that relates at least one input relation in P to an input relation in Q.
Search space
generation
Transformation
rules
Equivalent QEP
Search
strategy
Cost model
A Study on Optimization of Top-k Queries in Relational Databases
www.iosrjournals.org 40 | Page
2. f can be expressed as f(f1(SP); f2(SQ); f3(SO)), where f1, f2 and f3 are three scoring functions, SP are the
score expressions on the relations in P, SQ are the score expressions on the relations in Q, and SO are the score
expressions on the rest of the input relations.
3. There is at least one plan that accesses P and/or Q ordered on SP and/or SQ, respectively.
-Join Choices: Rank-join can have several implementations as physical join operators. For each rank-join
between P and Q, plans can be generated for each join implementation.
-Join Order: For symmetric rank-join operators there is no distinction between outer and inner relations. For the
nested-loops implementation, a different plan can be generated by switching the inner and the outer relations.
Now we discuss the second task, providing a costing mechanism for the new operators to help the
optimizer prune expensive plans in favor of more general cheaper plans. In real-world query optimizers, the cost
model for different query operators is quite complex and depends on many parameters. Parameters include input
cardinality, available buffers, type of access paths etc. Although cost models can be very complex, a key
ingredient of accurate estimation is the accuracy of estimating the size of intermediate results. In traditional join
operators, the input cardinalities are independent of the operator itself and only depend on the input subplan.
Mostly, the output cardinality depends only on the size of the inputs and the selectivity of the logical operation.
On the other hand, since a rank-join operator does not consume all of its inputs, the actual input size depends on
the operator itself and how the operator decides that it has seen enough information from the inputs to generate
the top k results. Hence, the input cardinality depends on the number of ranked join results requested from that
operator. Therefore, the cost of a rank-join operator depends on the following:
- The number of required results k and how k is propagated in the pipeline
- The number of tuples from inputs that contain enough information for the operator to report the required
number of answers, k.
- The selectivity of the join operation affects the number of tuples propagated from the inputs to higher
operators through the join operation. Hence, the join selectivity affects the number of input tuples required by
the rank-join operator to produce ranked results.
There are two ways to produce plans that join two sets of input relations, P and Q, and produce ranked
results- by using rank-join operators to join L and R subplans or by gluing a sort operator on the cheapest join
plan that joins P and Q without preserving the required order. The challenge is in comparing two plans when
one or both of them are rank-join plans. For all rank-join plans, the cost of the plan depends on k and the join
selectivity s. Since these two parameters are the same for all plans, the pruning among these plans follows the
same mechanism as in traditional cost based pruning. For example, pruning a rank-join plan in favor of another
rank-join plan depends on the input cardinality of the relations, the cost of the join method, the access paths, and
the statistics available on the input scores. We assume the availability of an estimate of the join selectivity,
which is the same for both sort-plans and rank-join plans.
VI. Conclusion
Query optimization is essential for large information retrieval system. In this paper we describe the top-
k queries and general query optimization method for to provide an opportunity for the database system to
systematically evaluate alternative query execution strategies and to choose an optimal strategy. We also
discussed optimization of top-k queries with a framework for integrating rank-join operators in real-world query
optimizers. Our framework was based on two main steps- first, extend the enumeration phase of the query
optimizer to generate rank-aware plans. The extension was achieved by providing rank-join operators as
possible join choices and by defining ranking expressions as a new physical plan property.
References
[1] S. Chaudhury, R. Ramakrishnan and G. Weikum (2005), Integrating db and ir technologies, CIDR.
[2] Ilyas, I.F. and W.G. Aref (2006), Adaptive Rank aware Query Optimization in Relational Databases, ACM Transactions on
Database Systems.
[3] Ilyas, I.F. and R. Shah (2004), Rank-aware Query Optimization,SIGMOD, June13-18, Paris, France.
[4] Ilyas, I.F. and C. Li, 2005. Ranksql: Query algebra and optimization for relational top-k queries.SIGMOD 2005, June 1416,
Baltimore, Maryland, USA.
[5] Ilyas, I.F. and W.G. Aref, 2003. Supporting Top-k Join Queries in Relational Databases. Proceedings of the 29th VLDB Conference,
Berlin, Germany.
[6] M.A. Kashem, A S Chowdhury, R Deb, M Jahan (2010), Query Optimization on Relational Databases for Supporting Top-k Query
Processing techniques, JCIT

Contenu connexe

Tendances

Enhancing three variants of harmony search algorithm for continuous optimizat...
Enhancing three variants of harmony search algorithm for continuous optimizat...Enhancing three variants of harmony search algorithm for continuous optimizat...
Enhancing three variants of harmony search algorithm for continuous optimizat...IJECEIAES
 
A COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMS
A COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMSA COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMS
A COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMSecij
 
Query processing and Query Optimization
Query processing and Query OptimizationQuery processing and Query Optimization
Query processing and Query OptimizationNiraj Gandha
 
Bit Error Rate Analysis in Multicast Multiple Input Multiple Output Systems
Bit Error Rate Analysis in Multicast Multiple Input Multiple Output SystemsBit Error Rate Analysis in Multicast Multiple Input Multiple Output Systems
Bit Error Rate Analysis in Multicast Multiple Input Multiple Output Systemsrahulmonikasharma
 
Survey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POISurvey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POIIRJET Journal
 
On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...
On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...
On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...Amir Ziai
 
Modeling Search Computing Applications
Modeling Search Computing ApplicationsModeling Search Computing Applications
Modeling Search Computing ApplicationsMarco Brambilla
 
Comparison of search algorithms in Javanese-Indonesian dictionary application
Comparison of search algorithms in Javanese-Indonesian dictionary applicationComparison of search algorithms in Javanese-Indonesian dictionary application
Comparison of search algorithms in Javanese-Indonesian dictionary applicationTELKOMNIKA JOURNAL
 
Analysis of signal transition
Analysis of signal transitionAnalysis of signal transition
Analysis of signal transitioncsandit
 
Masters Thesis Defense Talk
Masters Thesis Defense TalkMasters Thesis Defense Talk
Masters Thesis Defense TalkRavi Gummadi
 
PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1
PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1
PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1Shweta Sood
 

Tendances (12)

Enhancing three variants of harmony search algorithm for continuous optimizat...
Enhancing three variants of harmony search algorithm for continuous optimizat...Enhancing three variants of harmony search algorithm for continuous optimizat...
Enhancing three variants of harmony search algorithm for continuous optimizat...
 
A COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMS
A COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMSA COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMS
A COMPARISON BETWEEN SWARM INTELLIGENCE ALGORITHMS FOR ROUTING PROBLEMS
 
Query processing and Query Optimization
Query processing and Query OptimizationQuery processing and Query Optimization
Query processing and Query Optimization
 
I AM SAM web app
I AM SAM web appI AM SAM web app
I AM SAM web app
 
Bit Error Rate Analysis in Multicast Multiple Input Multiple Output Systems
Bit Error Rate Analysis in Multicast Multiple Input Multiple Output SystemsBit Error Rate Analysis in Multicast Multiple Input Multiple Output Systems
Bit Error Rate Analysis in Multicast Multiple Input Multiple Output Systems
 
Survey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POISurvey on Location Based Recommendation System Using POI
Survey on Location Based Recommendation System Using POI
 
On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...
On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...
On the Performance of the Pareto Set Pursuing (PSP) Method for Mixed-Variable...
 
Modeling Search Computing Applications
Modeling Search Computing ApplicationsModeling Search Computing Applications
Modeling Search Computing Applications
 
Comparison of search algorithms in Javanese-Indonesian dictionary application
Comparison of search algorithms in Javanese-Indonesian dictionary applicationComparison of search algorithms in Javanese-Indonesian dictionary application
Comparison of search algorithms in Javanese-Indonesian dictionary application
 
Analysis of signal transition
Analysis of signal transitionAnalysis of signal transition
Analysis of signal transition
 
Masters Thesis Defense Talk
Masters Thesis Defense TalkMasters Thesis Defense Talk
Masters Thesis Defense Talk
 
PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1
PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1
PPT_ADML_PGM_KnowledgeSharing_9JULY2015_v1
 

En vedette

Bridging the Gap Between Industry and Higher Education Demands on Electronic ...
Bridging the Gap Between Industry and Higher Education Demands on Electronic ...Bridging the Gap Between Industry and Higher Education Demands on Electronic ...
Bridging the Gap Between Industry and Higher Education Demands on Electronic ...IOSR Journals
 
Concepts and Derivatives of Web Services
Concepts and Derivatives of Web ServicesConcepts and Derivatives of Web Services
Concepts and Derivatives of Web ServicesIOSR Journals
 
Security Threat Solution over Single Cloud To Multi-Cloud Using DepSky Model
Security Threat Solution over Single Cloud To Multi-Cloud Using DepSky ModelSecurity Threat Solution over Single Cloud To Multi-Cloud Using DepSky Model
Security Threat Solution over Single Cloud To Multi-Cloud Using DepSky ModelIOSR Journals
 
Discovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining ProcedureDiscovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining ProcedureIOSR Journals
 
SQl Injection Protector for Authentication in Distributed Applications
SQl Injection Protector for Authentication in Distributed ApplicationsSQl Injection Protector for Authentication in Distributed Applications
SQl Injection Protector for Authentication in Distributed ApplicationsIOSR Journals
 
Routing Algorithm for Heterogeneous Wireless Network in Vanet
Routing Algorithm for Heterogeneous Wireless Network in VanetRouting Algorithm for Heterogeneous Wireless Network in Vanet
Routing Algorithm for Heterogeneous Wireless Network in VanetIOSR Journals
 
Assessing Buffering with Scheduling Schemes in a QoS Internet Router
Assessing Buffering with Scheduling Schemes in a QoS Internet RouterAssessing Buffering with Scheduling Schemes in a QoS Internet Router
Assessing Buffering with Scheduling Schemes in a QoS Internet RouterIOSR Journals
 
Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...
Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...
Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...IOSR Journals
 
Real Time Services for Cloud Computing Enabled Vehicle Networks
Real Time Services for Cloud Computing Enabled Vehicle NetworksReal Time Services for Cloud Computing Enabled Vehicle Networks
Real Time Services for Cloud Computing Enabled Vehicle NetworksIOSR Journals
 
Quality Assurance Standards and Survey of IT Industries
Quality Assurance Standards and Survey of IT IndustriesQuality Assurance Standards and Survey of IT Industries
Quality Assurance Standards and Survey of IT IndustriesIOSR Journals
 
Privacy and Integrity Preserving in Cloud Storage Devices
Privacy and Integrity Preserving in Cloud Storage DevicesPrivacy and Integrity Preserving in Cloud Storage Devices
Privacy and Integrity Preserving in Cloud Storage DevicesIOSR Journals
 
A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...
A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...
A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...IOSR Journals
 
Segmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome imagesSegmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome imagesIOSR Journals
 
Analysis of service-oriented traffic classification with imperfect traffic cl...
Analysis of service-oriented traffic classification with imperfect traffic cl...Analysis of service-oriented traffic classification with imperfect traffic cl...
Analysis of service-oriented traffic classification with imperfect traffic cl...IOSR Journals
 

En vedette (20)

M01312106112
M01312106112M01312106112
M01312106112
 
F013114350
F013114350F013114350
F013114350
 
Bridging the Gap Between Industry and Higher Education Demands on Electronic ...
Bridging the Gap Between Industry and Higher Education Demands on Electronic ...Bridging the Gap Between Industry and Higher Education Demands on Electronic ...
Bridging the Gap Between Industry and Higher Education Demands on Electronic ...
 
N0171595105
N0171595105N0171595105
N0171595105
 
Concepts and Derivatives of Web Services
Concepts and Derivatives of Web ServicesConcepts and Derivatives of Web Services
Concepts and Derivatives of Web Services
 
L01117074
L01117074L01117074
L01117074
 
Security Threat Solution over Single Cloud To Multi-Cloud Using DepSky Model
Security Threat Solution over Single Cloud To Multi-Cloud Using DepSky ModelSecurity Threat Solution over Single Cloud To Multi-Cloud Using DepSky Model
Security Threat Solution over Single Cloud To Multi-Cloud Using DepSky Model
 
Discovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining ProcedureDiscovering Frequent Patterns with New Mining Procedure
Discovering Frequent Patterns with New Mining Procedure
 
SQl Injection Protector for Authentication in Distributed Applications
SQl Injection Protector for Authentication in Distributed ApplicationsSQl Injection Protector for Authentication in Distributed Applications
SQl Injection Protector for Authentication in Distributed Applications
 
Routing Algorithm for Heterogeneous Wireless Network in Vanet
Routing Algorithm for Heterogeneous Wireless Network in VanetRouting Algorithm for Heterogeneous Wireless Network in Vanet
Routing Algorithm for Heterogeneous Wireless Network in Vanet
 
Assessing Buffering with Scheduling Schemes in a QoS Internet Router
Assessing Buffering with Scheduling Schemes in a QoS Internet RouterAssessing Buffering with Scheduling Schemes in a QoS Internet Router
Assessing Buffering with Scheduling Schemes in a QoS Internet Router
 
Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...
Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...
Multipath Routing Protocol by Breadth First Search Algorithm in Wireless Mesh...
 
Real Time Services for Cloud Computing Enabled Vehicle Networks
Real Time Services for Cloud Computing Enabled Vehicle NetworksReal Time Services for Cloud Computing Enabled Vehicle Networks
Real Time Services for Cloud Computing Enabled Vehicle Networks
 
L0126598103
L0126598103L0126598103
L0126598103
 
Quality Assurance Standards and Survey of IT Industries
Quality Assurance Standards and Survey of IT IndustriesQuality Assurance Standards and Survey of IT Industries
Quality Assurance Standards and Survey of IT Industries
 
Privacy and Integrity Preserving in Cloud Storage Devices
Privacy and Integrity Preserving in Cloud Storage DevicesPrivacy and Integrity Preserving in Cloud Storage Devices
Privacy and Integrity Preserving in Cloud Storage Devices
 
I0426778
I0426778I0426778
I0426778
 
A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...
A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...
A Survey on Applications of Neural Networks and Genetic Algorithms in Fault D...
 
Segmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome imagesSegmentation of Overlapped and Touching Human Chromosome images
Segmentation of Overlapped and Touching Human Chromosome images
 
Analysis of service-oriented traffic classification with imperfect traffic cl...
Analysis of service-oriented traffic classification with imperfect traffic cl...Analysis of service-oriented traffic classification with imperfect traffic cl...
Analysis of service-oriented traffic classification with imperfect traffic cl...
 

Similaire à A Study on Optimization of Top-k Queries in Relational Databases

probabilistic ranking
probabilistic rankingprobabilistic ranking
probabilistic rankingFELIX75
 
E132833
E132833E132833
E132833irjes
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAcsandit
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentIJERD Editor
 
Efficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribeEfficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribeIJSRD
 
IRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET Journal
 
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET Journal
 
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...IRJET Journal
 
A Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means AlgorithmA Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means AlgorithmIRJET Journal
 
Fast top k path-based relevance query on massive graphs
Fast top k path-based relevance query on massive graphsFast top k path-based relevance query on massive graphs
Fast top k path-based relevance query on massive graphsieeechennai
 
SHAHBAZ_TECHNICAL_SEMINAR.docx
SHAHBAZ_TECHNICAL_SEMINAR.docxSHAHBAZ_TECHNICAL_SEMINAR.docx
SHAHBAZ_TECHNICAL_SEMINAR.docxShahbazKhan77289
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...IOSR Journals
 
SIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMSSIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMScscpconf
 
Size estimation of olap systems
Size estimation of olap systemsSize estimation of olap systems
Size estimation of olap systemscsandit
 
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESIRJET Journal
 
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...IRJET Journal
 

Similaire à A Study on Optimization of Top-k Queries in Relational Databases (20)

pdf
pdfpdf
pdf
 
probabilistic ranking
probabilistic rankingprobabilistic ranking
probabilistic ranking
 
E132833
E132833E132833
E132833
 
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATAEFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
EFFICIENTLY PROCESSING OF TOP-K TYPICALITY QUERY FOR STRUCTURED DATA
 
Apsec 2014 Presentation
Apsec 2014 PresentationApsec 2014 Presentation
Apsec 2014 Presentation
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
H1076875
H1076875H1076875
H1076875
 
Efficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribeEfficient Filtering Algorithms for Location- Aware Publish/subscribe
Efficient Filtering Algorithms for Location- Aware Publish/subscribe
 
IRJET- Online Course Recommendation System
IRJET- Online Course Recommendation SystemIRJET- Online Course Recommendation System
IRJET- Online Course Recommendation System
 
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative FilteringIRJET- Boosting Response Aware Model-Based Collaborative Filtering
IRJET- Boosting Response Aware Model-Based Collaborative Filtering
 
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
A WEB BASED APPLICATION FOR RESUME PARSER USING NATURAL LANGUAGE PROCESSING T...
 
A Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means AlgorithmA Study of Efficiency Improvements Technique for K-Means Algorithm
A Study of Efficiency Improvements Technique for K-Means Algorithm
 
Fast top k path-based relevance query on massive graphs
Fast top k path-based relevance query on massive graphsFast top k path-based relevance query on massive graphs
Fast top k path-based relevance query on massive graphs
 
SHAHBAZ_TECHNICAL_SEMINAR.docx
SHAHBAZ_TECHNICAL_SEMINAR.docxSHAHBAZ_TECHNICAL_SEMINAR.docx
SHAHBAZ_TECHNICAL_SEMINAR.docx
 
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
Impulsion of Mining Paradigm with Density Based Clustering of Multi Dimension...
 
SIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMSSIZE ESTIMATION OF OLAP SYSTEMS
SIZE ESTIMATION OF OLAP SYSTEMS
 
Size estimation of olap systems
Size estimation of olap systemsSize estimation of olap systems
Size estimation of olap systems
 
mlsys_portrait
mlsys_portraitmlsys_portrait
mlsys_portrait
 
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
 
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
Mining Query Log to Suggest Competitive Keyphrases for Sponsored Search Via I...
 

Plus de IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Dernier

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxAsutosh Ranjan
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N
 

Dernier (20)

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
★ CALL US 9953330565 ( HOT Young Call Girls In Badarpur delhi NCR
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
9953056974 Call Girls In South Ex, Escorts (Delhi) NCR.pdf
 
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINEMANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE
 

A Study on Optimization of Top-k Queries in Relational Databases

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 10, Issue 4 (Mar. - Apr. 2013), PP 38-40 www.iosrjournals.org www.iosrjournals.org 38 | Page A Study on Optimization of Top-k Queries in Relational Databases Neha Singh, P.K. Pandey* Department of Computer Science, A.P.S. University, Rewa (M.P.) * Department of Physics, Govt. Science College, Rewa (M.P.) Abstract: Efficient execution of top-k queries is increasingly becoming a major challenge for relational database technology. Top-k query has emerged as a key requirement in modern application. In these applications, efficient and adaptive evaluation of top-k queries is an integral part of application semantics. In this paper we discuss a query optimization framework of top-k queries that fully integrates rank-join operators. Ranking queries produce results that are ordered on some computed score. A key property of top-k queries is that, users are interested only in the first k results and not in total ranking of all query results. This property directly impacts the optimization of top-k queries by optimizing for the first k results. Traditionally, most real world database systems offer the feature of first n row optimization. I. Introduction The increasing importance of top-k queries warrants an efficient support of ranking in the relational database management system. Emerging applications that depend on ranking queries warrant efficient support of ranking in database management system. Integration of database and information retrieval technologies ahs been an active research topic recently [1]. Top-k queries are dominant in many emerging applications as web databases, multimedia databases, middlewares, datamining etc. Supporting ranking gives database systems the ability to efficiently answer Information Retrieval queries. A top-k query is an important type of query that allows for supporting IR applications on top of database systems. Database system provide guarantee of efficient data handling with solid integrity and consistency. An important subject in this connection is determining the suitable size of inputs in relation N for answering to the rank ware query so that in this manner top-k query answers are gotten. It is vital in information integration with large scale to select top-k rank aware from multiple sources and it has also basic role for minimizing transfer cost because as the size of relations become smaller, transfer cost become less [6]. The algorithm that is used for answering to the ranking queries is estimating input sizes by means of statistical relations, monotony hypothesis and random variables [2, 3]. The other new innovation provides a systematic and principled framework to support efficient evaluations of top-k queries in relational database systems by extending relational algebra and query optimization [4]. In this connection one another solution is improving the join and using the ripple join thath minimize the time in order to get estimation with relatively acceptable precision for query results. The main idea of ripple join is join algorithm aware the suggested rank for supporting join queries with top-k relational database [5]. In this paper we study the optimization of top-k queries. II. Relational Database A Relational database management system (RDBMS) is a database management system (DBMS) that is based on the relational model as introduced by E. F. Codd. A relational database of data items organized as a set of formally described relations (also known as table) from which data can be accessed or reassembled in many different ways without having to reorganize the database relations. III. Top-K Queries The top-k query is define as- Given a database D of m objects, each of which is characterized by n attributes, a scoring function f, according to which we rank the objects in D, and the number of expected results k. Then a top-k query Q returns the k objects with the highest rank in f. In Top-k query, query define on n attribute a1, a2 ,…, an and relation M in the form of R1 ,R2 ,…,RM that each ai (i=1:n) belongs to one relation Rj (j=1:M). Each of the attributes has special domain in comparison with their kind. According to the query, a series of attributes of these relations are applied for projection, a series of attributes of these relations are used for restriction and join. In the rank aware queries there is apart for ranking that some of relations attribute are presented in the form of a ranking relation which is called ranking function. Ranking function f is formed in the form of attribute n' that is n' <=n. A theory for ranking function f is this: ranking function changes in comparison with all relations are monotonic. In addition to this, the number of suitable answers in rank aware queries is determined too that is just Top k. Consider a set of relations R1 to Rm. Each tuple in Ri is associated with some
  • 2. A Study on Optimization of Top-k Queries in Relational Databases www.iosrjournals.org 39 | Page score that gives it a rank within Ri. The top-k join query joins R1 to Rm and produces the results ranked on a total score. The total score is computed according to some function, say F that combines individual scores. Note that the score attached with each relation can be the value of one attribute or a value computed using a predicate on a subset of its attributes. A possible SQL-like notation for expressing a top-k join query is as follows: SELECT * FROM R1, R2, . . . ,Rm WHERE join condition (R1, R2, . . . ,Rm) ORDER BY F (R1.score, R2.score. . . Rm.score) LIMIT k; Where LIMIT limits the number of results reported to the user. IV. Query Optimization Query optimization is necessary for high level relational queries to provide an opportunity for the database system to systematically evaluate alternative query execution strategies and to choose an optimal strategy. Query optimization is a function of many relational database management systems in which multiple query plans for satisfying a query are examined and a good query plan is identified. The function of query optimization is to select an efficient execution strategy. The figure 1 shows the query optimization process which generates the best query execution plan. For a given query, the search space can be defined as the set of equivalent operator tress that can be produced using transformation rules. Input Query Best QEP figure 1: Query Optimization Process V. Optimization of Top-k Queries Optimization of top-k queries (also known as Rank aware query optimization) in relational database has contributed to the existing system for the query operations significantly. In this section we discuss a rank aware query optimization framework that fully integrates rank-join operators into relational query engine and how to extend the traditional query optimization to handle the new rank join operators. The two major task are includes while integrating the new rank-join operators in query optimizer- enlarging the space of possible plans to include those plans that use rank join operators as a possible join alternative and providing a costing mechanism for the new operators to help the optimizer prune expensive plans in favor of more general cheaper plans. The first task, enlarging the plan space is achieved by extending the enumeration algorithm to produce new execution plans. Now we exhibit how to extend the enumeration space to generate top-k query execution plans. Top-k plans will integrate the rank-join operators into general execution plans. For a query with n input relations, R1 to Rn, assume there exists a ranking function f(s1,s2,…sn), where si is score expression on relation Ri. For two sets of input relations, P and Q, we extend the space of plans that join P and Q to include rank join plans by make suitable for following: - Join Eligibility: P and Q are rank-join eligible if all the following apply: 1. There is a join condition that relates at least one input relation in P to an input relation in Q. Search space generation Transformation rules Equivalent QEP Search strategy Cost model
  • 3. A Study on Optimization of Top-k Queries in Relational Databases www.iosrjournals.org 40 | Page 2. f can be expressed as f(f1(SP); f2(SQ); f3(SO)), where f1, f2 and f3 are three scoring functions, SP are the score expressions on the relations in P, SQ are the score expressions on the relations in Q, and SO are the score expressions on the rest of the input relations. 3. There is at least one plan that accesses P and/or Q ordered on SP and/or SQ, respectively. -Join Choices: Rank-join can have several implementations as physical join operators. For each rank-join between P and Q, plans can be generated for each join implementation. -Join Order: For symmetric rank-join operators there is no distinction between outer and inner relations. For the nested-loops implementation, a different plan can be generated by switching the inner and the outer relations. Now we discuss the second task, providing a costing mechanism for the new operators to help the optimizer prune expensive plans in favor of more general cheaper plans. In real-world query optimizers, the cost model for different query operators is quite complex and depends on many parameters. Parameters include input cardinality, available buffers, type of access paths etc. Although cost models can be very complex, a key ingredient of accurate estimation is the accuracy of estimating the size of intermediate results. In traditional join operators, the input cardinalities are independent of the operator itself and only depend on the input subplan. Mostly, the output cardinality depends only on the size of the inputs and the selectivity of the logical operation. On the other hand, since a rank-join operator does not consume all of its inputs, the actual input size depends on the operator itself and how the operator decides that it has seen enough information from the inputs to generate the top k results. Hence, the input cardinality depends on the number of ranked join results requested from that operator. Therefore, the cost of a rank-join operator depends on the following: - The number of required results k and how k is propagated in the pipeline - The number of tuples from inputs that contain enough information for the operator to report the required number of answers, k. - The selectivity of the join operation affects the number of tuples propagated from the inputs to higher operators through the join operation. Hence, the join selectivity affects the number of input tuples required by the rank-join operator to produce ranked results. There are two ways to produce plans that join two sets of input relations, P and Q, and produce ranked results- by using rank-join operators to join L and R subplans or by gluing a sort operator on the cheapest join plan that joins P and Q without preserving the required order. The challenge is in comparing two plans when one or both of them are rank-join plans. For all rank-join plans, the cost of the plan depends on k and the join selectivity s. Since these two parameters are the same for all plans, the pruning among these plans follows the same mechanism as in traditional cost based pruning. For example, pruning a rank-join plan in favor of another rank-join plan depends on the input cardinality of the relations, the cost of the join method, the access paths, and the statistics available on the input scores. We assume the availability of an estimate of the join selectivity, which is the same for both sort-plans and rank-join plans. VI. Conclusion Query optimization is essential for large information retrieval system. In this paper we describe the top- k queries and general query optimization method for to provide an opportunity for the database system to systematically evaluate alternative query execution strategies and to choose an optimal strategy. We also discussed optimization of top-k queries with a framework for integrating rank-join operators in real-world query optimizers. Our framework was based on two main steps- first, extend the enumeration phase of the query optimizer to generate rank-aware plans. The extension was achieved by providing rank-join operators as possible join choices and by defining ranking expressions as a new physical plan property. References [1] S. Chaudhury, R. Ramakrishnan and G. Weikum (2005), Integrating db and ir technologies, CIDR. [2] Ilyas, I.F. and W.G. Aref (2006), Adaptive Rank aware Query Optimization in Relational Databases, ACM Transactions on Database Systems. [3] Ilyas, I.F. and R. Shah (2004), Rank-aware Query Optimization,SIGMOD, June13-18, Paris, France. [4] Ilyas, I.F. and C. Li, 2005. Ranksql: Query algebra and optimization for relational top-k queries.SIGMOD 2005, June 1416, Baltimore, Maryland, USA. [5] Ilyas, I.F. and W.G. Aref, 2003. Supporting Top-k Join Queries in Relational Databases. Proceedings of the 29th VLDB Conference, Berlin, Germany. [6] M.A. Kashem, A S Chowdhury, R Deb, M Jahan (2010), Query Optimization on Relational Databases for Supporting Top-k Query Processing techniques, JCIT