SlideShare a Scribd company logo
1 of 5
Download to read offline
IOSR Journal of Computer Engineering (IOSRJCE)
ISSN: 2278-0661 Volume 3, Issue 6 (Sep-Oct. 2012), PP 43-47
www.iosrjournals.org
www.iosrjournals.org 43 | Page
Mining Negative Association Rules
Tushar Mani
Ph.D. (CSE), NIMS University, Jaipur (Rajasthan), India
Abstract: Association rule mining is a data mining task that discovers associations among items in a
transactional database. Typical association rules consider only items enumerated in transactions. Such rules
are referred to as positive association rules. Negative association rules also consider the same items, but in
addition consider negated items (i.e. absent from transactions). Negative association rules are useful in market-
basket analysis to identify products that conflict with each other or products that complement each other. They
are also very convenient for associative classifiers, classifiers that build their classification model based on
association rules. Many other applications would benefit from negative association rules if it was not for the
expensive process to discover them. Indeed, mining for such rules necessitates the examination of an
exponentially large search space. In this paper, we propose an algorithm that mines negative association rules
by using conviction measure which does not require extra database scans.
I. Introduction
Association rule mining is a data mining task that discovers relationships among items in a
transactional database. Association rules have been extensively studied in the literature for their usefulness in
many application domains such as recommender systems, diagnosis decisions support, telecommunication,
intrusion detection, etc. The efficient discovery of such rules has been a major focus in the data mining research
community. From the original apriori algorithm [1] there have been a remarkable number of variants and
improvements of association rule mining algorithms [2].
Association rule analysis is the task of discovering association rules that occur frequently in a given
data set. A typical example of association rule mining application is the market basket analysis. In this process,
the behaviour of the customers is studied when buying different products in a shopping store. The discovery of
interesting patterns in this collection of data can lead to important marketing and management strategic
decisions. For instance, if a customer buys bread, what is the probability that he/she buys milk as well?
Depending on the probability of such an association, marketing personnel can develop better planning of the
shelf space in the store or can base their discount strategies on such associations/correlations found in the data.
All the traditional association rule mining algorithms were developed to find positive associations between
items. By positive associations we refer to associations between items existing in transactions (i.e. items
bought). What about associations of the type: “customers that buy Coke do not buy Pepsi” or “customers that
buy juice do not buy bottled water”? In addition to the positive associations, the negative association can
provide valuable information, in devising marketing strategies. Interestingly, very few have focused on negative
association rules due to the difficulty in discovering these rules.
Although some researchers pointed out the importance of negative associations [3], only few groups of
researchers [4], [5], [6] proposed an algorithm to mine these types of associations. This not only illustrates the
novelty of negative association rules, but also the challenge in discovering them.
II. Basic Concept and terminology
This section introduces basic concept of association rules and some related work on negative association rules.
2.1 Association Rules
Formally, association rules are defined as follows: Let I = {i1, i2,…,in} be a set of items. Let D be a set
of transactions, where each transaction T is a set of items such that T ⊆I. Each transaction is associated with a
unique identifier TID. A transaction T is said to contain X, a set of items in I, if X ⊆T. An association rule is
an implication of the form “X ->Y”, where X ⊆I; Y ⊆I, and X ∩ Y = Φ. The rule X ->Y has support s in
the transaction set D if s% of the transactions in D contain X U Y. In other words, the support of the rule is the
probability that X and Y hold together among all the possible presented cases. It is said that the rule X ->Y
holds in the transaction set D with confidence c if c% of transactions in D that contain X also contain Y. In other
words, the confidence of the rule is the conditional probability that the consequent Y is true under the condition
of the antecedent X. The problem of discovering all association rules from a set of transactions D consists of
generating the rules that have a support and confidence greater than given thresholds. These rules are called
strong rules, and the framework is known as the support-confidence framework for association rule mining.
Mining Negative Association Rules
www.iosrjournals.org 44 | Page
A negative association rule is an implication of the form X ->┐Y (or ┐ X ->Y or ┐ X ->┐ Y), where X
⊆I, Y ⊆I and X ∩ Y = Φ (Note that although rule in the form of ┐ X ->┐ Y contains negative elements, it
is equivalent to a positive association rule in the form of Y->X. Therefore it is not considered as a negative
association rule.) In contrast to positive rules, a negative rule encapsulates relationship between the occurrences
of one set of items with the absence of the other set of items. The rule X ->┐ Y has support s % in the data
sets, if s % of transactions in T contain itemset X while do not contain itemset Y. The support of a negative
association rule, supp( X ->┐Y), is the frequency of occurrence of transactions with item set X in the absence
of item set Y. Let U be the set of transactions that contain all items in X. The rule X ->┐ Y holds in the given
data set (database) with confidence c %, if c% of transactions in U do not contain item set Y.
Confidence of negative association rule, conf ( X ->┐ Y), can be calculated with P( X ┐ Y )/P(X), where P(.) is
the probability function. The support and confidence of itemsets are calculated during iterations. However, it is
difficult to count the support and confidence of non-existing items in transactions. To avoid counting them
directly, we can compute the measures through those of positive rules.
III. Related Work In Negative Association Rule Mining
A new idea to mine strong negative rules presented in [15]. They combine positive frequent itemsets[8]
with domain knowledge in the form of taxonomy to mine negative associations. However, their algorithm is
hard to generalize since it is domain dependent and requires a predefined taxonomy. Finding negative itemsets
involve following steps: (1) first find all the generalized large itemsets in the data (i.e., itemsets at all levels in
the taxonomy whose support is greater than the user specified minimum support) (2) next identify the candidate
negative itemsets based on the large itemsets and the taxonomy and assign them expected support. (3) in the last
step, count the actual support for the candidate itemsets and retain only the negative itemsets .The interest
measure RI of negative association rule X ┐Y, as follows RI=(E[support( X U Y )]-support( X U
Y))/support(X) Where E[support(X)] is the expected support of an itemset X.
A new measure called mininterest, (the argument is that a rule A B is of interest only if supp ( A U
B) - supp(A) supp(B) ≥ mininterest) added on top of the support-confidence framework[17]. They consider the
itemsets (positive or negative) that exceed minimum support and minimum interest thresholds as itemsets of
interest. Although, [17] introduces the “mininterest” parameter, the authors do not discuss how to set it and what
would be the impact on the results when changing this parameter.
A novel approach has proposed in [16]. In this, mining both positive and negative association rules of
interest can be decomposed into the following two sub problems, (1) generate the set of frequent itemsets [8] of
interest (PL) and the set of infrequent itemsets of interest (NL) (2) extract positive rules of the form A=>B in
PL, and negative rules of the forms A ┐ B, ┐ AB and┐ A ┐ B in NL. To generate PL, NL and
negative association rules they developed three functions namely, fipi( ), iipis() and CPIR( ).
The most common frame-work in the association rule generation is the “Support-Confidence” one. In
[14], authors considered another frame-work called correlation analysis that adds to the support-confidence. In
this paper, they combined the two phases (mining frequent itemsets[8] and generating strong association rules)
and generated the relevant rules while analyzing the correlations within each candidate itemset. This avoids
evaluating item combinations redundantly. Indeed, for each generated candidate itemset, they computed all
possible combinations of items to analyze their correlations. At the end, they keep only those rules generated
from item combinations with strong correlation. If the correlation is positive, a positive rule is discovered. If the
correlation is negative, two negative rules are discovered. The negative rules produced are of the form X ┐Y
or ┐ X Y which the authors term as “confined negative association rules”. Here the entire antecedent or
consequent is either a conjunction of negated attributes or a conjunction of non-negated attributes.
An innovative approach has proposed in [13]. In this generating positive and negative association rules
consists of four steps: (1) Generate all positive frequent itemsets L ( P1 ) (ii) for all itemsets I in L( P1 ),
generate negative frequent itemsets of the form ┐ ( I1 I2 ) (iii) Generate all negative frequent itemsets ┐ I1 ┐I2
(iv) Generate all negative frequent itemsets I1 ┐ I2 and (v) Generate all valid positive and negative association
rules . Authors generated negative rules without adding additional interesting measure(s) to support-confidence
frame work.
A new and different approach has been proposed in [7]. This is simple but effective. It is not using any
additional interesting measures and additional database scans. In this approach, it is finding negative itemsets by
replacing a literal in a candidate itemset by its corresponding negated item. If a candidate itemset contains 3
items then it will produce corresponding 3 negative itemsets one for each literal.
IV. Discovering Negative Association Rules
The most common framework in the association rules generation is the “support-confidence” one.
Although these two parameters allow the pruning of many associations that are discovered in data, there are
cases when many uninteresting rules may be produced. In this paper we consider another interesting measure
Mining Negative Association Rules
www.iosrjournals.org 45 | Page
called conviction that adds to the support- confidence framework. Next section introduces the measure
conviction.
 The conviction of a rule is defined as:
conv(X=>Y) can be interpreted as the ratio of the expected frequency that X occurs without Y (that is X=>┐Y)
if X and Y were independent divided by the observed frequency of incorrect predictions. The range of
conviction is 0 to ∞
A. Algorithm MPNAR
In this section we propose and explain our algorithm.
Algorithm: Mining Negative Association Rules
Input: TDB-Transactional Database
mins -minimum support
minc-minimum confidence
Output: Negative Association Rules
Method:
1. NARФ /* initially NAR is empty */
2. Scan the database and find the set of frequent 1.itemsets(F1)
3. for (k=2;Fk-1!=Φ; k++)
4. {
5. Ck= Fk⋈-1 Fk-1 /* generates candidates itemsets */
6. // Prune using Apriori Property
7. for each i ε Ck, any subset of i is not in Fk-1 then Ck = CK - { i }
8. for each i ε Ck /* perform database scanning to find support */
9. {
10. s = Support( i);
11. for each A,B (A U B= i )
12. {
13. if ( Supp(X ┐Y) ≥mins && Conviction(X┐Y) ≤2.0) /* produces NAR of the form X┐Y */
14. NARNAR U { X┐Y) /* produces NAR of the form ┐X Y */
15. if ( Supp(┐XY) ≥ mins && Conviction(┐XY) ≤2.0) then
16. NAR NAR U { ┐X Y }
17. }
18. }
19. }
 Support ( ┐X ) = 1-support(A)
 Support( X∪┐Y ) = support(X) – support(X ∪ Y)
 Support( ┐X ∪ Y ) = support(Y) – support(X ∪ Y)
 Support (┐X ∪┐Y ) = 1 – support - support(X) – support(Y) + support(X ∪ Y)
 The generation of positive rules continues without
V. Experimental Results
We tested our algorithm with [14]. We consider a transactional database contains many transactions.
We tested our algorithm with reference [14] with different minimum supports and minimum confidences. Our
algorithm is performing well than one in [14].
Conv( X => Y ) =
1- supp(Y)
1-conf(X=>Y)
Mining Negative Association Rules
www.iosrjournals.org 46 | Page
Fig 1:minimum support =30% and different minimum confidences
Fig 2: minimum support =40% and different minimum confidences
Fig 3: minimum support =50% and different minimum confidences
Fig 4: minimum confidence =60% and different minimum supports
Fig 5: minimum confidence =70% and different minimum supports
Mining Negative Association Rules
www.iosrjournals.org 47 | Page
Fig 6: minimum confidence =80% and different minimum supports
VI. Conclusion And Future Work
In this paper we introduced a new algorithm to generate both positive and negative association rules.
Our method adds the conviction to the support-confidence framework to generate stronger positive and negative
rules. We compared our algorithm with [14] on a real dataset. We discussed their performances on a
transactional database and analyzed experimental results. The results prove that our algorithm can perform
better than one in [14].
In future we wish to conduct experiments on some other real datasets and compare the performance of
our algorithm with other related algorithms such as reference [7] and reference [16].
References
[1] Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of SIGMOD.
(1993) 207–216
[2] Goethals, B., Zaki, M., eds.: FIMI’03: Workshop on Frequent Itemset Mining Implementations. Volume 90 of CEUR Workshop
Proceedings series. (2003) http://CEUR-WS.org/Vol-90/.
[3] S.Brin, R. Motwani, and C.Silverstein. Beyond market baskets: Generalizing association rules to correlations. In ACM
SIGMOD,Tucson, Arizona, 1997.
[4] Wu, X., Zhang, C., Zhang, S.: Mining both positive and negative association rules. In: Proc. of ICML. (2002) 658–665
[5] Teng, W., Hsieh, M., Chen, M.: On the mining of substitution rules for statistically dependent items. In: Proc. of ICDM. (2002)
442–449
[6] Savasere, A., Omiecinski, E., Navathe, S.: Mining for strong negative associations in a large database of customer transactions. In:
Proc. of ICDE. (1998) 494–502
[7] B.Ramasubbareddy, A.Govardhan, and A.Ramamohanreddy. Mining Positive and Negative Association Rules, IEEE ICSE 2010,
Hefeai, China, August 2010.
[8] Goethals, B., Zaki, M., eds.: FIMI’03: Workshop on Frequent Itemset Mining Implementations. Volume 90 of CEUR Workshop
Proceedings series. (2003) http://CEUR-WS.org/Vol-90/.
[9] Teng, W., Hsieh, M., Chen, M.: On the mining of substitution rules for statistically dependent items. In: Proc. of ICDM. (2002)
442–449
[10] Tan, P., Kumar, V.: Interestingness measures for association patterns: A perspective.In: Proc. of Workshop on Postprocessing in
Machine Learning and Data Mining. (2000)
[11] Gourab Kundu, Md. Monirul Islam, Sirajum Munir, Md. Faizul Bari ACN: An Associative Classifier with Negative Rules 11th
IEEE
International Conference on Computational Science and Engineering, 2008.
[12] Brin,S., Motwani,R. and Silverstein,C., “ Beyond Market Baskets: Generalizing Association Rules to Correlations,” Proc. ACM
SIGMOD Conf., pp.265-276, May 1997.
[13] Chris Cornelis, peng Yan, Xing Zhang, Guoqing Chen: Mining Positive and Negative Association Rules from Large Databases ,
IEEE conference 2006.
[14] M.L. Antonie and O.R. Za¨ıane, ”Mining Positive and Negative Association Rules: an Approach for Confined Rules”, Proc. Intl.
Conf. on Principles and Practice of Knowledge Discovery in Databases, 2004, pp 27–38.
[15] Savasere, A., Omiecinski,E., Navathe, S.: Mining for Strong negative associations in a large data base of customer transactions. In:
Proc. of ICDE. (1998) 494- 502..
[16] Wu, X., Zhang, C., Zhang, S.: efficient mining both positive and negative association rules. ACM Transactions on Information
Systems, Vol. 22, No.3, July 2004,Pages 381-405.
[17] Wu, X., Zhang, C., Zhang, S.: Mining both positive and negative association rules.In: Proc. of ICML. (2002) 658–665
[18] Yuan,X., Buckles, B.,Yuan, Z.,Zhang, J.:Mining Negative Association Rules. In: Proc. of ISCC. (2002) 623-629.
[19] Honglei Zhu, Zhigang Xu: An Effective Algorithm for Mining Positive and Negative Association Rules. International Conference on
Computer Science and Software Engineering 2008.

More Related Content

Viewers also liked

Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...
Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...
Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...IOSR Journals
 
Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...
Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...
Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...IOSR Journals
 
причастие как часть речи урок 1
причастие как часть речи  урок 1причастие как часть речи  урок 1
причастие как часть речи урок 1Natalya Dyrda
 
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...IOSR Journals
 
Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property
Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property
Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property David Zak Lawyer
 
Comparison Study of Lossless Data Compression Algorithms for Text Data
Comparison Study of Lossless Data Compression Algorithms for Text DataComparison Study of Lossless Data Compression Algorithms for Text Data
Comparison Study of Lossless Data Compression Algorithms for Text DataIOSR Journals
 
Automation Hackathon 4
Automation Hackathon 4Automation Hackathon 4
Automation Hackathon 4Maksym Romanov
 
Reduce the False Positive and False Negative from Real Traffic with Intrusion...
Reduce the False Positive and False Negative from Real Traffic with Intrusion...Reduce the False Positive and False Negative from Real Traffic with Intrusion...
Reduce the False Positive and False Negative from Real Traffic with Intrusion...IOSR Journals
 
Diabetes - How sweet are you
Diabetes - How sweet are youDiabetes - How sweet are you
Diabetes - How sweet are youVASS Yukon
 
Presentazione dogsinthecity
Presentazione dogsinthecityPresentazione dogsinthecity
Presentazione dogsinthecitylucaminetti
 
Incidence of Equine Hoof Derangements in Malaysian Horse Population
Incidence of Equine Hoof Derangements in Malaysian Horse PopulationIncidence of Equine Hoof Derangements in Malaysian Horse Population
Incidence of Equine Hoof Derangements in Malaysian Horse PopulationIOSR Journals
 
Efficiency of Prediction Algorithms for Mining Biological Databases
Efficiency of Prediction Algorithms for Mining Biological  DatabasesEfficiency of Prediction Algorithms for Mining Biological  Databases
Efficiency of Prediction Algorithms for Mining Biological DatabasesIOSR Journals
 

Viewers also liked (20)

Cosiall
CosiallCosiall
Cosiall
 
Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...
Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...
Performance evaluation of Hard and Soft Wimax by using PGP and PKM protocols ...
 
J0535964
J0535964J0535964
J0535964
 
Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...
Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...
Incidence of Methicillin-Resistant Staphylococcus aureus (MRSA) In a Small Po...
 
причастие как часть речи урок 1
причастие как часть речи  урок 1причастие как часть речи  урок 1
причастие как часть речи урок 1
 
A0510108
A0510108A0510108
A0510108
 
Films cool q3
Films cool q3Films cool q3
Films cool q3
 
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
Effect of Age of Spawned Catfish (Clarias Gariepinus) Broodstock on Quantity ...
 
Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property
Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property
Why U.S. Bank Lost Its Case against Ibanez on a Foreclosed Property
 
Comparison Study of Lossless Data Compression Algorithms for Text Data
Comparison Study of Lossless Data Compression Algorithms for Text DataComparison Study of Lossless Data Compression Algorithms for Text Data
Comparison Study of Lossless Data Compression Algorithms for Text Data
 
Automation Hackathon 4
Automation Hackathon 4Automation Hackathon 4
Automation Hackathon 4
 
A0710113
A0710113A0710113
A0710113
 
Reduce the False Positive and False Negative from Real Traffic with Intrusion...
Reduce the False Positive and False Negative from Real Traffic with Intrusion...Reduce the False Positive and False Negative from Real Traffic with Intrusion...
Reduce the False Positive and False Negative from Real Traffic with Intrusion...
 
Diabetes - How sweet are you
Diabetes - How sweet are youDiabetes - How sweet are you
Diabetes - How sweet are you
 
Physical education 2
Physical education 2Physical education 2
Physical education 2
 
Presentazione dogsinthecity
Presentazione dogsinthecityPresentazione dogsinthecity
Presentazione dogsinthecity
 
Incidence of Equine Hoof Derangements in Malaysian Horse Population
Incidence of Equine Hoof Derangements in Malaysian Horse PopulationIncidence of Equine Hoof Derangements in Malaysian Horse Population
Incidence of Equine Hoof Derangements in Malaysian Horse Population
 
Needle stick
Needle stickNeedle stick
Needle stick
 
test
testtest
test
 
Efficiency of Prediction Algorithms for Mining Biological Databases
Efficiency of Prediction Algorithms for Mining Biological  DatabasesEfficiency of Prediction Algorithms for Mining Biological  Databases
Efficiency of Prediction Algorithms for Mining Biological Databases
 

Similar to Mining Negative Association Rules

Classification based on Positive and Negative Association Rules
Classification based on Positive and Negative Association RulesClassification based on Positive and Negative Association Rules
Classification based on Positive and Negative Association RulesWaqas Tariq
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithmhina firdaus
 
5Association AnalysisBasic Concepts an.docx
 5Association AnalysisBasic Concepts an.docx 5Association AnalysisBasic Concepts an.docx
5Association AnalysisBasic Concepts an.docxShiraPrater50
 
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET-  	  Minning Frequent Patterns,Associations and CorrelationsIRJET-  	  Minning Frequent Patterns,Associations and Correlations
IRJET- Minning Frequent Patterns,Associations and CorrelationsIRJET Journal
 
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...National Cheng Kung University
 
Data Science - Part VI - Market Basket and Product Recommendation Engines
Data Science - Part VI - Market Basket and Product Recommendation EnginesData Science - Part VI - Market Basket and Product Recommendation Engines
Data Science - Part VI - Market Basket and Product Recommendation EnginesDerek Kane
 
An apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distanceAn apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distanceIJDKP
 
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset CreationTop Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset Creationcscpconf
 
Re-mining item associations: Methodology and a case study in apparel retailing
Re-mining item associations: Methodology and a case study in apparel retailingRe-mining item associations: Methodology and a case study in apparel retailing
Re-mining item associations: Methodology and a case study in apparel retailingGurdal Ertek
 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIJSRD
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligenceGeo Marian
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Scienceresearchinventy
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Scienceresearchinventy
 
Pattern Discovery Using Apriori and Ch-Search Algorithm
 Pattern Discovery Using Apriori and Ch-Search Algorithm Pattern Discovery Using Apriori and Ch-Search Algorithm
Pattern Discovery Using Apriori and Ch-Search Algorithmijceronline
 
A literature review of modern association rule mining techniques
A literature review of modern association rule mining techniquesA literature review of modern association rule mining techniques
A literature review of modern association rule mining techniquesijctet
 
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SETA NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SETcscpconf
 

Similar to Mining Negative Association Rules (20)

Classification based on Positive and Negative Association Rules
Classification based on Positive and Negative Association RulesClassification based on Positive and Negative Association Rules
Classification based on Positive and Negative Association Rules
 
Association rule mining and Apriori algorithm
Association rule mining and Apriori algorithmAssociation rule mining and Apriori algorithm
Association rule mining and Apriori algorithm
 
5Association AnalysisBasic Concepts an.docx
 5Association AnalysisBasic Concepts an.docx 5Association AnalysisBasic Concepts an.docx
5Association AnalysisBasic Concepts an.docx
 
IRJET- Minning Frequent Patterns,Associations and Correlations
IRJET-  	  Minning Frequent Patterns,Associations and CorrelationsIRJET-  	  Minning Frequent Patterns,Associations and Correlations
IRJET- Minning Frequent Patterns,Associations and Correlations
 
I43055257
I43055257I43055257
I43055257
 
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...
Association Rule (Data Mining) - Frequent Itemset Generation, Closed Frequent...
 
Data Science - Part VI - Market Basket and Product Recommendation Engines
Data Science - Part VI - Market Basket and Product Recommendation EnginesData Science - Part VI - Market Basket and Product Recommendation Engines
Data Science - Part VI - Market Basket and Product Recommendation Engines
 
An apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distanceAn apriori based algorithm to mine association rules with inter itemset distance
An apriori based algorithm to mine association rules with inter itemset distance
 
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset CreationTop Down Approach to find Maximal Frequent Item Sets using Subset Creation
Top Down Approach to find Maximal Frequent Item Sets using Subset Creation
 
Re-mining item associations: Methodology and a case study in apparel retailing
Re-mining item associations: Methodology and a case study in apparel retailingRe-mining item associations: Methodology and a case study in apparel retailing
Re-mining item associations: Methodology and a case study in apparel retailing
 
F1803033843
F1803033843F1803033843
F1803033843
 
Effectiveness of ERules in Generating Non Redundant Rule Sets in Pharmacy Dat...
Effectiveness of ERules in Generating Non Redundant Rule Sets in Pharmacy Dat...Effectiveness of ERules in Generating Non Redundant Rule Sets in Pharmacy Dat...
Effectiveness of ERules in Generating Non Redundant Rule Sets in Pharmacy Dat...
 
Introduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its MethodsIntroduction To Multilevel Association Rule And Its Methods
Introduction To Multilevel Association Rule And Its Methods
 
Business intelligence
Business intelligenceBusiness intelligence
Business intelligence
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
Pattern Discovery Using Apriori and Ch-Search Algorithm
 Pattern Discovery Using Apriori and Ch-Search Algorithm Pattern Discovery Using Apriori and Ch-Search Algorithm
Pattern Discovery Using Apriori and Ch-Search Algorithm
 
A literature review of modern association rule mining techniques
A literature review of modern association rule mining techniquesA literature review of modern association rule mining techniques
A literature review of modern association rule mining techniques
 
Assignment #3 10.19.14
Assignment #3 10.19.14Assignment #3 10.19.14
Assignment #3 10.19.14
 
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SETA NEW ASSOCIATION RULE MINING BASED  ON FREQUENT ITEM SET
A NEW ASSOCIATION RULE MINING BASED ON FREQUENT ITEM SET
 

More from IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Recently uploaded

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 

Recently uploaded (20)

Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 

Mining Negative Association Rules

  • 1. IOSR Journal of Computer Engineering (IOSRJCE) ISSN: 2278-0661 Volume 3, Issue 6 (Sep-Oct. 2012), PP 43-47 www.iosrjournals.org www.iosrjournals.org 43 | Page Mining Negative Association Rules Tushar Mani Ph.D. (CSE), NIMS University, Jaipur (Rajasthan), India Abstract: Association rule mining is a data mining task that discovers associations among items in a transactional database. Typical association rules consider only items enumerated in transactions. Such rules are referred to as positive association rules. Negative association rules also consider the same items, but in addition consider negated items (i.e. absent from transactions). Negative association rules are useful in market- basket analysis to identify products that conflict with each other or products that complement each other. They are also very convenient for associative classifiers, classifiers that build their classification model based on association rules. Many other applications would benefit from negative association rules if it was not for the expensive process to discover them. Indeed, mining for such rules necessitates the examination of an exponentially large search space. In this paper, we propose an algorithm that mines negative association rules by using conviction measure which does not require extra database scans. I. Introduction Association rule mining is a data mining task that discovers relationships among items in a transactional database. Association rules have been extensively studied in the literature for their usefulness in many application domains such as recommender systems, diagnosis decisions support, telecommunication, intrusion detection, etc. The efficient discovery of such rules has been a major focus in the data mining research community. From the original apriori algorithm [1] there have been a remarkable number of variants and improvements of association rule mining algorithms [2]. Association rule analysis is the task of discovering association rules that occur frequently in a given data set. A typical example of association rule mining application is the market basket analysis. In this process, the behaviour of the customers is studied when buying different products in a shopping store. The discovery of interesting patterns in this collection of data can lead to important marketing and management strategic decisions. For instance, if a customer buys bread, what is the probability that he/she buys milk as well? Depending on the probability of such an association, marketing personnel can develop better planning of the shelf space in the store or can base their discount strategies on such associations/correlations found in the data. All the traditional association rule mining algorithms were developed to find positive associations between items. By positive associations we refer to associations between items existing in transactions (i.e. items bought). What about associations of the type: “customers that buy Coke do not buy Pepsi” or “customers that buy juice do not buy bottled water”? In addition to the positive associations, the negative association can provide valuable information, in devising marketing strategies. Interestingly, very few have focused on negative association rules due to the difficulty in discovering these rules. Although some researchers pointed out the importance of negative associations [3], only few groups of researchers [4], [5], [6] proposed an algorithm to mine these types of associations. This not only illustrates the novelty of negative association rules, but also the challenge in discovering them. II. Basic Concept and terminology This section introduces basic concept of association rules and some related work on negative association rules. 2.1 Association Rules Formally, association rules are defined as follows: Let I = {i1, i2,…,in} be a set of items. Let D be a set of transactions, where each transaction T is a set of items such that T ⊆I. Each transaction is associated with a unique identifier TID. A transaction T is said to contain X, a set of items in I, if X ⊆T. An association rule is an implication of the form “X ->Y”, where X ⊆I; Y ⊆I, and X ∩ Y = Φ. The rule X ->Y has support s in the transaction set D if s% of the transactions in D contain X U Y. In other words, the support of the rule is the probability that X and Y hold together among all the possible presented cases. It is said that the rule X ->Y holds in the transaction set D with confidence c if c% of transactions in D that contain X also contain Y. In other words, the confidence of the rule is the conditional probability that the consequent Y is true under the condition of the antecedent X. The problem of discovering all association rules from a set of transactions D consists of generating the rules that have a support and confidence greater than given thresholds. These rules are called strong rules, and the framework is known as the support-confidence framework for association rule mining.
  • 2. Mining Negative Association Rules www.iosrjournals.org 44 | Page A negative association rule is an implication of the form X ->┐Y (or ┐ X ->Y or ┐ X ->┐ Y), where X ⊆I, Y ⊆I and X ∩ Y = Φ (Note that although rule in the form of ┐ X ->┐ Y contains negative elements, it is equivalent to a positive association rule in the form of Y->X. Therefore it is not considered as a negative association rule.) In contrast to positive rules, a negative rule encapsulates relationship between the occurrences of one set of items with the absence of the other set of items. The rule X ->┐ Y has support s % in the data sets, if s % of transactions in T contain itemset X while do not contain itemset Y. The support of a negative association rule, supp( X ->┐Y), is the frequency of occurrence of transactions with item set X in the absence of item set Y. Let U be the set of transactions that contain all items in X. The rule X ->┐ Y holds in the given data set (database) with confidence c %, if c% of transactions in U do not contain item set Y. Confidence of negative association rule, conf ( X ->┐ Y), can be calculated with P( X ┐ Y )/P(X), where P(.) is the probability function. The support and confidence of itemsets are calculated during iterations. However, it is difficult to count the support and confidence of non-existing items in transactions. To avoid counting them directly, we can compute the measures through those of positive rules. III. Related Work In Negative Association Rule Mining A new idea to mine strong negative rules presented in [15]. They combine positive frequent itemsets[8] with domain knowledge in the form of taxonomy to mine negative associations. However, their algorithm is hard to generalize since it is domain dependent and requires a predefined taxonomy. Finding negative itemsets involve following steps: (1) first find all the generalized large itemsets in the data (i.e., itemsets at all levels in the taxonomy whose support is greater than the user specified minimum support) (2) next identify the candidate negative itemsets based on the large itemsets and the taxonomy and assign them expected support. (3) in the last step, count the actual support for the candidate itemsets and retain only the negative itemsets .The interest measure RI of negative association rule X ┐Y, as follows RI=(E[support( X U Y )]-support( X U Y))/support(X) Where E[support(X)] is the expected support of an itemset X. A new measure called mininterest, (the argument is that a rule A B is of interest only if supp ( A U B) - supp(A) supp(B) ≥ mininterest) added on top of the support-confidence framework[17]. They consider the itemsets (positive or negative) that exceed minimum support and minimum interest thresholds as itemsets of interest. Although, [17] introduces the “mininterest” parameter, the authors do not discuss how to set it and what would be the impact on the results when changing this parameter. A novel approach has proposed in [16]. In this, mining both positive and negative association rules of interest can be decomposed into the following two sub problems, (1) generate the set of frequent itemsets [8] of interest (PL) and the set of infrequent itemsets of interest (NL) (2) extract positive rules of the form A=>B in PL, and negative rules of the forms A ┐ B, ┐ AB and┐ A ┐ B in NL. To generate PL, NL and negative association rules they developed three functions namely, fipi( ), iipis() and CPIR( ). The most common frame-work in the association rule generation is the “Support-Confidence” one. In [14], authors considered another frame-work called correlation analysis that adds to the support-confidence. In this paper, they combined the two phases (mining frequent itemsets[8] and generating strong association rules) and generated the relevant rules while analyzing the correlations within each candidate itemset. This avoids evaluating item combinations redundantly. Indeed, for each generated candidate itemset, they computed all possible combinations of items to analyze their correlations. At the end, they keep only those rules generated from item combinations with strong correlation. If the correlation is positive, a positive rule is discovered. If the correlation is negative, two negative rules are discovered. The negative rules produced are of the form X ┐Y or ┐ X Y which the authors term as “confined negative association rules”. Here the entire antecedent or consequent is either a conjunction of negated attributes or a conjunction of non-negated attributes. An innovative approach has proposed in [13]. In this generating positive and negative association rules consists of four steps: (1) Generate all positive frequent itemsets L ( P1 ) (ii) for all itemsets I in L( P1 ), generate negative frequent itemsets of the form ┐ ( I1 I2 ) (iii) Generate all negative frequent itemsets ┐ I1 ┐I2 (iv) Generate all negative frequent itemsets I1 ┐ I2 and (v) Generate all valid positive and negative association rules . Authors generated negative rules without adding additional interesting measure(s) to support-confidence frame work. A new and different approach has been proposed in [7]. This is simple but effective. It is not using any additional interesting measures and additional database scans. In this approach, it is finding negative itemsets by replacing a literal in a candidate itemset by its corresponding negated item. If a candidate itemset contains 3 items then it will produce corresponding 3 negative itemsets one for each literal. IV. Discovering Negative Association Rules The most common framework in the association rules generation is the “support-confidence” one. Although these two parameters allow the pruning of many associations that are discovered in data, there are cases when many uninteresting rules may be produced. In this paper we consider another interesting measure
  • 3. Mining Negative Association Rules www.iosrjournals.org 45 | Page called conviction that adds to the support- confidence framework. Next section introduces the measure conviction.  The conviction of a rule is defined as: conv(X=>Y) can be interpreted as the ratio of the expected frequency that X occurs without Y (that is X=>┐Y) if X and Y were independent divided by the observed frequency of incorrect predictions. The range of conviction is 0 to ∞ A. Algorithm MPNAR In this section we propose and explain our algorithm. Algorithm: Mining Negative Association Rules Input: TDB-Transactional Database mins -minimum support minc-minimum confidence Output: Negative Association Rules Method: 1. NARФ /* initially NAR is empty */ 2. Scan the database and find the set of frequent 1.itemsets(F1) 3. for (k=2;Fk-1!=Φ; k++) 4. { 5. Ck= Fk⋈-1 Fk-1 /* generates candidates itemsets */ 6. // Prune using Apriori Property 7. for each i ε Ck, any subset of i is not in Fk-1 then Ck = CK - { i } 8. for each i ε Ck /* perform database scanning to find support */ 9. { 10. s = Support( i); 11. for each A,B (A U B= i ) 12. { 13. if ( Supp(X ┐Y) ≥mins && Conviction(X┐Y) ≤2.0) /* produces NAR of the form X┐Y */ 14. NARNAR U { X┐Y) /* produces NAR of the form ┐X Y */ 15. if ( Supp(┐XY) ≥ mins && Conviction(┐XY) ≤2.0) then 16. NAR NAR U { ┐X Y } 17. } 18. } 19. }  Support ( ┐X ) = 1-support(A)  Support( X∪┐Y ) = support(X) – support(X ∪ Y)  Support( ┐X ∪ Y ) = support(Y) – support(X ∪ Y)  Support (┐X ∪┐Y ) = 1 – support - support(X) – support(Y) + support(X ∪ Y)  The generation of positive rules continues without V. Experimental Results We tested our algorithm with [14]. We consider a transactional database contains many transactions. We tested our algorithm with reference [14] with different minimum supports and minimum confidences. Our algorithm is performing well than one in [14]. Conv( X => Y ) = 1- supp(Y) 1-conf(X=>Y)
  • 4. Mining Negative Association Rules www.iosrjournals.org 46 | Page Fig 1:minimum support =30% and different minimum confidences Fig 2: minimum support =40% and different minimum confidences Fig 3: minimum support =50% and different minimum confidences Fig 4: minimum confidence =60% and different minimum supports Fig 5: minimum confidence =70% and different minimum supports
  • 5. Mining Negative Association Rules www.iosrjournals.org 47 | Page Fig 6: minimum confidence =80% and different minimum supports VI. Conclusion And Future Work In this paper we introduced a new algorithm to generate both positive and negative association rules. Our method adds the conviction to the support-confidence framework to generate stronger positive and negative rules. We compared our algorithm with [14] on a real dataset. We discussed their performances on a transactional database and analyzed experimental results. The results prove that our algorithm can perform better than one in [14]. In future we wish to conduct experiments on some other real datasets and compare the performance of our algorithm with other related algorithms such as reference [7] and reference [16]. References [1] Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Proc. of SIGMOD. (1993) 207–216 [2] Goethals, B., Zaki, M., eds.: FIMI’03: Workshop on Frequent Itemset Mining Implementations. Volume 90 of CEUR Workshop Proceedings series. (2003) http://CEUR-WS.org/Vol-90/. [3] S.Brin, R. Motwani, and C.Silverstein. Beyond market baskets: Generalizing association rules to correlations. In ACM SIGMOD,Tucson, Arizona, 1997. [4] Wu, X., Zhang, C., Zhang, S.: Mining both positive and negative association rules. In: Proc. of ICML. (2002) 658–665 [5] Teng, W., Hsieh, M., Chen, M.: On the mining of substitution rules for statistically dependent items. In: Proc. of ICDM. (2002) 442–449 [6] Savasere, A., Omiecinski, E., Navathe, S.: Mining for strong negative associations in a large database of customer transactions. In: Proc. of ICDE. (1998) 494–502 [7] B.Ramasubbareddy, A.Govardhan, and A.Ramamohanreddy. Mining Positive and Negative Association Rules, IEEE ICSE 2010, Hefeai, China, August 2010. [8] Goethals, B., Zaki, M., eds.: FIMI’03: Workshop on Frequent Itemset Mining Implementations. Volume 90 of CEUR Workshop Proceedings series. (2003) http://CEUR-WS.org/Vol-90/. [9] Teng, W., Hsieh, M., Chen, M.: On the mining of substitution rules for statistically dependent items. In: Proc. of ICDM. (2002) 442–449 [10] Tan, P., Kumar, V.: Interestingness measures for association patterns: A perspective.In: Proc. of Workshop on Postprocessing in Machine Learning and Data Mining. (2000) [11] Gourab Kundu, Md. Monirul Islam, Sirajum Munir, Md. Faizul Bari ACN: An Associative Classifier with Negative Rules 11th IEEE International Conference on Computational Science and Engineering, 2008. [12] Brin,S., Motwani,R. and Silverstein,C., “ Beyond Market Baskets: Generalizing Association Rules to Correlations,” Proc. ACM SIGMOD Conf., pp.265-276, May 1997. [13] Chris Cornelis, peng Yan, Xing Zhang, Guoqing Chen: Mining Positive and Negative Association Rules from Large Databases , IEEE conference 2006. [14] M.L. Antonie and O.R. Za¨ıane, ”Mining Positive and Negative Association Rules: an Approach for Confined Rules”, Proc. Intl. Conf. on Principles and Practice of Knowledge Discovery in Databases, 2004, pp 27–38. [15] Savasere, A., Omiecinski,E., Navathe, S.: Mining for Strong negative associations in a large data base of customer transactions. In: Proc. of ICDE. (1998) 494- 502.. [16] Wu, X., Zhang, C., Zhang, S.: efficient mining both positive and negative association rules. ACM Transactions on Information Systems, Vol. 22, No.3, July 2004,Pages 381-405. [17] Wu, X., Zhang, C., Zhang, S.: Mining both positive and negative association rules.In: Proc. of ICML. (2002) 658–665 [18] Yuan,X., Buckles, B.,Yuan, Z.,Zhang, J.:Mining Negative Association Rules. In: Proc. of ISCC. (2002) 623-629. [19] Honglei Zhu, Zhigang Xu: An Effective Algorithm for Mining Positive and Negative Association Rules. International Conference on Computer Science and Software Engineering 2008.