SlideShare une entreprise Scribd logo
1  sur  11
Télécharger pour lire hors ligne
Ertek, G. and Demiriz, A. (2006). "A framework for visualizing association mining results.”
Lecture Notes in Computer Science, vol: 4263, pp. 593-602.

Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as
above. You can download this final draft from http://research.sabanciuniv.edu.




                               A Framework for Visualizing
                                Association Mining Results


                                    Gürdal Ertek1 and Ayhan Demiriz2


                                          1   Sabanci University
                            Faculty of Engineering and Natural Sciences
                               Orhanli, Tuzla, 34956, Istanbul, Turkey


                               2   Department of Industrial Engineering
                                           Sakarya University
                                         54187, Sakarya, Turkey
A Framework for Visualizing
                 Association Mining Results

                       G¨rdal Ertek1 and Ayhan Demiriz2
                        u
                                1
                                  Sabanci University
                    Faculty of Engineering and Natural Sciences
                      Orhanli, Tuzla, 34956, Istanbul, Turkey
                             ertekg@sabanciuniv.edu
                      2
                        Department of Industrial Engineering
                                 Sakarya University
                               54187, Sakarya, Turkey
                                ademiriz@gmail.com



      Abstract. Association mining is one of the most used data mining tech-
      niques due to interpretable and actionable results. In this study we pro-
      pose a framework to visualize the association mining results, specifically
      frequent itemsets and association rules, as graphs. We demonstrate the
      applicability and usefulness of our approach through a Market Basket
      Analysis (MBA) case study where we visually explore the data mining
      results for a supermarket data set. In this case study we derive several
      interesting insights regarding the relationships among the items and sug-
      gest how they can be used as basis for decision making in retailing.


1   Introduction
Association mining is an increasingly used data mining and business tool among
practitioners and business analysts [7]. Interpretable and actionable results of
association mining can be considered as the major reasons for the popularity
of this type of data mining tools. Association rules can be classified based on
several criteria, as outlined in [11]. In this paper, we focus on single-dimensional,
single-level boolean association rules, in the context of market basket analysis.
    By utilizing efficient algorithms such as Apriori [2] to analyze very large trans-
actional data -frequently from transactional sales data- will result in a large set
of association rules. Commonly, finding the association rules from very large
data sets is considered and emphasized as the most challenging step in associa-
tion mining. Often, results are presented in a text (or table) format with some
querying and sorting functionalities. The rules include “if” clauses by default.
The structure of such rules is as follows: “If the customer purchases Item A, then
with probability C% he/she will buy Item B.”
    This probability, C%, is referred to as confidence level. More formally, the
confidence level can be computed as follows: C = f requency(A∩B) , where A ∩ B
                                                         f requency(A)
refers to the transactions that have both Item A and Item B. Confidence level
is also equivalent to the conditional probability of having Item B given Item A.
2       G¨ rdal Ertek and Ayhan Demiriz
         u

Another important statistic in association mining is the support level. Support
level is basically equal to the fraction of the transactions that have both Item A
and Item B. Thus the support level S is computed as follows: S = f requency(A∩B)
                                                                           T
where T is equal to the total number of the transactions. Left and right hand
sides of the rule are called antecedent and consequent of the rule respectively.
     There exists an extensive literature where a multitude of interestingness mea-
sures for association rules are suggested [22] and efficient data mining algorithms
are presented for deriving these measures. However, there is considerably less
work that focuses on the interpretation of the association mining results.
     Information visualization, a growing field of research in computer science [6,
8, 13], investigates ways of visually representing multi-dimensional data with the
purpose of knowledge discovery. The significance and the impact of information
visualization is reflected by the development and availability of highly user-
friendly and successful software tools such as Miner3D [18], Spotfire [21], Advizor
[1], DBMiner [11], and IBM Intelligent Miner Visualization [15].
     Our motivation for this study stems from the idea that visualizing the results
of association mining can help end-users significantly by enabling them to derive
previously unknown insights. We provide a framework that is easy to implement
(since it simply merges two existing fields of computer science) and that provides
a flexible and human-centered way of discovering insights.
     In spite of successful visualization tools mentioned above, the visualization
of the association mining results in particular is somewhat a fertile field of study.
Some of the studies done in this field are summarized in the next section. We then
introduce our proposed framework in Section 3. We explain our implementation
in Section 4. We report our findings from the case study in Section 5. We then
conclude with future research directions in Section 6.


2   Literature Review

The visualization of association mining results has attracted attention recently,
due to the invention of information visualization schemes such as parallel coor-
dinate plots (||-coords). Here we summarize some of the studies that we believe
are the most interesting.
    Hofmann et al. [14] elegantly visualize association rules through Mosaic plots
and Double Decker plots. While Mosaic plots allow display of association rules
with two items in the antecedent, Double Decker plots enable visualization of
association rules with more items in the antecedent. The interestingness mea-
sure of “differences of confidence” can be directly seen in Double Decker plots.
Discovering intersection and sequential structures are also discussed.
    Kopanakis and Theodoulidis [17] present several visualization schemes for
displaying and exploring association rules, including bar charts, grid form mod-
els, and ||-coords. In their extensive work they also discuss a similarity based
approach for the layout of multiple association rules on the screen.
    Two popular approaches for visualizing association rules are summarized in
[23]: The first, matrix based approach, maps items and itemsets in the antecedent
A Framework for Visualizing Association Mining Results        3

and consequent to the X and Y axes respectively. Wong et al. [23] bring an
alternative to this approach by mapping rules -instead of items- to the X axis.
In the second, directed graph approach, the items and rules are mapped to
nodes and edges of a directed graph respectively. Our framework departs from
the second approach since we map both the items and the rules to nodes.


3   Proposed Framework

We propose a graph-based framework to visualize and interpret the results of
well-known association mining algorithms as directed graphs. In our visualiza-
tions, the items, the itemsets, and the association rules are all represented as
nodes. Edges represent the links between the items and the itemsets or associa-
tions.
    In visualizing frequent itemsets (Figure 1) the nodes that represent the items
are shown with no color, whereas the nodes that represent the itemsets are col-
ored reflecting the cardinality of the itemsets. For example, in our case study the
lightest shade of blue denotes an itemset with two items and the darkest shade
of blue denotes an itemset with four items. The sizes (the areas) of the nodes
show the support levels. The directed edges symbolize which items constitute a
given frequent itemset.
    In visualizing association rules (Figure 4), the items are again represented by
nodes without coloring, and the rules are shown by colored nodes. The node sizes
(the areas) again show the support levels, but this time the node colors show the
confidence levels. In our case study, the confidence levels are shown in a linearly
mapped yellow-red color spectrum with the yellow representing the lowest and
the red representing the highest confidence levels. The directed edges are color-
coded depending on whether they are incoming or outgoing edges. Incoming
edges of the rule nodes are shown in grey and outgoing are shown in black. For
example, in Figure 4 the association rule A01 is indeed (Item 110 ⇒ Item 38).
    The main idea in our framework is to exploit already existing graph drawing
algorithms [3] and software in the information visualization literature [12] for vi-
sualization of association mining results which are generated by already existing
algorithms and software in the data mining literature [11].


4   Steps in Implementing the Framework

To demonstrate our framework we have carried out a case study using a real
word data set from retail industry which we describe in the next section. In this
section, we briefly outline the steps in implementing our framework.
    The first step is to run an efficient implementation of the Apriori algorithm:
We have selected to use the application developed by C. Borgelt which is available
on the internet [4] and is well documented.
    To generate both the frequent itemsets and association rules, it is required to
run Borgelt’s application twice because this particular application is capable of
4       G¨ rdal Ertek and Ayhan Demiriz
         u

generating either the frequent itemsets or the association rules. One important
issue that we paid attention to was using the right options while computing the
support levels of the association rules. The default computation of the support
level in Borgelt’s application is different from the original definition by Agrawal
and Srikant [2]. The option “-o” is included in command line to adhere to the
original definition which we have defined in the Introduction section.
    The second step is to translate the results of the Apriori algorithm into a
graph specification. In our case study, we carried out this step by converting the
support and confidence levels to corresponding node diameters and colors in a
spreadsheet.
    We then created the graph objects based on the calculated specifications in
the previous step. There are a multitude of tools available on the internet for
graph visualization [19]. We have selected the yEd Graph Editor [24] for drawing
our initial graph and generating visually interpretable graph layouts. yEd im-
plements several types of graph drawing algorithms including those that create
hierarchical, organic, orthogonal, and circular layouts. For interested readers, the
detailed information on the algorithms and explanations of the various settings
can be found under the program’s Help menu.
    The final step is to run the available graph layout algorithms and try to
visually discover interesting and actionable insights. Our case study revealed
that different layout algorithms should be selected depending on whether one is
visualizing frequent itemsets or association rules, and on the underlying purpose
of the analysis e.g. catalog design, shelf layout, and promotional pricing.


5     Case Study: Market Basket Analysis

The benchmark data set used in this study is provided at [10] and initially
analyzed in [5] for assortment planning purposes. The data set is composed
of 88,163 transactions and was collected at a Belgian supermarket. There are
16,470 unique items in it. The data set lists only the composition of transactions;
different visits by the same customer cannot be identified and the monetary value
of the transactions are omitted.
    In this section, we present our findings through visual analysis of frequent
itemsets and association rules. Frequent itemset graphs generated by yEd pro-
vided us with guidelines for catalog design and supermarket shelf design. Asso-
ciation rule graph supplied us with inherent relationships between the items and
enabled development of promotional pricing strategies.


5.1   Visualizing Frequent Itemsets

Figure 1 depicts the results of the Apriori algorithm that generated the frequent
itemsets at support level of 2% for the data set. This graph was drawn by
selecting the Classic Organic Layout in yEd. Items that belong to similar frequent
itemsets and have high support levels are placed in close proximity of each other.
From Figure 1, one can easily notice the Items 39, 48, 32, 41, and 38 have very
A Framework for Visualizing Association Mining Results                                                                                                                                                                                                  5


                                                                                                                                                                                           




                                                                                                                               )




                                                                                                                                                                           )
                                                                                                                                                                                                      )
                                                                                                                                                                                                                                            )



                                                                                  )


                                                                                                                                                                                                                 )


                                                                                                                                                                                                                                                    )
                                                                                                                                                                                              )                  
                                                                                )                                                                                                                                             

                                                                                                                                                                                                                                                       )
                                                                                                                                                                                 )
                                                                                                                                                             )

                                                                                                                              )
                                                                                                                                                                                             )
                                                                                                                                                                                                                                
                                                                                                                                                                                                                                                                                )

                                                                                                                                                                                                                  )

                                                                                                                                                                         )
                                                                                                                     )
                                                                                                                                                  )

                                                                                                                                                                                                            )
                                                                                                                                                                                       )

                                                                                                                                                                     
                                                                                              )                                                                                                                                )
                                                                                                                                                                                                                        )



                                                                                                                                                                                                                                                                                  )




                                                                                                      )                                                                                           )
                                                                                                                                                                                                                                )
                                                                                                                                                                                                                                                             )

                                                                                                                                                                                                                         )




                                                                                                                 )
                                                                                                                                                                                                                                                               
                                                                                                                                                                                                                        )


                                                                                                                                                                                              )                                      
                                                                                                                                         )



                                                                                                                                                                   )




                                                                                                     


                                                                                                                                                                   




   Fig. 1. Visualization of the frequent itemsets through a Classic Organic Layout


                                                                                                                                                                                                                                                       )

                                                                                                                                                                                                                                                                                               )
                                                                                                                                                                                                                                                                                  )


                                                                                                                                                                                                                                                                                             )




                                        )
                                                                                                                                                                                                                                                                          )                  
                                                                                                                                                                                                                                                                                                           
                                                                    )
                                                                                               )
                                                                                                                                                                                                                                                             )
                                                                                                                                      )



               )                                                                                                                                                                                                                      )
                                                                                                          )                                                                                                                                                                                            
                                                                                                                                                                                                                                                                         )
                                                                                       )                  
                                                                                                                                                                                                                                                                                              )
              )                                                                                                              
                                                                                                                                                                                                                                                   )
        
                                                                          )
                                                         )

                                       )
                                                                                      )
                                                                                                                                                                                                                                                                                      )
                                                                                                                                                                                                                               )                                 )
                                                                                                           )


        
                                                                )                                                                                                                                                                               
                                 )
                                              )                                                                                                                                                                                                                                                          )
                                                                                                     )
                                                                                )

                                                                                                                                                                                                                                                                                                  )
                     )                                                                                                        )
                                                                                                                 )




                                                                                                                                                                                                                                                                                )
                           )                                                               )
                                                                                                                               )
                                                                                                                                                                                                                                                                                                           )

                                                                                                                  )
                                                                                                                                                                                                                                                                                                     )




Fig. 2. Querying the visualization of the frequent itemsets (a) Selecting a single item.
(b) Selecting three items together.
6      G¨ rdal Ertek and Ayhan Demiriz
        u

high support levels, and form frequent itemsets (with up to four elements) among
themselves. On the upper corner of the figure, there is a set of items which meet
the minimum support level condition, but have no significant associations with
any other item.
    Besides identifying the most significant and interdependent items, one can
also see items that are independent from all but one of the items. For example,
Items 310, 237, and 225 are independent from all items but Item 39. This phe-
nomenon enables us to visually cluster (group) the items into sets that are fairly
independent of each other. In the context of retailing, catalog design requires
selecting sets of items that would be displayed on each page of a catalog. From
Figure 1, one can easily determine that Items 310, 237, and 225 could be on the
same catalog page as Item 39. One might suggest putting Items 39, 48, 32, 41,
and 38 on the same page since these form frequent itemsets with high confidence
levels. However, this would result in a catalog with only one page of popular
items and many pages of much less popular items. We suggest that the popu-
lar items be placed on different pages so that they serve as attractors, drawing
attention to less popular items related to them.
    Another type of insight that can be derived from Figure 1 is the identifica-
tion of items which form frequent itemsets with the same item(s) but do not
form any frequent itemsets with each other. For example, Items 65 and 89 each
independently form frequent itemsets with Items 39 and 48, but do not form
frequent itemsets with one another. These items may be substitute items and
their relationship deserves further investigation.
    The frequent itemset visualization can be enhanced by incorporating inter-
active visual querying capabilities. One such capability could be that once an
item is selected, the items and the frequent itemsets associated with it are high-
lighted. This concept is illustrated in Figure 2 (a) where Item 38 is assumed
to be selected. It can be seen that Item 38 plays a very influential role since it
forms frequent itemsets with seven of the 13 items. When two or more items are
selected, only the frequent itemsets associated with all of them could be high-
lighted. Thus selecting more than one item could serve as a query with an AND
operator. This concept is illustrated in Figure 2 (b) where Items 39, 32 and 38
are assumed to be selected. One can notice in this figure that even though all
the three items have high support levels each (as reflected by their large sizes)
the frequent itemset F27 (in the middle) that consists of all the three items has
a very low support level. When analyzed in more detail it can be seen that this
is mainly due to the low support level of the frequent itemset F13 which consists
of Items 32 and 38. This suggests that the association between Items 32 and 38
is significantly low, and these items can be placed into separate clusters.
    When experimenting with various graph layout algorithms within yEd we
found the Interactive Hierarchical Layout particularly helpful. The obtained vi-
sualization is given in Figure 3, where items are sorted such that the edges have
minimal crossings and related items are positioned in close proximity to each
other. This visualization can be used directly in planning the supermarket shelf
layouts. For example assuming that we would place these items into a single
A Framework for Visualizing Association Mining Results                                                                                                   7

aisle in the supermarket and that all the items have same unit volume, we can
lay out the items according to their positions in Figure 3, allocating shelf space
proportional to their node sizes (support levels).


                                                                                                                                                      
                                                                                                                       
                                                                                                                                                                                                    




)   )   )    )   )   )   )   )   )   )    )   )   )   )   )   )   )   )    )   )        )   )   )   )    )   )   )   )   )   )   )    )   )   )   )




      




Fig. 3. Visualization of the frequent itemsets through an Interactive Hierarchical Lay-
out


    Of course in a real world setting there would be many aisles and the items
would not have the same unit volume. For adopting our approach to the former
situation we can pursue the following steps: We start with the analysis of a
Classic Organic Layout as in Figure 1, and determine from this graph groups of
items that will go into aisles together. When forming the groups we try to make
sure that the total area of the items in each group is roughly the same. Once the
groups are determined we can generate and analyze an Interactive Hierarchical
Layout as in Figure 3 for each group and then decide on the shelf layouts at
each aisle. Incorporating the situation where the items have significantly varying
unit volumes is a more challenging task, since it requires consideration of these
volumes in addition to consideration of the support levels.

5.2               Visualizing Association Rules
Figure 4 depicts the results of the Apriori algorithm that generated the associ-
ation rules at support level of 2% and confidence level of 20% for the data set.
This graph was drawn by selecting the Classic Hierarchical Layout in yEd and
visualizes 27 association rules. In the graph, only rules with a single item in the
antecedent are shown.
    The figure shows which items are “sales drivers” that push the sales of other
items. For example one can observe at the top of the figure that the rules A01
(Item 110 ⇒ Item 38), A02 (Item 36 ⇒ Item 38), and A04 (Item 170 ⇒ Item 38)
all have high confidence levels, as reflected by their red colors. This observation
related with high confidence levels can be verified by seeing that the node sizes
of the rules A01, A02, and A04 are almost same as the node sizes of Items 110,
36, and 170. For increasing the sales of Item 38 in a retail setting we could use
the insights that we gained from Figure 4. Initiating a promotional campaign for
any combination of Items 110, 36, or 170 and placing these items next to Item
38 could boost sales for Item 38. This type of a campaign should especially be
8       G¨ rdal Ertek and Ayhan Demiriz
         u

considered if the unit profits of the driver items (which reside in the antecedent
of the association rule) are lower than the unit profit of the item whose sale is
to be boosted (which resides in the antecedent of the association rule).
    We can also identify Items 32, 89, 65, 310, 237, and 225 as sales drivers since
they have an indegree of zero (except Item 32) and affect sales of “downstream”
items with a high confidence.


6    Conclusions and Future Work

We have introduced a novel framework for knowledge discovery from association
mining results. We demonstrated the applicability of our framework through
a market basket analysis case study where we visualize the frequent itemsets
and binary association rules derived from transactional sales data of a Belgian
supermarket.
    Ideally, the steps in our framework should be carried out automatically by a
single integrated program or at least from within a single modelling and analysis
environment that readily communicates with the association mining and graph
visualization software. Such a software does not currently exist.
    In the retail industry new products emerge and consumer preferences change
constantly. Thus one would be interested in laying the foundation of an analysis
framework that can fit to the dynamic nature of retailing data. Our framework
can be adapted for analysis of frequent itemsets and association rules over time
by incorporating latest research on evolving graphs [9].
    Another avenue of future research is testing other graph visualization algo-
rithms and software [19] and investigating whether new insights can be discov-
ered by their application. For example, Pajek graph visualization software [20]
enables the mapping of attributes to line thickness, which is not possible in yEd.
The visualizations that we presented in this paper were all 2D. It is an inter-
esting research question to determine whether exploring association rules in 3D
can enable new styles of analysis and new types of insights.
    As a final word, we conclude by remarking that visualization of association
mining results in particular, and data mining results in general is a promising
area of future research. Educational, research, government and business institu-
tions can benefit significantly from the symbiosis of data mining and information
visualization disciplines.


References

1. http://www.advizorsolutions.com
2. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. Proceedings
   of the 20th VLDB Conference,Santiago, Chile. (1994) 487–499
3. Battista, G. D., Eades, P., Tamassia, R., Tollis, I. G.: Graph Drawing: Algorithms
   for the Visualization of Graphs. Prentice Hall PTR.(1998)
4. http://fuzzy.cs.uni-magdeburg.de/∼borgelt/apriori.html
A Framework for Visualizing Association Mining Results                       9



                                                                                   


                             $         $      $     $       $
                                                                                          
                                             




                     $           $                  $



                      




           $             $                 $            $



                             
                                                            




               $     $             $     $        $         $         $



                     
                                                                       




         $                     $            $                 $           $




                                                                




                           $                    $                       $




Fig. 4. Visualization of the association rules through an Interactive Hierarchical Layout
10      G¨rdal Ertek and Ayhan Demiriz
         u

5. Brijs, T., Swinnen, G., Vanhoof, K., Wets, G.: The use of association rules for
   product assortment decisions: a case study. Proceedings of the Fifth International
   Conference on Knowledge Discovery and Data Mining, San Diego (USA), (1999)
   254–260.
6. de Oliveira, M. C. F., Levkowitz, H.: From visual data exploration to visual data
   mining: a survey. IEEE Transactions on Visualization and Computer Graphics 9,
   no.3 (2003) 378–394
7. Demiriz, A.: Enhancing product recommender systems on sparse binary data. Jour-
   nal of Data Mining and Knowledge Discovery. 9, no.2 (2004) 378–394
8. Eick, S. G.: Visual discovery and analysis. IEEE Transactions on Visualization and
   Computer Graphics 6, no.1 (2000) 44–58
9. Erten, D. A., Harding, P. J., Kobourov, S. G., Wampler, K., Yee, G.: GraphAEL:
   Graph animations with evolving layouts. Lecture Notes in Computer Science. 2913,
   (2004) 98–110
10. http://fimi.cs.helsinki.fi/data/
11. Han, J., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufman
   Publishers. (2001)
12. Herman, I., Melan¸on, G., Marshall, M. S.: Graph visualization and navigation
                        c
   in information visualization: A survey. IEEE Transactions on Visualization and
   Computer Graphics. 6, no.1 (2000) 24–43
13. Hoffman, P. E., Grinstein, G. G.: A survey of visualizations for high-dimensional
   data mining. Chapter 2, Information visualization in data mining and knowledge
   discovery, Eds: Fayyad, U., Grinstein, G. G., Wierse, A. (2002) 47–82
14. Hofmann, H., Siebes, A. P. J. M., Wilhelm, A. F. X.: Visualizing association rules
   with interactive mosaic plots. Proceedings of the Sixth International Conference on
   Knowledge Discovery and Data Mining. (2000) 227–235
15. http://www-306.ibm.com/software/data/iminer/visualization/
16. Keim, D. A.: Information visualization and visual data mining. IEEE Transactions
   on Visualization and Computer Graphics. 8, no.1 (2002) 1–8
17. Kopanakis, I., Theodoulidis, B.: Visual data mining modeling techniques for the
   visualization of mining outcomes. Journal of Visual Languages and Computing. 14
   (2003) 543–589
18. http://www.miner3d.com/
19. http://www.netvis.org/resources.php
20. http://vlado.fmf.uni-lj.si/pub/networks/pajek/
21. http://www.spotfire.com/
22. Tan, P., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for
   association patterns. Proceedings of SIGKDD. (2002) 32–41
23. Wong, P. C., Whitney, P., Thomas, J.: Visualizing association rules for text mining.
   Proceedings of the 1999 IEEE Symposium on Information Visualization. (1999)
24. http://www.yworks.com/

Contenu connexe

Tendances

An effective adaptive approach for joining data in data
An effective adaptive approach for joining data in dataAn effective adaptive approach for joining data in data
An effective adaptive approach for joining data in dataeSAT Publishing House
 
Semi-Supervised Discriminant Analysis Based On Data Structure
Semi-Supervised Discriminant Analysis Based On Data StructureSemi-Supervised Discriminant Analysis Based On Data Structure
Semi-Supervised Discriminant Analysis Based On Data Structureiosrjce
 
Data mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cubeData mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cubeIJECEIAES
 
Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...eSAT Publishing House
 
Effect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysisEffect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysiseSAT Publishing House
 
An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...IJDKP
 
IRJET- Customer Segmentation from Massive Customer Transaction Data
IRJET- Customer Segmentation from Massive Customer Transaction DataIRJET- Customer Segmentation from Massive Customer Transaction Data
IRJET- Customer Segmentation from Massive Customer Transaction DataIRJET Journal
 
UNIT III NON LINEAR DATA STRUCTURES – TREES
UNIT III 	NON LINEAR DATA STRUCTURES – TREESUNIT III 	NON LINEAR DATA STRUCTURES – TREES
UNIT III NON LINEAR DATA STRUCTURES – TREESKathirvel Ayyaswamy
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIEScsandit
 
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESGI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESAM Publications
 
Application Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts PredictionApplication Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts PredictionCSCJournals
 
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MININGPATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MININGIJDKP
 
IRJET- Customer Relationship and Management System
IRJET-  	  Customer Relationship and Management SystemIRJET-  	  Customer Relationship and Management System
IRJET- Customer Relationship and Management SystemIRJET Journal
 
Fuzzy logic applications for data acquisition systems of practical measurement
Fuzzy logic applications for data acquisition systems  of practical measurement Fuzzy logic applications for data acquisition systems  of practical measurement
Fuzzy logic applications for data acquisition systems of practical measurement IJECEIAES
 

Tendances (16)

Ijetcas14 347
Ijetcas14 347Ijetcas14 347
Ijetcas14 347
 
An effective adaptive approach for joining data in data
An effective adaptive approach for joining data in dataAn effective adaptive approach for joining data in data
An effective adaptive approach for joining data in data
 
Semi-Supervised Discriminant Analysis Based On Data Structure
Semi-Supervised Discriminant Analysis Based On Data StructureSemi-Supervised Discriminant Analysis Based On Data Structure
Semi-Supervised Discriminant Analysis Based On Data Structure
 
Data mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cubeData mining techniques application for prediction in OLAP cube
Data mining techniques application for prediction in OLAP cube
 
Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...Comparative study of various supervisedclassification methodsforanalysing def...
Comparative study of various supervisedclassification methodsforanalysing def...
 
F04463437
F04463437F04463437
F04463437
 
Effect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysisEffect of elements on linear elastic stress analysis
Effect of elements on linear elastic stress analysis
 
An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...An experimental evaluation of similarity-based and embedding-based link predi...
An experimental evaluation of similarity-based and embedding-based link predi...
 
IRJET- Customer Segmentation from Massive Customer Transaction Data
IRJET- Customer Segmentation from Massive Customer Transaction DataIRJET- Customer Segmentation from Massive Customer Transaction Data
IRJET- Customer Segmentation from Massive Customer Transaction Data
 
UNIT III NON LINEAR DATA STRUCTURES – TREES
UNIT III 	NON LINEAR DATA STRUCTURES – TREESUNIT III 	NON LINEAR DATA STRUCTURES – TREES
UNIT III NON LINEAR DATA STRUCTURES – TREES
 
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIESENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
ENHANCING KEYWORD SEARCH OVER RELATIONAL DATABASES USING ONTOLOGIES
 
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUESGI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
GI-ANFIS APPROACH FOR ENVISAGE HEART ATTACK DISEASE USING DATA MINING TECHNIQUES
 
Application Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts PredictionApplication Of Extreme Value Theory To Bursts Prediction
Application Of Extreme Value Theory To Bursts Prediction
 
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MININGPATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
PATTERN GENERATION FOR COMPLEX DATA USING HYBRID MINING
 
IRJET- Customer Relationship and Management System
IRJET-  	  Customer Relationship and Management SystemIRJET-  	  Customer Relationship and Management System
IRJET- Customer Relationship and Management System
 
Fuzzy logic applications for data acquisition systems of practical measurement
Fuzzy logic applications for data acquisition systems  of practical measurement Fuzzy logic applications for data acquisition systems  of practical measurement
Fuzzy logic applications for data acquisition systems of practical measurement
 

En vedette

Developing Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningDeveloping Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data Miningertekg
 
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...ertekg
 
Anastásia Projetos Especiais
Anastásia Projetos EspeciaisAnastásia Projetos Especiais
Anastásia Projetos EspeciaisTássia Jaeger
 
Best B2B Ecommerce Solution In India, China, Taiwan, Germany & Korea
Best B2B Ecommerce Solution In India, China, Taiwan, Germany & KoreaBest B2B Ecommerce Solution In India, China, Taiwan, Germany & Korea
Best B2B Ecommerce Solution In India, China, Taiwan, Germany & KoreaPrathamesh Landge
 
INFOGRAPHIC: How to Measure Your Upfront/NewFront ROI
INFOGRAPHIC: How to Measure Your Upfront/NewFront ROIINFOGRAPHIC: How to Measure Your Upfront/NewFront ROI
INFOGRAPHIC: How to Measure Your Upfront/NewFront ROIagencyEA
 
Vocabulary- Adelina's Whales
Vocabulary- Adelina's WhalesVocabulary- Adelina's Whales
Vocabulary- Adelina's Whaleslindaecc
 
Clear Language report for Draft_White_Paper_PGHD_Policy_Framework
Clear Language report for Draft_White_Paper_PGHD_Policy_FrameworkClear Language report for Draft_White_Paper_PGHD_Policy_Framework
Clear Language report for Draft_White_Paper_PGHD_Policy_FrameworkDon Taylor
 

En vedette (12)

Readme
ReadmeReadme
Readme
 
Developing Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningDeveloping Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data Mining
 
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
Insights into the Efficiencies of On-Shore Wind Turbines: A Data-Centric Anal...
 
Anastásia Projetos Especiais
Anastásia Projetos EspeciaisAnastásia Projetos Especiais
Anastásia Projetos Especiais
 
Motivacion jmg
Motivacion jmgMotivacion jmg
Motivacion jmg
 
PaulKozLetter-2
PaulKozLetter-2PaulKozLetter-2
PaulKozLetter-2
 
Best B2B Ecommerce Solution In India, China, Taiwan, Germany & Korea
Best B2B Ecommerce Solution In India, China, Taiwan, Germany & KoreaBest B2B Ecommerce Solution In India, China, Taiwan, Germany & Korea
Best B2B Ecommerce Solution In India, China, Taiwan, Germany & Korea
 
INFOGRAPHIC: How to Measure Your Upfront/NewFront ROI
INFOGRAPHIC: How to Measure Your Upfront/NewFront ROIINFOGRAPHIC: How to Measure Your Upfront/NewFront ROI
INFOGRAPHIC: How to Measure Your Upfront/NewFront ROI
 
Vocabulary- Adelina's Whales
Vocabulary- Adelina's WhalesVocabulary- Adelina's Whales
Vocabulary- Adelina's Whales
 
Clear Language report for Draft_White_Paper_PGHD_Policy_Framework
Clear Language report for Draft_White_Paper_PGHD_Policy_FrameworkClear Language report for Draft_White_Paper_PGHD_Policy_Framework
Clear Language report for Draft_White_Paper_PGHD_Policy_Framework
 
Android Kit Kat
Android Kit KatAndroid Kit Kat
Android Kit Kat
 
Big Data Readiness Assessment
Big Data Readiness AssessmentBig Data Readiness Assessment
Big Data Readiness Assessment
 

Similaire à Visualizing Association Mining Results

A framework for visualizing association mining results
A framework for visualizing association mining resultsA framework for visualizing association mining results
A framework for visualizing association mining resultsGurdal Ertek
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Miningertekg
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningGurdal Ertek
 
Re-Mining Association Mining Results through Visualization, Data Envelopment ...
Re-Mining Association Mining Results through Visualization, Data Envelopment ...Re-Mining Association Mining Results through Visualization, Data Envelopment ...
Re-Mining Association Mining Results through Visualization, Data Envelopment ...Gurdal Ertek
 
Recommendation system using unsupervised machine learning algorithm & assoc
Recommendation system using unsupervised machine learning algorithm & assocRecommendation system using unsupervised machine learning algorithm & assoc
Recommendation system using unsupervised machine learning algorithm & associjerd
 
Developing Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningDeveloping Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningGurdal Ertek
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...ertekg
 
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...IJERA Editor
 
Estimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachEstimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachcsandit
 
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHcscpconf
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple Databasesertekg
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Gurdal Ertek
 
Usage and Research Challenges in the Area of Frequent Pattern in Data Mining
Usage and Research Challenges in the Area of Frequent Pattern in Data MiningUsage and Research Challenges in the Area of Frequent Pattern in Data Mining
Usage and Research Challenges in the Area of Frequent Pattern in Data MiningIOSR Journals
 
A Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document ClusteringA Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document ClusteringIJMER
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...acijjournal
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesGurdal Ertek
 
Pak eko 4412ijdms01
Pak eko 4412ijdms01Pak eko 4412ijdms01
Pak eko 4412ijdms01hyuviridvic
 
Re-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining ResultsRe-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining Resultsertekg
 

Similaire à Visualizing Association Mining Results (20)

A framework for visualizing association mining results
A framework for visualizing association mining resultsA framework for visualizing association mining results
A framework for visualizing association mining results
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
 
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-MiningLinking Behavioral Patterns to Personal Attributes through Data Re-Mining
Linking Behavioral Patterns to Personal Attributes through Data Re-Mining
 
Re-Mining Association Mining Results through Visualization, Data Envelopment ...
Re-Mining Association Mining Results through Visualization, Data Envelopment ...Re-Mining Association Mining Results through Visualization, Data Envelopment ...
Re-Mining Association Mining Results through Visualization, Data Envelopment ...
 
Recommendation system using unsupervised machine learning algorithm & assoc
Recommendation system using unsupervised machine learning algorithm & assocRecommendation system using unsupervised machine learning algorithm & assoc
Recommendation system using unsupervised machine learning algorithm & assoc
 
G1803054653
G1803054653G1803054653
G1803054653
 
Developing Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data MiningDeveloping Competitive Strategies in Higher Education through Visual Data Mining
Developing Competitive Strategies in Higher Education through Visual Data Mining
 
M033059064
M033059064M033059064
M033059064
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
 
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
Comparison of Cost Estimation Methods using Hybrid Artificial Intelligence on...
 
Estimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approachEstimating project development effort using clustered regression approach
Estimating project development effort using clustered regression approach
 
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACHESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple Databases
 
Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...Analyzing the solutions of DEA through information visualization and data min...
Analyzing the solutions of DEA through information visualization and data min...
 
Usage and Research Challenges in the Area of Frequent Pattern in Data Mining
Usage and Research Challenges in the Area of Frequent Pattern in Data MiningUsage and Research Challenges in the Area of Frequent Pattern in Data Mining
Usage and Research Challenges in the Area of Frequent Pattern in Data Mining
 
A Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document ClusteringA Novel Multi- Viewpoint based Similarity Measure for Document Clustering
A Novel Multi- Viewpoint based Similarity Measure for Document Clustering
 
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
MAP/REDUCE DESIGN AND IMPLEMENTATION OF APRIORIALGORITHM FOR HANDLING VOLUMIN...
 
A Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple DatabasesA Framework for Automated Association Mining Over Multiple Databases
A Framework for Automated Association Mining Over Multiple Databases
 
Pak eko 4412ijdms01
Pak eko 4412ijdms01Pak eko 4412ijdms01
Pak eko 4412ijdms01
 
Re-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining ResultsRe-mining Positive and Negative Association Mining Results
Re-mining Positive and Negative Association Mining Results
 

Plus de ertekg

Rule-based expert systems for supporting university students
Rule-based expert systems for supporting university studentsRule-based expert systems for supporting university students
Rule-based expert systems for supporting university studentsertekg
 
Optimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJOptimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJertekg
 
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...ertekg
 
Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...ertekg
 
Modelling the supply chain perception gaps
Modelling the supply chain perception gapsModelling the supply chain perception gaps
Modelling the supply chain perception gapsertekg
 
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining AnalysisRisk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysisertekg
 
Text Mining with RapidMiner
Text Mining with RapidMinerText Mining with RapidMiner
Text Mining with RapidMinerertekg
 
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...ertekg
 
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a  Two-Stage Supply ChainSupplier and Buyer Driven Channels in a  Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a Two-Stage Supply Chainertekg
 
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord ManufacturingSimulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturingertekg
 
Visual and analytical mining of transactions data for production planning f...
Visual and analytical mining of transactions data  for production planning  f...Visual and analytical mining of transactions data  for production planning  f...
Visual and analytical mining of transactions data for production planning f...ertekg
 
A Tutorial On Crossdocking
A Tutorial On CrossdockingA Tutorial On Crossdocking
A Tutorial On Crossdockingertekg
 
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...ertekg
 
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...ertekg
 
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
Optimizing Waste Collection  In An Organized Industrial Region: A Case StudyOptimizing Waste Collection  In An Organized Industrial Region: A Case Study
Optimizing Waste Collection In An Organized Industrial Region: A Case Studyertekg
 
Demonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive AnimationsDemonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive Animationsertekg
 
Compiere kurulumu
Compiere kurulumuCompiere kurulumu
Compiere kurulumuertekg
 
Application of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case StudyApplication of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case Studyertekg
 
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...ertekg
 
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...ertekg
 

Plus de ertekg (20)

Rule-based expert systems for supporting university students
Rule-based expert systems for supporting university studentsRule-based expert systems for supporting university students
Rule-based expert systems for supporting university students
 
Optimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJOptimizing the electric charge station network of EŞARJ
Optimizing the electric charge station network of EŞARJ
 
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
Competitiveness of Top 100 U.S. Universities: A Benchmark Study Using Data En...
 
Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...Industrial Benchmarking through Information Visualization and Data Envelopmen...
Industrial Benchmarking through Information Visualization and Data Envelopmen...
 
Modelling the supply chain perception gaps
Modelling the supply chain perception gapsModelling the supply chain perception gaps
Modelling the supply chain perception gaps
 
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining AnalysisRisk Factors and Identifiers for Alzheimer’s Disease:  A Data Mining Analysis
Risk Factors and Identifiers for Alzheimer’s Disease: A Data Mining Analysis
 
Text Mining with RapidMiner
Text Mining with RapidMinerText Mining with RapidMiner
Text Mining with RapidMiner
 
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
Competitive Pattern-Based Strategies under Complexity: The Case of Turkish Ma...
 
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a  Two-Stage Supply ChainSupplier and Buyer Driven Channels in a  Two-Stage Supply Chain
Supplier and Buyer Driven Channels in a Two-Stage Supply Chain
 
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord ManufacturingSimulation Modeling For Quality And Productivity In Steel Cord Manufacturing
Simulation Modeling For Quality And Productivity In Steel Cord Manufacturing
 
Visual and analytical mining of transactions data for production planning f...
Visual and analytical mining of transactions data  for production planning  f...Visual and analytical mining of transactions data  for production planning  f...
Visual and analytical mining of transactions data for production planning f...
 
A Tutorial On Crossdocking
A Tutorial On CrossdockingA Tutorial On Crossdocking
A Tutorial On Crossdocking
 
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...Application Of Local Search Methods For Solving  A Quadratic Assignment Probl...
Application Of Local Search Methods For Solving A Quadratic Assignment Probl...
 
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...Financial Benchmarking Of Transportation Companies  In The New York Stock Exc...
Financial Benchmarking Of Transportation Companies In The New York Stock Exc...
 
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
Optimizing Waste Collection  In An Organized Industrial Region: A Case StudyOptimizing Waste Collection  In An Organized Industrial Region: A Case Study
Optimizing Waste Collection In An Organized Industrial Region: A Case Study
 
Demonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive AnimationsDemonstrating Warehousing Concepts Through Interactive Animations
Demonstrating Warehousing Concepts Through Interactive Animations
 
Compiere kurulumu
Compiere kurulumuCompiere kurulumu
Compiere kurulumu
 
Application of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case StudyApplication of the Cutting Stock Problem to a Construction Company: A Case Study
Application of the Cutting Stock Problem to a Construction Company: A Case Study
 
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
Benchmarking The Turkish Apparel Retail Industry Through Data Envelopment Ana...
 
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
Visual Mining of Science Citation Data for Benchmarking Scientific and Techno...
 

Dernier

India Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportIndia Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportMintel Group
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMintel Group
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...ictsugar
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Servicecallgirls2057
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCRashishs7044
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?Olivia Kresic
 
Future Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionFuture Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionMintel Group
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Riya Pathan
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchirictsugar
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadAyesha Khan
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607dollysharma2066
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024Matteo Carbone
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailAriel592675
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfJos Voskuil
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCRashishs7044
 
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / NcrCall Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncrdollysharma2066
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCRashishs7044
 

Dernier (20)

India Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample ReportIndia Consumer 2024 Redacted Sample Report
India Consumer 2024 Redacted Sample Report
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 Edition
 
Corporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information TechnologyCorporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information Technology
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
 
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort ServiceCall US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
Call US-88OO1O2216 Call Girls In Mahipalpur Female Escort Service
 
8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR8447779800, Low rate Call girls in Saket Delhi NCR
8447779800, Low rate Call girls in Saket Delhi NCR
 
MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?MAHA Global and IPR: Do Actions Speak Louder Than Words?
MAHA Global and IPR: Do Actions Speak Louder Than Words?
 
Future Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted VersionFuture Of Sample Report 2024 | Redacted Version
Future Of Sample Report 2024 | Redacted Version
 
Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737Independent Call Girls Andheri Nightlaila 9967584737
Independent Call Girls Andheri Nightlaila 9967584737
 
Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
Marketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent ChirchirMarketplace and Quality Assurance Presentation - Vincent Chirchir
Marketplace and Quality Assurance Presentation - Vincent Chirchir
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
 
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607FULL ENJOY Call girls in Paharganj Delhi | 8377087607
FULL ENJOY Call girls in Paharganj Delhi | 8377087607
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detail
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdf
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR
 
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / NcrCall Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
 
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
8447779800, Low rate Call girls in Kotla Mubarakpur Delhi NCR
 
Call Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North GoaCall Us ➥9319373153▻Call Girls In North Goa
Call Us ➥9319373153▻Call Girls In North Goa
 

Visualizing Association Mining Results

  • 1. Ertek, G. and Demiriz, A. (2006). "A framework for visualizing association mining results.” Lecture Notes in Computer Science, vol: 4263, pp. 593-602. Note: This is the final draft version of this paper. Please cite this paper (or this final draft) as above. You can download this final draft from http://research.sabanciuniv.edu. A Framework for Visualizing Association Mining Results Gürdal Ertek1 and Ayhan Demiriz2 1 Sabanci University Faculty of Engineering and Natural Sciences Orhanli, Tuzla, 34956, Istanbul, Turkey 2 Department of Industrial Engineering Sakarya University 54187, Sakarya, Turkey
  • 2. A Framework for Visualizing Association Mining Results G¨rdal Ertek1 and Ayhan Demiriz2 u 1 Sabanci University Faculty of Engineering and Natural Sciences Orhanli, Tuzla, 34956, Istanbul, Turkey ertekg@sabanciuniv.edu 2 Department of Industrial Engineering Sakarya University 54187, Sakarya, Turkey ademiriz@gmail.com Abstract. Association mining is one of the most used data mining tech- niques due to interpretable and actionable results. In this study we pro- pose a framework to visualize the association mining results, specifically frequent itemsets and association rules, as graphs. We demonstrate the applicability and usefulness of our approach through a Market Basket Analysis (MBA) case study where we visually explore the data mining results for a supermarket data set. In this case study we derive several interesting insights regarding the relationships among the items and sug- gest how they can be used as basis for decision making in retailing. 1 Introduction Association mining is an increasingly used data mining and business tool among practitioners and business analysts [7]. Interpretable and actionable results of association mining can be considered as the major reasons for the popularity of this type of data mining tools. Association rules can be classified based on several criteria, as outlined in [11]. In this paper, we focus on single-dimensional, single-level boolean association rules, in the context of market basket analysis. By utilizing efficient algorithms such as Apriori [2] to analyze very large trans- actional data -frequently from transactional sales data- will result in a large set of association rules. Commonly, finding the association rules from very large data sets is considered and emphasized as the most challenging step in associa- tion mining. Often, results are presented in a text (or table) format with some querying and sorting functionalities. The rules include “if” clauses by default. The structure of such rules is as follows: “If the customer purchases Item A, then with probability C% he/she will buy Item B.” This probability, C%, is referred to as confidence level. More formally, the confidence level can be computed as follows: C = f requency(A∩B) , where A ∩ B f requency(A) refers to the transactions that have both Item A and Item B. Confidence level is also equivalent to the conditional probability of having Item B given Item A.
  • 3. 2 G¨ rdal Ertek and Ayhan Demiriz u Another important statistic in association mining is the support level. Support level is basically equal to the fraction of the transactions that have both Item A and Item B. Thus the support level S is computed as follows: S = f requency(A∩B) T where T is equal to the total number of the transactions. Left and right hand sides of the rule are called antecedent and consequent of the rule respectively. There exists an extensive literature where a multitude of interestingness mea- sures for association rules are suggested [22] and efficient data mining algorithms are presented for deriving these measures. However, there is considerably less work that focuses on the interpretation of the association mining results. Information visualization, a growing field of research in computer science [6, 8, 13], investigates ways of visually representing multi-dimensional data with the purpose of knowledge discovery. The significance and the impact of information visualization is reflected by the development and availability of highly user- friendly and successful software tools such as Miner3D [18], Spotfire [21], Advizor [1], DBMiner [11], and IBM Intelligent Miner Visualization [15]. Our motivation for this study stems from the idea that visualizing the results of association mining can help end-users significantly by enabling them to derive previously unknown insights. We provide a framework that is easy to implement (since it simply merges two existing fields of computer science) and that provides a flexible and human-centered way of discovering insights. In spite of successful visualization tools mentioned above, the visualization of the association mining results in particular is somewhat a fertile field of study. Some of the studies done in this field are summarized in the next section. We then introduce our proposed framework in Section 3. We explain our implementation in Section 4. We report our findings from the case study in Section 5. We then conclude with future research directions in Section 6. 2 Literature Review The visualization of association mining results has attracted attention recently, due to the invention of information visualization schemes such as parallel coor- dinate plots (||-coords). Here we summarize some of the studies that we believe are the most interesting. Hofmann et al. [14] elegantly visualize association rules through Mosaic plots and Double Decker plots. While Mosaic plots allow display of association rules with two items in the antecedent, Double Decker plots enable visualization of association rules with more items in the antecedent. The interestingness mea- sure of “differences of confidence” can be directly seen in Double Decker plots. Discovering intersection and sequential structures are also discussed. Kopanakis and Theodoulidis [17] present several visualization schemes for displaying and exploring association rules, including bar charts, grid form mod- els, and ||-coords. In their extensive work they also discuss a similarity based approach for the layout of multiple association rules on the screen. Two popular approaches for visualizing association rules are summarized in [23]: The first, matrix based approach, maps items and itemsets in the antecedent
  • 4. A Framework for Visualizing Association Mining Results 3 and consequent to the X and Y axes respectively. Wong et al. [23] bring an alternative to this approach by mapping rules -instead of items- to the X axis. In the second, directed graph approach, the items and rules are mapped to nodes and edges of a directed graph respectively. Our framework departs from the second approach since we map both the items and the rules to nodes. 3 Proposed Framework We propose a graph-based framework to visualize and interpret the results of well-known association mining algorithms as directed graphs. In our visualiza- tions, the items, the itemsets, and the association rules are all represented as nodes. Edges represent the links between the items and the itemsets or associa- tions. In visualizing frequent itemsets (Figure 1) the nodes that represent the items are shown with no color, whereas the nodes that represent the itemsets are col- ored reflecting the cardinality of the itemsets. For example, in our case study the lightest shade of blue denotes an itemset with two items and the darkest shade of blue denotes an itemset with four items. The sizes (the areas) of the nodes show the support levels. The directed edges symbolize which items constitute a given frequent itemset. In visualizing association rules (Figure 4), the items are again represented by nodes without coloring, and the rules are shown by colored nodes. The node sizes (the areas) again show the support levels, but this time the node colors show the confidence levels. In our case study, the confidence levels are shown in a linearly mapped yellow-red color spectrum with the yellow representing the lowest and the red representing the highest confidence levels. The directed edges are color- coded depending on whether they are incoming or outgoing edges. Incoming edges of the rule nodes are shown in grey and outgoing are shown in black. For example, in Figure 4 the association rule A01 is indeed (Item 110 ⇒ Item 38). The main idea in our framework is to exploit already existing graph drawing algorithms [3] and software in the information visualization literature [12] for vi- sualization of association mining results which are generated by already existing algorithms and software in the data mining literature [11]. 4 Steps in Implementing the Framework To demonstrate our framework we have carried out a case study using a real word data set from retail industry which we describe in the next section. In this section, we briefly outline the steps in implementing our framework. The first step is to run an efficient implementation of the Apriori algorithm: We have selected to use the application developed by C. Borgelt which is available on the internet [4] and is well documented. To generate both the frequent itemsets and association rules, it is required to run Borgelt’s application twice because this particular application is capable of
  • 5. 4 G¨ rdal Ertek and Ayhan Demiriz u generating either the frequent itemsets or the association rules. One important issue that we paid attention to was using the right options while computing the support levels of the association rules. The default computation of the support level in Borgelt’s application is different from the original definition by Agrawal and Srikant [2]. The option “-o” is included in command line to adhere to the original definition which we have defined in the Introduction section. The second step is to translate the results of the Apriori algorithm into a graph specification. In our case study, we carried out this step by converting the support and confidence levels to corresponding node diameters and colors in a spreadsheet. We then created the graph objects based on the calculated specifications in the previous step. There are a multitude of tools available on the internet for graph visualization [19]. We have selected the yEd Graph Editor [24] for drawing our initial graph and generating visually interpretable graph layouts. yEd im- plements several types of graph drawing algorithms including those that create hierarchical, organic, orthogonal, and circular layouts. For interested readers, the detailed information on the algorithms and explanations of the various settings can be found under the program’s Help menu. The final step is to run the available graph layout algorithms and try to visually discover interesting and actionable insights. Our case study revealed that different layout algorithms should be selected depending on whether one is visualizing frequent itemsets or association rules, and on the underlying purpose of the analysis e.g. catalog design, shelf layout, and promotional pricing. 5 Case Study: Market Basket Analysis The benchmark data set used in this study is provided at [10] and initially analyzed in [5] for assortment planning purposes. The data set is composed of 88,163 transactions and was collected at a Belgian supermarket. There are 16,470 unique items in it. The data set lists only the composition of transactions; different visits by the same customer cannot be identified and the monetary value of the transactions are omitted. In this section, we present our findings through visual analysis of frequent itemsets and association rules. Frequent itemset graphs generated by yEd pro- vided us with guidelines for catalog design and supermarket shelf design. Asso- ciation rule graph supplied us with inherent relationships between the items and enabled development of promotional pricing strategies. 5.1 Visualizing Frequent Itemsets Figure 1 depicts the results of the Apriori algorithm that generated the frequent itemsets at support level of 2% for the data set. This graph was drawn by selecting the Classic Organic Layout in yEd. Items that belong to similar frequent itemsets and have high support levels are placed in close proximity of each other. From Figure 1, one can easily notice the Items 39, 48, 32, 41, and 38 have very
  • 6. A Framework for Visualizing Association Mining Results 5 ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) Fig. 1. Visualization of the frequent itemsets through a Classic Organic Layout ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) Fig. 2. Querying the visualization of the frequent itemsets (a) Selecting a single item. (b) Selecting three items together.
  • 7. 6 G¨ rdal Ertek and Ayhan Demiriz u high support levels, and form frequent itemsets (with up to four elements) among themselves. On the upper corner of the figure, there is a set of items which meet the minimum support level condition, but have no significant associations with any other item. Besides identifying the most significant and interdependent items, one can also see items that are independent from all but one of the items. For example, Items 310, 237, and 225 are independent from all items but Item 39. This phe- nomenon enables us to visually cluster (group) the items into sets that are fairly independent of each other. In the context of retailing, catalog design requires selecting sets of items that would be displayed on each page of a catalog. From Figure 1, one can easily determine that Items 310, 237, and 225 could be on the same catalog page as Item 39. One might suggest putting Items 39, 48, 32, 41, and 38 on the same page since these form frequent itemsets with high confidence levels. However, this would result in a catalog with only one page of popular items and many pages of much less popular items. We suggest that the popu- lar items be placed on different pages so that they serve as attractors, drawing attention to less popular items related to them. Another type of insight that can be derived from Figure 1 is the identifica- tion of items which form frequent itemsets with the same item(s) but do not form any frequent itemsets with each other. For example, Items 65 and 89 each independently form frequent itemsets with Items 39 and 48, but do not form frequent itemsets with one another. These items may be substitute items and their relationship deserves further investigation. The frequent itemset visualization can be enhanced by incorporating inter- active visual querying capabilities. One such capability could be that once an item is selected, the items and the frequent itemsets associated with it are high- lighted. This concept is illustrated in Figure 2 (a) where Item 38 is assumed to be selected. It can be seen that Item 38 plays a very influential role since it forms frequent itemsets with seven of the 13 items. When two or more items are selected, only the frequent itemsets associated with all of them could be high- lighted. Thus selecting more than one item could serve as a query with an AND operator. This concept is illustrated in Figure 2 (b) where Items 39, 32 and 38 are assumed to be selected. One can notice in this figure that even though all the three items have high support levels each (as reflected by their large sizes) the frequent itemset F27 (in the middle) that consists of all the three items has a very low support level. When analyzed in more detail it can be seen that this is mainly due to the low support level of the frequent itemset F13 which consists of Items 32 and 38. This suggests that the association between Items 32 and 38 is significantly low, and these items can be placed into separate clusters. When experimenting with various graph layout algorithms within yEd we found the Interactive Hierarchical Layout particularly helpful. The obtained vi- sualization is given in Figure 3, where items are sorted such that the edges have minimal crossings and related items are positioned in close proximity to each other. This visualization can be used directly in planning the supermarket shelf layouts. For example assuming that we would place these items into a single
  • 8. A Framework for Visualizing Association Mining Results 7 aisle in the supermarket and that all the items have same unit volume, we can lay out the items according to their positions in Figure 3, allocating shelf space proportional to their node sizes (support levels). ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) ) Fig. 3. Visualization of the frequent itemsets through an Interactive Hierarchical Lay- out Of course in a real world setting there would be many aisles and the items would not have the same unit volume. For adopting our approach to the former situation we can pursue the following steps: We start with the analysis of a Classic Organic Layout as in Figure 1, and determine from this graph groups of items that will go into aisles together. When forming the groups we try to make sure that the total area of the items in each group is roughly the same. Once the groups are determined we can generate and analyze an Interactive Hierarchical Layout as in Figure 3 for each group and then decide on the shelf layouts at each aisle. Incorporating the situation where the items have significantly varying unit volumes is a more challenging task, since it requires consideration of these volumes in addition to consideration of the support levels. 5.2 Visualizing Association Rules Figure 4 depicts the results of the Apriori algorithm that generated the associ- ation rules at support level of 2% and confidence level of 20% for the data set. This graph was drawn by selecting the Classic Hierarchical Layout in yEd and visualizes 27 association rules. In the graph, only rules with a single item in the antecedent are shown. The figure shows which items are “sales drivers” that push the sales of other items. For example one can observe at the top of the figure that the rules A01 (Item 110 ⇒ Item 38), A02 (Item 36 ⇒ Item 38), and A04 (Item 170 ⇒ Item 38) all have high confidence levels, as reflected by their red colors. This observation related with high confidence levels can be verified by seeing that the node sizes of the rules A01, A02, and A04 are almost same as the node sizes of Items 110, 36, and 170. For increasing the sales of Item 38 in a retail setting we could use the insights that we gained from Figure 4. Initiating a promotional campaign for any combination of Items 110, 36, or 170 and placing these items next to Item 38 could boost sales for Item 38. This type of a campaign should especially be
  • 9. 8 G¨ rdal Ertek and Ayhan Demiriz u considered if the unit profits of the driver items (which reside in the antecedent of the association rule) are lower than the unit profit of the item whose sale is to be boosted (which resides in the antecedent of the association rule). We can also identify Items 32, 89, 65, 310, 237, and 225 as sales drivers since they have an indegree of zero (except Item 32) and affect sales of “downstream” items with a high confidence. 6 Conclusions and Future Work We have introduced a novel framework for knowledge discovery from association mining results. We demonstrated the applicability of our framework through a market basket analysis case study where we visualize the frequent itemsets and binary association rules derived from transactional sales data of a Belgian supermarket. Ideally, the steps in our framework should be carried out automatically by a single integrated program or at least from within a single modelling and analysis environment that readily communicates with the association mining and graph visualization software. Such a software does not currently exist. In the retail industry new products emerge and consumer preferences change constantly. Thus one would be interested in laying the foundation of an analysis framework that can fit to the dynamic nature of retailing data. Our framework can be adapted for analysis of frequent itemsets and association rules over time by incorporating latest research on evolving graphs [9]. Another avenue of future research is testing other graph visualization algo- rithms and software [19] and investigating whether new insights can be discov- ered by their application. For example, Pajek graph visualization software [20] enables the mapping of attributes to line thickness, which is not possible in yEd. The visualizations that we presented in this paper were all 2D. It is an inter- esting research question to determine whether exploring association rules in 3D can enable new styles of analysis and new types of insights. As a final word, we conclude by remarking that visualization of association mining results in particular, and data mining results in general is a promising area of future research. Educational, research, government and business institu- tions can benefit significantly from the symbiosis of data mining and information visualization disciplines. References 1. http://www.advizorsolutions.com 2. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. Proceedings of the 20th VLDB Conference,Santiago, Chile. (1994) 487–499 3. Battista, G. D., Eades, P., Tamassia, R., Tollis, I. G.: Graph Drawing: Algorithms for the Visualization of Graphs. Prentice Hall PTR.(1998) 4. http://fuzzy.cs.uni-magdeburg.de/∼borgelt/apriori.html
  • 10. A Framework for Visualizing Association Mining Results 9 $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ $ Fig. 4. Visualization of the association rules through an Interactive Hierarchical Layout
  • 11. 10 G¨rdal Ertek and Ayhan Demiriz u 5. Brijs, T., Swinnen, G., Vanhoof, K., Wets, G.: The use of association rules for product assortment decisions: a case study. Proceedings of the Fifth International Conference on Knowledge Discovery and Data Mining, San Diego (USA), (1999) 254–260. 6. de Oliveira, M. C. F., Levkowitz, H.: From visual data exploration to visual data mining: a survey. IEEE Transactions on Visualization and Computer Graphics 9, no.3 (2003) 378–394 7. Demiriz, A.: Enhancing product recommender systems on sparse binary data. Jour- nal of Data Mining and Knowledge Discovery. 9, no.2 (2004) 378–394 8. Eick, S. G.: Visual discovery and analysis. IEEE Transactions on Visualization and Computer Graphics 6, no.1 (2000) 44–58 9. Erten, D. A., Harding, P. J., Kobourov, S. G., Wampler, K., Yee, G.: GraphAEL: Graph animations with evolving layouts. Lecture Notes in Computer Science. 2913, (2004) 98–110 10. http://fimi.cs.helsinki.fi/data/ 11. Han, J., Kamber, M.: Data Mining Concepts and Techniques. Morgan Kaufman Publishers. (2001) 12. Herman, I., Melan¸on, G., Marshall, M. S.: Graph visualization and navigation c in information visualization: A survey. IEEE Transactions on Visualization and Computer Graphics. 6, no.1 (2000) 24–43 13. Hoffman, P. E., Grinstein, G. G.: A survey of visualizations for high-dimensional data mining. Chapter 2, Information visualization in data mining and knowledge discovery, Eds: Fayyad, U., Grinstein, G. G., Wierse, A. (2002) 47–82 14. Hofmann, H., Siebes, A. P. J. M., Wilhelm, A. F. X.: Visualizing association rules with interactive mosaic plots. Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining. (2000) 227–235 15. http://www-306.ibm.com/software/data/iminer/visualization/ 16. Keim, D. A.: Information visualization and visual data mining. IEEE Transactions on Visualization and Computer Graphics. 8, no.1 (2002) 1–8 17. Kopanakis, I., Theodoulidis, B.: Visual data mining modeling techniques for the visualization of mining outcomes. Journal of Visual Languages and Computing. 14 (2003) 543–589 18. http://www.miner3d.com/ 19. http://www.netvis.org/resources.php 20. http://vlado.fmf.uni-lj.si/pub/networks/pajek/ 21. http://www.spotfire.com/ 22. Tan, P., Kumar, V., Srivastava, J.: Selecting the right interestingness measure for association patterns. Proceedings of SIGKDD. (2002) 32–41 23. Wong, P. C., Whitney, P., Thomas, J.: Visualizing association rules for text mining. Proceedings of the 1999 IEEE Symposium on Information Visualization. (1999) 24. http://www.yworks.com/