SlideShare a Scribd company logo
1 of 8
Download to read offline
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
DOI : 10.5121/ijcses.2014.5205 49
A Vague Improved Markov Model Approach for
Web Page Prediction
Priya Bajaj and Supriya Raheja
Department of Computer Science & Engineering, ITM University
Gurgaon, Haryana 122001, India
ABSTRACT
Today most of the information in all areas is available over the web. It increases the web utilization as
well as attracts the interest of researchers to improve the effectiveness of web access and web utilization.
As the number of web clients gets increased, the bandwidth sharing is performed that decreases the web
access efficiency. Web page prefetching improves the effectiveness of web access by availing the next
required web page before the user demand. It is an intelligent predictive mining that analyze the user web
access history and predict the next page. In this work, vague improved markov model is presented to
perform the prediction. In this work, vague rules are suggested to perform the pruning at different levels of
markov model. Once the prediction table is generated, the association mining will be implemented to
identify the most effective next page. In this paper, an integrated model is suggested to improve the
prediction accuracy and effectiveness.
KEYWORDS
Vague Rule, Markov Model, predictive, Web Usage Mining
1. INTRODUCTION
Web caching or prefetching is one of the adaptive utility or the approach that analyze the web
usage done by the particular user or the users. Based on this analysis, it defines some prediction
approach to identify the next expected visiting page before the user demand. When a user is
reading his current accessed page, the next predicted page is loaded into the user cache memory.
It decreases the loading time for next page access at user end so that the web page retrieval
efficiency will be improved.
The concept of web page prediction is the application comes under the web page mining along
with data mining. When the page access is performed, it comes under the web content mining to
locate and load the predicted page into the cache. When the history of the web server is collected
in the form of user web usage history and presented in the form of web pages. The basic attributes
of web page history is shown in table 1.
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
50
Table 1: Effective Attributes of Web Usage DB
Once the information database gets available, the next work is to perform the data mining
operations to prediction. But generally, the size of this kind of datasets is quite large, because of
this to reduce the dataset size, some clustering process is required. The clustering can be static
session based clustering or an intelligent clustering using some analytical approach. Once the
clustering is performed, the identification of the appropriate cluster is performed to that relates the
user existence. This identified cluster is selected as the working dataset based on which the
prediction is performed.
The prediction process is basically to identify the frequency of next visiting pages in relevancy to
the current page. Once the prediction analysis is performed, the association identification is
performed to identify most associated next page. This page is then selected as the next predicted
web page. The basic structural model of this working process is shown in figure 1.
Figure 1 : Basic Structure of Web Page Prediction
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
51
In this paper, an improved web page prediction model is presented. The presented work is the
improved with the association of three main concepts: markov model, vague rules and the
association mining. Markov model will work as the intelligent prediction approach that will be
filtered at two different levels using vague rules. Vague will define the intelligent ruleset by
performing the dataset analysis. At the later stage, the association mining will be implemented to
perform the web page prediction for the caching.
In this section, an introduction to the web page prediction is defined with the specification of the
mining dataset and the structure. The structure is the most effective process steps followed by
most of the researchers. In section II, the work done by the earlier researchers for web page
prediction is discussed. In section III, the exploration of proposed vague improved markov model
is defined. In section IV, the conclusion obtained from the work is presented.
2.LITERATURE REVIEW
Lot of work is already done in the area of web page prediction and web caching. In this section,
the work done by the earlier researchers in this area is presented and discussed.
In this paper, author has defined the optimization process to reduce the web information access
and to reduce the error. Author optimized the search mechanism along with encoded search.
Author improved the quality of the search algorithm with the reduction of integration error[1].
Another work on the improvement of web page access was defined by the author. Author
presented the prediction analysis approach to improve the web page caching. The work proposed
by the author considered a realistic prefetching architecture using real and representative traces.
Author implemented the work in real web environment and the obtained results shows the
significant improvement over the existing approaches [2]. In this paper Author present an online
prediction model that does not have an offline component and fit in the memory with good
prediction accuracy. Presented algorithm is based on LZ78 and LZW algorithms that are adapted
for modeling the user navigation in Web. Presented model decreases computational complexities
which are a serious problem in developing online prediction systems. A performance evaluation
is presented using real Web logs. This evaluation shows that Presented model needs much less
memory than PPM family of algorithms with good prediction accuracy [3].
In this paper preliminary work in the area of Web page prediction is presented. The designed and
implemented prototype offers personalized interaction by predicting the user's behavior from
previous Web browsing history. Those predictions are afterwards used to simplify the user's
future interactions. Rather simple and feasible prototype enhancements are offered and discussed.
Its simplicity and effectiveness makes it potentially useful for widespread application [4]. In this
paper, Author presented an improvement over the caching scheme so that the page access
consistency will be improved. Author performed the analysis over the cache parameters in terms
of size, frequency analysis etc so that effective web page modeling under prefetching will be done
[5]. In this paper three different schemes for Web Prefetching and caching are proposed i.e.
Prefetching only, Prefetching with Caching and Prefetching from Caching. Prediction of the next
accessed Web page for prefetching and caching is achieved by modeling the Web log using
Dynamic Nested Markov model. Dynamic Nested Markov model is analyzed on these three
Prefetching and Caching schemes. Experiments have been conducted on real world data sets [6].
V.V.R.Maheswara Rao defined a Markov Prediction Model called HSMP. The HSMP model is
initially predicts the possible wanted categories using Relevance factor, which can be used to
infer the users’ browsing behavior between Web categories. Then predict the pages in predicted
categories using techniques for intelligently combining different order Markov models so that the
resulting model has low state complexity, improved prediction accuracy and retains the coverage
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
52
of the all higher order Markov model [7]. R.Khanchana defined an approach that uses a HMM
model to perform the single page analysis. Author uses a directed graph based weightage analysis
approach to identify the integration links between the web pages. Author generated the navigation
path to reduce the size of processing dataset and to perform accurate prediction of the web page.
The obtained results from the system gives the effective ranking the web page in terms of rank
assignment and the page prediction [8]. Naveed Ahmad focused on when a user requests for a
Web page, how to improve the overall performance of Web prefetching mechanism? The
proposed mechanism provides the pages locally available to a user or group of users by utilizing
bandwidth of the network. The server contains an algorithm for the prediction of Web pages and
the prediction of a Web page is based on counting the number of times a page is accessed by a
user from each cluster [9].
Yaser Alosefer has presented an algorithm that is able to detect the potential malicious behavior
of a Web server based on current and past interactions between the Web client and the server and
can also predict possible future behaviors. The prediction algorithm learns from previously
scanned behaviors recorded by a client honeypot system. Author group such behaviors in order to
enable common characteristics to be investigated across these groups [10]. Shreya Dubey has
defined a hybrid model for the Web page prediction. This model includes some intelligent
approaches called SBM, Association rule mining and the Markov model to perform the page
prediction. The author combined two main concepts called the prediction mining and the Web
personalization [11]. Sina Bahram has defined a work on the prediction of Web pages under the
machine learning approaches. Author defined the structural and featured analysis on the Web
pages to identify the individual and the relation features over the Web access. Author has defined
three main datasets to perform the classification process. Author implemented the work in real
environment and obtained results shows the effectiveness of the work [12].
3.PROPOSED WORK
The presented work is the improvement of the existing prediction model with an effective
prediction model with rule based pruning process. The presented process is effectual for large
dataset as the work includes four levels of filtration process. This filtration process includes the
static session based filtration, dynamic clustering process and two layers of pruning process. The
complete work is divided in three main stages called filtration stage, Analysis stage and
Prediction Stage. The basic structure of proposed architecture is shown in figure 2.
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
53
Figure 2 : Structure of Proposed Work
3.1 Filtration Stage
The filtration is actually the identification stage of most relevant dataset on which the actually
prediction and analysis process will be performed. When the raw web data is collected, it contains
number of attributes and having a large tupleset. It is not feasible to process on complete dataset
at one time. Because of this, the filtration stage is implied over it perform the dataset reduction.
This dataset reduction process includes the horizontal and vertical filtration. The horizontal
dataset reduction includes the elimination of non required attributes from the dataset and
identifies the most relevant attributes. The vertical dataset reduction process includes the static
and dynamic dataset reduction process. The static dataset reduction is defined in terms of session
based reduction. The session can be defined in terms of time line or the proxy server. In dynamic
dataset reduction is actually the clustering process that will collect the most relevant pages. The
filtration stage process is here shown in figure 3.
Filtration Stage
(Attribute fileration, Static
Session based Filration,
Dynamic Clustering)
Analysis Stage
(Two Level Markov Model
and each Level is refined
under Vague rules)
Prediction Stage
(The Association Mining will
be implied to identify most
associated combination)
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
54
Figure 3 : Filtration Process
This filtration stage actually identifies the most relevant sub-dataset on which the actual
prediction process will be performed. This stage will improve the accuracy and efficiency in the
prediction of end result.
3.2 Analysis Stage
This is actually the process stage defined in the paper. The formation of this stage includes an
integrated combination of markov model with vague rule set. In this stage, most relevant cluster
will be accepted as the input dataset on which the markov model will be implemented at two level
. In level 1, the single page analysis will be performed under the frequency analysis.
Figure 4 : Analysis Process
This stage will be followed by the vague ruleset to perform the dataset pruning so that the
irrelevant and less frequency pages will be eliminated from the list. After the pruning process, the
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
55
level two analysis will be performed under markov model to perform the associated web page
analysis in combination of two pages. This analysis again include the identification of the
associatively and the frequency of associated page combination. This level 2 analysis will be
followed by the pruning process defined using vague ruleset. The model of this analysis stage is
shown in figure 4.
3.3 Prediction Stage
This stage is actually the conclusion stage where the identification of the most associated page
will be performed. In this stage, the input will be taken as the most effective page pair
combination driven from the anlaysis stage. Now, the association mining will be implemented on
this anlytical dataset to obtain the effective result. The most associated page combination will be
elected as the final cached page.
4. CONCLUSION
In this present work, an effective page prediction model is presented using vague improved
markov model. This paper has presented the conceptual model of the presented work with
detailed exploration of each stage.
References
[1] Yong Zhen Guo, “ Error Correcting Output Coding-based Conditional Random Fields forWeb Page
Prediction”, International Conference on Web Intelligence and Intelligent Agent Technology, IEEE,
2008.
[2] B. de la Ossa, “An Empirical Study on Maximum Latency Saving in Web Prefetching”, International
Conference on Web Intelligence and Intelligent Agent Technology, IEEE, 2009.
[3] Alborz moghaddam, “ Dynamic and memory efficient Web page prediction model using LZ78 and
LZW algorithms”, Proceedings of the 14th International CSI Computer Conference (CSICC'09),
IEEE ,2009.
[4] Andrea Bacic, “ Intelligent Interaction: A Case Study of Web Page Prediction”, Proceedings of the
31st Int. Conf. on Information Technology Interfaces, ITI, 2009.
[5] Yanjun Liu, “Strong Cache Consistency on World Wide Web”, 3rd International Conference on
Advanced Computer Theory and Engineering (ICACTE), 2010.
[6] Bhawna Nigam, “ Analysis of Markov Model On Different Web Prefecting And Caching Schemes”,
IEEE, 2010.
[7] V.V.R.Maheswara Rao, “An Efficient Hybrid Successive Markov Model for Predicting Web User
Usage Behavior using Web Usage Mining”, International Journal of Data Engineering (IJDE) ,2010.
[8] R.Khanchana, “ An Efficient Web Page Prediction Based on Access Time-Length and Frequency”,
IEEE , 2011.
[9] Naveed Ahmad, “ Reducing User Latency in Web Prefetching Using Integrated Techniques”, IEEE,
2011.
[10] Yaser Alosefer, “ Predicting Client-side Attacks via Behaviour Analysis using Honeypot Data”,
IEEE, 2011.
[11] Shreya Dubey," Web Page Prediction using Hybrid Model", International Journal on Computer
Science and Engineering (IJCSE) ,2011.
[12] Sina Bahram," Prediction of Web Page Accessibility Based on Structural and Textual Features", Co-
Located with the 20th International World Wide Web Conference, W4A2011 - Communications
paper, March 28-29, 2011.
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014
56
Authors
First Author Priya Bajaj, Have completed engineering in Computer Science from
Maharishi Dayanand University, Rohtak in 2012 and pursuing M-tech in Computer
Science from ITM University (2012-2014).
Second Author Supriya Raheja, Assistant Professor, ITM University, is pursuing
her PhD in Computer Science from Banasthali University. She had done her
engineering from Hindu college of Engineering, Sonepat and masters from Guru
Jambeshwar University of Science and Technology, Hisar. Her total Research
publications are thirteen in International Conferences and Journals. She is working as
a Reviewer/Committee member of various International Journals and Conferences.

More Related Content

What's hot

An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentijdpsjournal
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
User Navigation Pattern Prediction from Web Log Data: A Survey
User Navigation Pattern Prediction from Web Log Data:  A SurveyUser Navigation Pattern Prediction from Web Log Data:  A Survey
User Navigation Pattern Prediction from Web Log Data: A SurveyIJMER
 
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...cscpconf
 
Recommendation generation by integrating sequential
Recommendation generation by integrating sequentialRecommendation generation by integrating sequential
Recommendation generation by integrating sequentialeSAT Publishing House
 
Recommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semanticsRecommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semanticseSAT Journals
 
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...ijdkp
 
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web CrawlerIRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web CrawlerIRJET Journal
 
Study on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its ApplicationsStudy on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its ApplicationsIJERA Editor
 
DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...
DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...
DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...ijmech
 
User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...eSAT Publishing House
 

What's hot (13)

An effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded contentAn effective search on web log from most popular downloaded content
An effective search on web log from most popular downloaded content
 
Keyword query routing
Keyword query routingKeyword query routing
Keyword query routing
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
User Navigation Pattern Prediction from Web Log Data: A Survey
User Navigation Pattern Prediction from Web Log Data:  A SurveyUser Navigation Pattern Prediction from Web Log Data:  A Survey
User Navigation Pattern Prediction from Web Log Data: A Survey
 
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...
A NEW IMPROVED WEIGHTED ASSOCIATION RULE MINING WITH DYNAMIC PROGRAMMING APPR...
 
Recommendation generation by integrating sequential
Recommendation generation by integrating sequentialRecommendation generation by integrating sequential
Recommendation generation by integrating sequential
 
Recommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semanticsRecommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semantics
 
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...
AN INTELLIGENT OPTIMAL GENETIC MODEL TO INVESTIGATE THE USER USAGE BEHAVIOUR ...
 
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web CrawlerIRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
IRJET-Deep Web Crawling Efficiently using Dynamic Focused Web Crawler
 
Study on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its ApplicationsStudy on Theoretical Aspects of Virtual Data Integration and its Applications
Study on Theoretical Aspects of Virtual Data Integration and its Applications
 
DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...
DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...
DESIGN AND IMPLEMENTATION OF CARPOOL DATA ACQUISITION PROGRAM BASED ON WEB CR...
 
User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...User search goal inference and feedback session using fast generalized – fuzz...
User search goal inference and feedback session using fast generalized – fuzz...
 
Web crawling
Web crawlingWeb crawling
Web crawling
 

Viewers also liked

Pcf investigation to improve the
Pcf investigation to improve thePcf investigation to improve the
Pcf investigation to improve theIJCSES Journal
 
Time alignment techniques for experimental sensor data
Time alignment techniques for experimental sensor dataTime alignment techniques for experimental sensor data
Time alignment techniques for experimental sensor dataIJCSES Journal
 
Sept 2014 cloud security presentation
Sept 2014   cloud security presentationSept 2014   cloud security presentation
Sept 2014 cloud security presentationJoan Dembowski
 
FZahir Technology Final
FZahir Technology FinalFZahir Technology Final
FZahir Technology Finalfareehazahir23
 
STUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOT
STUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOTSTUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOT
STUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOTIJCSES Journal
 
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSAADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSAIJCSES Journal
 
発声練習用スライド
発声練習用スライド発声練習用スライド
発声練習用スライドaika_a
 
Semantic web service discovery approaches
Semantic web service discovery approachesSemantic web service discovery approaches
Semantic web service discovery approachesIJCSES Journal
 
PENSE QUE ME MORIA
PENSE QUE ME MORIA PENSE QUE ME MORIA
PENSE QUE ME MORIA David Rivero
 
STUDY OF VARIOUS INTERNET OF THINGS PLATFORMS
STUDY OF VARIOUS INTERNET OF THINGS PLATFORMSSTUDY OF VARIOUS INTERNET OF THINGS PLATFORMS
STUDY OF VARIOUS INTERNET OF THINGS PLATFORMSIJCSES Journal
 
Jpeg image compression using discrete cosine transform a survey
Jpeg image compression using discrete cosine transform   a surveyJpeg image compression using discrete cosine transform   a survey
Jpeg image compression using discrete cosine transform a surveyIJCSES Journal
 
PERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNEL
PERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNELPERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNEL
PERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNELIJCSES Journal
 
Introduction to Quality Improvement
Introduction to Quality ImprovementIntroduction to Quality Improvement
Introduction to Quality ImprovementJoan Dembowski
 

Viewers also liked (18)

Pcf investigation to improve the
Pcf investigation to improve thePcf investigation to improve the
Pcf investigation to improve the
 
Time alignment techniques for experimental sensor data
Time alignment techniques for experimental sensor dataTime alignment techniques for experimental sensor data
Time alignment techniques for experimental sensor data
 
Sept 2014 cloud security presentation
Sept 2014   cloud security presentationSept 2014   cloud security presentation
Sept 2014 cloud security presentation
 
FZahir Technology Final
FZahir Technology FinalFZahir Technology Final
FZahir Technology Final
 
STUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOT
STUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOTSTUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOT
STUDY ON THE PATH TRACKING AND POSITIONING METHOD OF WHEELED MOBILE ROBOT
 
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSAADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
ADVANCES IN HIGHER EDUCATIONAL RESOURCE SHARING AND CLOUD SERVICES FOR KSA
 
Feedback that works
Feedback that worksFeedback that works
Feedback that works
 
発声練習用スライド
発声練習用スライド発声練習用スライド
発声練習用スライド
 
Tf2 helloween
Tf2 helloweenTf2 helloween
Tf2 helloween
 
Sav efinal
Sav efinalSav efinal
Sav efinal
 
Semantic web service discovery approaches
Semantic web service discovery approachesSemantic web service discovery approaches
Semantic web service discovery approaches
 
PENSE QUE ME MORIA
PENSE QUE ME MORIA PENSE QUE ME MORIA
PENSE QUE ME MORIA
 
STUDY OF VARIOUS INTERNET OF THINGS PLATFORMS
STUDY OF VARIOUS INTERNET OF THINGS PLATFORMSSTUDY OF VARIOUS INTERNET OF THINGS PLATFORMS
STUDY OF VARIOUS INTERNET OF THINGS PLATFORMS
 
Community fresh organics 3
Community fresh organics 3Community fresh organics 3
Community fresh organics 3
 
Jpeg image compression using discrete cosine transform a survey
Jpeg image compression using discrete cosine transform   a surveyJpeg image compression using discrete cosine transform   a survey
Jpeg image compression using discrete cosine transform a survey
 
PERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNEL
PERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNELPERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNEL
PERFORMANCE EVALUATION OF MC-CDMA SYSTEM OVER RAYLEIGH FADING CHANNEL
 
Introduction to Quality Improvement
Introduction to Quality ImprovementIntroduction to Quality Improvement
Introduction to Quality Improvement
 
Examen resuelto
Examen resueltoExamen resuelto
Examen resuelto
 

Similar to A vague improved markov model approach for web page prediction

Integrating Vague Association Mining with Markov Model
Integrating Vague Association Mining with Markov Model  Integrating Vague Association Mining with Markov Model
Integrating Vague Association Mining with Markov Model ijsc
 
Methodologies on user Behavior Analysis and Future Request Prediction in Web ...
Methodologies on user Behavior Analysis and Future Request Prediction in Web ...Methodologies on user Behavior Analysis and Future Request Prediction in Web ...
Methodologies on user Behavior Analysis and Future Request Prediction in Web ...ijbuiiir1
 
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...Certain Issues in Web Page Prediction, Classification and Clustering in Data ...
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...IJAEMSJORNAL
 
Volume 2-issue-6-2056-2060
Volume 2-issue-6-2056-2060Volume 2-issue-6-2056-2060
Volume 2-issue-6-2056-2060Editor IJARCET
 
3 iaetsd semantic web page recommender system
3 iaetsd semantic web page recommender system3 iaetsd semantic web page recommender system
3 iaetsd semantic web page recommender systemIaetsd Iaetsd
 
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEB
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEBCOST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEB
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEBIJDKP
 
User Navigation Pattern Prediction from Web Log Data: A Survey
User Navigation Pattern Prediction from Web Log Data: A SurveyUser Navigation Pattern Prediction from Web Log Data: A Survey
User Navigation Pattern Prediction from Web Log Data: A SurveyIJMER
 
2000-08.doc
2000-08.doc2000-08.doc
2000-08.docbutest
 
2000-08.doc
2000-08.doc2000-08.doc
2000-08.docbutest
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesEditor IJCATR
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...IRJET Journal
 
Performance of Real Time Web Traffic Analysis Using Feed Forward Neural Netw...
Performance of Real Time Web Traffic Analysis Using Feed  Forward Neural Netw...Performance of Real Time Web Traffic Analysis Using Feed  Forward Neural Netw...
Performance of Real Time Web Traffic Analysis Using Feed Forward Neural Netw...IOSR Journals
 
Pdd crawler a focused web
Pdd crawler  a focused webPdd crawler  a focused web
Pdd crawler a focused webcsandit
 
Web Page Recommendation Using Web Mining
Web Page Recommendation Using Web MiningWeb Page Recommendation Using Web Mining
Web Page Recommendation Using Web MiningIJERA Editor
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimizationBookStoreLib
 
Annotation for query result records based on domain specific ontology
Annotation for query result records based on domain specific ontologyAnnotation for query result records based on domain specific ontology
Annotation for query result records based on domain specific ontologyijnlc
 
Integrated Web Recommendation Model with Improved Weighted Association Rule M...
Integrated Web Recommendation Model with Improved Weighted Association Rule M...Integrated Web Recommendation Model with Improved Weighted Association Rule M...
Integrated Web Recommendation Model with Improved Weighted Association Rule M...ijdkp
 
Evaluation of models for predicting user’s next request in web usage mining
Evaluation of models for predicting user’s next request in web usage miningEvaluation of models for predicting user’s next request in web usage mining
Evaluation of models for predicting user’s next request in web usage miningIJCI JOURNAL
 

Similar to A vague improved markov model approach for web page prediction (20)

Integrating Vague Association Mining with Markov Model
Integrating Vague Association Mining with Markov Model  Integrating Vague Association Mining with Markov Model
Integrating Vague Association Mining with Markov Model
 
Methodologies on user Behavior Analysis and Future Request Prediction in Web ...
Methodologies on user Behavior Analysis and Future Request Prediction in Web ...Methodologies on user Behavior Analysis and Future Request Prediction in Web ...
Methodologies on user Behavior Analysis and Future Request Prediction in Web ...
 
H0314450
H0314450H0314450
H0314450
 
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...Certain Issues in Web Page Prediction, Classification and Clustering in Data ...
Certain Issues in Web Page Prediction, Classification and Clustering in Data ...
 
Volume 2-issue-6-2056-2060
Volume 2-issue-6-2056-2060Volume 2-issue-6-2056-2060
Volume 2-issue-6-2056-2060
 
3 iaetsd semantic web page recommender system
3 iaetsd semantic web page recommender system3 iaetsd semantic web page recommender system
3 iaetsd semantic web page recommender system
 
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEB
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEBCOST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEB
COST-SENSITIVE TOPICAL DATA ACQUISITION FROM THE WEB
 
User Navigation Pattern Prediction from Web Log Data: A Survey
User Navigation Pattern Prediction from Web Log Data: A SurveyUser Navigation Pattern Prediction from Web Log Data: A Survey
User Navigation Pattern Prediction from Web Log Data: A Survey
 
A017250106
A017250106A017250106
A017250106
 
2000-08.doc
2000-08.doc2000-08.doc
2000-08.doc
 
2000-08.doc
2000-08.doc2000-08.doc
2000-08.doc
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining Techniques
 
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
Enactment of Firefly Algorithm and Fuzzy C-Means Clustering For Consumer Requ...
 
Performance of Real Time Web Traffic Analysis Using Feed Forward Neural Netw...
Performance of Real Time Web Traffic Analysis Using Feed  Forward Neural Netw...Performance of Real Time Web Traffic Analysis Using Feed  Forward Neural Netw...
Performance of Real Time Web Traffic Analysis Using Feed Forward Neural Netw...
 
Pdd crawler a focused web
Pdd crawler  a focused webPdd crawler  a focused web
Pdd crawler a focused web
 
Web Page Recommendation Using Web Mining
Web Page Recommendation Using Web MiningWeb Page Recommendation Using Web Mining
Web Page Recommendation Using Web Mining
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
 
Annotation for query result records based on domain specific ontology
Annotation for query result records based on domain specific ontologyAnnotation for query result records based on domain specific ontology
Annotation for query result records based on domain specific ontology
 
Integrated Web Recommendation Model with Improved Weighted Association Rule M...
Integrated Web Recommendation Model with Improved Weighted Association Rule M...Integrated Web Recommendation Model with Improved Weighted Association Rule M...
Integrated Web Recommendation Model with Improved Weighted Association Rule M...
 
Evaluation of models for predicting user’s next request in web usage mining
Evaluation of models for predicting user’s next request in web usage miningEvaluation of models for predicting user’s next request in web usage mining
Evaluation of models for predicting user’s next request in web usage mining
 

Recently uploaded

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

A vague improved markov model approach for web page prediction

  • 1. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 DOI : 10.5121/ijcses.2014.5205 49 A Vague Improved Markov Model Approach for Web Page Prediction Priya Bajaj and Supriya Raheja Department of Computer Science & Engineering, ITM University Gurgaon, Haryana 122001, India ABSTRACT Today most of the information in all areas is available over the web. It increases the web utilization as well as attracts the interest of researchers to improve the effectiveness of web access and web utilization. As the number of web clients gets increased, the bandwidth sharing is performed that decreases the web access efficiency. Web page prefetching improves the effectiveness of web access by availing the next required web page before the user demand. It is an intelligent predictive mining that analyze the user web access history and predict the next page. In this work, vague improved markov model is presented to perform the prediction. In this work, vague rules are suggested to perform the pruning at different levels of markov model. Once the prediction table is generated, the association mining will be implemented to identify the most effective next page. In this paper, an integrated model is suggested to improve the prediction accuracy and effectiveness. KEYWORDS Vague Rule, Markov Model, predictive, Web Usage Mining 1. INTRODUCTION Web caching or prefetching is one of the adaptive utility or the approach that analyze the web usage done by the particular user or the users. Based on this analysis, it defines some prediction approach to identify the next expected visiting page before the user demand. When a user is reading his current accessed page, the next predicted page is loaded into the user cache memory. It decreases the loading time for next page access at user end so that the web page retrieval efficiency will be improved. The concept of web page prediction is the application comes under the web page mining along with data mining. When the page access is performed, it comes under the web content mining to locate and load the predicted page into the cache. When the history of the web server is collected in the form of user web usage history and presented in the form of web pages. The basic attributes of web page history is shown in table 1.
  • 2. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 50 Table 1: Effective Attributes of Web Usage DB Once the information database gets available, the next work is to perform the data mining operations to prediction. But generally, the size of this kind of datasets is quite large, because of this to reduce the dataset size, some clustering process is required. The clustering can be static session based clustering or an intelligent clustering using some analytical approach. Once the clustering is performed, the identification of the appropriate cluster is performed to that relates the user existence. This identified cluster is selected as the working dataset based on which the prediction is performed. The prediction process is basically to identify the frequency of next visiting pages in relevancy to the current page. Once the prediction analysis is performed, the association identification is performed to identify most associated next page. This page is then selected as the next predicted web page. The basic structural model of this working process is shown in figure 1. Figure 1 : Basic Structure of Web Page Prediction
  • 3. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 51 In this paper, an improved web page prediction model is presented. The presented work is the improved with the association of three main concepts: markov model, vague rules and the association mining. Markov model will work as the intelligent prediction approach that will be filtered at two different levels using vague rules. Vague will define the intelligent ruleset by performing the dataset analysis. At the later stage, the association mining will be implemented to perform the web page prediction for the caching. In this section, an introduction to the web page prediction is defined with the specification of the mining dataset and the structure. The structure is the most effective process steps followed by most of the researchers. In section II, the work done by the earlier researchers for web page prediction is discussed. In section III, the exploration of proposed vague improved markov model is defined. In section IV, the conclusion obtained from the work is presented. 2.LITERATURE REVIEW Lot of work is already done in the area of web page prediction and web caching. In this section, the work done by the earlier researchers in this area is presented and discussed. In this paper, author has defined the optimization process to reduce the web information access and to reduce the error. Author optimized the search mechanism along with encoded search. Author improved the quality of the search algorithm with the reduction of integration error[1]. Another work on the improvement of web page access was defined by the author. Author presented the prediction analysis approach to improve the web page caching. The work proposed by the author considered a realistic prefetching architecture using real and representative traces. Author implemented the work in real web environment and the obtained results shows the significant improvement over the existing approaches [2]. In this paper Author present an online prediction model that does not have an offline component and fit in the memory with good prediction accuracy. Presented algorithm is based on LZ78 and LZW algorithms that are adapted for modeling the user navigation in Web. Presented model decreases computational complexities which are a serious problem in developing online prediction systems. A performance evaluation is presented using real Web logs. This evaluation shows that Presented model needs much less memory than PPM family of algorithms with good prediction accuracy [3]. In this paper preliminary work in the area of Web page prediction is presented. The designed and implemented prototype offers personalized interaction by predicting the user's behavior from previous Web browsing history. Those predictions are afterwards used to simplify the user's future interactions. Rather simple and feasible prototype enhancements are offered and discussed. Its simplicity and effectiveness makes it potentially useful for widespread application [4]. In this paper, Author presented an improvement over the caching scheme so that the page access consistency will be improved. Author performed the analysis over the cache parameters in terms of size, frequency analysis etc so that effective web page modeling under prefetching will be done [5]. In this paper three different schemes for Web Prefetching and caching are proposed i.e. Prefetching only, Prefetching with Caching and Prefetching from Caching. Prediction of the next accessed Web page for prefetching and caching is achieved by modeling the Web log using Dynamic Nested Markov model. Dynamic Nested Markov model is analyzed on these three Prefetching and Caching schemes. Experiments have been conducted on real world data sets [6]. V.V.R.Maheswara Rao defined a Markov Prediction Model called HSMP. The HSMP model is initially predicts the possible wanted categories using Relevance factor, which can be used to infer the users’ browsing behavior between Web categories. Then predict the pages in predicted categories using techniques for intelligently combining different order Markov models so that the resulting model has low state complexity, improved prediction accuracy and retains the coverage
  • 4. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 52 of the all higher order Markov model [7]. R.Khanchana defined an approach that uses a HMM model to perform the single page analysis. Author uses a directed graph based weightage analysis approach to identify the integration links between the web pages. Author generated the navigation path to reduce the size of processing dataset and to perform accurate prediction of the web page. The obtained results from the system gives the effective ranking the web page in terms of rank assignment and the page prediction [8]. Naveed Ahmad focused on when a user requests for a Web page, how to improve the overall performance of Web prefetching mechanism? The proposed mechanism provides the pages locally available to a user or group of users by utilizing bandwidth of the network. The server contains an algorithm for the prediction of Web pages and the prediction of a Web page is based on counting the number of times a page is accessed by a user from each cluster [9]. Yaser Alosefer has presented an algorithm that is able to detect the potential malicious behavior of a Web server based on current and past interactions between the Web client and the server and can also predict possible future behaviors. The prediction algorithm learns from previously scanned behaviors recorded by a client honeypot system. Author group such behaviors in order to enable common characteristics to be investigated across these groups [10]. Shreya Dubey has defined a hybrid model for the Web page prediction. This model includes some intelligent approaches called SBM, Association rule mining and the Markov model to perform the page prediction. The author combined two main concepts called the prediction mining and the Web personalization [11]. Sina Bahram has defined a work on the prediction of Web pages under the machine learning approaches. Author defined the structural and featured analysis on the Web pages to identify the individual and the relation features over the Web access. Author has defined three main datasets to perform the classification process. Author implemented the work in real environment and obtained results shows the effectiveness of the work [12]. 3.PROPOSED WORK The presented work is the improvement of the existing prediction model with an effective prediction model with rule based pruning process. The presented process is effectual for large dataset as the work includes four levels of filtration process. This filtration process includes the static session based filtration, dynamic clustering process and two layers of pruning process. The complete work is divided in three main stages called filtration stage, Analysis stage and Prediction Stage. The basic structure of proposed architecture is shown in figure 2.
  • 5. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 53 Figure 2 : Structure of Proposed Work 3.1 Filtration Stage The filtration is actually the identification stage of most relevant dataset on which the actually prediction and analysis process will be performed. When the raw web data is collected, it contains number of attributes and having a large tupleset. It is not feasible to process on complete dataset at one time. Because of this, the filtration stage is implied over it perform the dataset reduction. This dataset reduction process includes the horizontal and vertical filtration. The horizontal dataset reduction includes the elimination of non required attributes from the dataset and identifies the most relevant attributes. The vertical dataset reduction process includes the static and dynamic dataset reduction process. The static dataset reduction is defined in terms of session based reduction. The session can be defined in terms of time line or the proxy server. In dynamic dataset reduction is actually the clustering process that will collect the most relevant pages. The filtration stage process is here shown in figure 3. Filtration Stage (Attribute fileration, Static Session based Filration, Dynamic Clustering) Analysis Stage (Two Level Markov Model and each Level is refined under Vague rules) Prediction Stage (The Association Mining will be implied to identify most associated combination)
  • 6. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 54 Figure 3 : Filtration Process This filtration stage actually identifies the most relevant sub-dataset on which the actual prediction process will be performed. This stage will improve the accuracy and efficiency in the prediction of end result. 3.2 Analysis Stage This is actually the process stage defined in the paper. The formation of this stage includes an integrated combination of markov model with vague rule set. In this stage, most relevant cluster will be accepted as the input dataset on which the markov model will be implemented at two level . In level 1, the single page analysis will be performed under the frequency analysis. Figure 4 : Analysis Process This stage will be followed by the vague ruleset to perform the dataset pruning so that the irrelevant and less frequency pages will be eliminated from the list. After the pruning process, the
  • 7. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 55 level two analysis will be performed under markov model to perform the associated web page analysis in combination of two pages. This analysis again include the identification of the associatively and the frequency of associated page combination. This level 2 analysis will be followed by the pruning process defined using vague ruleset. The model of this analysis stage is shown in figure 4. 3.3 Prediction Stage This stage is actually the conclusion stage where the identification of the most associated page will be performed. In this stage, the input will be taken as the most effective page pair combination driven from the anlaysis stage. Now, the association mining will be implemented on this anlytical dataset to obtain the effective result. The most associated page combination will be elected as the final cached page. 4. CONCLUSION In this present work, an effective page prediction model is presented using vague improved markov model. This paper has presented the conceptual model of the presented work with detailed exploration of each stage. References [1] Yong Zhen Guo, “ Error Correcting Output Coding-based Conditional Random Fields forWeb Page Prediction”, International Conference on Web Intelligence and Intelligent Agent Technology, IEEE, 2008. [2] B. de la Ossa, “An Empirical Study on Maximum Latency Saving in Web Prefetching”, International Conference on Web Intelligence and Intelligent Agent Technology, IEEE, 2009. [3] Alborz moghaddam, “ Dynamic and memory efficient Web page prediction model using LZ78 and LZW algorithms”, Proceedings of the 14th International CSI Computer Conference (CSICC'09), IEEE ,2009. [4] Andrea Bacic, “ Intelligent Interaction: A Case Study of Web Page Prediction”, Proceedings of the 31st Int. Conf. on Information Technology Interfaces, ITI, 2009. [5] Yanjun Liu, “Strong Cache Consistency on World Wide Web”, 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE), 2010. [6] Bhawna Nigam, “ Analysis of Markov Model On Different Web Prefecting And Caching Schemes”, IEEE, 2010. [7] V.V.R.Maheswara Rao, “An Efficient Hybrid Successive Markov Model for Predicting Web User Usage Behavior using Web Usage Mining”, International Journal of Data Engineering (IJDE) ,2010. [8] R.Khanchana, “ An Efficient Web Page Prediction Based on Access Time-Length and Frequency”, IEEE , 2011. [9] Naveed Ahmad, “ Reducing User Latency in Web Prefetching Using Integrated Techniques”, IEEE, 2011. [10] Yaser Alosefer, “ Predicting Client-side Attacks via Behaviour Analysis using Honeypot Data”, IEEE, 2011. [11] Shreya Dubey," Web Page Prediction using Hybrid Model", International Journal on Computer Science and Engineering (IJCSE) ,2011. [12] Sina Bahram," Prediction of Web Page Accessibility Based on Structural and Textual Features", Co- Located with the 20th International World Wide Web Conference, W4A2011 - Communications paper, March 28-29, 2011.
  • 8. International Journal of Computer Science & Engineering Survey (IJCSES) Vol.5, No.2, April 2014 56 Authors First Author Priya Bajaj, Have completed engineering in Computer Science from Maharishi Dayanand University, Rohtak in 2012 and pursuing M-tech in Computer Science from ITM University (2012-2014). Second Author Supriya Raheja, Assistant Professor, ITM University, is pursuing her PhD in Computer Science from Banasthali University. She had done her engineering from Hindu college of Engineering, Sonepat and masters from Guru Jambeshwar University of Science and Technology, Hisar. Her total Research publications are thirteen in International Conferences and Journals. She is working as a Reviewer/Committee member of various International Journals and Conferences.