SlideShare une entreprise Scribd logo
1  sur  8
Télécharger pour lire hors ligne
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
107
WEB INFORMATION RETRIEVAL USING AUTOMATIC
MULTI-DOCUMENT SUMMARIZATION
Rahul Shankarrao Khokale
Department of Computer Science & Engineering,
Priyadarshini Indira Gandhi College of Engineering,
NAGPUR (India)
Mohammad Atique
Post Graduate Department of Computer Science,
Sant Gadge Baba Amravati University,
AMRAVATI (India)
ABSTRACT
Today, internet has become the most important source of information. People are highly
accustomed to the use of internet for acquiring information which they need. Many times, it is
revealed that, the information seeker does not get relevant information very easily due to the
presence of non-relevant web pages. This paper addresses the problem of effective information
retrieval from the web. In this paper, the notion of Web Information Retrieval using Automatic
Multi-document Summarization is presented. The proposed work is blend of Web technology and
Natural Language Processing. When user will fire the query, the system tries to fetch web pages
from different web servers, and they are indexed as per the order of relevance. The degree of
relevance is not determined by the how many times the keywords of query is present in the document
but it is determined on the basis of semantic content of the document and the user query
Keywords: Web Information Retrieval, Automatic Multi-Document Summarizations,
Web Technology, Information Retrieval and Natural Language Processing
INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING &
TECHNOLOGY (IJCET)
ISSN 0976 – 6367(Print)
ISSN 0976 – 6375(Online)
Volume 5, Issue 3, March (2014), pp. 107-114
© IAEME: www.iaeme.com/ijcet.asp
Journal Impact Factor (2014): 8.5328 (Calculated by GISI)
www.jifactor.com
IJCET
© I A E M E
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
108
I. INTRODUCTION
1.1 Information Retrieval
The Internet and the Web offer new opportunities and challenges to information retrieval
researchers. With the information explosion and never ending increase of web pages as well as
digital data, it is very hard to retrieve useful and reliable information from the Web. Materials from
millions of web pages from organizations, institutions and personnel have been made public
electronically accessible to millions of interested users. The Web uses an addressing system called
Uniform Resource Locators (URLs) to represent links to documents on web servers. These URLs
provide location information. Like titles of books in traditional libraries, no one can remember all
URLs on the Web. Web search engines allow us to locate the internet resources through thousands of
Web pages. It is almost impossible to get the right information as there is too much irrelevant and out
dated information. Information retrieval systems provide useful information in libraries to
researchers.
The Web can be viewed as a virtual library. Information retrieval is an important and major
component of the Internet and the Web in the information age and should play an important role in
knowledge discovery. General search engines such as, Google, AltaVista, Excite are considered as
the powerful search engines so far. Most of the current search engines are based on words, not the
concepts. When searching for certain information or knowledge with a search engine, one can only
use a few key words to narrow down the search. The result of the search is tens or maybe hundreds
of relevant and irrelevant links to various Web pages.
In spite of the voluminous studies in the field of intelligent retrieval systems, effective
retrieving of information has been remained an important unsolved problem. Implementations of
different conceptual knowledge in the information retrieval process such as ontology have been
considered as a solution to enhance the quality of results. Furthermore, the conceptual formalism
supported by typical ontology may not be sufficient to represent uncertainty information due to the
lack of clear-cut boundaries between concepts of the domains [1] “Information retrieval is a field
concerned with the structure, analysis, organization, storage, searching, and retrieval of information.”
(Salton, 1968).
1.2 Automatic Text Summarization
Automatic Text Summarization means the process of extraction and representation of most
important content from the source document in the condensed form. This process involves Document
Preprocessing, Feature Extraction, Sentence Ranking and Summary Generation [3]. Preprocessing is
accomplished by Tokenization, Sentence Splitting, POS Tagging etc. Feature Extraction includes
Word Frequency Extraction and Sentence Ordering. Weighing Sentences helps to score sentences
required for Sentence Ranking. Summary Generation is the resulting phase of automatic text
summarization.
1.3 Multi-Document Summarization
Multi-document text summarization deals with retrieval of salient information about a topic
from various sources. The task of multi-document summarization is to identify a set of sentences,
phrases or some generated semantically correct language units carrying some useful information.
Then significant sentences are extracted from this set and re-organized them to get multi-documents’
summary [1]. Let D = {D1, D2,.......,Dn} be a set of documents. Let S = {S1, S2, ...... ,Sn} be a set of
summary of each document respectively.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
109
Figure 1: Multi –Document Summarization
II. RELATED WORK
In recent years, the research focus in the domain of natural language processing and
information retrieval has been shifted to the area of automatic document summarization. Automatic
document summarization is of two types : abstractive and extractive.
The research in this field began with Term Frequency based summarization. Following
researchers used term frequency based approach for document summarization. G. Salton, 1989,
Jun'ichi Fukumoto, 2004, You Ouyang, 2009 and Mr.Vikrant Gupta, 2012 [1]. Inderjeet Mani, 1997,
Rada Mihalcea, 2004, Junlin Zhang, 2005, Xiaojun Wan, 2008, Kokil Jaidka, 2010 [1] carried out
research for document summarization using Graph-based approach. Kathleen McKeown, 1995,
Xiaojun Wan, 2007 used Time-Based method for document summarization. Sentence Correlation
method was implemented for document summarization by Shanmugasundaram Hariharan, 2012,
Tiedan Zhu, 2012. Clustering-Based method for document summarization was proposed by Jade
Goldstein, 2000. Vikrant Gupta el at [2], developed an auto-summarization tool using statistical
techniques. The techniques involve finding the frequency of words, scoring the sentences, ranking
the sentences etc. Yogan Jaya Kumar et al. [3] discussed Automatic Multi Document Summarization
Approaches. Y. Surendranadha Reddy el at [4] presented a summarization system that produces a
summary for a given web document based on sentence importance measures such as sentence
ranking. Tiedan Zhu et al [5] proposed an improved approach to sentence ordering for Multi-
document Summarization.
III. PROPOSED WORK
This paper deals with the framework for “Web Information Retrieval based on Multi-Document
Summarization”. The proposed framework is shown in Figure 2. However, in this paper, the
emphasis is given on the multi-level document summarization. The basic purpose of this framework
is to enhance the effectiveness of web information retrieval. As the indexing and ranking of the
retrieved documents is supported by intelligent decision making system which is based on fuzzy
inference rules, the degree of relevance can be increased.
Document
(D1)
Document
(D2)
Document
(Dn)
Summary
(S1)
Summary
(S2)
Summary
(Sn)
Significant and
common
sentences
extraction
Sentences Re-
ordering based
on semantic
contents
Multi-
Document
Summary
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
110
User
Query
Figure 2: Framework for web information retrieval
3.1 Intelligent Query Processing
When a query is written by the user and submitted to the system, it is required to manipulate
it and represent it in proper form. Intelligent Query Processing (IQP) module helps user to formulate
his query. WordNet is used for identifying synonyms and thesaurus. This intelligent unit tries to
understand the user need and accordingly it classifies the query into any of three types :
Informational, Navigational and Transactional.
3.2 Search Engine
Search Engine uses web crawler to traverse the World Wide Web to fetch matching URLs.
3.3 Multi-Document Summarization
The n number of links/URLs is the input to the Multi-Document Summarization (MDS) unit.
Each of these inputs can be HTML web page or a text/PDF file. MDS finds summary of each
document separately and finally combines all of them to form single summary. It involves significant
sentences identification, sentences reordering etc. The detail algorithm for this is discussed in section
IV.
3.4 Indexing
The process of generating inverted indices of the retrieved documents is Indexing. It is also
an important step in information retrieval.
Intelligent
Query
Processing
Search Engine
URL1
URL2
URL3
URLn
Multi-
document
Summarization
Indexing
Ranking
Documents
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
111
3.5 Ranking
Ranking means determining the weight or rank of each retrieved web page or web document.
Our page rank strategy is based on the summary generated by MDS unit. The Page Rank Algorithm
will use fuzzy inference system to judge the relevance of the web page according to the user query.
IV. AUTOMATIC DOCUMENT SUMMARIZATION MODEL
To summarize a document or documents, a reader has to understand the document(s) and
integrate information and make connections across sentences to form a coherent discourse
representation. We designed and developed a new generic algorithm for automatic document
summarization based on the analysis of human cognition and intelligence. In order make this model
applicable for summarization we define the concept of ‘event’. ‘Event’ is a cognitive psychological
concept, and can be either a story or a sentence in microstructure. We have treated each sentence as
an event in this paper. The cognitive model is shown below
document
Figure 3: Cognitive Model
As each event is representing sentence in the document, it is a combination of two parts :
Subject and Predicate. e.g. Consider the example of an event :
Event : Tiger is a wild animal
Subject Predicate
This event can be represented in predicate form by using FOPL syntax
wild_animal(tiger)
The representation of events in predicate form is required to create the knowledge about the
document. In addition to that, we have defined inference rules to understand the connection of one
sentence to others. This helps us to decide the significance of sentence within the document and used
for sentence re-ordering.
Event 1 Event 1 Event N
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
112
The Algorithm for Automatic text/document summarization is discussed below:
Figure 4: Algorithm for Text summarization
V. EXPERIMENTAL RESULTS
We have implemented the multi-document summarization system in JAVA.
5.1 Test Data/Corpus : We have used the standard CACM test data with 3204 test samples.
Following are the documents chosen from CACM test data.
Document : CACM—0276.html
START
Read the
document
Tokenization
Stemmer
POS Tagger
Summary
Cognitive Model
Program Organization and Record Keeping for Dynamic Storage
Allocation
The material presented in this paper is part
of the design plan of the core allocation portion
of the ASCII-MATIC Programming System. Project ASCII-MATIC
is concerned with the application of computer
techniques to the activities of certain headquarters
military intelligence operations of the U. Army.
CACM October, 1961
Holt, A. W.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
113
Summary :
5.2 Term Frequency Computation
The term frequency of all the terms in the sentence is calculated individually. The sum of all
these frequencies will give the T(si,),term frequency of sentence si. This can be calculated by:
ܶሺܵ݅ሻ ൌ ෍ ‫ݐ‬ ‫כ‬ ݂ሺ‫݅ݐ‬ሻ
௡
௧௜ୀଵ
Where, ti is the ith
term in the sentence. And t*f(ti) is the term frequency of term ti.
Therefore, T(Si) for document 1
T(Si)=(1+1+1+1+1+1+1+1+2+5+1+1+1+1+1+1+3+1+1+1+1+2+2+1+1+1+1+1+1+1+1+1
+1+1+1+1+1+1+1+1+1)
= 50
5.3 TF-IDF or term-frequency inverse-document-frequency
TF-IDF or term-frequency inverse-document-frequency is computed as the ratio of the
quantity of terms in that document to the frequency of the quantity of documents containing that
terms. For above document the TF_IDF is given below:
ܶ‫ܨܦܫ_ܨ‬ ൌ
41
50
=0.82
V. CONCLUSION
In this paper, we have proposed a framework for web information retrieval using multi-
document summarization. The emphasis was given on the multi-level document summarization. The
basic purpose of this framework is to enhance the effectiveness of web information retrieval. As the
indexing and ranking of the retrieved documents is supported by intelligent decision making system
which is based on fuzzy inference rules, the degree of relevance can be increased. The experiment
are carried out on the CACM test data, and it is found that information retrieval results can be
improved after multi-document summarization process is performed.
Title : Program Organization and Record Keeping for Dynamic
Storage Allocation
Author : Holt, A. W.
Publication & Year : CACM October, 1961
Part of the design plan of the core allocation portion of
the ASCII-MATIC Programming System. Project ASCII-MATIC deals
with the application of computer techniques to the activities
of certain headquarters military intelligence operations of the
U. Army.
International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print),
ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME
114
REFERENCES
[1] Md. Majharul Haque, Suraiya Pervin, and Zerina Begum, Literature Review of Automatic
Multiple Documents Text Summarization, International Journal of Innovation and Applied
Studies, Vol. 3 No. 1 May 2013, pp. 121-129
[2] Vikrant Gupta, Priya Chauhan, Sohan Garg, Anita Borude, Shobha Krishnan, An Statistical
Tool for Multi-Document summarization, International Journal of Scientific and Research
Publications, Volume 2, Issue 5, May 2012
[3] Yogan Jaya Kumar and Naomie Salim, Automatic Multi Document Summarization
Approaches, Journal of Computer Science 8 (1): 133-140,
[4] Y. Surendranadha Reddy and A.P. Siva Kumar, An Efficient Approach for Web document
summarization by Sentence Ranking, International Journal of Advanced Research in
Computer Science and Software Engineering, Volume 2, Issue 7, July 2012
[5] Tiedan Zhu, Xinxin Zhao, An Improved Approach to Sentence Ordering For Multi-document
Summarization, 2012 IACSIT Hong Kong Conferences, IPCSIT vol. 25 (2012) © (2012)
IACSIT Press, Singapore
[6] Nikola Vlahovic, Information Retrieval and Information Extraction in Web 2.0 environment,
International Journal of Computers, Issue 1, Volume 5, 2011
[7] Yi Guo and George Stylios, An Intelligent Algorithm For Automatic Document
Summarization.
[8] Prakasha S, Shashidhar HR and Dr. G T Raju, “A Survey on Various Architectures, Models
and Methodologies for Information Retrieval”, International Journal of Computer
Engineering & Technology (IJCET), Volume 4, Issue 1, 2013, pp. 182 - 194, ISSN Print:
0976 – 6367, ISSN Online: 0976 – 6375.
[9] Mousmi Chaurasia and Dr. Sushil Kumar, “Natural Language Processing Based Information
Retrieval for the Purpose of Author Identification”, International Journal of Information
Technology and Management Information Systems (IJITMIS), Volume 1, Issue 1, 2010,
pp. 45 - 54, ISSN Print: 0976 – 6405, ISSN Online: 0976 – 6413.

Contenu connexe

Tendances

Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...IJECEIAES
 
QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...
QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...
QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...cseij
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...IRJET Journal
 
Development of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework usingDevelopment of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework usingIAEME Publication
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope IJCSEIT Journal
 
Comparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML DocumentComparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML DocumentIJAEMSJORNAL
 
710201947
710201947710201947
710201947IJRAT
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsIRJET Journal
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET Journal
 
Context Driven Technique for Document Classification
Context Driven Technique for Document ClassificationContext Driven Technique for Document Classification
Context Driven Technique for Document ClassificationIDES Editor
 
Multikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive GraphsMultikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive GraphsIRJET Journal
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notesBAIRAVI T
 
Generic Algorithm based Data Retrieval Technique in Data Mining
Generic Algorithm based Data Retrieval Technique in Data MiningGeneric Algorithm based Data Retrieval Technique in Data Mining
Generic Algorithm based Data Retrieval Technique in Data MiningAM Publications,India
 
Fuzzy clustering technique
Fuzzy clustering techniqueFuzzy clustering technique
Fuzzy clustering techniqueprjpublications
 
Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...rahulmonikasharma
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET Journal
 

Tendances (18)

Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...Sentimental classification analysis of polarity multi-view textual data using...
Sentimental classification analysis of polarity multi-view textual data using...
 
Bx044461467
Bx044461467Bx044461467
Bx044461467
 
QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...
QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...
QUERY SENSITIVE COMPARATIVE SUMMARIZATION OF SEARCH RESULTS USING CONCEPT BAS...
 
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
A Robust Keywords Based Document Retrieval by Utilizing Advanced Encryption S...
 
Development of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework usingDevelopment of pattern knowledge discovery framework using
Development of pattern knowledge discovery framework using
 
The Survey of Data Mining Applications And Feature Scope
The Survey of Data Mining Applications  And Feature Scope The Survey of Data Mining Applications  And Feature Scope
The Survey of Data Mining Applications And Feature Scope
 
AN IMPROVED TECHNIQUE FOR DOCUMENT CLUSTERING
AN IMPROVED TECHNIQUE FOR DOCUMENT CLUSTERINGAN IMPROVED TECHNIQUE FOR DOCUMENT CLUSTERING
AN IMPROVED TECHNIQUE FOR DOCUMENT CLUSTERING
 
Comparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML DocumentComparative Study on Graph-based Information Retrieval: the Case of XML Document
Comparative Study on Graph-based Information Retrieval: the Case of XML Document
 
710201947
710201947710201947
710201947
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
 
Context Driven Technique for Document Classification
Context Driven Technique for Document ClassificationContext Driven Technique for Document Classification
Context Driven Technique for Document Classification
 
Multikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive GraphsMultikeyword Hunt on Progressive Graphs
Multikeyword Hunt on Progressive Graphs
 
Information retrieval-systems notes
Information retrieval-systems notesInformation retrieval-systems notes
Information retrieval-systems notes
 
Generic Algorithm based Data Retrieval Technique in Data Mining
Generic Algorithm based Data Retrieval Technique in Data MiningGeneric Algorithm based Data Retrieval Technique in Data Mining
Generic Algorithm based Data Retrieval Technique in Data Mining
 
Fuzzy clustering technique
Fuzzy clustering techniqueFuzzy clustering technique
Fuzzy clustering technique
 
Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...Structured and Unstructured Information Extraction Using Text Mining and Natu...
Structured and Unstructured Information Extraction Using Text Mining and Natu...
 
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
IRJET- Cluster Analysis for Effective Information Retrieval through Cohesive ...
 

En vedette (20)

Dishpointer
DishpointerDishpointer
Dishpointer
 
Handiek txikiei laguntzen
Handiek txikiei laguntzenHandiek txikiei laguntzen
Handiek txikiei laguntzen
 
Tavo
TavoTavo
Tavo
 
Photostory Board
Photostory BoardPhotostory Board
Photostory Board
 
Gimp final
Gimp finalGimp final
Gimp final
 
brents slide
brents slidebrents slide
brents slide
 
Śniadanie Daje Moc
Śniadanie Daje MocŚniadanie Daje Moc
Śniadanie Daje Moc
 
Micronic Mobility
Micronic MobilityMicronic Mobility
Micronic Mobility
 
Ed
EdEd
Ed
 
Marrying Math & Art Project
Marrying Math & Art ProjectMarrying Math & Art Project
Marrying Math & Art Project
 
Q3
Q3Q3
Q3
 
Iphone
IphoneIphone
Iphone
 
Archives: Digital outreach towards the teen user group
Archives: Digital outreach towards the teen user groupArchives: Digital outreach towards the teen user group
Archives: Digital outreach towards the teen user group
 
Gov 2.0: Going Further with the Wikinomics Folks - Michelle Springer
Gov 2.0: Going Further with the Wikinomics Folks - Michelle SpringerGov 2.0: Going Further with the Wikinomics Folks - Michelle Springer
Gov 2.0: Going Further with the Wikinomics Folks - Michelle Springer
 
Dvms Student Expectations 2010 2011-11
Dvms Student Expectations 2010 2011-11Dvms Student Expectations 2010 2011-11
Dvms Student Expectations 2010 2011-11
 
Culture of Slavery in the South
Culture of Slavery in the SouthCulture of Slavery in the South
Culture of Slavery in the South
 
Minh cuong
Minh cuongMinh cuong
Minh cuong
 
492 dunder mifflin
492 dunder mifflin492 dunder mifflin
492 dunder mifflin
 
Análisis del audio del Fiscal Marcelo Sosa
Análisis del audio del Fiscal Marcelo SosaAnálisis del audio del Fiscal Marcelo Sosa
Análisis del audio del Fiscal Marcelo Sosa
 
Hashing
HashingHashing
Hashing
 

Similaire à 50120140503012

Extraction and Retrieval of Web based Content in Web Engineering
Extraction and Retrieval of Web based Content in Web EngineeringExtraction and Retrieval of Web based Content in Web Engineering
Extraction and Retrieval of Web based Content in Web EngineeringIRJET Journal
 
A novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieA novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieIAEME Publication
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMIRJET Journal
 
MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM
MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM
MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM IAEME Publication
 
An Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataAn Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataMelinda Watson
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringIRJET Journal
 
A survey on various architectures, models and methodologies for information r...
A survey on various architectures, models and methodologies for information r...A survey on various architectures, models and methodologies for information r...
A survey on various architectures, models and methodologies for information r...IAEME Publication
 
UML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVAL
UML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVALUML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVAL
UML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVALijcsit
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachAIRCC Publishing Corporation
 
INFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACH
INFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACHINFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACH
INFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACHijcsit
 
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringAn Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringKelly Lipiec
 
Ijarcet vol-2-issue-4-1339-1341
Ijarcet vol-2-issue-4-1339-1341Ijarcet vol-2-issue-4-1339-1341
Ijarcet vol-2-issue-4-1339-1341Editor IJARCET
 
Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrievalidescitation
 
An effective pre processing algorithm for information retrieval systems
An effective pre processing algorithm for information retrieval systemsAn effective pre processing algorithm for information retrieval systems
An effective pre processing algorithm for information retrieval systemsijdms
 
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET Journal
 
Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...IRJET Journal
 

Similaire à 50120140503012 (20)

Extraction and Retrieval of Web based Content in Web Engineering
Extraction and Retrieval of Web based Content in Web EngineeringExtraction and Retrieval of Web based Content in Web Engineering
Extraction and Retrieval of Web based Content in Web Engineering
 
A novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrieA novel method to search information through multi agent search and retrie
A novel method to search information through multi agent search and retrie
 
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEMCANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
CANDIDATE SET KEY DOCUMENT RETRIEVAL SYSTEM
 
50120140502013
5012014050201350120140502013
50120140502013
 
50120140502013
5012014050201350120140502013
50120140502013
 
MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM
MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM
MULTI-DOCUMENT SUMMARIZATION SYSTEM: USING FUZZY LOGIC AND GENETIC ALGORITHM
 
An Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured DataAn Improved Annotation Based Summary Generation For Unstructured Data
An Improved Annotation Based Summary Generation For Unstructured Data
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
 
Ijetcas14 409
Ijetcas14 409Ijetcas14 409
Ijetcas14 409
 
A survey on various architectures, models and methodologies for information r...
A survey on various architectures, models and methodologies for information r...A survey on various architectures, models and methodologies for information r...
A survey on various architectures, models and methodologies for information r...
 
50320130403007
5032013040300750320130403007
50320130403007
 
UML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVAL
UML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVALUML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVAL
UML MODELING AND SYSTEM ARCHITECTURE FOR AGENT BASED INFORMATION RETRIEVAL
 
Information Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis ApproachInformation Retrieval based on Cluster Analysis Approach
Information Retrieval based on Cluster Analysis Approach
 
INFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACH
INFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACHINFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACH
INFORMATION RETRIEVAL BASED ON CLUSTER ANALYSIS APPROACH
 
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using ClusteringAn Improved Mining Of Biomedical Data From Web Documents Using Clustering
An Improved Mining Of Biomedical Data From Web Documents Using Clustering
 
Ijarcet vol-2-issue-4-1339-1341
Ijarcet vol-2-issue-4-1339-1341Ijarcet vol-2-issue-4-1339-1341
Ijarcet vol-2-issue-4-1339-1341
 
Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrieval
 
An effective pre processing algorithm for information retrieval systems
An effective pre processing algorithm for information retrieval systemsAn effective pre processing algorithm for information retrieval systems
An effective pre processing algorithm for information retrieval systems
 
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical ApproachIRJET-  	  Multi-Document Summarization using Fuzzy and Hierarchical Approach
IRJET- Multi-Document Summarization using Fuzzy and Hierarchical Approach
 
Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...
 

Plus de IAEME Publication

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME Publication
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...IAEME Publication
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSIAEME Publication
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSIAEME Publication
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSIAEME Publication
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSIAEME Publication
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOIAEME Publication
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IAEME Publication
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYIAEME Publication
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...IAEME Publication
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEIAEME Publication
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...IAEME Publication
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...IAEME Publication
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...IAEME Publication
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...IAEME Publication
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...IAEME Publication
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...IAEME Publication
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...IAEME Publication
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...IAEME Publication
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTIAEME Publication
 

Plus de IAEME Publication (20)

IAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdfIAEME_Publication_Call_for_Paper_September_2022.pdf
IAEME_Publication_Call_for_Paper_September_2022.pdf
 
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
MODELING AND ANALYSIS OF SURFACE ROUGHNESS AND WHITE LATER THICKNESS IN WIRE-...
 
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURSA STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
A STUDY ON THE REASONS FOR TRANSGENDER TO BECOME ENTREPRENEURS
 
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURSBROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
BROAD UNEXPOSED SKILLS OF TRANSGENDER ENTREPRENEURS
 
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONSDETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
DETERMINANTS AFFECTING THE USER'S INTENTION TO USE MOBILE BANKING APPLICATIONS
 
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONSANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
ANALYSE THE USER PREDILECTION ON GPAY AND PHONEPE FOR DIGITAL TRANSACTIONS
 
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINOVOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
VOICE BASED ATM FOR VISUALLY IMPAIRED USING ARDUINO
 
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
IMPACT OF EMOTIONAL INTELLIGENCE ON HUMAN RESOURCE MANAGEMENT PRACTICES AMONG...
 
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMYVISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
VISUALISING AGING PARENTS & THEIR CLOSE CARERS LIFE JOURNEY IN AGING ECONOMY
 
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
A STUDY ON THE IMPACT OF ORGANIZATIONAL CULTURE ON THE EFFECTIVENESS OF PERFO...
 
GANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICEGANDHI ON NON-VIOLENT POLICE
GANDHI ON NON-VIOLENT POLICE
 
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
A STUDY ON TALENT MANAGEMENT AND ITS IMPACT ON EMPLOYEE RETENTION IN SELECTED...
 
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
ATTRITION IN THE IT INDUSTRY DURING COVID-19 PANDEMIC: LINKING EMOTIONAL INTE...
 
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
INFLUENCE OF TALENT MANAGEMENT PRACTICES ON ORGANIZATIONAL PERFORMANCE A STUD...
 
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
A STUDY OF VARIOUS TYPES OF LOANS OF SELECTED PUBLIC AND PRIVATE SECTOR BANKS...
 
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
EXPERIMENTAL STUDY OF MECHANICAL AND TRIBOLOGICAL RELATION OF NYLON/BaSO4 POL...
 
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
ROLE OF SOCIAL ENTREPRENEURSHIP IN RURAL DEVELOPMENT OF INDIA - PROBLEMS AND ...
 
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
OPTIMAL RECONFIGURATION OF POWER DISTRIBUTION RADIAL NETWORK USING HYBRID MET...
 
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
APPLICATION OF FRUGAL APPROACH FOR PRODUCTIVITY IMPROVEMENT - A CASE STUDY OF...
 
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENTA MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
A MULTIPLE – CHANNEL QUEUING MODELS ON FUZZY ENVIRONMENT
 

Dernier

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 

Dernier (20)

Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

50120140503012

  • 1. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 107 WEB INFORMATION RETRIEVAL USING AUTOMATIC MULTI-DOCUMENT SUMMARIZATION Rahul Shankarrao Khokale Department of Computer Science & Engineering, Priyadarshini Indira Gandhi College of Engineering, NAGPUR (India) Mohammad Atique Post Graduate Department of Computer Science, Sant Gadge Baba Amravati University, AMRAVATI (India) ABSTRACT Today, internet has become the most important source of information. People are highly accustomed to the use of internet for acquiring information which they need. Many times, it is revealed that, the information seeker does not get relevant information very easily due to the presence of non-relevant web pages. This paper addresses the problem of effective information retrieval from the web. In this paper, the notion of Web Information Retrieval using Automatic Multi-document Summarization is presented. The proposed work is blend of Web technology and Natural Language Processing. When user will fire the query, the system tries to fetch web pages from different web servers, and they are indexed as per the order of relevance. The degree of relevance is not determined by the how many times the keywords of query is present in the document but it is determined on the basis of semantic content of the document and the user query Keywords: Web Information Retrieval, Automatic Multi-Document Summarizations, Web Technology, Information Retrieval and Natural Language Processing INTERNATIONAL JOURNAL OF COMPUTER ENGINEERING & TECHNOLOGY (IJCET) ISSN 0976 – 6367(Print) ISSN 0976 – 6375(Online) Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME: www.iaeme.com/ijcet.asp Journal Impact Factor (2014): 8.5328 (Calculated by GISI) www.jifactor.com IJCET © I A E M E
  • 2. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 108 I. INTRODUCTION 1.1 Information Retrieval The Internet and the Web offer new opportunities and challenges to information retrieval researchers. With the information explosion and never ending increase of web pages as well as digital data, it is very hard to retrieve useful and reliable information from the Web. Materials from millions of web pages from organizations, institutions and personnel have been made public electronically accessible to millions of interested users. The Web uses an addressing system called Uniform Resource Locators (URLs) to represent links to documents on web servers. These URLs provide location information. Like titles of books in traditional libraries, no one can remember all URLs on the Web. Web search engines allow us to locate the internet resources through thousands of Web pages. It is almost impossible to get the right information as there is too much irrelevant and out dated information. Information retrieval systems provide useful information in libraries to researchers. The Web can be viewed as a virtual library. Information retrieval is an important and major component of the Internet and the Web in the information age and should play an important role in knowledge discovery. General search engines such as, Google, AltaVista, Excite are considered as the powerful search engines so far. Most of the current search engines are based on words, not the concepts. When searching for certain information or knowledge with a search engine, one can only use a few key words to narrow down the search. The result of the search is tens or maybe hundreds of relevant and irrelevant links to various Web pages. In spite of the voluminous studies in the field of intelligent retrieval systems, effective retrieving of information has been remained an important unsolved problem. Implementations of different conceptual knowledge in the information retrieval process such as ontology have been considered as a solution to enhance the quality of results. Furthermore, the conceptual formalism supported by typical ontology may not be sufficient to represent uncertainty information due to the lack of clear-cut boundaries between concepts of the domains [1] “Information retrieval is a field concerned with the structure, analysis, organization, storage, searching, and retrieval of information.” (Salton, 1968). 1.2 Automatic Text Summarization Automatic Text Summarization means the process of extraction and representation of most important content from the source document in the condensed form. This process involves Document Preprocessing, Feature Extraction, Sentence Ranking and Summary Generation [3]. Preprocessing is accomplished by Tokenization, Sentence Splitting, POS Tagging etc. Feature Extraction includes Word Frequency Extraction and Sentence Ordering. Weighing Sentences helps to score sentences required for Sentence Ranking. Summary Generation is the resulting phase of automatic text summarization. 1.3 Multi-Document Summarization Multi-document text summarization deals with retrieval of salient information about a topic from various sources. The task of multi-document summarization is to identify a set of sentences, phrases or some generated semantically correct language units carrying some useful information. Then significant sentences are extracted from this set and re-organized them to get multi-documents’ summary [1]. Let D = {D1, D2,.......,Dn} be a set of documents. Let S = {S1, S2, ...... ,Sn} be a set of summary of each document respectively.
  • 3. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 109 Figure 1: Multi –Document Summarization II. RELATED WORK In recent years, the research focus in the domain of natural language processing and information retrieval has been shifted to the area of automatic document summarization. Automatic document summarization is of two types : abstractive and extractive. The research in this field began with Term Frequency based summarization. Following researchers used term frequency based approach for document summarization. G. Salton, 1989, Jun'ichi Fukumoto, 2004, You Ouyang, 2009 and Mr.Vikrant Gupta, 2012 [1]. Inderjeet Mani, 1997, Rada Mihalcea, 2004, Junlin Zhang, 2005, Xiaojun Wan, 2008, Kokil Jaidka, 2010 [1] carried out research for document summarization using Graph-based approach. Kathleen McKeown, 1995, Xiaojun Wan, 2007 used Time-Based method for document summarization. Sentence Correlation method was implemented for document summarization by Shanmugasundaram Hariharan, 2012, Tiedan Zhu, 2012. Clustering-Based method for document summarization was proposed by Jade Goldstein, 2000. Vikrant Gupta el at [2], developed an auto-summarization tool using statistical techniques. The techniques involve finding the frequency of words, scoring the sentences, ranking the sentences etc. Yogan Jaya Kumar et al. [3] discussed Automatic Multi Document Summarization Approaches. Y. Surendranadha Reddy el at [4] presented a summarization system that produces a summary for a given web document based on sentence importance measures such as sentence ranking. Tiedan Zhu et al [5] proposed an improved approach to sentence ordering for Multi- document Summarization. III. PROPOSED WORK This paper deals with the framework for “Web Information Retrieval based on Multi-Document Summarization”. The proposed framework is shown in Figure 2. However, in this paper, the emphasis is given on the multi-level document summarization. The basic purpose of this framework is to enhance the effectiveness of web information retrieval. As the indexing and ranking of the retrieved documents is supported by intelligent decision making system which is based on fuzzy inference rules, the degree of relevance can be increased. Document (D1) Document (D2) Document (Dn) Summary (S1) Summary (S2) Summary (Sn) Significant and common sentences extraction Sentences Re- ordering based on semantic contents Multi- Document Summary
  • 4. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 110 User Query Figure 2: Framework for web information retrieval 3.1 Intelligent Query Processing When a query is written by the user and submitted to the system, it is required to manipulate it and represent it in proper form. Intelligent Query Processing (IQP) module helps user to formulate his query. WordNet is used for identifying synonyms and thesaurus. This intelligent unit tries to understand the user need and accordingly it classifies the query into any of three types : Informational, Navigational and Transactional. 3.2 Search Engine Search Engine uses web crawler to traverse the World Wide Web to fetch matching URLs. 3.3 Multi-Document Summarization The n number of links/URLs is the input to the Multi-Document Summarization (MDS) unit. Each of these inputs can be HTML web page or a text/PDF file. MDS finds summary of each document separately and finally combines all of them to form single summary. It involves significant sentences identification, sentences reordering etc. The detail algorithm for this is discussed in section IV. 3.4 Indexing The process of generating inverted indices of the retrieved documents is Indexing. It is also an important step in information retrieval. Intelligent Query Processing Search Engine URL1 URL2 URL3 URLn Multi- document Summarization Indexing Ranking Documents
  • 5. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 111 3.5 Ranking Ranking means determining the weight or rank of each retrieved web page or web document. Our page rank strategy is based on the summary generated by MDS unit. The Page Rank Algorithm will use fuzzy inference system to judge the relevance of the web page according to the user query. IV. AUTOMATIC DOCUMENT SUMMARIZATION MODEL To summarize a document or documents, a reader has to understand the document(s) and integrate information and make connections across sentences to form a coherent discourse representation. We designed and developed a new generic algorithm for automatic document summarization based on the analysis of human cognition and intelligence. In order make this model applicable for summarization we define the concept of ‘event’. ‘Event’ is a cognitive psychological concept, and can be either a story or a sentence in microstructure. We have treated each sentence as an event in this paper. The cognitive model is shown below document Figure 3: Cognitive Model As each event is representing sentence in the document, it is a combination of two parts : Subject and Predicate. e.g. Consider the example of an event : Event : Tiger is a wild animal Subject Predicate This event can be represented in predicate form by using FOPL syntax wild_animal(tiger) The representation of events in predicate form is required to create the knowledge about the document. In addition to that, we have defined inference rules to understand the connection of one sentence to others. This helps us to decide the significance of sentence within the document and used for sentence re-ordering. Event 1 Event 1 Event N
  • 6. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 112 The Algorithm for Automatic text/document summarization is discussed below: Figure 4: Algorithm for Text summarization V. EXPERIMENTAL RESULTS We have implemented the multi-document summarization system in JAVA. 5.1 Test Data/Corpus : We have used the standard CACM test data with 3204 test samples. Following are the documents chosen from CACM test data. Document : CACM—0276.html START Read the document Tokenization Stemmer POS Tagger Summary Cognitive Model Program Organization and Record Keeping for Dynamic Storage Allocation The material presented in this paper is part of the design plan of the core allocation portion of the ASCII-MATIC Programming System. Project ASCII-MATIC is concerned with the application of computer techniques to the activities of certain headquarters military intelligence operations of the U. Army. CACM October, 1961 Holt, A. W.
  • 7. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 113 Summary : 5.2 Term Frequency Computation The term frequency of all the terms in the sentence is calculated individually. The sum of all these frequencies will give the T(si,),term frequency of sentence si. This can be calculated by: ܶሺܵ݅ሻ ൌ ෍ ‫ݐ‬ ‫כ‬ ݂ሺ‫݅ݐ‬ሻ ௡ ௧௜ୀଵ Where, ti is the ith term in the sentence. And t*f(ti) is the term frequency of term ti. Therefore, T(Si) for document 1 T(Si)=(1+1+1+1+1+1+1+1+2+5+1+1+1+1+1+1+3+1+1+1+1+2+2+1+1+1+1+1+1+1+1+1 +1+1+1+1+1+1+1+1+1) = 50 5.3 TF-IDF or term-frequency inverse-document-frequency TF-IDF or term-frequency inverse-document-frequency is computed as the ratio of the quantity of terms in that document to the frequency of the quantity of documents containing that terms. For above document the TF_IDF is given below: ܶ‫ܨܦܫ_ܨ‬ ൌ 41 50 =0.82 V. CONCLUSION In this paper, we have proposed a framework for web information retrieval using multi- document summarization. The emphasis was given on the multi-level document summarization. The basic purpose of this framework is to enhance the effectiveness of web information retrieval. As the indexing and ranking of the retrieved documents is supported by intelligent decision making system which is based on fuzzy inference rules, the degree of relevance can be increased. The experiment are carried out on the CACM test data, and it is found that information retrieval results can be improved after multi-document summarization process is performed. Title : Program Organization and Record Keeping for Dynamic Storage Allocation Author : Holt, A. W. Publication & Year : CACM October, 1961 Part of the design plan of the core allocation portion of the ASCII-MATIC Programming System. Project ASCII-MATIC deals with the application of computer techniques to the activities of certain headquarters military intelligence operations of the U. Army.
  • 8. International Journal of Computer Engineering and Technology (IJCET), ISSN 0976-6367(Print), ISSN 0976 - 6375(Online), Volume 5, Issue 3, March (2014), pp. 107-114 © IAEME 114 REFERENCES [1] Md. Majharul Haque, Suraiya Pervin, and Zerina Begum, Literature Review of Automatic Multiple Documents Text Summarization, International Journal of Innovation and Applied Studies, Vol. 3 No. 1 May 2013, pp. 121-129 [2] Vikrant Gupta, Priya Chauhan, Sohan Garg, Anita Borude, Shobha Krishnan, An Statistical Tool for Multi-Document summarization, International Journal of Scientific and Research Publications, Volume 2, Issue 5, May 2012 [3] Yogan Jaya Kumar and Naomie Salim, Automatic Multi Document Summarization Approaches, Journal of Computer Science 8 (1): 133-140, [4] Y. Surendranadha Reddy and A.P. Siva Kumar, An Efficient Approach for Web document summarization by Sentence Ranking, International Journal of Advanced Research in Computer Science and Software Engineering, Volume 2, Issue 7, July 2012 [5] Tiedan Zhu, Xinxin Zhao, An Improved Approach to Sentence Ordering For Multi-document Summarization, 2012 IACSIT Hong Kong Conferences, IPCSIT vol. 25 (2012) © (2012) IACSIT Press, Singapore [6] Nikola Vlahovic, Information Retrieval and Information Extraction in Web 2.0 environment, International Journal of Computers, Issue 1, Volume 5, 2011 [7] Yi Guo and George Stylios, An Intelligent Algorithm For Automatic Document Summarization. [8] Prakasha S, Shashidhar HR and Dr. G T Raju, “A Survey on Various Architectures, Models and Methodologies for Information Retrieval”, International Journal of Computer Engineering & Technology (IJCET), Volume 4, Issue 1, 2013, pp. 182 - 194, ISSN Print: 0976 – 6367, ISSN Online: 0976 – 6375. [9] Mousmi Chaurasia and Dr. Sushil Kumar, “Natural Language Processing Based Information Retrieval for the Purpose of Author Identification”, International Journal of Information Technology and Management Information Systems (IJITMIS), Volume 1, Issue 1, 2010, pp. 45 - 54, ISSN Print: 0976 – 6405, ISSN Online: 0976 – 6413.