SlideShare une entreprise Scribd logo
1  sur  6
Télécharger pour lire hors ligne
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1
Model for semantic processing in information retrieval systems
Ph.D Roberto Passailaigue Baquerizo1, MSc. Hubert Viltres Sala2, Ing. Paúl Rodríguez Leyva3,
Ph.D Vivian Estrada Sentí4
1Canciller Universidad Tecnológica (ECOTEC)
Guayaquil, Ecuador
2Departamento de Práctica Profesional,
Universidad de las Ciencias Informáticas,
La Habana, Cuba
3Departamento de Soluciones Informáticas para Internet,
Universidad de las Ciencias Informáticas,
La Habana, Cuba
4Departamento Metodológico de Postgrado
Universidad de las Ciencias Informáticas,
La Habana, Cuba
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - The processing of information with semantic
annotation allows to identify the intention of search of the
users and to adjust the result according to the context of
the information. The present research proposes a model
for the retrieval of information with semantic annotation
that allows to help the user to recover the most relevant
information among all the information available on the
web. In the model, three components (Trace-Indexing,
Processing and Presentation) are developed that allow
identifying the need for user information through the
processing, selection and subsequent publication of the
retrieved information. The crawling and indexing
component allows the identification of available web sites
to extract information and perform semantic annotation
by applying different information processing techniques.
The processing component analyzes the preferences of the
user and processes the query performed to calculate the
similarity of the indexed information. Subsequently the
results are sorted according to the relevance to show in
the Presentation component a quantity of information
that can be assimilated by the users. For the validation of
the proposal, the metrics of precision and completeness
were used to demonstrate the quality and relevance of the
information retrieval with semantic annotation.
Key Words: Semantic Web, information retrieval,
processing, relevance, semantic annotation, similarity
1. INTRODUCTION
The development of society, the emergence of
technologies and tools to improve access to information
and the rapid growth of the Internet in recent years, has
enabled a large volume of web content to be generated.
The information available on the web is dispersed,
poorly structured or invisible to the common user,
making it difficult to access information of high quality
and value to the user. In this context, users when they
access the Internet are overwhelmed by information
overload and do not easily and quickly obtain the
information that best suits their needs, limit their
experience in the use of information retrieval systems.
There are more than a trillion websites on the Internet
and every day there is an exponential increase in the
amount of information available. Generating new
opportunities and different challenges for users when
they try to obtain relevant information. Due to the large
amount of information available on the Internet and the
difficulty of assimilating it, users rely on information
retrieval systems (IRS) to find what they are looking for.
Information retrieval systems using different tools,
methods and techniques retrieve public information
from the web for later analysis, selecting and ordering
the most relevant information for the user's needs.
Among the main sources of information are the
component repositories, databases and search engines
that allow to simplify and group relevant information,
using certain concepts of information organization. The
main objective of an SRI as proposed in [1] is to satisfy
the user's need for information in a natural language
query specified through a set of key words (see figure 1),
which help identify the most relevant to the user.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 2
Figura-1: Source information search process [2]
Several authors in [3], [4], [5], suggest that the
Information Search and Recovery has as main objective
to provide relevant information to the user to satisfy
their information need. Within the BRI, five main
activities are defined (locating, selecting, interpreting,
synthesizing and communicating information) to guide
the process of obtaining information tailored to the
user's needs. These five activities are covered in the
three main components of a search engine today
(crawler, indexer and processor).
During the information retrieval process, traditional
search engines generally use techniques that determine
relevance by matching keywords in documents and do
not analyze the relationships between the implicit
meaning of keywords and the document. For them it is
necessary to carry out a process of identifying the user's
intention behind the question asked and adjust the result
to the context of the question.
Several authors argue that the semantic retrieval of
information improves the quality and relevance of the
information shown to users, since it uses natural
language processing techniques, uses ontologies to
identify the context and the relevance is established by
the semantic similarity of the query And indexed
documents.
1.1 Semantic retrieval of information
The Semantic Web is changing the way of obtaining
information on the Internet, it is one of the technologies
that have generated the most impact for Internet users
because of the quality of the information they get.
Berners-Lee in [6] defines the Semantic Web as "... an
extension of the current Web, in which information has a
well-defined meaning, facilitating computers to work
better in cooperation with humans" and its main
objective has been Allowing data stored on the Web to
be intelligently processed by the machines, making it
easier for people to search, integrate and analyze
available information.
The semantic web has as principle the processing of
information automatically by the use of artificial
intelligence using a great variety of algorithms. It also
aims to understand the need expressed by the user in a
query performed and provide the search for meaning,
identifying and providing reliable information. To
perform the semantic search semantic search engines
are used that are "information retrieval systems that
understand the user's need and analyze the information
available on the Web through the use of algorithms that
simulate understanding or understanding."
The general functioning of a semantic search engine
in [7] is associated to the following characteristics:
 Performs field searches.
 Has ability to extend query terms using
synonyms or related words.
 Identifies named entities, such as company
names, organizations or individuals that are
used with that meaning in the search process.
 Uses grouping techniques to construct
categorizations of content on which to search
or group key terms. This is the case of tag
clouds that show the key terms of a website
according to its importance.
 Detects relationships between search terms
and words that appear in content based on
knowledge models represented through
ontologies.
 It offers the possibility of using natural
language to express queries and even factual
questions, for which concrete answers are
obtained [7].
The characteristics discussed above demonstrate
the semantic web's possibilities in retrieving information
where a user expresses in natural language his or her
search intention and the searcher analyzes and selects
the information adjusted to that need. In the context of
the Cuban web where technological limitations difficulty
the information retrieval process to solve this problem it
is necessary to employ the retrieval of semantic
information.
1.2 Information retrieval on the Cuban web
In Cuba there are more than 6 thousand websites
hosted under the .cu domain with varied information. In
order to access the information stored on the Cuban web,
users use different information retrieval systems but do
not always obtain relevant information, mainly due to:
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 3
 Heterogeneity of sources of information.
 Quality of information.
 Visibility of information.
 Accessibility of information.
In addition to the above mentioned elements
another factor that affects the information retrieval is
the use of systems that use algorithms to calculate the
similarity by words, where the semantics of the
information is not analyzed. An analysis of the systems
that determine the similarity by keywords showed the
following deficiencies:
 Difficulty understanding the user's need
expressed in natural language.
 Low accuracy of results because the similarity of
keywords is enhanced.
 Sensitivity of the results against the exact terms
introduced.
 Selection of the information by the relevance of
the positioning of the website.
The above difficulties show little exactitude and
accuracy in the information retrieval process and
decrease the user experience when performing a search
for information. These deficiencies coupled with the
need to provide users with high-quality information
raises the need to develop an information retrieval
system with semantic annotation that allows the
selection of information that is more adjusted to the
needs of users and thereby improve their experience in
the Cuban web.
1.3 Semantic search of information
The semantic web is an extension of the current web,
several authors [2] [6] [7] [8] [9] [10] suggest that
information can be efficiently obtained by integrating,
automating and reusing data using various techniques to
Improve the relevance of the information collected.
Semantic searches provide relevant results by
understanding the need for user information expressed
in natural language.
According to Redondo in [8] the aim of semantic search
is to improve the accuracy of the search by
understanding the user's intention when making a query
and the contextual meaning of the data in the knowledge
source. Semantic search predicts what the user explicitly
expresses (search intent) and adjusts their need (context)
to available information by selecting the most relevant
one for the user.
Information retrieval systems focus their
implementation on understanding search using query
processing, extracting knowledge from data sources,
adjusting user preferences, and calculating relevance.
The model proposed in the research is based on the
retrieval of relevant information for the user using
semantic technology.
2. METHODOLOGY
In order to obtain relevant information for users, a
computational model is implemented that allows the
processing of the information available semantically. In
the model, the three main components (Tracking-
Indexing, Processing and Presentation) are considered;
which will identify the need for user information
through the processing, selection and subsequent
publication of the retrieved information. Figure 2
presents the components that support the process of
searching and retrieving information on the web. Each of
the three components is described below.
Fig - 2: Computational model for the semantic
processing of information (own elaboration)
2.1 Tracking and indexing component
The crawling and indexing component allows the
identification of available web sites, as well as retrieving
and storing information from each web page for further
processing and presentation to users when making a
query.
The crawlers are in charge of exploring the web
identifying the pages that have been created or updated
to continue updating its index of information. After
tracking different metadata (url, content summary, links,
keywords, language) are stored that are used to extract
knowledge using semantic web techniques.
2.1.1 Tracking the web
The crawl process starts with a list of links to
websites provided by previous crawls or sitemap; The
greater the number of links the given to new websites,
changes to current websites and broken links. The
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 4
crawler analyzes each page, downloads its content and
identifies new links to continue the process on a
recurring basis. It is used to carry out the Nutch tracing
in a distributed way using the policies of selection, re-
visit, courtesy and parallelization that allow a thorough
search. The crawler configuration determines which
sites to crawl, how often and how many pages to scan on
each site (Google, 2017).
2.1.2 Indexing the web
After performing the tracking process, each web
page is analyzed to identify the main elements and then
store the information and create an index of contents
that allows to improve the information retrieval process.
In the indexing process, the information tracked is
standardized, defining the necessary metadata for the
processing of the information.
Subsequently, the knowledge graph is generated by
extracting from each page the content according to the
context through the use of a general ontology and a
specific one according to the category of the web page.
Solr and Apache Jena use different techniques and
algorithms to extract the implicit knowledge of web
pages.
Solr implements the vector space model and uses an
inverted file system to create the index; In addition to
performing the normalization process has multiple
analyzers and can define own analyzers [11]. For
semantic reasoning the information uses Apache Jena
which provides an API for reading, writing, extracting
and processing RDF graphs. It also has an inference
engine to reason about ontologies and to perform
queries with SPARQL specification. In addition, the
algorithm CF-IDF (concept frequency - reverse document
frequency) is used for the creation of the index based on
the annotations made, which according to [12] and [9]
improves the information retrieval process.
2.2 Processing component
It is responsible for processing and analyzing texts
in natural language by associating each sentence of a text
with a semantic representation based on an ontology
with thousands of words, where words are categorized
according to the different meanings they have and where
the relationships between them are defined.
Gruber defines an ontology in [13] as "an explicit
specification of a conceptualization" that allows to add a
sense to the information that needs to be processed. It
consists of 5 components (concepts, relationships,
functions, instances and axioms) that describe the
relationships of words and add a natural meaning to it.
The use of Ontologies makes it possible to improve the
natural language processing of the query performed by
the user and the information collected by the crawlers
on the web.
2.2.1 Query processing
Users when accessing an information retrieval
system formulate the questions in natural language. In
order to understand the intention behind the question
asked, different techniques need to be processed and
applied to identify the user's need for information. The
query processing has as main objective the
disambiguation of the terms entered by the user
generating as output a triplet in RDF format.
2.2.2 User Profile Processing
It allows you to generate and update the user
profile according to your implicit and explicit
preferences using various elements (categories selected
in your profile, search history and user location) to get
better results when a user performs a search.
2.2.3 Calculation of similarity
In order to determine the similarity between the
query performed by the user and the information
indexed in the searcher, the results of the query
processing, the user profile processing, and the
relevance index of the semantic annotation performed
during the storage process of information.
The similarity is determined using Levenshtein's
algorithm for short texts and the cosine function.
2.2.4 Calculation of relevance
After obtaining the semantic similarity we proceed
to calculate the relevance to show the most relevant
information for the user. In this process the algorithm
proposed in [14] [15] is used to determine the relevance
coefficient according to the user profile, the query and
the semantic similarity index.
The relevance coefficient obtained is used to order
the results and show a number of elements that can be
assimilated by the user.
2.3 Presentation component
Employing user experience techniques, the system
interface is designed where the user can perform the
query and obtain the results. The information retrieval
system has a simple search and an advanced search that
comply with the principles of user-centered design. In
the simple search the user enters the question and
shows the most relevant results. Advanced search allows
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 5
the user a greater level of customization of results using
one of the following filters:
 With any of the words: returns results that
contain one or some of the words in the search
criteria
 With all words: return results that specifically
contain all the words in the criterion
 With the exact phrase: returns results that
specifically contain the exact phrase entered in
the search criteria
 Site: allows you to search for results by defining
the websites or domain
2.4 Validation of the model
In the evaluation of the proposed model, we used
the Precision and Completeness metrics that allow us to
check the quality of the results obtained. For the
validation an experiment was designed on the
information published on the Cuban web. In the
experiment we analyzed the results provided to the
questions formulated by the users using an SRI without
semantic processing and the proposed model.
The precision values obtained were 8.3 and
exhaustiveness of 8.5, corroborating that the retrieval of
information with semantic annotation improves the
retrieval of information. In addition, an expert
consultation was conducted where the concordance
showed a high level of satisfaction with the application
of the proposed model.
The evaluation using the metrics and the expert
consultation demonstrates the quality, relevance and
relevance of the information retrieval with semantic
annotation.
Allowing to adjust the most relevant results to the
needs of the user, increasing their experience in the use
of systems of retrieval of semantic information.
3. CONCLUSIONS
The analysis of the information retrieval process
identified as the main deficiencies the overload of
information, heterogeneity of information sources and
interoperability that greatly hinder the adequate
processing of available information.
The use of a component for the tracing-indexing,
processing and presentation of the information allowed
to retrieve relevant information for the users.
The calculation of the relevance using the semantic
similarity allows to improve the information retrieval
process.
The validation of the model using the metrics of
Precision and Completeness and the consultation of
experts allows to check the quality of the obtained
results.
REFERENCES
DECO, C.; REYES, N. y BENDER, C: Recuperación de
Información en Bases de datos no estructuradas, XIV
Workshop de Investigadores en Ciencias de la
Computación, 2012
VUOTTO, A.; BOGETTI, C. y FERNÁNDEZ, G. Application
of TF-IDF factor in the semantic analysis of a
documentary collection, biblios, 015, vol 60, p. 1-
13.
SALTON, G. y MCGILL, M. Introduction to
ModernInformation Retrieval. McGraw-Hill, Inc.,
1983.
GONZALO, C.; CODINA, L., et al. Recuperación de
información centrada en el usuario y SEO:
Categorización y determinación de las intenciones
de búsqueda en la Web. [Consultado el: 15 de enero
de 2017] Disponible en:
http://journals.sfu.ca/indexcomunicacion/index.ph
p/indexcomunicacion/article/download/197/175
MARTÍNEZ MÉNDEZ, F. J. Recuperación de información:
modelos, sistemas y evaluación. Murcia, KIOSKO
JMC, 2004. 106 p.
BERNERS-LEE,T. et al. “The semantic web,” Scientific
american, vol. 284, no. 5, pp. 28-37, 2001
Martínez-Fernández,J. L. et al. Búsqueda semántica a
través del Procesamiento de Lenguaje Natural, 2010
p. 2-3.
Redondo, S. ¿Qué es la búsqueda semántica y por qué
me debe importar? [Consultado el: 15 de marzo de
2017] Disponible en:
http://www.senormunoz.es/SEO-MARBELLA/que-
es-la-busqueda-semantica-y-por-que-me-debe-
importar
GARCÍA MORENO, C. "Desarrollo de un modelo para la
gestión de la I+D+i soportado por tecnologías de la
Web Semántica" ,2015.
RODRÍGUEZ-GARCÍA, M. A., et al. Creating a
semantically-enhanced cloud services environment
through ontology evolution. Future Generations in
Computer Systems, 32, 2014, p 295–306.
MONTERO PUÑALES, E. M. y PLACENCIA SALGUEIRO, A.
Sistema de recuperación y análisis de información
para
investigadores del Instituto Investigativo ICIMAF. INFO
2016, 2016, p 2-15.
[1] GOOSSEN, Frank, et al. News personalization using
the CF-IDF semantic recommender. En Proceedings
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056
Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072
© 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 6
of the International Conference on Web Intelligence,
Mining and Semantics. ACM, 2011. p. 10.
[2] GRUBER, .T. R. “A Translation Approach to Portable
Ontology Specifications”. Knowledge Acquisition,
5(2), 1993. pp.199-220.
[15] PASSAILAIGUE Baquerizo, R., et al. Algorithm for
calculating relevance of documents in information
retrieval systems. International Research Journal of
Engineering and Technology (IRJET). Volume: 04
Issue: 3, Marzo. 2017. e-ISSN: 2395-0056.
BIOGRAPHIES
Ing. en Informática. Jefe de
departamento de Soluciones
Informáticas para Internet.
Habana
Ph.D Education, Canciller
University ECOTEC, Ecuador
MSc. En Informática. Universidad
de las Ciencias Informáticas,
La Habana.
Ph.D Computing, Adviser
postgraduate, Habana, Cuba

Contenu connexe

Tendances

Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsIRJET Journal
 
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...IRJET Journal
 
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET Journal
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...inventionjournals
 
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVAL
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVALCONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVAL
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVALijcsa
 
Comparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering TechniqueComparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering TechniqueIOSR Journals
 
SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAIN
SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAINSEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAIN
SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAINcscpconf
 
Application of fuzzy logic for user
Application of fuzzy logic for userApplication of fuzzy logic for user
Application of fuzzy logic for userIJCI JOURNAL
 
A detail survey of page re ranking various web features and techniques
A detail survey of page re ranking various web features and techniquesA detail survey of page re ranking various web features and techniques
A detail survey of page re ranking various web features and techniquesijctet
 
Classification of search_engine
Classification of search_engineClassification of search_engine
Classification of search_engineBookStoreLib
 
Designing and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applicationsDesigning and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applicationsTELKOMNIKA JOURNAL
 
A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)Editor IJCATR
 
IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...IRJET Journal
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimizationBookStoreLib
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET Journal
 
Personalized web search using browsing history and domain knowledge
Personalized web search using browsing history and domain knowledgePersonalized web search using browsing history and domain knowledge
Personalized web search using browsing history and domain knowledgeRishikesh Pathak
 

Tendances (19)

Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional DatasetsProjection Multi Scale Hashing Keyword Search in Multidimensional Datasets
Projection Multi Scale Hashing Keyword Search in Multidimensional Datasets
 
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
IRJET- A Literature Review and Classification of Semantic Web Approaches for ...
 
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback SessionsIRJET- A Novel Technique for Inferring User Search using Feedback Sessions
IRJET- A Novel Technique for Inferring User Search using Feedback Sessions
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVAL
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVALCONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVAL
CONTENT AND USER CLICK BASED PAGE RANKING FOR IMPROVED WEB INFORMATION RETRIEVAL
 
Abcd
AbcdAbcd
Abcd
 
Comparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering TechniqueComparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering Technique
 
SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAIN
SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAINSEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAIN
SEMANTIC INFORMATION EXTRACTION IN UNIVERSITY DOMAIN
 
Application of fuzzy logic for user
Application of fuzzy logic for userApplication of fuzzy logic for user
Application of fuzzy logic for user
 
A detail survey of page re ranking various web features and techniques
A detail survey of page re ranking various web features and techniquesA detail survey of page re ranking various web features and techniques
A detail survey of page re ranking various web features and techniques
 
Classification of search_engine
Classification of search_engineClassification of search_engine
Classification of search_engine
 
Designing and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applicationsDesigning and configuring context-aware semantic web applications
Designing and configuring context-aware semantic web applications
 
A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)A Generic Model for Student Data Analytic Web Service (SDAWS)
A Generic Model for Student Data Analytic Web Service (SDAWS)
 
IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...IRJET-Computational model for the processing of documents and support to the ...
IRJET-Computational model for the processing of documents and support to the ...
 
H0314450
H0314450H0314450
H0314450
 
Data mining in web search engine optimization
Data mining in web search engine optimizationData mining in web search engine optimization
Data mining in web search engine optimization
 
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
IRJET- Text-based Domain and Image Categorization of Google Search Engine usi...
 
Personalized web search using browsing history and domain knowledge
Personalized web search using browsing history and domain knowledgePersonalized web search using browsing history and domain knowledge
Personalized web search using browsing history and domain knowledge
 

Similaire à IRJET-Model for semantic processing in information retrieval systems

Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringIRJET Journal
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs UsageIRJET Journal
 
`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areasinventionjournals
 
Semantic Information Retrieval Using Ontology in University Domain
Semantic Information Retrieval Using Ontology in University Domain Semantic Information Retrieval Using Ontology in University Domain
Semantic Information Retrieval Using Ontology in University Domain dannyijwest
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine ScrapperIRJET Journal
 
Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrievalidescitation
 
Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...IRJET Journal
 
Recommendation generation by integrating sequential
Recommendation generation by integrating sequentialRecommendation generation by integrating sequential
Recommendation generation by integrating sequentialeSAT Publishing House
 
Recommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semanticsRecommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semanticseSAT Journals
 
Building a recommendation system based on the job offers extracted from the w...
Building a recommendation system based on the job offers extracted from the w...Building a recommendation system based on the job offers extracted from the w...
Building a recommendation system based on the job offers extracted from the w...IJECEIAES
 
Product Comparison Website using Web scraping and Machine learning.
Product Comparison Website using Web scraping and Machine learning.Product Comparison Website using Web scraping and Machine learning.
Product Comparison Website using Web scraping and Machine learning.IRJET Journal
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based Systemijcnes
 
Intelligent Semantic Web Search Engines: A Brief Survey
Intelligent Semantic Web Search Engines: A Brief Survey  Intelligent Semantic Web Search Engines: A Brief Survey
Intelligent Semantic Web Search Engines: A Brief Survey dannyijwest
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsIRJET Journal
 
WEB BASED INFORMATION SYSTEMS OF E-COMMERCE USER SATISFACTION USING ZACHMAN ...
WEB BASED INFORMATION SYSTEMS OF  E-COMMERCE USER SATISFACTION USING ZACHMAN ...WEB BASED INFORMATION SYSTEMS OF  E-COMMERCE USER SATISFACTION USING ZACHMAN ...
WEB BASED INFORMATION SYSTEMS OF E-COMMERCE USER SATISFACTION USING ZACHMAN ...AM Publications
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...IRJET Journal
 
Semantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applicationsSemantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applicationsIRJET Journal
 

Similaire à IRJET-Model for semantic processing in information retrieval systems (20)

Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
 
Introduction abstract
Introduction abstractIntroduction abstract
Introduction abstract
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs Usage
 
`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas`A Survey on approaches of Web Mining in Varied Areas
`A Survey on approaches of Web Mining in Varied Areas
 
50120140502013
5012014050201350120140502013
50120140502013
 
50120140502013
5012014050201350120140502013
50120140502013
 
Semantic Information Retrieval Using Ontology in University Domain
Semantic Information Retrieval Using Ontology in University Domain Semantic Information Retrieval Using Ontology in University Domain
Semantic Information Retrieval Using Ontology in University Domain
 
Search Engine Scrapper
Search Engine ScrapperSearch Engine Scrapper
Search Engine Scrapper
 
Performance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information RetrievalPerformance Evaluation of Query Processing Techniques in Information Retrieval
Performance Evaluation of Query Processing Techniques in Information Retrieval
 
Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...Review on an automatic extraction of educational digital objects and metadata...
Review on an automatic extraction of educational digital objects and metadata...
 
Recommendation generation by integrating sequential
Recommendation generation by integrating sequentialRecommendation generation by integrating sequential
Recommendation generation by integrating sequential
 
Recommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semanticsRecommendation generation by integrating sequential pattern mining and semantics
Recommendation generation by integrating sequential pattern mining and semantics
 
Building a recommendation system based on the job offers extracted from the w...
Building a recommendation system based on the job offers extracted from the w...Building a recommendation system based on the job offers extracted from the w...
Building a recommendation system based on the job offers extracted from the w...
 
Product Comparison Website using Web scraping and Machine learning.
Product Comparison Website using Web scraping and Machine learning.Product Comparison Website using Web scraping and Machine learning.
Product Comparison Website using Web scraping and Machine learning.
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based System
 
Intelligent Semantic Web Search Engines: A Brief Survey
Intelligent Semantic Web Search Engines: A Brief Survey  Intelligent Semantic Web Search Engines: A Brief Survey
Intelligent Semantic Web Search Engines: A Brief Survey
 
A Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search ResultsA Survey on Automatically Mining Facets for Queries from their Search Results
A Survey on Automatically Mining Facets for Queries from their Search Results
 
WEB BASED INFORMATION SYSTEMS OF E-COMMERCE USER SATISFACTION USING ZACHMAN ...
WEB BASED INFORMATION SYSTEMS OF  E-COMMERCE USER SATISFACTION USING ZACHMAN ...WEB BASED INFORMATION SYSTEMS OF  E-COMMERCE USER SATISFACTION USING ZACHMAN ...
WEB BASED INFORMATION SYSTEMS OF E-COMMERCE USER SATISFACTION USING ZACHMAN ...
 
Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...Algorithm for calculating relevance of documents in information retrieval sys...
Algorithm for calculating relevance of documents in information retrieval sys...
 
Semantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applicationsSemantic Web concepts used in Web 3.0 applications
Semantic Web concepts used in Web 3.0 applications
 

Plus de IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

Plus de IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Dernier

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingrakeshbaidya232001
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)Suman Mia
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdfankushspencer015
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...ranjana rawat
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 

Dernier (20)

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
Porous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writingPorous Ceramics seminar and technical writing
Porous Ceramics seminar and technical writing
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)Software Development Life Cycle By  Team Orange (Dept. of Pharmacy)
Software Development Life Cycle By Team Orange (Dept. of Pharmacy)
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
Call Girls in Nagpur Suman Call 7001035870 Meet With Nagpur Escorts
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(PRIYA) Rajgurunagar Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 

IRJET-Model for semantic processing in information retrieval systems

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 1 Model for semantic processing in information retrieval systems Ph.D Roberto Passailaigue Baquerizo1, MSc. Hubert Viltres Sala2, Ing. Paúl Rodríguez Leyva3, Ph.D Vivian Estrada Sentí4 1Canciller Universidad Tecnológica (ECOTEC) Guayaquil, Ecuador 2Departamento de Práctica Profesional, Universidad de las Ciencias Informáticas, La Habana, Cuba 3Departamento de Soluciones Informáticas para Internet, Universidad de las Ciencias Informáticas, La Habana, Cuba 4Departamento Metodológico de Postgrado Universidad de las Ciencias Informáticas, La Habana, Cuba ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - The processing of information with semantic annotation allows to identify the intention of search of the users and to adjust the result according to the context of the information. The present research proposes a model for the retrieval of information with semantic annotation that allows to help the user to recover the most relevant information among all the information available on the web. In the model, three components (Trace-Indexing, Processing and Presentation) are developed that allow identifying the need for user information through the processing, selection and subsequent publication of the retrieved information. The crawling and indexing component allows the identification of available web sites to extract information and perform semantic annotation by applying different information processing techniques. The processing component analyzes the preferences of the user and processes the query performed to calculate the similarity of the indexed information. Subsequently the results are sorted according to the relevance to show in the Presentation component a quantity of information that can be assimilated by the users. For the validation of the proposal, the metrics of precision and completeness were used to demonstrate the quality and relevance of the information retrieval with semantic annotation. Key Words: Semantic Web, information retrieval, processing, relevance, semantic annotation, similarity 1. INTRODUCTION The development of society, the emergence of technologies and tools to improve access to information and the rapid growth of the Internet in recent years, has enabled a large volume of web content to be generated. The information available on the web is dispersed, poorly structured or invisible to the common user, making it difficult to access information of high quality and value to the user. In this context, users when they access the Internet are overwhelmed by information overload and do not easily and quickly obtain the information that best suits their needs, limit their experience in the use of information retrieval systems. There are more than a trillion websites on the Internet and every day there is an exponential increase in the amount of information available. Generating new opportunities and different challenges for users when they try to obtain relevant information. Due to the large amount of information available on the Internet and the difficulty of assimilating it, users rely on information retrieval systems (IRS) to find what they are looking for. Information retrieval systems using different tools, methods and techniques retrieve public information from the web for later analysis, selecting and ordering the most relevant information for the user's needs. Among the main sources of information are the component repositories, databases and search engines that allow to simplify and group relevant information, using certain concepts of information organization. The main objective of an SRI as proposed in [1] is to satisfy the user's need for information in a natural language query specified through a set of key words (see figure 1), which help identify the most relevant to the user.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 2 Figura-1: Source information search process [2] Several authors in [3], [4], [5], suggest that the Information Search and Recovery has as main objective to provide relevant information to the user to satisfy their information need. Within the BRI, five main activities are defined (locating, selecting, interpreting, synthesizing and communicating information) to guide the process of obtaining information tailored to the user's needs. These five activities are covered in the three main components of a search engine today (crawler, indexer and processor). During the information retrieval process, traditional search engines generally use techniques that determine relevance by matching keywords in documents and do not analyze the relationships between the implicit meaning of keywords and the document. For them it is necessary to carry out a process of identifying the user's intention behind the question asked and adjust the result to the context of the question. Several authors argue that the semantic retrieval of information improves the quality and relevance of the information shown to users, since it uses natural language processing techniques, uses ontologies to identify the context and the relevance is established by the semantic similarity of the query And indexed documents. 1.1 Semantic retrieval of information The Semantic Web is changing the way of obtaining information on the Internet, it is one of the technologies that have generated the most impact for Internet users because of the quality of the information they get. Berners-Lee in [6] defines the Semantic Web as "... an extension of the current Web, in which information has a well-defined meaning, facilitating computers to work better in cooperation with humans" and its main objective has been Allowing data stored on the Web to be intelligently processed by the machines, making it easier for people to search, integrate and analyze available information. The semantic web has as principle the processing of information automatically by the use of artificial intelligence using a great variety of algorithms. It also aims to understand the need expressed by the user in a query performed and provide the search for meaning, identifying and providing reliable information. To perform the semantic search semantic search engines are used that are "information retrieval systems that understand the user's need and analyze the information available on the Web through the use of algorithms that simulate understanding or understanding." The general functioning of a semantic search engine in [7] is associated to the following characteristics:  Performs field searches.  Has ability to extend query terms using synonyms or related words.  Identifies named entities, such as company names, organizations or individuals that are used with that meaning in the search process.  Uses grouping techniques to construct categorizations of content on which to search or group key terms. This is the case of tag clouds that show the key terms of a website according to its importance.  Detects relationships between search terms and words that appear in content based on knowledge models represented through ontologies.  It offers the possibility of using natural language to express queries and even factual questions, for which concrete answers are obtained [7]. The characteristics discussed above demonstrate the semantic web's possibilities in retrieving information where a user expresses in natural language his or her search intention and the searcher analyzes and selects the information adjusted to that need. In the context of the Cuban web where technological limitations difficulty the information retrieval process to solve this problem it is necessary to employ the retrieval of semantic information. 1.2 Information retrieval on the Cuban web In Cuba there are more than 6 thousand websites hosted under the .cu domain with varied information. In order to access the information stored on the Cuban web, users use different information retrieval systems but do not always obtain relevant information, mainly due to:
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 3  Heterogeneity of sources of information.  Quality of information.  Visibility of information.  Accessibility of information. In addition to the above mentioned elements another factor that affects the information retrieval is the use of systems that use algorithms to calculate the similarity by words, where the semantics of the information is not analyzed. An analysis of the systems that determine the similarity by keywords showed the following deficiencies:  Difficulty understanding the user's need expressed in natural language.  Low accuracy of results because the similarity of keywords is enhanced.  Sensitivity of the results against the exact terms introduced.  Selection of the information by the relevance of the positioning of the website. The above difficulties show little exactitude and accuracy in the information retrieval process and decrease the user experience when performing a search for information. These deficiencies coupled with the need to provide users with high-quality information raises the need to develop an information retrieval system with semantic annotation that allows the selection of information that is more adjusted to the needs of users and thereby improve their experience in the Cuban web. 1.3 Semantic search of information The semantic web is an extension of the current web, several authors [2] [6] [7] [8] [9] [10] suggest that information can be efficiently obtained by integrating, automating and reusing data using various techniques to Improve the relevance of the information collected. Semantic searches provide relevant results by understanding the need for user information expressed in natural language. According to Redondo in [8] the aim of semantic search is to improve the accuracy of the search by understanding the user's intention when making a query and the contextual meaning of the data in the knowledge source. Semantic search predicts what the user explicitly expresses (search intent) and adjusts their need (context) to available information by selecting the most relevant one for the user. Information retrieval systems focus their implementation on understanding search using query processing, extracting knowledge from data sources, adjusting user preferences, and calculating relevance. The model proposed in the research is based on the retrieval of relevant information for the user using semantic technology. 2. METHODOLOGY In order to obtain relevant information for users, a computational model is implemented that allows the processing of the information available semantically. In the model, the three main components (Tracking- Indexing, Processing and Presentation) are considered; which will identify the need for user information through the processing, selection and subsequent publication of the retrieved information. Figure 2 presents the components that support the process of searching and retrieving information on the web. Each of the three components is described below. Fig - 2: Computational model for the semantic processing of information (own elaboration) 2.1 Tracking and indexing component The crawling and indexing component allows the identification of available web sites, as well as retrieving and storing information from each web page for further processing and presentation to users when making a query. The crawlers are in charge of exploring the web identifying the pages that have been created or updated to continue updating its index of information. After tracking different metadata (url, content summary, links, keywords, language) are stored that are used to extract knowledge using semantic web techniques. 2.1.1 Tracking the web The crawl process starts with a list of links to websites provided by previous crawls or sitemap; The greater the number of links the given to new websites, changes to current websites and broken links. The
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 4 crawler analyzes each page, downloads its content and identifies new links to continue the process on a recurring basis. It is used to carry out the Nutch tracing in a distributed way using the policies of selection, re- visit, courtesy and parallelization that allow a thorough search. The crawler configuration determines which sites to crawl, how often and how many pages to scan on each site (Google, 2017). 2.1.2 Indexing the web After performing the tracking process, each web page is analyzed to identify the main elements and then store the information and create an index of contents that allows to improve the information retrieval process. In the indexing process, the information tracked is standardized, defining the necessary metadata for the processing of the information. Subsequently, the knowledge graph is generated by extracting from each page the content according to the context through the use of a general ontology and a specific one according to the category of the web page. Solr and Apache Jena use different techniques and algorithms to extract the implicit knowledge of web pages. Solr implements the vector space model and uses an inverted file system to create the index; In addition to performing the normalization process has multiple analyzers and can define own analyzers [11]. For semantic reasoning the information uses Apache Jena which provides an API for reading, writing, extracting and processing RDF graphs. It also has an inference engine to reason about ontologies and to perform queries with SPARQL specification. In addition, the algorithm CF-IDF (concept frequency - reverse document frequency) is used for the creation of the index based on the annotations made, which according to [12] and [9] improves the information retrieval process. 2.2 Processing component It is responsible for processing and analyzing texts in natural language by associating each sentence of a text with a semantic representation based on an ontology with thousands of words, where words are categorized according to the different meanings they have and where the relationships between them are defined. Gruber defines an ontology in [13] as "an explicit specification of a conceptualization" that allows to add a sense to the information that needs to be processed. It consists of 5 components (concepts, relationships, functions, instances and axioms) that describe the relationships of words and add a natural meaning to it. The use of Ontologies makes it possible to improve the natural language processing of the query performed by the user and the information collected by the crawlers on the web. 2.2.1 Query processing Users when accessing an information retrieval system formulate the questions in natural language. In order to understand the intention behind the question asked, different techniques need to be processed and applied to identify the user's need for information. The query processing has as main objective the disambiguation of the terms entered by the user generating as output a triplet in RDF format. 2.2.2 User Profile Processing It allows you to generate and update the user profile according to your implicit and explicit preferences using various elements (categories selected in your profile, search history and user location) to get better results when a user performs a search. 2.2.3 Calculation of similarity In order to determine the similarity between the query performed by the user and the information indexed in the searcher, the results of the query processing, the user profile processing, and the relevance index of the semantic annotation performed during the storage process of information. The similarity is determined using Levenshtein's algorithm for short texts and the cosine function. 2.2.4 Calculation of relevance After obtaining the semantic similarity we proceed to calculate the relevance to show the most relevant information for the user. In this process the algorithm proposed in [14] [15] is used to determine the relevance coefficient according to the user profile, the query and the semantic similarity index. The relevance coefficient obtained is used to order the results and show a number of elements that can be assimilated by the user. 2.3 Presentation component Employing user experience techniques, the system interface is designed where the user can perform the query and obtain the results. The information retrieval system has a simple search and an advanced search that comply with the principles of user-centered design. In the simple search the user enters the question and shows the most relevant results. Advanced search allows
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 5 the user a greater level of customization of results using one of the following filters:  With any of the words: returns results that contain one or some of the words in the search criteria  With all words: return results that specifically contain all the words in the criterion  With the exact phrase: returns results that specifically contain the exact phrase entered in the search criteria  Site: allows you to search for results by defining the websites or domain 2.4 Validation of the model In the evaluation of the proposed model, we used the Precision and Completeness metrics that allow us to check the quality of the results obtained. For the validation an experiment was designed on the information published on the Cuban web. In the experiment we analyzed the results provided to the questions formulated by the users using an SRI without semantic processing and the proposed model. The precision values obtained were 8.3 and exhaustiveness of 8.5, corroborating that the retrieval of information with semantic annotation improves the retrieval of information. In addition, an expert consultation was conducted where the concordance showed a high level of satisfaction with the application of the proposed model. The evaluation using the metrics and the expert consultation demonstrates the quality, relevance and relevance of the information retrieval with semantic annotation. Allowing to adjust the most relevant results to the needs of the user, increasing their experience in the use of systems of retrieval of semantic information. 3. CONCLUSIONS The analysis of the information retrieval process identified as the main deficiencies the overload of information, heterogeneity of information sources and interoperability that greatly hinder the adequate processing of available information. The use of a component for the tracing-indexing, processing and presentation of the information allowed to retrieve relevant information for the users. The calculation of the relevance using the semantic similarity allows to improve the information retrieval process. The validation of the model using the metrics of Precision and Completeness and the consultation of experts allows to check the quality of the obtained results. REFERENCES DECO, C.; REYES, N. y BENDER, C: Recuperación de Información en Bases de datos no estructuradas, XIV Workshop de Investigadores en Ciencias de la Computación, 2012 VUOTTO, A.; BOGETTI, C. y FERNÁNDEZ, G. Application of TF-IDF factor in the semantic analysis of a documentary collection, biblios, 015, vol 60, p. 1- 13. SALTON, G. y MCGILL, M. Introduction to ModernInformation Retrieval. McGraw-Hill, Inc., 1983. GONZALO, C.; CODINA, L., et al. Recuperación de información centrada en el usuario y SEO: Categorización y determinación de las intenciones de búsqueda en la Web. [Consultado el: 15 de enero de 2017] Disponible en: http://journals.sfu.ca/indexcomunicacion/index.ph p/indexcomunicacion/article/download/197/175 MARTÍNEZ MÉNDEZ, F. J. Recuperación de información: modelos, sistemas y evaluación. Murcia, KIOSKO JMC, 2004. 106 p. BERNERS-LEE,T. et al. “The semantic web,” Scientific american, vol. 284, no. 5, pp. 28-37, 2001 Martínez-Fernández,J. L. et al. Búsqueda semántica a través del Procesamiento de Lenguaje Natural, 2010 p. 2-3. Redondo, S. ¿Qué es la búsqueda semántica y por qué me debe importar? [Consultado el: 15 de marzo de 2017] Disponible en: http://www.senormunoz.es/SEO-MARBELLA/que- es-la-busqueda-semantica-y-por-que-me-debe- importar GARCÍA MORENO, C. "Desarrollo de un modelo para la gestión de la I+D+i soportado por tecnologías de la Web Semántica" ,2015. RODRÍGUEZ-GARCÍA, M. A., et al. Creating a semantically-enhanced cloud services environment through ontology evolution. Future Generations in Computer Systems, 32, 2014, p 295–306. MONTERO PUÑALES, E. M. y PLACENCIA SALGUEIRO, A. Sistema de recuperación y análisis de información para investigadores del Instituto Investigativo ICIMAF. INFO 2016, 2016, p 2-15. [1] GOOSSEN, Frank, et al. News personalization using the CF-IDF semantic recommender. En Proceedings
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395 -0056 Volume: 04 Issue: 05 | May -2017 www.irjet.net p-ISSN: 2395-0072 © 2017, IRJET | Impact Factor value: 5.181 | ISO 9001:2008 Certified Journal | Page 6 of the International Conference on Web Intelligence, Mining and Semantics. ACM, 2011. p. 10. [2] GRUBER, .T. R. “A Translation Approach to Portable Ontology Specifications”. Knowledge Acquisition, 5(2), 1993. pp.199-220. [15] PASSAILAIGUE Baquerizo, R., et al. Algorithm for calculating relevance of documents in information retrieval systems. International Research Journal of Engineering and Technology (IRJET). Volume: 04 Issue: 3, Marzo. 2017. e-ISSN: 2395-0056. BIOGRAPHIES Ing. en Informática. Jefe de departamento de Soluciones Informáticas para Internet. Habana Ph.D Education, Canciller University ECOTEC, Ecuador MSc. En Informática. Universidad de las Ciencias Informáticas, La Habana. Ph.D Computing, Adviser postgraduate, Habana, Cuba