Swoogle

•Télécharger en tant que PPTX, PDF•

2 j'aime•1,218 vues

Swoogle is a search engine and crawler for ontologies, documents, terms and data published on the Semantic Web. It crawls and indexes documents written in RDF and OWL and provides search services through a web interface and web services. Its objectives are to organize the physically distributed Semantic Web documents in a systematic way so that humans and agents can easily search and query the repository. It allows users to search for existing ontologies matching their needs and domains before creating new ones.

Technologie

An Indexing and Retrieval Engine for the
Semantic Web

What is Swoogle?
• Started as a research project of the Ebiquity research group in
University of Maryland
• Swoogle is a search engine for Semantic Web ontologies, documents,
terms and data published on the Web
• A distribute online repository of SWDs
• a crawler based indexing and retrieval system for Semantic Web
• crawls and discovers documents written in RDF,OWL
• provides services to human users through a browser interface and to
software agents via RESTful web services

Objective of Swoogle
• More and more SWDs, both ontologies and instances physically
distributed all over the web
• A retrieval system that organizes these documents in a systematic
way
• Both humans and agents can easily conduct searches and queries
against this repository

Why we use Swoogle?
• Avoid creating new ontologies
• Need for reuse

Services
• Search Semantic Web ontologies
• Search Semantic Web instance data
• Search Semantic Web terms, i.e., URIs that have been defined as
classes and properties
• Provide metadata of Semantic Web documents and support browsing
the Semantic Web
• Archive different versions of Semantic Web documents

What Swoogle search?
• Find if suitable ontologies matching the user’s need already exist
within underlying domain
• User inputs specific term
• Swoogle replies with existing ontologies that also use the term
entered
• Follow the link and see whether the provided ontology satisfies the
need
• Query SWDs with constraints on classes and properties used by them

Swoogle Architecture
Ontology
discovery
Web
interface
DB SWD
crawler
Web
Ontology
Analyzer
Ontology
Agents
Ontology
Agents
Ontology
Agents
Ontology
Agents Ontology
discovery
Google
Apache/
Tomcat
php, myAdmin
mySQL
Jena Jena
IR
engine
SIRE
Web
services
Agent
services
cached
files
Focused
Crawler
APIs

Swoogle Architecture
• SWD discovery component
• Metadata creation component
• Data analysis component
• Indexation and retrieval component
• User interface

Swoogle Crawler
• Crawler visits the web to collect SWDs, ignoring all other documents
(html, pdf, image files)
• For each SWD discovered, Swoogle extracts metadata from the
document and indexes it into an information retrieval system for later
searches and queries

How does Swoogle crawl the semantic web?
• Manual submission
• Google-based meta-crawling
• Bounded HTML crawling
• RDF crawling

Check URL
• Semantic Web archive service to help users
• check if a URL has been indexed
• track the previous versions of the Semantic Web document retrieved from the
URL

Submit URL
• Submit a new URL or a web page containing hyperlinks to user’s
Semantic Web documents
• Swoogle will run regular crawling starting from the provided URL

• Index is being updated regularly
• Already defined or outdated URL submissions are ignored
• Documents that are not accessible will eventually be removed from
the database

Swoogle-like semantic web tools
• manually maintained ontology repositories
ex: Schema Web, DAML ontology library
• Semantic Web search and browsers
ex: Ontaria, Semantic Web Search ,ontoSearch
• Crawlers
ex: DAML Crawler, RDF Crawler, OCRA, scutter from FOAF project

References
Introduction to the Semantic Web and Semantic Web Services
By Liyang Yu

Contenu connexe

Tendances

The Semantic Web #6 - RDF SchemaMyungjin Lee

Restful web services pptOECLIB Odisha Electronics Control Library

Boolean,vector space retrieval Models Primya Tamil

Introducing SwaggerTony Tam

Les concepts fondamentaux de DITAPeccatte

XML Schemayht4ever

Padrões de projeto - Martin Fowler - P of EAAAricelio Souza

Introdução APIs RESTfulDouglas V. Pasqua

Rest APIPhil Aylesworth

Java LoggingZeeshan Bilal

Php Unit 1team11vgnt

API Design, A Quick Guide to REST, SOAP, gRPC, and GraphQL, By Vahid RahimianVahid Rahimian

What is Puppet? | How Puppet Works? | Puppet Tutorial For Beginners | DevOps ...Simplilearn

Introduction To CodeIgniterschwebbie

Introdução ao Front-end no Desenvolvimento WebAnderson Luís Furlan

Introduction to phpMeetendra Singh

An Overview of Web Services: SOAP and REST Ram Awadh Prasad, PMP

Introduction to Spring FrameworkSerhat Can

DrupalIsriya Paireepairit

XmlSantosh Pandey

Tendances (20)

The Semantic Web #6 - RDF Schema

Restful web services ppt

Boolean,vector space retrieval Models

Introducing Swagger

Les concepts fondamentaux de DITA

XML Schema

Padrões de projeto - Martin Fowler - P of EAA

Introdução APIs RESTful

Rest API

Java Logging

Php Unit 1

API Design, A Quick Guide to REST, SOAP, gRPC, and GraphQL, By Vahid Rahimian

What is Puppet? | How Puppet Works? | Puppet Tutorial For Beginners | DevOps ...

Introduction To CodeIgniter

Introdução ao Front-end no Desenvolvimento Web

Introduction to php

An Overview of Web Services: SOAP and REST

Introduction to Spring Framework

Drupal

Xml

En vedette

Finding knowledge, data and answers on the Semantic Webebiquity

Semantic Search with Semantic WebZahra Sadeghi

Panama papers case studySuchini Priyangika

Bottom up & top down tutorial 2Darshiny Rajasegaran

الويب 1 2-3fatimah991

Visual Design with DataSeth Familian

En vedette (6)

Finding knowledge, data and answers on the Semantic Web

Semantic Search with Semantic Web

Panama papers case study

Bottom up & top down tutorial 2

الويب 1 2-3

Visual Design with Data

Similaire à Swoogle

Search all the thingscyberswat

Library Mashups & APIslibrarywebchic

RESTful Web Service using SwaggerHong-Jhih Lin

Drupal and Apache StanbolAlkuvoima

WOTS2E: A Search Engine for a Semantic Web of ThingsAndreas Kamilaris

Current and emerging trends in library servicesNikesh Narayanan

Cloud-based Linked Data Management for Self-service Application DevelopmentPeter Haase

Wedi - Web Data InterpreterSergiu Soltan

Intro to Apache Solr for DrupalChris Caple

RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...Joaquin Delgado PhD.

RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...S. Diana Hu

Discovery InterfacesJonathan-Andornot

Wikipedia Cloud Search WebinarSearch Technologies

Reconceiving the Web as a Distributed (NoSQL) Data SystemDaniel Austin

Building genomic data cyberinfrastructure with the online database software T...mestato

Apereo OAE - BootcampNicolaas Matthijs

Django courseNagi Annapureddy

State-of-the-Art Drupal Search with Apache Solrguest432cd6

State-of-the-Art Drupal Search with Apache SolrRobert Douglass

Globus Portal Framework (APS Workshop)Globus

Similaire à Swoogle (20)

Search all the things

Library Mashups & APIs

RESTful Web Service using Swagger

Drupal and Apache Stanbol

WOTS2E: A Search Engine for a Semantic Web of Things

Current and emerging trends in library services

Cloud-based Linked Data Management for Self-service Application Development

Wedi - Web Data Interpreter

Intro to Apache Solr for Drupal

RecSys 2015 Tutorial - Scalable Recommender Systems: Where Machine Learning m...

RecSys 2015 Tutorial – Scalable Recommender Systems: Where Machine Learning...

Discovery Interfaces

Wikipedia Cloud Search Webinar

Reconceiving the Web as a Distributed (NoSQL) Data System

Building genomic data cyberinfrastructure with the online database software T...

Apereo OAE - Bootcamp

Django course

State-of-the-Art Drupal Search with Apache Solr

Globus Portal Framework (APS Workshop)

Dernier

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

Ransomware_Q4_2023. The report. [EN].pdfOverkill Security

Manulife - Insurer Transformation Award 2024The Digital Insurer

TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc

MS Copilot expands with MS Graph connectorsNanddeep Nachan

Corporate and higher education May webinar.pptxRustici Software

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo

Real Time Object Detection Using Open CVKhem

Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

A Beginners Guide to Building a RAG App Using Open Source MilvusZilliz

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

FWD Group - Insurer Innovation Award 2024The Digital Insurer

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

MINDCTI Revenue Release Quarter One 2024MIND CTI

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz

Architecting Cloud Native ApplicationsWSO2

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

presentation ICT roal in 21st century educationjfdjdjcjdnsjd

ICT role in 21st century education and its challengesrafiqahmad00786416

Dernier (20)

Axa Assurance Maroc - Insurer Innovation Award 2024

Ransomware_Q4_2023. The report. [EN].pdf

Manulife - Insurer Transformation Award 2024

TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery

MS Copilot expands with MS Graph connectors

Corporate and higher education May webinar.pptx

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Real Time Object Detection Using Open CV

Data Cloud, More than a CDP by Matt Robison

AWS Community Day CPH - Three problems of Terraform

A Beginners Guide to Building a RAG App Using Open Source Milvus

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

FWD Group - Insurer Innovation Award 2024

Artificial Intelligence Chap.5 : Uncertainty

MINDCTI Revenue Release Quarter One 2024

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...

Architecting Cloud Native Applications

How to Troubleshoot Apps for the Modern Connected Worker

presentation ICT roal in 21st century education

ICT role in 21st century education and its challenges

Swoogle

1. An Indexing and Retrieval Engine for the Semantic Web

2. What is Swoogle? • Started as a research project of the Ebiquity research group in University of Maryland • Swoogle is a search engine for Semantic Web ontologies, documents, terms and data published on the Web • A distribute online repository of SWDs • a crawler based indexing and retrieval system for Semantic Web • crawls and discovers documents written in RDF,OWL • provides services to human users through a browser interface and to software agents via RESTful web services

3. Objective of Swoogle • More and more SWDs, both ontologies and instances physically distributed all over the web • A retrieval system that organizes these documents in a systematic way • Both humans and agents can easily conduct searches and queries against this repository

4. Why we use Swoogle? • Avoid creating new ontologies • Need for reuse

5. Services • Search Semantic Web ontologies • Search Semantic Web instance data • Search Semantic Web terms, i.e., URIs that have been defined as classes and properties • Provide metadata of Semantic Web documents and support browsing the Semantic Web • Archive different versions of Semantic Web documents

6. What Swoogle search? • Find if suitable ontologies matching the user’s need already exist within underlying domain • User inputs specific term • Swoogle replies with existing ontologies that also use the term entered • Follow the link and see whether the provided ontology satisfies the need • Query SWDs with constraints on classes and properties used by them

7. Swoogle Architecture Ontology discovery Web interface DB SWD crawler Web Ontology Analyzer Ontology Agents Ontology Agents Ontology Agents Ontology Agents Ontology discovery Google Apache/ Tomcat php, myAdmin mySQL Jena Jena IR engine SIRE Web services Agent services cached files Focused Crawler APIs

8. Swoogle Architecture • SWD discovery component • Metadata creation component • Data analysis component • Indexation and retrieval component • User interface

9. Swoogle Crawler • Crawler visits the web to collect SWDs, ignoring all other documents (html, pdf, image files) • For each SWD discovered, Swoogle extracts metadata from the document and indexes it into an information retrieval system for later searches and queries

10. How does Swoogle crawl the semantic web? • Manual submission • Google-based meta-crawling • Bounded HTML crawling • RDF crawling

11. Check URL • Semantic Web archive service to help users • check if a URL has been indexed • track the previous versions of the Semantic Web document retrieved from the URL

12.

13. Submit URL • Submit a new URL or a web page containing hyperlinks to user’s Semantic Web documents • Swoogle will run regular crawling starting from the provided URL

14.

15. • Index is being updated regularly • Already defined or outdated URL submissions are ignored • Documents that are not accessible will eventually be removed from the database

16. Demo

17. Swoogle-like semantic web tools • manually maintained ontology repositories ex: Schema Web, DAML ontology library • Semantic Web search and browsers ex: Ontaria, Semantic Web Search ,ontoSearch • Crawlers ex: DAML Crawler, RDF Crawler, OCRA, scutter from FOAF project

18. References Introduction to the Semantic Web and Semantic Web Services By Liyang Yu

19. THANK YOU!!

Notes de l'éditeur

Because if everyone come up with new one, no shared understanding about anything, no interoperability between two agents
entire Swoogle website is based on the web services as well Swoogle uses google to find SWDs, google provides API to add constraints to a search . Files with .rdf .owl Jena confirms that docs are SWDs Software agents and services can use swoogle APIs
two crawlers; focused crawler and swooglebot. keep updated information about SWDs creates metadata for each SWD for necessary computations and navigation calculate relationships and ranks of SWDs(OntoRank and TermRank)
Currently, Swoogle only indexes some metadata (namespace and localname)about Semantic Web documents. It neither stores nor searches all triples in an Semantic Web documents as a triple store.

Swoogle

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (6)

Similaire à Swoogle

Similaire à Swoogle (20)

Dernier

Dernier (20)

Swoogle

Notes de l'éditeur