SlideShare une entreprise Scribd logo
1  sur  43
Bethesda, Maryland, April 6, 1999 Amit Sheth Large Scale Distributed Information Systems Lab University of Georgia http://lsdis.cs.uga.edu Semantic Interoperability and Information Brokering in Global Information Systems
Three perspectives to GlobIS Information Integration Perspective distribution autonomy heterogeneity Information Brokering Perspective data meta-data semantic (terminological, contextual) “ Vision” Perspective data connectivity computing information knowledge
Evolving targets and approaches in integrating data and information  (a personal perspective) Infocosm Mermaid DDTS Multibase, MRDSM, ADDS,  IISS, Omnibase, ... Generation I 1980s DL-II projects ADEPT, InfoQuilt Generation III 1997... InfoSleuth, KMed, DL-I projects Infoscopes, HERMES, SIMS,  Garlic,TSIMMIS,Harvest, RUFUS,...  Generation II 1990s VisualHarness InfoHarness a society for ubiquitous exchange of (tradeable) information in all digital forms of representation; information anywhere, anytime, any forms
[object Object],[object Object],[object Object],[object Object],[object Object],Generation I
(heterogeneity in FDBMSs) Generation I C o m m u n i c a t i o n ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],1970s 1980s
Generation I (Federated Database Systems: Schema Architecture) ,[object Object],[object Object],[object Object],Component DBS Local Schema Component Schema Export Schema Export Schema Export Schema Federated Schema External Schema External Schema . . . Component DBS Local Schema Component Schema . . . . . . . . . . . . schema translation schema integration
(characterization of schematic conflicts in multidatabase systems) Sheth & Kashyap, Kim & Seo Generation I Schematic Conflicts Generalization Conflicts Aggregation Conflicts Abstraction Level Incompatibility Data Value Attribute Conflict  Entity Attribute Conflict Data Value Entity Conflict Schematic Discrepancies Naming Conflicts Database Identifier Conflicts Schema Isomorphism Conflicts Missing Data Items Conflicts Entity Definition Incompatibility Naming Conflicts Data Representation Conflicts Data Scaling Conflicts Data Precision Conflicts Default Value Conflicts Attribute Integrity Constraint Conflicts Domain Definition Incompatibility Known Inconsistency Temporal Inconsistency Acceptable Inconsistency Data Value Incompatibility B U T these techniques for dealing with schematic heterogeneity do not directly map to dealing with much larger variety of heterogeneous media
(observations and lessons learnt) ,[object Object],[object Object],[object Object],[object Object],[object Object],Generation I
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Generation II
(limited types of metadata, extractors, mappers, wrappers) Generation II METADATA EXTRACTORS Find Marketing Manager positions in a company that is within 15 miles of San Francisco and whose stock price has been growing at a rate of at least 25% per year over the last three years Junglee, SIGMOD Record, Dec. 1997 Global/Enterprise Web Repositories Digital Maps Nexis UPI AP Documents Digital Audios Data Stores Digital Videos Digital Images . . . . . . . . .
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],(a metadata classification: the informartion pyramid) Generation II Data   (Heterogeneous Types/Media) Content Independent Metadata   (creation-date, location, type-of-sensor...) Content Dependent Metadata   (size, max colors, rows, columns...) Direct Content Based Metadata (inverted lists,  document vectors, WAIS, Glimpse, LSI) Domain Independent (structural) Metadata   (C++ class-subclass relationships, HTML/SGML Document Type Definitions, C program structure...) Domain Specific Metadata area, population (Census), land-cover, relief (GIS),metadata  concept descriptions from ontologies Ontologies Classifications Domain Models User Move in this   direction to tackle information  overload!!
VisualHarness – an example
Query processing and information requests What’s next (after comprehensive use of metadata)? NOW ,[object Object],[object Object],[object Object],NEXT ,[object Object],[object Object]
GIS Data Representation – Example multiple heterogeneous metadata models with different tag names for the same data in the same GIS domain Kansas State FGDC Metadata Model Theme keywords :  digital line graph, hydrography, transportation... Title : Dakota Aquifer Online linkage : http://gisdasc.kgs.ukans.edu/dasc/ Direct Spatial Reference Method:  Vector Horizontal Coordinate System Definition: Universal Transverse Mercator   … … … ... UDK Metadata Model Search terms :  digital line graph,  hydrography, transportation... Topic :  Dakota Aquifer Adress Id: http://gisdasc.kgs.ukans.edu/dasc/ Measuring Techniques:  Vector Co-ordinate System: Universal Transverse Mercator … … … ...
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Generation III
Information Brokering: An Enabler for the Infocosm   INFORMATION/DATA OVERLOAD INFORMATION PROVIDERS Newswires Universities Corporations Research Labs Information System Data Repository Information System INFORMATION CONSUMERS Corporations Universities People Government Programs User  Query User  Query User  Query arbitration between information consumers and providers for resolving  information impedance INFORMATION BROKERING Information System Data Repository Information System Information Request Information Request Information Request dynamic reinterpretation of  information requests  for determination of relevant  information services  and products — dynamic creation and composition of  information products
Information Brokering: Three Dimensions   Objective:   Reduce the problem of knowing structure and semantics of data in the huge number of information sources on a global scale to: understanding and navigating a significantly smaller number of domain ontologies S  E  M  A  N  T  I  C  S S  T  R  U  C  T  U  R  E S  Y  N  T  A  X S  Y  S  T  E  M C  O  N  S  U  M  E  R  S B  R  O  K  E  R  S P  R  O  V  I  D  E  R  S D  A  T  A M  E  T  A  D  A  T  A V O C A B U L A R Y T H R E E  D I M E N S I O N S
What else can Information Brokering do?   W W W a confusing heterogeneity of media, formats (Tower of Babel) information correlation using physical (HREF) links at the extensional data level location dependent browsing of information using physical (HREF) links user has to keep track of information content !! W W W  + Information Brokering Domain Specific Ontologies as  “semantic conceptual views” Information correlation using concept mappings at the intensional concept level Browsing of information using terminological relationships across ontologies Higher level of abstraction, closer to user view of information !!
Concepts, tools and techniques to support semantics context media-independent information correlations semantic proximity inter-ontological relations ontologies (esp. domain-specific) profiles domain-specific metadata
[object Object],[object Object],[object Object],[object Object],[object Object],Tools to support semantics BIG challenge:   identifying relationship or similarity between objects of different media,  developed and managed by different persons and systems
Information Brokering over Heterogeneous  Digital Data: A Metadata-based Approach ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],We shall focus on these! I N F O R M A T I O N  O V E R L O A D  = HETEROGENEITY  +  GLOBALIZATION
Heterogeneity... …  is a Babel Tower!! SEMANTIC INTEROPERABILITY metadata ontologies contexts SEMANTIC HETEROGENEITY
The InfoQuilt Project   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],http://lsdis.cs.uga.edu/proj/iq/iq.html
InfoQuilt Project: using the  M etadata  REF erence link   http://lsdis.cs.uga.edu/proj/iq/iq.html MREF  Complements HREF, creating a “logical web” through media independent ontology & metadata based correlation It is a description of the information asset we want to retrieve MREF domain ontologies IQ_Asset ontology + extension ontologies attributes relations constraints keywords content attributes (color, scene cuts, …) Semantic Correlation using MREF MREF Concept Model for logical correlation using ontological terms and metadata Framework for representing MREF’s Serialization (one implementation choice) X M L M R E F R D F
Domain Specific Correlation – example   Potential locations for a future shopping mall identified by all   regions   having a   population   greater than 5000, and   area   greater than 50 sq. ft. having an urban land cover  and moderate   relief   <A MREF ATTRIBUTES(population > 5000; area > 50;  region-type  = ‘block’; land-cover = ‘urban’; relief = ‘moderate’)  can be viewed here </A> =>   media-independent    relationships   between domain    specific metadata :   population,    area, land cover, relief =>   correlation  between image    and structured data at a    higher  domain specific level    as   opposed to physical “link-   chasing” in the WWW domain specific metadata: terms chosen from domain specific ontologies Population: Area : Land cover: Relief: Boundaries: Census DB TIGER/Line DB US Geological Survey Regions (SQL) :    Boundaries    Image Features (image processing routines)
Domain Specific Correlation – example
A DL II approach for Information Brokering   CONSTRUCTING ADDITIONAL META-INFORMATION RESOURCES Physical/Simulation World DISCOVERING COLLECTIONS OF HETEROGENEOUS INFORMATION AND META-INFORMATION RESOURCES Images Data Stores Documents Digital Media Domain Specific Ontologies Domain Independent Ontologies Iscape N CONSTRUCTING APPROPRIATE INFORMATION LANDSCAPES Iscape 1
ADEPT Information Landscape Concept Prototype (a scenario for Digital Earth:   learning in the context of the “El Niño” phenomenon)   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TRY  ISCAPE CONCEPT DEMO ,[object Object],[object Object],[object Object],[object Object]
Putting MREFs to work User Agent Profile Manager user information MREF request retrieve profile User display results change profile design MREF domain ontologies MREF Builder IQ_Asset ontology + extension ontologies construct  new MREF Broker Agent send MREF send results retrieve MREF retrieve MREF MREF repository MREF repository User profiles
[object Object],[object Object],Context: the lynchpin of semantics Cricket
Constructing c-contexts from ontological terms ,[object Object],[object Object],[object Object],[object Object],[object Object],C-CONTEXT: “ All  documents  stored in the database   have been published  by some  agency ” => C def (DOC) = <(hasOrganization, AgencyConcept)> C-Context = <(C 1  , V 1 ) (C 2  , V 2 ) ... (C k  , V k ) > a collection of  contextual coordinates   C i  s   (roles) and  values   V i  s   (concepts/concept descriptions) AGENCY (RegNo, Name, Affiliation)  DOC (Id, Title, Agency)   ONTOLOGICAL TERMS Agency Concept DATABASE OBJECTS Document Concept hasOrganization
Using c-contexts to reason about  information in database - Reasoning with c-contexts:  glb(C def (DOC), C Q )   - Ontological Inferences: - DocumentConcept - (hasOrganization,  { “USGS” } ) Challenge 1:  use of multiple ontologies Challenge 2:  estimating the loss of information EXAMPLE C def (DOC) <(hasOrganization, AgencyConcept)> C Q <(hasOrganization,   {   “USGS”} )> glb(C def (DOC), C Q ) <(self, DocumentConcept),(hasOrganization,  { “USGS” } )>
Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  OBSERVER architecture   Eduardo Mena (III’98) Data Repositories Mappings Ontologies COMPONENT NODE Data Repositories Mappings Ontologies COMPONENT NODE Data Repositories Mappings Ontology Server Query Processor User Query Ontologies USER NODE Interontologies Terminological Relationships IRM IRM NODE Ontology Server Ontology Server Query Processor Query Processor
Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  “ Get title and number of pages of books written by Carl Sagan” Query construction - Example   Eduardo Mena (III’98) User ontology:  WN [name pages] for  (AND book (FILLS creator “Carl Sagan”)) Target ontology:  Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for  (AND  book  (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/
Re-use of Knowledge: Bibliography Data Ontology Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  “ Get title and number of pages of books written by Carl Sagan” Query construction - Example   Eduardo Mena (III’98) Biblio-Thing Document Book Edited-Book Technical-Report Periodical-Publication Journal Magazine Newspaper Miscellaneous-Publication Technical-Manual Computer-Program Multimedia-Document Artwork Cartographic-Map Thesis Doctoral-Thesis Master-Thesis Proceedings Conference Agent Person Author Organization Publisher University Stanford-I User ontology:  WN [name pages] for  (AND book (FILLS creator “Carl Sagan”)) Target ontology:  Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for  (AND  book  (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/
Re-use of Knowledge: A subset of WordNet 1.5 Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  “ Get title and number of pages of books written by Carl Sagan” Query construction - Example   Eduardo Mena (III’98) User ontology:  WN [name pages] for  (AND book (FILLS creator “Carl Sagan”)) Target ontology:  Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for  (AND  book  (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/ Print-Media Press Publication Journalism Newspaper Magazine Book Periodical Trade-Book Brochure TextBook Reference-Book SongBook PrayerBook Pictorial Series Journals CookBook Instruction-Book WordBook HandBook Directory Annual Encyclopedia Manual Bible GuideBook Instructions Reference-Manual
WN ontology and user query Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  “ Get title and number of pages of books written by Carl Sagan” Query construction - Example   Eduardo Mena (III’98) User ontology:  WN [name pages] for  (AND book (FILLS creator “Carl Sagan”)) Target ontology:  Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for  (AND  book  (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/
Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  Estimating the loss of information   Eduardo Mena (III’98) ,[object Object],[object Object],[object Object],[object Object],Plans in the example   User Query:  (AND  book   (FILLS doc-author-name “Carl Sagan”)) Plan 1:  (AND   document   (FILLS doc-author-name “Carl Sagan”)) Plan 2:  (AND   periodical-publication  (FILLS doc-author-name “Carl Sagan”)) Plan 3:  (AND   journal   (FILLS doc-author-name “Carl Sagan”)) Plan 4:  (AND   UNION(book, proceedings, thesis, misc-publication, technical-report)   (FILLS doc-author-name “Carl Sagan”))
Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  Loss of information based on intensional information   Eduardo Mena (III’98) User Query:  (AND  book  (FILLS doc-author-name “Carl Sagan”)) Plan 1: (AND   document  (FILLS doc-author-name “Carl Sagan”)) book:=(AND publication ( AT-LEAST 1 ISBN )) publication:=(AND document ( AT-LEAST 1 place-of-publication )) Loss:   “Instead of books written by Carl Sagan, OBSERVER is providing all the documents written by Carl Sagan (even if they do not have an ISBN and place of publication)”
Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system  Example: loss for the plans   Eduardo Mena (III’98) Plan 1:   (AND  document  (FILLS doc-author-name “Carl Sagan”))  [case 2] 91.57% < (1-Loss) < 91.75% Plan 2: (AND  periodical-publication  (FILLS doc-author-name “Carl Sagan”))  94.03% < (1-Loss) < 100% [case 3] Plan 3: (AND  journal  (FILLS doc-author-name “Carl Sagan”)) [case 3] 98.56% < (1-Loss) < 100% Plan 4: (AND  UNION(book, proceedings, thesis, misc-publication, technical-report)  (FILLS doc-author-name “Carl Sagan”))  [case 1] 0% < (1-Loss) < 7.22%
Summary  Text Structured Databases Data Syntax, System Federated DB Semi-structured Metadata Structural, Schematic Mediator, Federated IS Visual, Scientific/Eng. Knowledge Semantic Knowledge Mgmt., Information Brokering, Cooperative IS
Agenda for research  ,[object Object],[object Object],[object Object],[object Object],[object Object]
Related Reading   ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],http://lsdis.cs.uga.edu [See publications on Metadata, Semantics,Context, InfoHarness/InfoQuilt] [email_address] Acknowledgements: Tarcisio Lima Vipul Kashyap

Contenu connexe

Tendances

Intelligent expert systems for location planning
Intelligent expert systems for location planningIntelligent expert systems for location planning
Intelligent expert systems for location planningNavid Milanizadeh
 
Cutting The Trees Of Knowledge
Cutting The Trees Of KnowledgeCutting The Trees Of Knowledge
Cutting The Trees Of Knowledgewenqiang
 
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceRelationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceAmit Sheth
 
Using Maltego Tungsten to Explore Cyber-Physical Confluence in Geolocation
Using Maltego Tungsten to Explore Cyber-Physical Confluence in GeolocationUsing Maltego Tungsten to Explore Cyber-Physical Confluence in Geolocation
Using Maltego Tungsten to Explore Cyber-Physical Confluence in GeolocationShalin Hai-Jew
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic WaveKaniska Mandal
 
Creating Effective Data Visualizations for Online Learning
Creating Effective Data Visualizations for Online Learning Creating Effective Data Visualizations for Online Learning
Creating Effective Data Visualizations for Online Learning Shalin Hai-Jew
 
Scraping and Clustering Techniques for the Characterization of Linkedin Profiles
Scraping and Clustering Techniques for the Characterization of Linkedin ProfilesScraping and Clustering Techniques for the Characterization of Linkedin Profiles
Scraping and Clustering Techniques for the Characterization of Linkedin Profilescsandit
 
2015 07-tuto3-mining hin
2015 07-tuto3-mining hin2015 07-tuto3-mining hin
2015 07-tuto3-mining hinjins0618
 
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsData Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsKaitlin Thaney
 
Role of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital LibrariesRole of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital LibrariesSebastian Ryszard Kruk
 
Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Talis Consulting
 
Information Technology
Information TechnologyInformation Technology
Information TechnologyRhea Ann Andao
 
JeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibraryJeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibrarySebastian Ryszard Kruk
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloudNational Institute of Informatics
 
Data Anonymization for Privacy Preservation in Big Data
Data Anonymization for Privacy Preservation in Big DataData Anonymization for Privacy Preservation in Big Data
Data Anonymization for Privacy Preservation in Big Datarahulmonikasharma
 

Tendances (19)

Intelligent expert systems for location planning
Intelligent expert systems for location planningIntelligent expert systems for location planning
Intelligent expert systems for location planning
 
Cutting The Trees Of Knowledge
Cutting The Trees Of KnowledgeCutting The Trees Of Knowledge
Cutting The Trees Of Knowledge
 
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human ExperienceRelationship Web: Trailblazing, Analytics and Computing for Human Experience
Relationship Web: Trailblazing, Analytics and Computing for Human Experience
 
Web Mining
Web MiningWeb Mining
Web Mining
 
Using Maltego Tungsten to Explore Cyber-Physical Confluence in Geolocation
Using Maltego Tungsten to Explore Cyber-Physical Confluence in GeolocationUsing Maltego Tungsten to Explore Cyber-Physical Confluence in Geolocation
Using Maltego Tungsten to Explore Cyber-Physical Confluence in Geolocation
 
Riding The Semantic Wave
Riding The Semantic WaveRiding The Semantic Wave
Riding The Semantic Wave
 
Creating Effective Data Visualizations for Online Learning
Creating Effective Data Visualizations for Online Learning Creating Effective Data Visualizations for Online Learning
Creating Effective Data Visualizations for Online Learning
 
Scraping and Clustering Techniques for the Characterization of Linkedin Profiles
Scraping and Clustering Techniques for the Characterization of Linkedin ProfilesScraping and Clustering Techniques for the Characterization of Linkedin Profiles
Scraping and Clustering Techniques for the Characterization of Linkedin Profiles
 
Angels_in_our_Midst
Angels_in_our_MidstAngels_in_our_Midst
Angels_in_our_Midst
 
2015 07-tuto3-mining hin
2015 07-tuto3-mining hin2015 07-tuto3-mining hin
2015 07-tuto3-mining hin
 
Semantic web
Semantic webSemantic web
Semantic web
 
Data Sharing and the Polar Information Commons
Data Sharing and the Polar Information CommonsData Sharing and the Polar Information Commons
Data Sharing and the Polar Information Commons
 
Shifting from librarian to data manager
Shifting from librarian to data managerShifting from librarian to data manager
Shifting from librarian to data manager
 
Role of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital LibrariesRole of Ontologies in Semantic Digital Libraries
Role of Ontologies in Semantic Digital Libraries
 
Linked Data Workshop Stanford University
Linked Data Workshop Stanford University Linked Data Workshop Stanford University
Linked Data Workshop Stanford University
 
Information Technology
Information TechnologyInformation Technology
Information Technology
 
JeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital LibraryJeromeDL - the Semantic Digital Library
JeromeDL - the Semantic Digital Library
 
Toward universal information access on the digital object cloud
Toward universal information access on the digital object cloudToward universal information access on the digital object cloud
Toward universal information access on the digital object cloud
 
Data Anonymization for Privacy Preservation in Big Data
Data Anonymization for Privacy Preservation in Big DataData Anonymization for Privacy Preservation in Big Data
Data Anonymization for Privacy Preservation in Big Data
 

En vedette

Data and education 16 may 2014 haggard london
Data and education 16 may 2014 haggard londonData and education 16 may 2014 haggard london
Data and education 16 may 2014 haggard londonStephen Haggard
 
The Ballad Of The Weimar Jew
The Ballad Of The Weimar JewThe Ballad Of The Weimar Jew
The Ballad Of The Weimar JewSarah Evins
 
Tips For Twitter Usage
Tips For Twitter UsageTips For Twitter Usage
Tips For Twitter UsageInteract
 
eWomenNetwork - Pam Vaccaro presentation 1/8/10
eWomenNetwork - Pam Vaccaro presentation 1/8/10eWomenNetwork - Pam Vaccaro presentation 1/8/10
eWomenNetwork - Pam Vaccaro presentation 1/8/10dgamache
 
Interact Online Tv
Interact Online TvInteract Online Tv
Interact Online TvInteract
 

En vedette (6)

Data and education 16 may 2014 haggard london
Data and education 16 may 2014 haggard londonData and education 16 may 2014 haggard london
Data and education 16 may 2014 haggard london
 
The Ballad Of The Weimar Jew
The Ballad Of The Weimar JewThe Ballad Of The Weimar Jew
The Ballad Of The Weimar Jew
 
ÖW Marketingkampagne 2013 Niederlande
ÖW Marketingkampagne 2013 NiederlandeÖW Marketingkampagne 2013 Niederlande
ÖW Marketingkampagne 2013 Niederlande
 
Tips For Twitter Usage
Tips For Twitter UsageTips For Twitter Usage
Tips For Twitter Usage
 
eWomenNetwork - Pam Vaccaro presentation 1/8/10
eWomenNetwork - Pam Vaccaro presentation 1/8/10eWomenNetwork - Pam Vaccaro presentation 1/8/10
eWomenNetwork - Pam Vaccaro presentation 1/8/10
 
Interact Online Tv
Interact Online TvInteract Online Tv
Interact Online Tv
 

Similaire à Semantic Interoperability and Information Brokering in Global Information Systems

Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseRinke Hoekstra
 
Data Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information SystemsData Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information Systemsijceronline
 
The Mysteries of Metadata
The Mysteries of MetadataThe Mysteries of Metadata
The Mysteries of MetadataAmit Sheth
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachAndre Freitas
 
A Taxonomy of the Data Resource in the Networked Industry
A Taxonomy of the Data Resource in the Networked IndustryA Taxonomy of the Data Resource in the Networked Industry
A Taxonomy of the Data Resource in the Networked IndustryBoris Otto
 
Applications of Semantic Technology in the Real World Today
Applications of Semantic Technology in the Real World TodayApplications of Semantic Technology in the Real World Today
Applications of Semantic Technology in the Real World TodayAmit Sheth
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfrajsharma159890
 
Final Next Generation Content Management
Final    Next  Generation  Content  ManagementFinal    Next  Generation  Content  Management
Final Next Generation Content ManagementScott Abel
 
Big data visualization state of the art
Big data visualization state of the artBig data visualization state of the art
Big data visualization state of the artsoria musa
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration James Hendler
 
Journalism and the Semantic Web
Journalism and the Semantic WebJournalism and the Semantic Web
Journalism and the Semantic WebKurt Cagle
 
How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?andrea huang
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programsJoseph Busch
 
9th International Conference on Database and Data Mining (DBDM 2021)
9th International Conference on Database and Data Mining (DBDM 2021)9th International Conference on Database and Data Mining (DBDM 2021)
9th International Conference on Database and Data Mining (DBDM 2021)albert ca
 
The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic webTony Dobaj
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesKaran Deep Singh
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data miningDevakumar Jain
 
Coreon - Making Sure IoT Devices Understand Each Other!
Coreon - Making Sure IoT Devices Understand Each Other!Coreon - Making Sure IoT Devices Understand Each Other!
Coreon - Making Sure IoT Devices Understand Each Other!Jochen Hummel
 

Similaire à Semantic Interoperability and Information Brokering in Global Information Systems (20)

Managing Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS caseManaging Metadata for Science and Technology Studies: the RISIS case
Managing Metadata for Science and Technology Studies: the RISIS case
 
Data Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information SystemsData Integration in Multi-sources Information Systems
Data Integration in Multi-sources Information Systems
 
The Mysteries of Metadata
The Mysteries of MetadataThe Mysteries of Metadata
The Mysteries of Metadata
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
 
A Taxonomy of the Data Resource in the Networked Industry
A Taxonomy of the Data Resource in the Networked IndustryA Taxonomy of the Data Resource in the Networked Industry
A Taxonomy of the Data Resource in the Networked Industry
 
Applications of Semantic Technology in the Real World Today
Applications of Semantic Technology in the Real World TodayApplications of Semantic Technology in the Real World Today
Applications of Semantic Technology in the Real World Today
 
Big-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdfBig-Data-Analytics.8592259.powerpoint.pdf
Big-Data-Analytics.8592259.powerpoint.pdf
 
Final Next Generation Content Management
Final    Next  Generation  Content  ManagementFinal    Next  Generation  Content  Management
Final Next Generation Content Management
 
Big data visualization state of the art
Big data visualization state of the artBig data visualization state of the art
Big data visualization state of the art
 
Unit 1
Unit 1Unit 1
Unit 1
 
The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration The Rensselaer IDEA: Data Exploration
The Rensselaer IDEA: Data Exploration
 
Journalism and the Semantic Web
Journalism and the Semantic WebJournalism and the Semantic Web
Journalism and the Semantic Web
 
How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?How to clean data less through Linked (Open Data) approach?
How to clean data less through Linked (Open Data) approach?
 
Role of metadata in transportation agency data programs
Role of metadata in transportation agency data programsRole of metadata in transportation agency data programs
Role of metadata in transportation agency data programs
 
9th International Conference on Database and Data Mining (DBDM 2021)
9th International Conference on Database and Data Mining (DBDM 2021)9th International Conference on Database and Data Mining (DBDM 2021)
9th International Conference on Database and Data Mining (DBDM 2021)
 
The technical case for a semantic web
The technical case for a semantic webThe technical case for a semantic web
The technical case for a semantic web
 
Big Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and IssuesBig Data Mining - Classification, Techniques and Issues
Big Data Mining - Classification, Techniques and Issues
 
Knowledge discovery thru data mining
Knowledge discovery thru data miningKnowledge discovery thru data mining
Knowledge discovery thru data mining
 
Coreon - Making Sure IoT Devices Understand Each Other!
Coreon - Making Sure IoT Devices Understand Each Other!Coreon - Making Sure IoT Devices Understand Each Other!
Coreon - Making Sure IoT Devices Understand Each Other!
 
BigData
BigDataBigData
BigData
 

Dernier

Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application ) Sakshi Ghasle
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesFatimaKhan178732
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersChitralekhaTherkar
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docxPoojaSen20
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991RKavithamani
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 

Dernier (20)

Hybridoma Technology ( Production , Purification , and Application )
Hybridoma Technology  ( Production , Purification , and Application  ) Hybridoma Technology  ( Production , Purification , and Application  )
Hybridoma Technology ( Production , Purification , and Application )
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
Separation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and ActinidesSeparation of Lanthanides/ Lanthanides and Actinides
Separation of Lanthanides/ Lanthanides and Actinides
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Micromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of PowdersMicromeritics - Fundamental and Derived Properties of Powders
Micromeritics - Fundamental and Derived Properties of Powders
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
MENTAL STATUS EXAMINATION format.docx
MENTAL     STATUS EXAMINATION format.docxMENTAL     STATUS EXAMINATION format.docx
MENTAL STATUS EXAMINATION format.docx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
Industrial Policy - 1948, 1956, 1973, 1977, 1980, 1991
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 

Semantic Interoperability and Information Brokering in Global Information Systems

  • 1. Bethesda, Maryland, April 6, 1999 Amit Sheth Large Scale Distributed Information Systems Lab University of Georgia http://lsdis.cs.uga.edu Semantic Interoperability and Information Brokering in Global Information Systems
  • 2. Three perspectives to GlobIS Information Integration Perspective distribution autonomy heterogeneity Information Brokering Perspective data meta-data semantic (terminological, contextual) “ Vision” Perspective data connectivity computing information knowledge
  • 3. Evolving targets and approaches in integrating data and information (a personal perspective) Infocosm Mermaid DDTS Multibase, MRDSM, ADDS, IISS, Omnibase, ... Generation I 1980s DL-II projects ADEPT, InfoQuilt Generation III 1997... InfoSleuth, KMed, DL-I projects Infoscopes, HERMES, SIMS, Garlic,TSIMMIS,Harvest, RUFUS,... Generation II 1990s VisualHarness InfoHarness a society for ubiquitous exchange of (tradeable) information in all digital forms of representation; information anywhere, anytime, any forms
  • 4.
  • 5.
  • 6.
  • 7. (characterization of schematic conflicts in multidatabase systems) Sheth & Kashyap, Kim & Seo Generation I Schematic Conflicts Generalization Conflicts Aggregation Conflicts Abstraction Level Incompatibility Data Value Attribute Conflict Entity Attribute Conflict Data Value Entity Conflict Schematic Discrepancies Naming Conflicts Database Identifier Conflicts Schema Isomorphism Conflicts Missing Data Items Conflicts Entity Definition Incompatibility Naming Conflicts Data Representation Conflicts Data Scaling Conflicts Data Precision Conflicts Default Value Conflicts Attribute Integrity Constraint Conflicts Domain Definition Incompatibility Known Inconsistency Temporal Inconsistency Acceptable Inconsistency Data Value Incompatibility B U T these techniques for dealing with schematic heterogeneity do not directly map to dealing with much larger variety of heterogeneous media
  • 8.
  • 9.
  • 10. (limited types of metadata, extractors, mappers, wrappers) Generation II METADATA EXTRACTORS Find Marketing Manager positions in a company that is within 15 miles of San Francisco and whose stock price has been growing at a rate of at least 25% per year over the last three years Junglee, SIGMOD Record, Dec. 1997 Global/Enterprise Web Repositories Digital Maps Nexis UPI AP Documents Digital Audios Data Stores Digital Videos Digital Images . . . . . . . . .
  • 11.
  • 13.
  • 14. GIS Data Representation – Example multiple heterogeneous metadata models with different tag names for the same data in the same GIS domain Kansas State FGDC Metadata Model Theme keywords : digital line graph, hydrography, transportation... Title : Dakota Aquifer Online linkage : http://gisdasc.kgs.ukans.edu/dasc/ Direct Spatial Reference Method: Vector Horizontal Coordinate System Definition: Universal Transverse Mercator … … … ... UDK Metadata Model Search terms : digital line graph, hydrography, transportation... Topic : Dakota Aquifer Adress Id: http://gisdasc.kgs.ukans.edu/dasc/ Measuring Techniques: Vector Co-ordinate System: Universal Transverse Mercator … … … ...
  • 15.
  • 16. Information Brokering: An Enabler for the Infocosm INFORMATION/DATA OVERLOAD INFORMATION PROVIDERS Newswires Universities Corporations Research Labs Information System Data Repository Information System INFORMATION CONSUMERS Corporations Universities People Government Programs User Query User Query User Query arbitration between information consumers and providers for resolving information impedance INFORMATION BROKERING Information System Data Repository Information System Information Request Information Request Information Request dynamic reinterpretation of information requests for determination of relevant information services and products — dynamic creation and composition of information products
  • 17. Information Brokering: Three Dimensions Objective: Reduce the problem of knowing structure and semantics of data in the huge number of information sources on a global scale to: understanding and navigating a significantly smaller number of domain ontologies S E M A N T I C S S T R U C T U R E S Y N T A X S Y S T E M C O N S U M E R S B R O K E R S P R O V I D E R S D A T A M E T A D A T A V O C A B U L A R Y T H R E E D I M E N S I O N S
  • 18. What else can Information Brokering do? W W W a confusing heterogeneity of media, formats (Tower of Babel) information correlation using physical (HREF) links at the extensional data level location dependent browsing of information using physical (HREF) links user has to keep track of information content !! W W W + Information Brokering Domain Specific Ontologies as “semantic conceptual views” Information correlation using concept mappings at the intensional concept level Browsing of information using terminological relationships across ontologies Higher level of abstraction, closer to user view of information !!
  • 19. Concepts, tools and techniques to support semantics context media-independent information correlations semantic proximity inter-ontological relations ontologies (esp. domain-specific) profiles domain-specific metadata
  • 20.
  • 21.
  • 22. Heterogeneity... … is a Babel Tower!! SEMANTIC INTEROPERABILITY metadata ontologies contexts SEMANTIC HETEROGENEITY
  • 23.
  • 24. InfoQuilt Project: using the M etadata REF erence link http://lsdis.cs.uga.edu/proj/iq/iq.html MREF Complements HREF, creating a “logical web” through media independent ontology & metadata based correlation It is a description of the information asset we want to retrieve MREF domain ontologies IQ_Asset ontology + extension ontologies attributes relations constraints keywords content attributes (color, scene cuts, …) Semantic Correlation using MREF MREF Concept Model for logical correlation using ontological terms and metadata Framework for representing MREF’s Serialization (one implementation choice) X M L M R E F R D F
  • 25. Domain Specific Correlation – example Potential locations for a future shopping mall identified by all regions having a population greater than 5000, and area greater than 50 sq. ft. having an urban land cover and moderate relief <A MREF ATTRIBUTES(population > 5000; area > 50; region-type = ‘block’; land-cover = ‘urban’; relief = ‘moderate’) can be viewed here </A> => media-independent relationships between domain specific metadata : population, area, land cover, relief => correlation between image and structured data at a higher domain specific level as opposed to physical “link- chasing” in the WWW domain specific metadata: terms chosen from domain specific ontologies Population: Area : Land cover: Relief: Boundaries: Census DB TIGER/Line DB US Geological Survey Regions (SQL) :  Boundaries  Image Features (image processing routines)
  • 27. A DL II approach for Information Brokering CONSTRUCTING ADDITIONAL META-INFORMATION RESOURCES Physical/Simulation World DISCOVERING COLLECTIONS OF HETEROGENEOUS INFORMATION AND META-INFORMATION RESOURCES Images Data Stores Documents Digital Media Domain Specific Ontologies Domain Independent Ontologies Iscape N CONSTRUCTING APPROPRIATE INFORMATION LANDSCAPES Iscape 1
  • 28.
  • 29. Putting MREFs to work User Agent Profile Manager user information MREF request retrieve profile User display results change profile design MREF domain ontologies MREF Builder IQ_Asset ontology + extension ontologies construct new MREF Broker Agent send MREF send results retrieve MREF retrieve MREF MREF repository MREF repository User profiles
  • 30.
  • 31.
  • 32. Using c-contexts to reason about information in database - Reasoning with c-contexts: glb(C def (DOC), C Q ) - Ontological Inferences: - DocumentConcept - (hasOrganization, { “USGS” } ) Challenge 1: use of multiple ontologies Challenge 2: estimating the loss of information EXAMPLE C def (DOC) <(hasOrganization, AgencyConcept)> C Q <(hasOrganization, { “USGS”} )> glb(C def (DOC), C Q ) <(self, DocumentConcept),(hasOrganization, { “USGS” } )>
  • 33. Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system OBSERVER architecture Eduardo Mena (III’98) Data Repositories Mappings Ontologies COMPONENT NODE Data Repositories Mappings Ontologies COMPONENT NODE Data Repositories Mappings Ontology Server Query Processor User Query Ontologies USER NODE Interontologies Terminological Relationships IRM IRM NODE Ontology Server Ontology Server Query Processor Query Processor
  • 34. Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system “ Get title and number of pages of books written by Carl Sagan” Query construction - Example Eduardo Mena (III’98) User ontology: WN [name pages] for (AND book (FILLS creator “Carl Sagan”)) Target ontology: Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for (AND book (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/
  • 35. Re-use of Knowledge: Bibliography Data Ontology Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system “ Get title and number of pages of books written by Carl Sagan” Query construction - Example Eduardo Mena (III’98) Biblio-Thing Document Book Edited-Book Technical-Report Periodical-Publication Journal Magazine Newspaper Miscellaneous-Publication Technical-Manual Computer-Program Multimedia-Document Artwork Cartographic-Map Thesis Doctoral-Thesis Master-Thesis Proceedings Conference Agent Person Author Organization Publisher University Stanford-I User ontology: WN [name pages] for (AND book (FILLS creator “Carl Sagan”)) Target ontology: Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for (AND book (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/
  • 36. Re-use of Knowledge: A subset of WordNet 1.5 Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system “ Get title and number of pages of books written by Carl Sagan” Query construction - Example Eduardo Mena (III’98) User ontology: WN [name pages] for (AND book (FILLS creator “Carl Sagan”)) Target ontology: Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for (AND book (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/ Print-Media Press Publication Journalism Newspaper Magazine Book Periodical Trade-Book Brochure TextBook Reference-Book SongBook PrayerBook Pictorial Series Journals CookBook Instruction-Book WordBook HandBook Directory Annual Encyclopedia Manual Bible GuideBook Instructions Reference-Manual
  • 37. WN ontology and user query Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system “ Get title and number of pages of books written by Carl Sagan” Query construction - Example Eduardo Mena (III’98) User ontology: WN [name pages] for (AND book (FILLS creator “Carl Sagan”)) Target ontology: Stanford-I Integrated ontology WN-Stanford-I [title number-of-pages] for (AND book (FILLS doc-author-name “Carl Sagan”)) Ontologies sites: http://www.cogsci.princeton.edu/~wn/w3wn.html http://www-ksl.stanford.edu/knowledge-sharing/ontologies/html/bibliographic-data/
  • 38.
  • 39. Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system Loss of information based on intensional information Eduardo Mena (III’98) User Query: (AND book (FILLS doc-author-name “Carl Sagan”)) Plan 1: (AND document (FILLS doc-author-name “Carl Sagan”)) book:=(AND publication ( AT-LEAST 1 ISBN )) publication:=(AND document ( AT-LEAST 1 place-of-publication )) Loss: “Instead of books written by Carl Sagan, OBSERVER is providing all the documents written by Carl Sagan (even if they do not have an ISBN and place of publication)”
  • 40. Estimating information loss for multi-ontology based query processing in the OBSERVER/InfoQuilt system Example: loss for the plans Eduardo Mena (III’98) Plan 1: (AND document (FILLS doc-author-name “Carl Sagan”)) [case 2] 91.57% < (1-Loss) < 91.75% Plan 2: (AND periodical-publication (FILLS doc-author-name “Carl Sagan”)) 94.03% < (1-Loss) < 100% [case 3] Plan 3: (AND journal (FILLS doc-author-name “Carl Sagan”)) [case 3] 98.56% < (1-Loss) < 100% Plan 4: (AND UNION(book, proceedings, thesis, misc-publication, technical-report) (FILLS doc-author-name “Carl Sagan”)) [case 1] 0% < (1-Loss) < 7.22%
  • 41. Summary Text Structured Databases Data Syntax, System Federated DB Semi-structured Metadata Structural, Schematic Mediator, Federated IS Visual, Scientific/Eng. Knowledge Semantic Knowledge Mgmt., Information Brokering, Cooperative IS
  • 42.
  • 43.