SlideShare une entreprise Scribd logo
1  sur  37
Télécharger pour lire hors ligne
Freebase Schema
        Jamie Taylor




Wednesday, December 8, 2010
Goals



        •   Schema: The Freebase Data Model
        •   Schema as API
        •   Schema patterns



Wednesday, December 8, 2010
Freebase is a collection of facts
                    Sofia Coppola directed Marie Antoinette




                                 {
                                 {
                      {

                        Freebase only contains
                           nodes and Links
Wednesday, December 8, 2010
Freebase is a Graph




Wednesday, December 8, 2010
Freebase is a labeled Graph
                              child                  wrote

                                                         directed
               parent            sibling

                                  directed

                                      starred_in                    directed

                                                   starred_in


Wednesday, December 8, 2010
Schema

                          "All the things you can say
                          about something in Freebase"




                       Schema is the data model for Freebase



Wednesday, December 8, 2010
All nodes are “/type/object”
 /m/02vyw                                     /people/person
         id                            type
                                        type /film/director



        [{
                                    /type/object/name
          "id":"/m/02vyw",            name
          "name":null,
          "type":[{}]
                                  “Francis Coppola”
        }]

Wednesday, December 8, 2010
Types suggest properties to use
 /m/02vyw
         id
                                                      type /film/director
                                  lm
                                /fi
                              or
                              ct
                    ire
                  /d
             lm
          /fi




                              /type/object/id /en/bram_stokers_dracula
Wednesday, December 8, 2010
Queries follow schema

                          [{
                             "id": "/en/francis_ford_coppola",
                             "/film/director/film": [{
                               "id":null,
                                "name":null
                             }]
                          }]




Wednesday, December 8, 2010
Properties link the graph together
 /m/02vyw
         id
                                                      type /film/director
                                  lm
                                /fi
                              or
                              ct
                    ire




                                         ten _by
                  /d




                                   wri t
             lm
          /fi




                              /type/object/id /en/bram_stokers_dracula
Wednesday, December 8, 2010
Queries follow schema

                          [{
                             "id": "/en/francis_ford_coppola",     Name is returned
                                                                   (how to get ID?)
                             "/film/director/film": [{               How to get all the
                               "id": "/en/bram_stokers_dracula",   writters for all of
                                                                   Coppola’s movies?
                                "written_by":null
                             }]
                          }]




Wednesday, December 8, 2010
Core Concepts




Wednesday, December 8, 2010
Core Concepts

                   Instance:
                        •     Topic: "a thing in the world"
                              •   Blade Runner, Ridley Scott, NBC, Last Proof

                   Schema:
                        •     Types - Categorical collections of instances
                        •     Properties - Relationships between instances


Wednesday, December 8, 2010
Core Concepts


   An instance may have multiple Types
        •   "Co-Types" (Types are mix-ins)
        •   Arnold Schwartzeneger
             •    Person, Actor, Politician, Sports Figure




Wednesday, December 8, 2010
Lessons from everyday vocabulary

                                                          Wikipedia Word Frequency

                                20000000


                                18000000


                                16000000


                                14000000


                                12000000
                    Frequency




                                10000000


                                 8000000


                                 6000000


                                 4000000


                                 2000000


                                       0
                                           0         20        40        60          80   100   120
                                                                        Rank
                   Data from Victor S. Grishchenko

Wednesday, December 8, 2010
Schema Principle #1
                                                                      Event Example:
                                                                      -Film Festival
                                                                      -Battle of Waterloo




                              Use Co-Types Liberally:
                                 Use a few large, encompassing Types to
                                 provide general information

                                 Use several smaller, fine grained Types to
                                 provide detailed information




Wednesday, December 8, 2010
Core Concepts                  Relational DB
                                                                vs
                                                                RDF




               Properties are defined on Types
                   •    Properties are the vocabulary for a specific Type
                   •    An instance must be “an instance of a type”
                        before it can use the Type’s properties to describe
                        itself




Wednesday, December 8, 2010
Core Concepts      Expected Type ~ RDFS Range




      •    A Property Value has a specific Type
           •    "Expected Type"

      •    A Property has exactly one Expected Type


                                  Manufactures




Wednesday, December 8, 2010
Core Concepts


                   Expected Types (Property Values):
                        •     Value Types (literals)
                              •   String (two flavors), Integer, Float, DateTime, boolean

                        •     Object Types
                              •   Everything Else




Wednesday, December 8, 2010
/type/object   contrast to common
                                                       topic




                Everything in Freebase has this Type
                Provides basic properties
                          •   Type

                          •   Name

                          •   .......

                All other Properties come from some other Type!



Wednesday, December 8, 2010
/common/topic
                   "Topics"
                        •     Things we have discourse about
                        •     Provides properties:
                              •   Alias

                              •   Article

                              •   Image

                              •   Weblinks

                        •     Assumed to be an "Included Type" for any
                              "standard" type
Wednesday, December 8, 2010
Schema Patterns


                Compound Value
                Mediator
                Phylogeny
                Enumeration



Wednesday, December 8, 2010
Compound Value                                      complex literal




                Two or more properties which can only be
                interpreted with regard to one another
                Population
                     •    Dated Integer ("when did this location have that many people")

                Movie Budget
                     •    Dated money value

                     •    Date, Currency, Amount

                Ticker Symbol
                     •    Exchange, Symbol


Wednesday, December 8, 2010
Compound Value
                                                 currency
                              estimated_budget     amount     31MM

       {
         "id":   "/en/apocalypse_now",
                                                 valid_date   1979
         "type": "/film/film",
         "estimated_budget": [{
           "currency":   null,
           "amount":     null,
           "valid_date": null
         }]
       }
Wednesday, December 8, 2010
Mediator               combine date annotation and
                                                        character = tv character



           An annotation on the link between two Topics
              • Requires an object between the two Topics
              • The Topics become separated by two properties


                actor             performance                       film


                                    character

           • Also useful for indicating the dates when a
                relationship existed (e.g., education, employment, etc.)
Wednesday, December 8, 2010
Mediator




  {
    "id":   "/en/marie_antoinette_2006",
    "type": "/film/film",
    "starring": [{
      "actor":null,
      "character":null
    }]
  }
Wednesday, December 8, 2010
Phylogeny
                  Used when instances form a hierarchy
                  Phylogeny properties have an expected
                  Type which is the same as the Type on
                  which the property is defined.

                              Examples:
                              •   /location/location/containedby

                              •   /computer/computer/parent_model

                              •   /tv/tv_program/spin_offs


Wednesday, December 8, 2010
Phylogeny




 {
   "id": "/en/fairfax_california",
   "/location/location/containedby": [{
     "id": null,
     "containedby": [{
       "id": null        Why can I use the
                         short name??
     }]
   }]
 }
Wednesday, December 8, 2010
Enumerated Value
                Closed collection of “values” for a property
                Constrains relations to fixed set of objects


        • /people/person/gender
                   { female, male, other }

        • /visual_art/visual_artist/art_forms
                   { drawing, painting, print making, photography.... }


Wednesday, December 8, 2010
Explore the Freebase Graph
                              child                  wrote

                                                         directed
               parent            sibling

                                  directed

                                      starred_in                    directed

                                                   starred_in


Wednesday, December 8, 2010
Explore the Freebase Graph


                              [{
                                "id":   null,
                                "type": "/film/director"
                              }]




Wednesday, December 8, 2010
Explore the Freebase Graph


                              [{
                                "id":   null,
                                "type": "/film/director",
                                "/people/person/children": [{
                                   "id":   null,
                                   "type": "/film/director"
                                }]
                              }]




Wednesday, December 8, 2010
Explore the Freebase Graph

                              [{
                                "id":   null,
                                "type": "/film/director",
                                "film":[ ],
                                "/people/person/children": [{
                                   "id":   null,
                                   "type": "/film/director"
                                   "film":[ ]
                                }]
                              }]




Wednesday, December 8, 2010
Explore the Freebase Graph
                              [{
                                "id":   null,
                                "type": "/film/director",
                                "film": [ ],
                                "/people/person/children": [{
                                   "id":   null,
                                   "type": "/film/director",
                                   "film": [{
                                     "name":null,
                                     "starring": [{
                                         "actor": null
                                     }]
                                   }]
                                }]
                              }]



Wednesday, December 8, 2010
namespace




   It’s all nodes
                                                                                                                                                                                                                                                /
                                                                                                                                                                                                                                                                         pe
                                                                                                                                                                                                                                                                            ople
                                                                                                                                                                                                                                                                                 ( key
                                                                                                                                                                                                 y)                                                                                       )
                                                                                                                                                                                             (ke
                                                                                                                                                                                       film

                                                                                                                                                                                                                                                                                                                          domain



                                                                                                                                                                                                                                                                                                                type
                                                                                                                               domain                                                                                                                                                                                     /people




      and links!
                                                                                                                                          type
                                                                                                                                /film


                                                                                                                                                                                                                                           e
                                                                                                                                                                         insta                                                       anc
                                                                                                                                                                               nce                      type                  inst


                                                                                                                                                                                                                 nam
                                                                                                                                                                                                                    e




                                                                                                                                                                                                                                                                                                     y)
                                                                                                                                                                                                                                                                                                (ke
                                                                                                                                                                                                e
                                                                                                                                                                                             typ
                                                                                                                                                                                                                                   "domain"




                                                                                                                                                                                                                                                                                               son
                                                                                                                                                                                                                                                                                                                                            type




                                                                                                                                                  film




                                                                                                                                                                                                                                                                                              per
                                                                                                                                                   (key
                                                                                                                                                                                                                                                                                                                                                          name           "property"




                                                                                                                                                                                             ins
                                                                                                                                                                                                                                ce                                                                                            type




                                                                                                                                                        )




                                                                                                                                                                                              tan
                                                                                                                                                                                                                                                                                                                                                                 propert
                                                                                                                                                                                                                             tan                                                                                                                                        ies
                                                                                                                                                                                                                        ins




                                                                                                                                                                                                   ce
                                                                                                                                                                                                                                                                                                                                                              ins
                                                                                                                                                                                                                                                                                                                                                                  tan
                                                                                                                                                                   "type"                                                                                                                                                                                            ce




                                                                                                                                                                                                                                                                                                                                             instance
                                                                                                                                                                                                          type
                                                                                                                                                                                             nam
                                                                                                                                                                                                   e                    type           instance                                                                                                                                   typ
                                                                                                                                                                                                                                                                                                                                                                                        e
                                                                                                                                                                                                                                                                                                                                                                                                property
                                                                                                                                                                                                                                       instance
                                                                                                                                                                                                                                                                                                                                                                     expected_type
                                                                                                                                                                                     ce
                                                                                                                                                                                 tan
                                                                                                                                                                           ins




                                                                                                                                                                                                                                                                                                                                                                                                       na
                                                                                                                                                                                                                       ins




                                                                                                                                                                                                                                                                                                                                                                                                          m
                                                                                                                                                                                                   nce




                                                                                                                                                                                                                                                                                                                                                                                                            e
                                                                                                                                                                                                                        tan
                                                                                                                                                                                                insta




                                                                                                                                                                                                                          ce
                                                                                                                                                                                                                                                                                                                                                                                                           "expected type"




                                                                                                                                                                                                                                                                                                                                     type
                                                                                                                                                                                                                                                                                                                               property

                                                                                                              type                                          type                                                                                            type
                                                                                                                                                                                                                                                                                                                                                                                        typ
                                                                                                                                type                                                                                                           type
                                                                                                                                                                                                                                                                                                                                                                                            e
                                                                                                                                                                            type                                                                                                properties                                                  na
                                                                                                                                                                                                                                                                                                                                                        expected
                                                                                                                                                                                                                                                                                                                                                                  _type                         date_time
                                                                                                                                                                                                                                                                                                                                               me
                                                                                                                                                                                                                                                                   na
                                                                                                                               na                                                                                                                                    me
                                                                                                                                 me                                       na
                                                                                                                                                                            m
                                                                                                                                                                                 e
                                                                                                                                                                                                                                                                                   "person"                                                             "date of birth"




                                                                                                                                                                                                                                                      nce
                                                                                                                                                 "film"




                                                                                                                                                              instance
                                                                                                                    instance




                                                                                                                                                                                                                                                    insta
                                                                                                                                                                                              "actor"




                                                                                                                      e                                                                                                                                                                LEGEND
                                                                                                 film           typ

                                                                                                             name
                                                                                                                                       "Blade Runner"                                                                                                                       /type/object
                                                                                                  starring




                                                                                                                                                                                                                                                                           /type/object + /common/topic
                                                                                                  film




                                                                                                                                                                                                                                                              obj type
                                                        film
                                                      character                               performance

                                                                                                                                                                                                                                                            /namespace
                                                  e
                                               nam
                              "Rick Deckard"
                                                                                                                                                                                                                                                                          outgoing                             incoming
                                                                                                  actor




                                                                                                                                                                                                                                                                                       keyvalue (key)
                                                                                                  film




                                                                                                 actor                                                                                                                                                                    out
                                                                                                                                                                                                                                                                                goin
                                                                                                                                                                                                                                                                                     gp
                                                                                                                                                                                                                                                                                        rop
                                                                                                person            e                                                                                                                                                                         e  rty
                                                                                                              typ
                                                                                                                      type                                                                                                                                                                                literal value
                                                                                                             nam
                                                                                         th                     e
                                                                               _o   f_bir
                                                                          date
                                                                                                                                "Harrison Ford"
                                                             1942-07-13




Wednesday, December 8, 2010
Domains, Bases and Commons

                              "commons"             individual's "bases"
                                          promo
                                               te


                                           Blade
                                           Runner



                              "domains"


Wednesday, December 8, 2010
Questions?!

                Docs: www.freebase.com/docs
                Wiki: wiki.freebase.com
                Mailing List: lists.freebase.com




Wednesday, December 8, 2010

Contenu connexe

Plus de Jamie Taylor

Social Fabric of Semantics - SemTech 2010
Social Fabric of Semantics - SemTech 2010Social Fabric of Semantics - SemTech 2010
Social Fabric of Semantics - SemTech 2010Jamie Taylor
 
Freebase - Semantic Technologies 2010 Code Camp
Freebase - Semantic Technologies 2010 Code CampFreebase - Semantic Technologies 2010 Code Camp
Freebase - Semantic Technologies 2010 Code CampJamie Taylor
 
Geo Location Semantics
Geo Location SemanticsGeo Location Semantics
Geo Location SemanticsJamie Taylor
 
The next phase of Web2.0: Data
The next phase of Web2.0: DataThe next phase of Web2.0: Data
The next phase of Web2.0: DataJamie Taylor
 
Using Semantics to Enhance Content
Using Semantics to Enhance ContentUsing Semantics to Enhance Content
Using Semantics to Enhance ContentJamie Taylor
 
Freebase Workshop, December 2009
Freebase Workshop, December 2009Freebase Workshop, December 2009
Freebase Workshop, December 2009Jamie Taylor
 
Using Semantics to Enhance Content Publishing
Using Semantics to Enhance Content PublishingUsing Semantics to Enhance Content Publishing
Using Semantics to Enhance Content PublishingJamie Taylor
 
ISWC 2009 Consuming LOD
ISWC 2009 Consuming LODISWC 2009 Consuming LOD
ISWC 2009 Consuming LODJamie Taylor
 
Drupal and the Semantic Web
Drupal and the Semantic WebDrupal and the Semantic Web
Drupal and the Semantic WebJamie Taylor
 
Freebase, RDF and the Semantic Web
Freebase, RDF and the Semantic WebFreebase, RDF and the Semantic Web
Freebase, RDF and the Semantic WebJamie Taylor
 

Plus de Jamie Taylor (10)

Social Fabric of Semantics - SemTech 2010
Social Fabric of Semantics - SemTech 2010Social Fabric of Semantics - SemTech 2010
Social Fabric of Semantics - SemTech 2010
 
Freebase - Semantic Technologies 2010 Code Camp
Freebase - Semantic Technologies 2010 Code CampFreebase - Semantic Technologies 2010 Code Camp
Freebase - Semantic Technologies 2010 Code Camp
 
Geo Location Semantics
Geo Location SemanticsGeo Location Semantics
Geo Location Semantics
 
The next phase of Web2.0: Data
The next phase of Web2.0: DataThe next phase of Web2.0: Data
The next phase of Web2.0: Data
 
Using Semantics to Enhance Content
Using Semantics to Enhance ContentUsing Semantics to Enhance Content
Using Semantics to Enhance Content
 
Freebase Workshop, December 2009
Freebase Workshop, December 2009Freebase Workshop, December 2009
Freebase Workshop, December 2009
 
Using Semantics to Enhance Content Publishing
Using Semantics to Enhance Content PublishingUsing Semantics to Enhance Content Publishing
Using Semantics to Enhance Content Publishing
 
ISWC 2009 Consuming LOD
ISWC 2009 Consuming LODISWC 2009 Consuming LOD
ISWC 2009 Consuming LOD
 
Drupal and the Semantic Web
Drupal and the Semantic WebDrupal and the Semantic Web
Drupal and the Semantic Web
 
Freebase, RDF and the Semantic Web
Freebase, RDF and the Semantic WebFreebase, RDF and the Semantic Web
Freebase, RDF and the Semantic Web
 

Dernier

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 

Dernier (20)

Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 

Freebase Schema

  • 1. Freebase Schema Jamie Taylor Wednesday, December 8, 2010
  • 2. Goals • Schema: The Freebase Data Model • Schema as API • Schema patterns Wednesday, December 8, 2010
  • 3. Freebase is a collection of facts Sofia Coppola directed Marie Antoinette { { { Freebase only contains nodes and Links Wednesday, December 8, 2010
  • 4. Freebase is a Graph Wednesday, December 8, 2010
  • 5. Freebase is a labeled Graph child wrote directed parent sibling directed starred_in directed starred_in Wednesday, December 8, 2010
  • 6. Schema "All the things you can say about something in Freebase" Schema is the data model for Freebase Wednesday, December 8, 2010
  • 7. All nodes are “/type/object” /m/02vyw /people/person id type type /film/director [{ /type/object/name   "id":"/m/02vyw", name   "name":null,   "type":[{}] “Francis Coppola” }] Wednesday, December 8, 2010
  • 8. Types suggest properties to use /m/02vyw id type /film/director lm /fi or ct ire /d lm /fi /type/object/id /en/bram_stokers_dracula Wednesday, December 8, 2010
  • 9. Queries follow schema [{    "id": "/en/francis_ford_coppola",    "/film/director/film": [{      "id":null, "name":null    }] }] Wednesday, December 8, 2010
  • 10. Properties link the graph together /m/02vyw id type /film/director lm /fi or ct ire ten _by /d wri t lm /fi /type/object/id /en/bram_stokers_dracula Wednesday, December 8, 2010
  • 11. Queries follow schema [{    "id": "/en/francis_ford_coppola", Name is returned (how to get ID?)    "/film/director/film": [{ How to get all the      "id": "/en/bram_stokers_dracula", writters for all of Coppola’s movies? "written_by":null    }] }] Wednesday, December 8, 2010
  • 13. Core Concepts Instance: • Topic: "a thing in the world" • Blade Runner, Ridley Scott, NBC, Last Proof Schema: • Types - Categorical collections of instances • Properties - Relationships between instances Wednesday, December 8, 2010
  • 14. Core Concepts An instance may have multiple Types • "Co-Types" (Types are mix-ins) • Arnold Schwartzeneger • Person, Actor, Politician, Sports Figure Wednesday, December 8, 2010
  • 15. Lessons from everyday vocabulary Wikipedia Word Frequency 20000000 18000000 16000000 14000000 12000000 Frequency 10000000 8000000 6000000 4000000 2000000 0 0 20 40 60 80 100 120 Rank Data from Victor S. Grishchenko Wednesday, December 8, 2010
  • 16. Schema Principle #1 Event Example: -Film Festival -Battle of Waterloo Use Co-Types Liberally: Use a few large, encompassing Types to provide general information Use several smaller, fine grained Types to provide detailed information Wednesday, December 8, 2010
  • 17. Core Concepts Relational DB vs RDF Properties are defined on Types • Properties are the vocabulary for a specific Type • An instance must be “an instance of a type” before it can use the Type’s properties to describe itself Wednesday, December 8, 2010
  • 18. Core Concepts Expected Type ~ RDFS Range • A Property Value has a specific Type • "Expected Type" • A Property has exactly one Expected Type Manufactures Wednesday, December 8, 2010
  • 19. Core Concepts Expected Types (Property Values): • Value Types (literals) • String (two flavors), Integer, Float, DateTime, boolean • Object Types • Everything Else Wednesday, December 8, 2010
  • 20. /type/object contrast to common topic Everything in Freebase has this Type Provides basic properties • Type • Name • ....... All other Properties come from some other Type! Wednesday, December 8, 2010
  • 21. /common/topic "Topics" • Things we have discourse about • Provides properties: • Alias • Article • Image • Weblinks • Assumed to be an "Included Type" for any "standard" type Wednesday, December 8, 2010
  • 22. Schema Patterns Compound Value Mediator Phylogeny Enumeration Wednesday, December 8, 2010
  • 23. Compound Value complex literal Two or more properties which can only be interpreted with regard to one another Population • Dated Integer ("when did this location have that many people") Movie Budget • Dated money value • Date, Currency, Amount Ticker Symbol • Exchange, Symbol Wednesday, December 8, 2010
  • 24. Compound Value currency estimated_budget amount 31MM {   "id":   "/en/apocalypse_now", valid_date 1979   "type": "/film/film",   "estimated_budget": [{     "currency":   null,     "amount":     null,     "valid_date": null   }] } Wednesday, December 8, 2010
  • 25. Mediator combine date annotation and character = tv character An annotation on the link between two Topics • Requires an object between the two Topics • The Topics become separated by two properties actor performance film character • Also useful for indicating the dates when a relationship existed (e.g., education, employment, etc.) Wednesday, December 8, 2010
  • 26. Mediator {   "id":   "/en/marie_antoinette_2006",   "type": "/film/film",   "starring": [{     "actor":null,     "character":null   }] } Wednesday, December 8, 2010
  • 27. Phylogeny Used when instances form a hierarchy Phylogeny properties have an expected Type which is the same as the Type on which the property is defined. Examples: • /location/location/containedby • /computer/computer/parent_model • /tv/tv_program/spin_offs Wednesday, December 8, 2010
  • 28. Phylogeny {   "id": "/en/fairfax_california",   "/location/location/containedby": [{     "id": null,     "containedby": [{       "id": null Why can I use the short name??     }]   }] } Wednesday, December 8, 2010
  • 29. Enumerated Value Closed collection of “values” for a property Constrains relations to fixed set of objects • /people/person/gender { female, male, other } • /visual_art/visual_artist/art_forms { drawing, painting, print making, photography.... } Wednesday, December 8, 2010
  • 30. Explore the Freebase Graph child wrote directed parent sibling directed starred_in directed starred_in Wednesday, December 8, 2010
  • 31. Explore the Freebase Graph [{   "id":   null,   "type": "/film/director" }] Wednesday, December 8, 2010
  • 32. Explore the Freebase Graph [{   "id":   null,   "type": "/film/director",   "/people/person/children": [{      "id":   null,      "type": "/film/director"   }] }] Wednesday, December 8, 2010
  • 33. Explore the Freebase Graph [{   "id":   null,   "type": "/film/director",   "film":[ ],   "/people/person/children": [{      "id":   null,      "type": "/film/director"      "film":[ ]   }] }] Wednesday, December 8, 2010
  • 34. Explore the Freebase Graph [{   "id":   null,   "type": "/film/director",   "film": [ ],   "/people/person/children": [{      "id":   null,      "type": "/film/director",      "film": [{        "name":null,        "starring": [{          "actor": null        }]      }]   }] }] Wednesday, December 8, 2010
  • 35. namespace It’s all nodes / pe ople ( key y) ) (ke film domain type domain /people and links! type /film e insta anc nce type inst nam e y) (ke e typ "domain" son type film per (key name "property" ins ce type ) tan propert tan ies ins ce ins tan "type" ce instance type nam e type instance typ e property instance expected_type ce tan ins na ins m nce e tan insta ce "expected type" type property type type type typ type type e type properties na expected _type date_time me na na me me na m e "person" "date of birth" nce "film" instance instance insta "actor" e LEGEND film typ name "Blade Runner" /type/object starring /type/object + /common/topic film obj type film character performance /namespace e nam "Rick Deckard" outgoing incoming actor keyvalue (key) film actor out goin gp rop person e e rty typ type literal value nam th e _o f_bir date "Harrison Ford" 1942-07-13 Wednesday, December 8, 2010
  • 36. Domains, Bases and Commons "commons" individual's "bases" promo te Blade Runner "domains" Wednesday, December 8, 2010
  • 37. Questions?! Docs: www.freebase.com/docs Wiki: wiki.freebase.com Mailing List: lists.freebase.com Wednesday, December 8, 2010