SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
 
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
Figure 4: An Individual Record from the Gateway to Oklahoma History Website
Emily Kolvitz, University of Oklahoma, School of Library and Information Studies & Bynder LLC
RESEARCH	
  METHODS
INTRODUCTION
Figure 3: Detailed Information on the types, quality and quantity of metadata from Gateway to
Oklahoma History Image Resources
Image Source: http://gateway.okhistory.org/ark:/67531/metadc317314/
Image Source: http://gateway.okhistory.org/ark:/67531/metadc190366/
Image Source: http://gateway.okhistory.org/ark:/67531/metadc316961/
Image Source: http://gateway.okhistory.org/ark:/67531/metadc316903
Figure 2: Screen capture of Embedded Metadata
Figure 1: Structured Data
Linter Tool Results
CONCLUSION
© Copyright
Emily Kolvitz, Product Consultant at Bynder, LLC
MLIS from The University of Oklahoma
Emily Kolvitz emily@getbynder.com
+1 405 471 2570 https://www.theinformationprofessional.com
https://www.getbynder.com
FURTHER	
  INFORMATION	
  
ACKNOWLEDGEMENTS	
  
I would like to thank The University of Oklahoma, who offered resources to make this project
possible and also The Oklahoma History Center for allowing me to work on such a wonderful
digitization project for the Gateway to Oklahoma History during my study.
RESULTS	
  
Image Resource Findability on the World Wide Web is still
very much a land-grab. For the Semantic Web to become a
reality online businesses and individuals have to get their
hands dirty and also come face-to-face with the realization that
search engine giants are increasingly becoming the go-to tool
for information resource retrieval. “Increasingly, students use
Web search engines such as Google to locate information
resources rather than seek out library online catalogs or
databases of scholarly journal articles” (Lippincott 2013). This
puts the search engine giant in a unique position to dictate
how the future of search will work on the Web - and therefore,
your organization’s future presence (or lack thereof) on the
Web.
Image search and retrieval is a more difficult area than
text search and retrieval because accessibility to the
image content is largely dependent on the context
presented in and around the image resource. The
widespread adoption of Schema.org and structured data on
the Web did not gain traction until the big four search engines
(Google, Bing, Yahoo, and Yandex ) agreed that a standard
was needed to pave the way forward. “On-page markup helps
search engines understand the information on web pages and
provide richer search results. A shared markup vocabulary
makes easier for webmasters to decide on a markup schema
and get the maximum benefit for their efforts,” (Schema.org
2014).Structured data is only one piece of the findability
algorithm. Metadata near content, embedded within
content, or listed in the alt text of an html document all tell
machines something about the content inside of the
record as well.
There are no guardians of the Web, ensuring structured data
is uniformly applied to all records with equal attention and care
and there is no standard, mandated requirement for records on
the Web to provide context for image resource findability. Most
search engines do not crawl embedded XMP data or the
invisible Web, leaving text near images, file names or text in the
alt-text in html markup as the only context for image resources.
The search algorithms for image retrieval are subject to change
frequently (Kritzinger 2013) and additionally, social media sites
and organizations strip embedded data from images
(Embedded Metadata Manifesto 2014). Embedded metadata
provides context and provenance for image resources.
Even with the dramatic adoption of structured data markup
utilizing schema.org vocabularies, there still remains
metadata opportunities on the Web. Reicks recommends
embedded metadata as a strategy for online findability by
showcasing examples of applications that parse embedded
data into structured data around images on the Web such as
PhotoShelter and LicenseStream (2010).
The one variable in the equation of Web findability that remains
a staple is good quality metadata under the hood of the
Website. In this case study, a methodology is applied to the
Gateway to Oklahoma History’s Website. This study can
be generalized to organizations looking to benchmark their own
findability maturity on the Web from an image-centric viewpoint.
The following research question informed this project:
What are the types and quality of structured data, XMP, and metadata records available for image resources appearing on
the website?
Utilizing the Structured Data Linter Tool and Phil Harvey’s ExifTool, information was gathered to quantify these research questions.
Image records on the Gateway to Oklahoma History’s website were investigated for the types, quality and quantity of embedded
metadata and structured data.
The Gateway to Oklahoma History’s Website has a wealth of structured data and metadata pertaining to
its image resources. Search queries utilizing structured data markup tags and/or embedded metadata yielded
relevant and accurate results during a normal web search, but did not yield relevant and/or accurate image
resources during an image search. Descriptive filenames were not used for image resources, which is an
important part of image retrieval through web search engines.
Adding Schema.org tags to the on-page markup, to accompany the structured data already present is
another area for improvement. An interesting finding from this research was that embedded metadata was
only found on the largest, original version of the image resource, and never on smaller derivative images.
Structured data included in the on-page mark-up included Open Graph Protocol and Dublin Core. IPTC was the
primarily type of embedded metadata present for the image resources.
	
  
The results and methodology for this research can help GLAM institutions (Galleries, Libraries,
Archives & Museums) by bringing awareness to the state of structured data and image resource
findability for cultural heritage institutions on the Web. GLAMs must be active in the SEO space,
support machine-readable language in the markup of their sites, and utilize Schema.org
vocabularies and descriptive filenames for relevancy in search engine results.
The Digital Library Federation, which is a program of the Council on Library and Information
Resources, concludes that “Getting found means repository objects must be included in the indexes
of major search engines because most students and faculty now begin their research with Internet
search engines. Digital repositories created by libraries will be largely invisible to users if their
contents are not indexed in these search engines”  (Digital Library Foundation 2014).  
REFERENCES
Corlosquet, Stephane and Gregg Kellogg. Last accessed August 1, 2015 “Structured Data Linter,”
http://linter.structured-data.org/
Digital Library Federation. Last accessed October 20, 2014. “SEO for Digital Libraries.”
http://www.diglib.org/community/groups/seo-for-digital-libraries/
Harvey, Phil. Last accessed August 1, 2015. “ExifTool by Phil Harvey,”
http://www.sno.phy.queensu.ca/~phil/exiftool/
International Business, Times. 0006. "Bing, Google and Yahoo merge to make search easier with
schema.org." International Business Times, April.
IPTC International Press Telecommunications Council, 2014. “Embedded Metadata Manifesto” Last
accessed November 20, 2014.
http://www.embeddedmetadata.org/social-media-test-results.php(Embedded Metadata Manifesto 2014).
Kritzinger, W. T. "Search Engine Optimization and Pay-per-Click Marketing Strategies." Journal of
Organizational Computing and Electronic Commerce, no. 3 (2013): 273-86.
Lippincott, Joan K. “Net Generation Students and Libraries,” EDUCAUSE (2005), accessed November
19, 2014,
http://www.educause.edu/research-and-publications/books/educating-net-generation/net-generation-
students-and-libraries
Reicks, David. 2010. “Why Embedded Metadata Won’t Help Your SEO,” Last Updated December 30,
2013. Last Accessed November 23, 2014.
http://www.controlledvocabulary.com/blog/embedded-metadata-wont-help-seo.html
Schema.org. 2015. “About Schema.org” Last Updated Unknown. https://schema.org/docs/faq.html
 

Contenu connexe

Tendances

Sem tech2013 tutorial
Sem tech2013 tutorialSem tech2013 tutorial
Sem tech2013 tutorial
Thengo Kim
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)
Besnik Fetahu
 
A Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic WebA Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic Web
Aaron Huang
 

Tendances (18)

Web scale discovery tools
Web scale discovery tools Web scale discovery tools
Web scale discovery tools
 
Sem tech2013 tutorial
Sem tech2013 tutorialSem tech2013 tutorial
Sem tech2013 tutorial
 
Web Content Mining
Web Content MiningWeb Content Mining
Web Content Mining
 
Web mining
Web miningWeb mining
Web mining
 
Federated Search Falls Short
Federated Search Falls ShortFederated Search Falls Short
Federated Search Falls Short
 
Web Mining
Web MiningWeb Mining
Web Mining
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)
 
Web Usage Pattern
Web Usage PatternWeb Usage Pattern
Web Usage Pattern
 
Gt ea2009
Gt ea2009Gt ea2009
Gt ea2009
 
Linked data MLA 2015
Linked data MLA 2015Linked data MLA 2015
Linked data MLA 2015
 
Linked Data MLA 2015
Linked Data MLA 2015Linked Data MLA 2015
Linked Data MLA 2015
 
A Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic WebA Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic Web
 
Linked Data: Uses and Users
Linked Data: Uses and UsersLinked Data: Uses and Users
Linked Data: Uses and Users
 
Web content mining
Web content miningWeb content mining
Web content mining
 
Web mining (structure mining)
Web mining (structure mining)Web mining (structure mining)
Web mining (structure mining)
 
Web Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experienceWeb Scale Discovery Services: Google like search experience
Web Scale Discovery Services: Google like search experience
 
Abcd
AbcdAbcd
Abcd
 
Subject information gateway in information technology (sigit) an introduction
Subject information gateway in information technology (sigit) an introductionSubject information gateway in information technology (sigit) an introduction
Subject information gateway in information technology (sigit) an introduction
 

En vedette

О компании (продукты и решения)
О компании (продукты и решения)О компании (продукты и решения)
О компании (продукты и решения)
NLMK Europe
 

En vedette (14)

151218 lunchlezing-prof-tops-verslag- (1)
151218 lunchlezing-prof-tops-verslag- (1)151218 lunchlezing-prof-tops-verslag- (1)
151218 lunchlezing-prof-tops-verslag- (1)
 
О компании (продукты и решения)
О компании (продукты и решения)О компании (продукты и решения)
О компании (продукты и решения)
 
My book my success
My book   my successMy book   my success
My book my success
 
UW Libraries: Research Smarter, Not Harder
UW Libraries: Research Smarter, Not HarderUW Libraries: Research Smarter, Not Harder
UW Libraries: Research Smarter, Not Harder
 
Question 7 - 2k15 Exam Preparation Day
Question 7 - 2k15 Exam Preparation DayQuestion 7 - 2k15 Exam Preparation Day
Question 7 - 2k15 Exam Preparation Day
 
AAG2014
AAG2014AAG2014
AAG2014
 
RESUMENEW5_30
RESUMENEW5_30RESUMENEW5_30
RESUMENEW5_30
 
Evaluation question 3
Evaluation question 3Evaluation question 3
Evaluation question 3
 
Single page apps with drupal 7
Single page apps with drupal 7Single page apps with drupal 7
Single page apps with drupal 7
 
SK-KD Seni Budaya SDLB – A(Tuna Netra)
SK-KD Seni Budaya SDLB – A(Tuna Netra)SK-KD Seni Budaya SDLB – A(Tuna Netra)
SK-KD Seni Budaya SDLB – A(Tuna Netra)
 
Building a Collaborative Data Architecture
Building a Collaborative Data ArchitectureBuilding a Collaborative Data Architecture
Building a Collaborative Data Architecture
 
SharePoint Designer Workflows - Nuts, Bolts and Examples
SharePoint Designer Workflows - Nuts, Bolts and ExamplesSharePoint Designer Workflows - Nuts, Bolts and Examples
SharePoint Designer Workflows - Nuts, Bolts and Examples
 
Vleiedag
Vleiedag Vleiedag
Vleiedag
 
Directing the Connected Human Experience by Brenda Fernandez, Victor Quintana...
Directing the Connected Human Experience by Brenda Fernandez, Victor Quintana...Directing the Connected Human Experience by Brenda Fernandez, Victor Quintana...
Directing the Connected Human Experience by Brenda Fernandez, Victor Quintana...
 

Similaire à Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluation for Improved Image Findability on the Web

Peter Mika's Presentation at SSSW 2011
Peter Mika's Presentation at SSSW 2011Peter Mika's Presentation at SSSW 2011
Peter Mika's Presentation at SSSW 2011
sssw2011
 
HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...
HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...
HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...
IJCSEA Journal
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
inventionjournals
 

Similaire à Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluation for Improved Image Findability on the Web (20)

Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 
Introduction abstract
Introduction abstractIntroduction abstract
Introduction abstract
 
Literature Survey on Web Mining
Literature Survey on Web MiningLiterature Survey on Web Mining
Literature Survey on Web Mining
 
Peter Mika's Presentation at SSSW 2011
Peter Mika's Presentation at SSSW 2011Peter Mika's Presentation at SSSW 2011
Peter Mika's Presentation at SSSW 2011
 
HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...
HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...
HIGH-LEVEL SEMANTICS OF IMAGES IN WEB DOCUMENTS USING WEIGHTED TAGS AND STREN...
 
Recent Trends in Semantic Search Technologies
Recent Trends in Semantic Search TechnologiesRecent Trends in Semantic Search Technologies
Recent Trends in Semantic Search Technologies
 
Web Mining
Web MiningWeb Mining
Web Mining
 
IRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search ResultsIRJET - Re-Ranking of Google Search Results
IRJET - Re-Ranking of Google Search Results
 
Linked dataresearch
Linked dataresearchLinked dataresearch
Linked dataresearch
 
IRJET-Model for semantic processing in information retrieval systems
IRJET-Model for semantic processing in information retrieval systemsIRJET-Model for semantic processing in information retrieval systems
IRJET-Model for semantic processing in information retrieval systems
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
Semantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based SystemSemantic Search of E-Learning Documents Using Ontology Based System
Semantic Search of E-Learning Documents Using Ontology Based System
 
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
Enhanced Web Usage Mining Using Fuzzy Clustering and Collaborative Filtering ...
 
3 Understanding Search
3 Understanding Search3 Understanding Search
3 Understanding Search
 
Comparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering TechniqueComparative Analysis of Collaborative Filtering Technique
Comparative Analysis of Collaborative Filtering Technique
 
International conference On Computer Science And technology
International conference On Computer Science And technologyInternational conference On Computer Science And technology
International conference On Computer Science And technology
 
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
Relationships at the Heart of Semantic Web: Modeling, Discovering, Validating...
 
Web Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics WebWeb Search Engine, Web Crawler, and Semantics Web
Web Search Engine, Web Crawler, and Semantics Web
 
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A ReviewIRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
IRJET-Multi -Stage Smart Deep Web Crawling Systems: A Review
 
Perception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document ClusteringPerception Determined Constructing Algorithm for Document Clustering
Perception Determined Constructing Algorithm for Document Clustering
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

Gateway to Oklahoma History Case Study: Structured Data and Metadata Evaluation for Improved Image Findability on the Web

  • 1.                         Figure 4: An Individual Record from the Gateway to Oklahoma History Website Emily Kolvitz, University of Oklahoma, School of Library and Information Studies & Bynder LLC RESEARCH  METHODS INTRODUCTION Figure 3: Detailed Information on the types, quality and quantity of metadata from Gateway to Oklahoma History Image Resources Image Source: http://gateway.okhistory.org/ark:/67531/metadc317314/ Image Source: http://gateway.okhistory.org/ark:/67531/metadc190366/ Image Source: http://gateway.okhistory.org/ark:/67531/metadc316961/ Image Source: http://gateway.okhistory.org/ark:/67531/metadc316903 Figure 2: Screen capture of Embedded Metadata Figure 1: Structured Data Linter Tool Results CONCLUSION © Copyright Emily Kolvitz, Product Consultant at Bynder, LLC MLIS from The University of Oklahoma Emily Kolvitz emily@getbynder.com +1 405 471 2570 https://www.theinformationprofessional.com https://www.getbynder.com FURTHER  INFORMATION   ACKNOWLEDGEMENTS   I would like to thank The University of Oklahoma, who offered resources to make this project possible and also The Oklahoma History Center for allowing me to work on such a wonderful digitization project for the Gateway to Oklahoma History during my study. RESULTS   Image Resource Findability on the World Wide Web is still very much a land-grab. For the Semantic Web to become a reality online businesses and individuals have to get their hands dirty and also come face-to-face with the realization that search engine giants are increasingly becoming the go-to tool for information resource retrieval. “Increasingly, students use Web search engines such as Google to locate information resources rather than seek out library online catalogs or databases of scholarly journal articles” (Lippincott 2013). This puts the search engine giant in a unique position to dictate how the future of search will work on the Web - and therefore, your organization’s future presence (or lack thereof) on the Web. Image search and retrieval is a more difficult area than text search and retrieval because accessibility to the image content is largely dependent on the context presented in and around the image resource. The widespread adoption of Schema.org and structured data on the Web did not gain traction until the big four search engines (Google, Bing, Yahoo, and Yandex ) agreed that a standard was needed to pave the way forward. “On-page markup helps search engines understand the information on web pages and provide richer search results. A shared markup vocabulary makes easier for webmasters to decide on a markup schema and get the maximum benefit for their efforts,” (Schema.org 2014).Structured data is only one piece of the findability algorithm. Metadata near content, embedded within content, or listed in the alt text of an html document all tell machines something about the content inside of the record as well. There are no guardians of the Web, ensuring structured data is uniformly applied to all records with equal attention and care and there is no standard, mandated requirement for records on the Web to provide context for image resource findability. Most search engines do not crawl embedded XMP data or the invisible Web, leaving text near images, file names or text in the alt-text in html markup as the only context for image resources. The search algorithms for image retrieval are subject to change frequently (Kritzinger 2013) and additionally, social media sites and organizations strip embedded data from images (Embedded Metadata Manifesto 2014). Embedded metadata provides context and provenance for image resources. Even with the dramatic adoption of structured data markup utilizing schema.org vocabularies, there still remains metadata opportunities on the Web. Reicks recommends embedded metadata as a strategy for online findability by showcasing examples of applications that parse embedded data into structured data around images on the Web such as PhotoShelter and LicenseStream (2010). The one variable in the equation of Web findability that remains a staple is good quality metadata under the hood of the Website. In this case study, a methodology is applied to the Gateway to Oklahoma History’s Website. This study can be generalized to organizations looking to benchmark their own findability maturity on the Web from an image-centric viewpoint. The following research question informed this project: What are the types and quality of structured data, XMP, and metadata records available for image resources appearing on the website? Utilizing the Structured Data Linter Tool and Phil Harvey’s ExifTool, information was gathered to quantify these research questions. Image records on the Gateway to Oklahoma History’s website were investigated for the types, quality and quantity of embedded metadata and structured data. The Gateway to Oklahoma History’s Website has a wealth of structured data and metadata pertaining to its image resources. Search queries utilizing structured data markup tags and/or embedded metadata yielded relevant and accurate results during a normal web search, but did not yield relevant and/or accurate image resources during an image search. Descriptive filenames were not used for image resources, which is an important part of image retrieval through web search engines. Adding Schema.org tags to the on-page markup, to accompany the structured data already present is another area for improvement. An interesting finding from this research was that embedded metadata was only found on the largest, original version of the image resource, and never on smaller derivative images. Structured data included in the on-page mark-up included Open Graph Protocol and Dublin Core. IPTC was the primarily type of embedded metadata present for the image resources.   The results and methodology for this research can help GLAM institutions (Galleries, Libraries, Archives & Museums) by bringing awareness to the state of structured data and image resource findability for cultural heritage institutions on the Web. GLAMs must be active in the SEO space, support machine-readable language in the markup of their sites, and utilize Schema.org vocabularies and descriptive filenames for relevancy in search engine results. The Digital Library Federation, which is a program of the Council on Library and Information Resources, concludes that “Getting found means repository objects must be included in the indexes of major search engines because most students and faculty now begin their research with Internet search engines. Digital repositories created by libraries will be largely invisible to users if their contents are not indexed in these search engines”  (Digital Library Foundation 2014).   REFERENCES Corlosquet, Stephane and Gregg Kellogg. Last accessed August 1, 2015 “Structured Data Linter,” http://linter.structured-data.org/ Digital Library Federation. Last accessed October 20, 2014. “SEO for Digital Libraries.” http://www.diglib.org/community/groups/seo-for-digital-libraries/ Harvey, Phil. Last accessed August 1, 2015. “ExifTool by Phil Harvey,” http://www.sno.phy.queensu.ca/~phil/exiftool/ International Business, Times. 0006. "Bing, Google and Yahoo merge to make search easier with schema.org." International Business Times, April. IPTC International Press Telecommunications Council, 2014. “Embedded Metadata Manifesto” Last accessed November 20, 2014. http://www.embeddedmetadata.org/social-media-test-results.php(Embedded Metadata Manifesto 2014). Kritzinger, W. T. "Search Engine Optimization and Pay-per-Click Marketing Strategies." Journal of Organizational Computing and Electronic Commerce, no. 3 (2013): 273-86. Lippincott, Joan K. “Net Generation Students and Libraries,” EDUCAUSE (2005), accessed November 19, 2014, http://www.educause.edu/research-and-publications/books/educating-net-generation/net-generation- students-and-libraries Reicks, David. 2010. “Why Embedded Metadata Won’t Help Your SEO,” Last Updated December 30, 2013. Last Accessed November 23, 2014. http://www.controlledvocabulary.com/blog/embedded-metadata-wont-help-seo.html Schema.org. 2015. “About Schema.org” Last Updated Unknown. https://schema.org/docs/faq.html