Ce diaporama a bien été signalé.
Le téléchargement de votre SlideShare est en cours. ×

AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA)

Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Publicité
Prochain SlideShare
Cinf flash v2 final
Cinf flash v2 final
Chargement dans…3
×

Consultez-les par la suite

1 sur 36 Publicité

AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA)

Télécharger pour lire hors ligne

How do you find video when you only have sparse data? While you can wander the stacks (if you can still find open stacks) for inspiration, video either physical or digital, is difficult to discover. Wandering the virtual stacks is, well, virtually impossible. Discovery platforms on the whole have not replicated the inspirational experience of wandering the stacks.

More companies are using archivable video for internal communication of the various research projects, product developments, test results, and more that are being considered, in progress, or completed. Showing how an experiment was conducted can convey considerably more information that is very difficult to communicate via text. How do you find a company video that might be helpful for your project?

A case study is presented of the problems and the solutions that were implemented by a large, multinational chemical company. A suite of content discovery technologies was used including a video to text to tagging system connected to their documents database and automatically indexed using several chemical as well as conceptual systems (rule-based, NLP, inference engine). To build the system and support the manuscript and video submission there is a metadata extraction program which pulls and inserts the metadata into the submission forms so the author can move quickly through that process.

How do you find video when you only have sparse data? While you can wander the stacks (if you can still find open stacks) for inspiration, video either physical or digital, is difficult to discover. Wandering the virtual stacks is, well, virtually impossible. Discovery platforms on the whole have not replicated the inspirational experience of wandering the stacks.

More companies are using archivable video for internal communication of the various research projects, product developments, test results, and more that are being considered, in progress, or completed. Showing how an experiment was conducted can convey considerably more information that is very difficult to communicate via text. How do you find a company video that might be helpful for your project?

A case study is presented of the problems and the solutions that were implemented by a large, multinational chemical company. A suite of content discovery technologies was used including a video to text to tagging system connected to their documents database and automatically indexed using several chemical as well as conceptual systems (rule-based, NLP, inference engine). To build the system and support the manuscript and video submission there is a metadata extraction program which pulls and inserts the metadata into the submission forms so the author can move quickly through that process.

Publicité
Publicité

Plus De Contenu Connexe

Similaire à AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA) (20)

Plus par Dr. Haxel Consult (20)

Publicité

Plus récents (20)

AI-SDV 2022: Where’s the one about…? Looney Tunes® Revisited Jay Ven Eman (CEO, Access Innovations, USA) Marjorie Hlava (President of Access Innovation, USA)

  1. 1. © 2022. Access Innovations, Inc. All rights reserved. Access Innovations, Inc. Marjorie M.K. Hlava mhlava@accessinn.com Jay Ven Eman j_ven_eman@accessinn.com www.accessinn.com www.dataharmony.com +1.505.998.0800 Albuquerque, NM Leveraging Your Content Semantically Where’s the one about… Looney Tunes® Revisited October 10, 2022
  2. 2. Wondering and wandering!
  3. 3. How do you find information… when you don’t know what you want, what it might be called, where to look? I used to wander the stacks…
  4. 4. Long Library, Trinity U., Dublin
  5. 5. Guinness Brewery
  6. 6. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Albuquerque, New Mexico Las Vegas, New Mexico Las Vegas, Nevada
  7. 7. Background
  8. 8. What if there is sparse metadata unlike library catalog cards? Video? - Notorious for no metadata - Maybe a title - Newspaper ‘slug’
  9. 9. Where’s the one about… Daffy Duck and Donald Duck and pianos?
  10. 10. What was the one about… Bugs Bunny - opera singer? © Warner Bros.
  11. 11. Do you recall the one about… a coyote and the what was it? © Warner Bros.
  12. 12. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Our neighborhood Coyote
  13. 13. The Road Runner
  14. 14. Questions to ponder ❖ How do you find what you’re looking for? ❖ How do you know what you want? ❖ How do you know you found it? ❖ How do you know, if you’ve missed something? ❖ How do you replicate wandering the stacks in the Age of Google?
  15. 15. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. And now… Case studies by Marjorie Hlava
  16. 16. Case Study ❖ Access Innovations, Inc. ❖ Changing ‘search’ to ‘found’ ❖ Why we do it – the problem ❖ How we do it – the solution ❖ Case study on metadata for video
  17. 17. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Clients Publishing & Media Education Government Non-profits & Societies Health/Pharma Manufacturing & Retail
  18. 18. Promising solutions for improving the accuracy of content metadata ❖ Standards - Check out the NISO Library on their web site ❖ Consortiums for clean data ❖ Share, check, and enhance metadata ❖ Automate as much of manuscript submission & peer review as possible ❖ Clean up the author synonymy ❖ Enhance your content and the audiences it represents worldwide ❖ THE GOAL ❖ High integrity, accurate, consistent content
  19. 19. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Semantic control and content enrichment ❖ Controlled vocabularies, authority files, taxonomies, thesaurus, ontologies, triple stores, and knowledge graphs ❖ Follow the standards ▪ Accepted Structure and Format Use • ANSI/NISO Z39.19 • ISO2788 • BS5723 • ISO25964 Parts 1 and 2
  20. 20. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Every Walk of Life Uses Constantly Changing Vernacular • Homeless • Unsheltered • Unhoused • Street people • Hobos • Vagrants • …. • Taxonomy • Ontology • Thesaurus • Knowledge Map • Metastatic breast cancer • Stage IV Breast Cancer • Invasive Breast Cancer • Covid-19 • Coronavirus • SARS-CoV-2 • Omicron • BA.4 • BA.5 • …..
  21. 21. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Differences in search results due to synonymy ❖ Invasive breast cancer: 520 results ❖ Metastatic breast cancer: 1803 results ❖ Stage IV breast cancer: 73 results ❖ Stage IV breast cancer: 46,400,000 results Lack of Synonymy Control Breaks Search
  22. 22. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. But How About Improving the Content Itself? ❖ Series of Metadata Filters and Enrichment ▪ The varying names used in the content of the publication ▪ Gene names – 19 or more synonyms per name ▪ Medicinal Plant names – nearly 17 synonyms per name ▪ Bad Cell Line references ▪ Suspect Science topics / Fake news ❖ Semantic enrichment supports metadata and search ❖ Time savings for researchers both authors and readers ❖ It allows the disambiguated information in the formation of a platform for better science ❖ Being able to reference a widely available authoritative source is crucial to all world health
  23. 23. Atypon Production How is it done 10/11/2022 Provisional Acceptance Article Submission Revision Review – Link to Portal Web based Deputy Editor Key Term Review Portal Review, add, delete, submit Key Term update in article XML New Taxonomy terms New Taxonomy updated SKOS file Accept Article XML Concept Taxonomy MPNS Name verification Taxogene Human Genome Tagging Suspect Science Filter Bad Cell Lines Identification SciGen Identification After Todd Ware of ACP ICD_10 CPT HCPCS Coding
  24. 24. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. TaxoGene ❖ Automatically find all synonyms and insert the consensus approved name. ❖ Special characters and extensions ❖ Directing all readers to the preferred name in either search or publication allows full retrieval recall of related material insures precision in search remove ambiguity in communication 10/11/2022 • Synonymy: Average of 19 synonyms per gene name • Sources: • Human Genome Project • https://www.ncbi.nlm.nih.gov/genome/guide/ human/
  25. 25. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Medicinal Plant Names Service (MPNS) ❖ How many kinds of “Ginger” are there?? ▪ At least 42 ❖ Better communication between researchers worldwide – no misidentification ❖ Link to full plant name record at MPNS ❖ Includes all known scientific names, common names, homonyms, and more ❖ Global coverage – not just regional which is important an integrated world ❖ Constantly updated and linked to the ▪ Kew International Plants Names Database. ▪ International Plant Names Index (IPNI) 10/11/2022 • Source Data: The Royal Botanical Gardens at Kew • Synonymy: Average of over 16 names per plant are used. • Includes all known scientific names, common names, homonyms, and more • www.kew.org/mpns
  26. 26. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. ❖ ICLAC - There are about 437 cell lines, which are documented as such and we mine those to highlight misuse…. ❖ List of known contaminated cell lines (many of them invaded by HeLa cells) ❖ Don’t let your authors and researchers work with known bad data. ▪ Over 32,000 papers that have worked on the wrong cells ▪ Cited by at least 500,000 more articles, ▪ https://blogs.sciencemag.org/pipeline/archives/2017/10/20/bad- cells-so-many-bad-cells 10/11/2022 Sources: 488 from ICLAC https://iclac.org/databases/cross-contaminations/ 757 from Swiss Institute of Bioinformatics (SIB) https://en.wikipedia.org/wiki/Cellosaurus https://en.wikipedia.org/wiki/List_of_contaminated_c ell_lines Offering: A rule base to quickly verify that the cell lines used are valid and not a contaminated line Bad Cell Lines
  27. 27. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Suspect Science Filter ❖ List of topics which require a closer look by acquisitions editors before sending out to potential peer reviewers ❖ Identifies questionable articles ❖ Autism and vaccination ▪ Flag for assessment before sending to peer review ❖ Saves time in acquisitions review ❖ Auto Identify at time of submission using a rule base 10/11/2022 Source: PLOS in conjunction with Access Innovations
  28. 28. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Access Integrity Coding ❖ Medical Coding – automatically for articles and reports etc. ❖ ICD-10 The international Classification of Diseases. !78,000 codes to give full details to medical professionals on where that article or report falls within medical diagnosis and procedures. ❖ CPT from the American Medical Association for Classification of Procedures and Techniques ❖ HCPCS also from the AMA to find the illusive materials and supplies needed to support this item described. 10/11/2022
  29. 29. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Taxonomy Links in the PLOS Editorial Workflow
  30. 30. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Increasing Audio Video Access ❖ The new horizon is indexing audio video content to make it accessible ▪ Conference proceedings ▪ Demonstrations, interviews online ▪ Lab experiments ❖ All disappear without tagging of the content ❖ Metadata without subject metadata does not give you access to the content (What was that about?) ❖ Add taxonomy terms to the audio layer using transcription via auto tagging ▪ USPTO case study ❖ VATT™ – video to text and tagging from Data Harmony®
  31. 31. © 2010. Access Innovations, Inc. All Rights Reserved. © 2022. Access Innovations, Inc. All Rights Reserved. Fiscal Impacts From Semantically Enriching your Content ❖ MUST use at both input and search ❖ 34% improvement in search ▪ With just semantic enrichment ▪ Ying-Hsang Liu , DC 2016, Copenhagen, Denmark ❖ 75% higher book sales with more complete metadata ▪ NIELSEN BOOK US STUDY: THE IMPORTANCE OF METADATA FOR DISCOVERABILITY AND SALES , ▪ David, Senior Director, Client Solutions, Nielsen Book’s Research and Commerce Solutions Published in the US December 31, 2016
  32. 32. Metadata is the Key!
  33. 33. © 2022. Access Innovations, Inc. All rights reserved. Access Innovations, Inc. Marjorie M.K. Hlava mhlava@accessinn.com Jay Ven Eman j_ven_eman@accessinn.com www.accessinn.com www.dataharmony.com +1.505.998.0800 Albuquerque, NM Leveraging Your Content Semantically Where’s the on about… Looney Tunes® Revisited October 10, 2022 Thank you!

×