Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

•Télécharger en tant que PPTX, PDF•

2 j'aime•1,165 vues

Cyndy Parr

Presented at iEvoBio: Informatics for Phylogeny, Evolution, and Biodiversity in Portland, OR 29 June 2010

Technologie

Community content building for evolutionary biology Lessons learned from LepTree and Encyclopedia of Life Cynthia Parr Smithsonian Institution University of Maryland

Today’s story LepTree and Encyclopedia of Life built a couple of websites LepTree: slow for social content-building but highly useful content EOL: quick for content aggregation, but now need to atomize and semanticize Conclusion: Best of both worlds

Community features Blog Commenting Forum Working Groups

LepTree built semantic tools, then invited data entry Export

Freely accessible: open access, open source

Available from a single portal in a common format

Always growing as new species are discovered and new knowledge is generated,[object Object]

http://www.eol.org/content_partner Objects can come from many partners Objects are sorted by topic Each partner gets credit

EOL aggregates, then annotates Catalogue of Life IUCN Content providers Databases LifeDesks Public contribution Curating Commenting Tagging GBIF Biodiversity Heritage Library http://www.eol.org/content_partner

LepTree’s data approach is more complex and customized LepTree ,[object Object]

Big S semantics (OWL, RDF triple store). Tied to people and project ontologies

Custom data entry: required new workflowEOL ,[object Object]

Variety of data paths: avoid changes in workflow,[object Object]

1750 pages (107 rich pages + ~450 fossils) + ~1600 images)

Contenu connexe

Similaire à Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

Introduction to EOL.org for scientistsCyndy Parr

The OER in COERLL: Defining Open EducationCenter for Open Educational Resources and Language Learning

5 steps to using open access in the classroom 11 9 2011 Elizabeth Brown

Introduction to Open Educational Resources (OER)Michael Paskevicius

Introduction to EOL v2 for Crossroads Cyndy Parr

Introducing Encyclopedia of Life version 2Cyndy Parr

Using Online Natural History Databases to Support Innovation in Undergraduate...Encyclopedia of Life Learning + Education

One Scientist’s Wish List for Scientific PublishersPhilip Bourne

Open Educational Resources (OER) for Enhancing Teaching and LearningZakir Hossain/ICS, Zurich

UCT Opencontent 1 Year AnniversaryMichael Paskevicius

Open science, open-source, and open data: Collaboration as an emergent property?Hilmar Lapp

Competitive & Saleable E-Content for Philippine LibrariesPhilippine Association of Academic/Research Librarians

Using OA ContentPhilip Bourne

If They Build It They Will ComeKeith Kirkwood

CRE Resource Creation and DiscoveryBill Warters

The OERs: Transforming Education for Sustainable Future by Dr. Sarita AnandDr. Sarita Anand

The repository ecology: an approach to understanding repository and service i...R. John Robertson

NORFest 2023 Lightning Talks Session Onedri_ireland

Libraries,librarians,social mediaAnne Peoples

OER: JTCCAchieving the Dream

Similaire à Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life (20)

Introduction to EOL.org for scientists

The OER in COERLL: Defining Open Education

5 steps to using open access in the classroom 11 9 2011

Introduction to Open Educational Resources (OER)

Introduction to EOL v2 for Crossroads

Introducing Encyclopedia of Life version 2

Using Online Natural History Databases to Support Innovation in Undergraduate...

One Scientist’s Wish List for Scientific Publishers

Open Educational Resources (OER) for Enhancing Teaching and Learning

UCT Opencontent 1 Year Anniversary

Open science, open-source, and open data: Collaboration as an emergent property?

Competitive & Saleable E-Content for Philippine Libraries

Using OA Content

If They Build It They Will Come

CRE Resource Creation and Discovery

The OERs: Transforming Education for Sustainable Future by Dr. Sarita Anand

The repository ecology: an approach to understanding repository and service i...

NORFest 2023 Lightning Talks Session One

Libraries,librarians,social media

OER: JTCC

Plus de Cyndy Parr

Open data and the ag data commonsCyndy Parr

Ag Data Commons for AgBioDataCyndy Parr

Biodiversity informatics and the agricultural data landscapeCyndy Parr

Public access to research results at USDACyndy Parr

Ag Data Commons: Agricultural research metadata and dataCyndy Parr

Ag Data Commons: A new USDA catalog and repository for agricultural research ...Cyndy Parr

Preparing for data-intensive science across domains.Cyndy Parr

Parr ag datacommonsnal_brownbagCyndy Parr

Ag Data Commons: Adding Value to open agricultural research dataCyndy Parr

Big Data Initiatives for AgroecosystemsCyndy Parr

TDWG 2014 opening talk: Chair's WelcomeCyndy Parr

Behavior ontology workshop princetonCyndy Parr

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK Cyndy Parr

Frontiers of discovery with Encyclopedia of LifeCyndy Parr

Practical interoperability across semantic stores of data for ecological, tax...Cyndy Parr

Using and extending Darwin Core for structured attribute dataCyndy Parr

How the Encyclopedia of Life is wrangling organismal attribute dataCyndy Parr

The Road to TraitBank: What's Next for the Encyclopedia of LifeCyndy Parr

Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...Cyndy Parr

Encyclopedia of Life: Use cases for phenotypesCyndy Parr

Plus de Cyndy Parr (20)

Open data and the ag data commons

Ag Data Commons for AgBioData

Biodiversity informatics and the agricultural data landscape

Public access to research results at USDA

Ag Data Commons: Agricultural research metadata and data

Ag Data Commons: A new USDA catalog and repository for agricultural research ...

Preparing for data-intensive science across domains.

Parr ag datacommonsnal_brownbag

Ag Data Commons: Adding Value to open agricultural research data

Big Data Initiatives for Agroecosystems

TDWG 2014 opening talk: Chair's Welcome

Behavior ontology workshop princeton

iEvoBio Keynote: Frontiers of discovery with Encyclopedia of Life -- TRAITBANK

Frontiers of discovery with Encyclopedia of Life

Practical interoperability across semantic stores of data for ecological, tax...

Using and extending Darwin Core for structured attribute data

How the Encyclopedia of Life is wrangling organismal attribute data

The Road to TraitBank: What's Next for the Encyclopedia of Life

Encyclopedia of Life: Applying Concepts from Amazon and LEGO to Biodiversity ...

Encyclopedia of Life: Use cases for phenotypes

Dernier

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Advanced Computer Architecture – An IntroductionDilum Bandara

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Gen AI in Business - Global Trends Report 2024.pdfAddepto

Anypoint Exchange: It’s Not Just a Repo!Manik S Magar

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc

Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar

Search Engine Optimization SEO PDF for 2024.pdfRankYa

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely

Powerpoint exploring the locations used in television show Time Clashcharlottematthew16

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Dernier (20)

Artificial intelligence in cctv survelliance.pptx

Are Multi-Cloud and Serverless Good or Bad?

Scanning the Internet for External Cloud Exposures via SSL Certs

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Advanced Computer Architecture – An Introduction

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Gen AI in Business - Global Trends Report 2024.pdf

Anypoint Exchange: It’s Not Just a Repo!

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

TrustArc Webinar - How to Build Consumer Trust Through Data Privacy

Unleash Your Potential - Namagunga Girls Coding Club

Search Engine Optimization SEO PDF for 2024.pdf

Streamlining Python Development: A Guide to a Modern Project Setup

What's New in Teams Calling, Meetings and Devices March 2024

Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf

Powerpoint exploring the locations used in television show Time Clash

Developer Data Modeling Mistakes: From Postgres to NoSQL

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

DevoxxFR 2024 Reproducible Builds with Apache Maven

Connect Wave/ connectwave Pitch Deck Presentation

Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

1. Community content building for evolutionary biology Lessons learned from LepTree and Encyclopedia of Life Cynthia Parr Smithsonian Institution University of Maryland

2. Today’s story LepTree and Encyclopedia of Life built a couple of websites LepTree: slow for social content-building but highly useful content EOL: quick for content aggregation, but now need to atomize and semanticize Conclusion: Best of both worlds

3. LepTreehttp://leptree.net

4. Community features Blog Commenting Forum Working Groups

5. Complex LepTreetaxontemplate

6. LepTree built semantic tools, then invited data entry Export

8. Freely accessible: open access, open source

9. Available from a single portal in a common format

10. Quality

11.

12. http://www.eol.org/content_partner Objects can come from many partners Objects are sorted by topic Each partner gets credit

13. EOL aggregates, then annotates Catalogue of Life IUCN Content providers Databases LifeDesks Public contribution Curating Commenting Tagging GBIF Biodiversity Heritage Library http://www.eol.org/content_partner

14.

15. Big S semantics (OWL, RDF triple store). Tied to people and project ontologies

16.

17. XML schema

18.

19. 1750 pages (107 rich pages + ~450 fossils) + ~1600 images)

20.

21. 2.4 million pages390 thousand pages with objects

22.

23. Community areas of LepTree are flat

24. EOL’s content trajectory is promising Species pages with a vetted object Year

25.

26. Communities are hard!

27.

28. Phylogenies

29.

30. EOL Cornerstone Institutions Sample Content Partners AmphibiaWeb Animal Diversity Web AntWeb Catalogue of Life FishBase Global Biodiversity Information Facility (GBIF) International Union for the Conservation of Nature Tree of Life Web Project The Biodiversity Heritage Library The Field Museum of Natural History The Missouri Botanical Garden The Marine Biological Laboratory Harvard University The Smithsonian Institution

Notes de l'éditeur

I’m going to do a compare and contrast talk, so I have two projects to introduce you to. I apologize in advance if I go a bit quickly. Please feel free to catch me anytime in the next two days to get a demonstration of either of these projects
Conclusion is that these are complementary approaches – can pursue in parallel. Focus on community driven databases that can be customized for the needs of the users of the data – result in highly atomized specialist data. Then alllow that information to be aggregated on EOL where it might find broader reuse and reinterpretation.
LepTree is an Assembling the tree of life project whose major goal is to use nuclear genetic sequence to resolve deep nodes at the family and superfamily level in the Lepidoptera. This tree on the left shows our initial published findings which are not the point of this talk. I’ll just note that our analysis suggests that macrolepidoptera, shown by these orange bars, the very large moths and all butterflies, are clearly not a monophyletic group.The subject of today’s talk is the website tools we’ve created at leptree.net that include some features such as an interactive matrix visualizion of the sequencing status for the project of the where columns are each of the genes being sequenced and the rows show the hundreds of samples being used by the project, colors show our progress for each gene.We also have a fossil project and a morphology project that also have representation on our pages.
The leptree website is built on a core of the open source drupal platform, and includes a number of the out-of-the-box community features, blog, discussion forum, commenting, the ability to create private working areas.In addition we have added new modules to allow community members to add information about their own projects, to post protocols that they are using so that they can link to them and other people can use the same protocols. Finally, we have a references module that lists about 800 articles on lepidopteransystematics. Rather than using the relational database that is the backend of drupal, these are actually storing data semantically – as RDF triples linked to rich ontologies.
And finally, we also set up a custom module that presents a user with a complex temlpate for describing taxa. The checkboxes and data fields are the result of months of consultation with lepidopterists and are intended to cover the kinds of morphological and ecological variation across the group. Like the projects, protocols, and references modules, the data are stored in a sesame triple store repository. We can use this semantic representation to link our knowledge to that generated by other projects and use machine reasoning to come up with new results. This is the kind of data that would be appropriate to “decorate” a phylogenetic tree to look for patterns.The goal is to produce about 150 of these taxon pages but we designed the system to be expandable.
So to summarize, LepTree built some semantics-enabled tools, combine this with data and links from a couple of other projects to create the taxonomic information pages you can see on LepTree.net under “Knowledge project”In addition, the taxon information is now being exported as text objects and also appears on the Encyclopedia of Life taxon pages.
Objects such as these are essentially chunks of text sorted by topic.Each of these credits the source, and can receive comments or ratings, or can be trusted or untrusted by curators.
So, the approach of EOL is rather different. EOL is a giant mashup that creates pages, that are then available for curators to assess and rate, or for anybody to provide comments or tags.LepTree has foccuseed on data entry tools while EOL has not – though I should note that we have also developed a Drupal-based system called LifeDesks, which are one of the many ways that data flows to the central EOL.
On LepTree, burden on users to learn a new systemOn EOL, burden on programming staff, not on users
The effort we went to in Leptree to add semantics to the tools likely just slowed us down, and distracted us from the effort of developing a community effort. But once we had tools with lots of checkboxes we have been able to accumulate a lot of potentially useful atomized data.By divide and conquer I mean that it should be possible to continue to promote community databases – these can be tailored to the specific needs of a scientific community and its audiences, with data as structured as possible. And then The data from these projects can be aggregated, essentially cross-indexed, so that they are accesssible from a common portal, EOL. If EOL had tried to structure or semanticize from the beginning we never would have achieved the growth we have.
Build contentExpose triplesShare data

Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

Recommandé

Recommandé

Contenu connexe

Similaire à Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

Similaire à Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life (20)

Plus de Cyndy Parr

Plus de Cyndy Parr (20)

Dernier

Dernier (20)

Community content building for evolutionary biology: Lessons learned from LepTree and Encyclopedia of Life

Notes de l'éditeur