The emerging biodiversity data ecosystem

1. The emerging biodiversity data ecosystem Cynthia Parr, Katja Schulz, Jennifer Hammock Smithsonian Institution Nathan Wilson, Patrick Leary Marine Biological Laboratory Richard Allen Environmental Protection Agency

2. Today’s story What is EOL Core questions Network analysis Hotlist development Page richness algorithm Conclusion: improving the health and richness of our knowledge network advances understanding

4. All species

5. Freely accessible & reusable: open access, open source

6. Available from a single portal in a common format

7. Quality

9. EOL is a content curation community Content providers Databases Journals LifeDesks Public contributions Curating Aggregation Commenting Tagging http://www.eol.org

10. Core questions Where is our knowledge about biodiversity? Where are the gaps? What are the most effective ways to fill gaps given our limited resources?

11. Network analysis with Anne Bowser, University of Maryland EOL GBIF NCBI EOL connects hubs

12. The GBIF hub has subnetworks

13. Key individuals seek out hubs TOLWeb

14. Implications and next steps Need more data Identify isolated projects & mechanisms for connecting them to the network Improve resilience & redundancy Distribute annotation & quality control Model data flow quantity and impact

15. Viewer of Life on EOL – Kris Urie

16. Low % of descendents with text in Arthropods

17. Within arthropods coverage varies . . . Perhaps as expected http://synthesis.eol.org/media/treemap/

18. Developing the EOL hot list Consultation with taxonomic experts Development of criteria Assembly of critical lists Establishing targets for rich taxon pages, lesser known pages

19. EOL’s hot lists Hot List Red Hot List 70,000 taxa Conservation concern Invasives Model organisms Ecologically important Pests Charismatics Data availability 2,800 taxa Most searched Top 100 invasives Crops (food) Zoos & aquaria High traffic Higher taxa

20. Taxon page richness algorithm 60% 30% 10% Breadth: Images, topics of text objects, references, maps, videos, sounds, conservation status Depth: # words per text object, # words total Diversity: Sources (partners) + + a (Breadth) b (Depth) c (Diversity) 0 – 1, Threshold 0.4

21. Summary of EOL page richness Overall Hot List 640,000 have content 2 % are rich 25 % have only links to literature 28 % of 75K are rich Average richness = 0.30 Red Hot List 56 % of 3K are rich Average richness = 0.43

22. Strategies for improving richness Crowd-sourcing Leveraging Collections Communities Mobile apps Enabling platforms Enabling journals Data mining BHL etc. Version 2 Coming in Fall 2011!

23. The page richness index Helps fill gaps with existing knowledge Helps prioritize funding and training so that it has maximum impact on closing true gaps Will be available via API Computing and storing richness index on EOL is a step towards storing and serving computable data

24. Dynamic data summaries = new knowledge Summarize data within a partner, then across partners. For example: compute an average value for one taxon (x specimens), compare to range of values across all taxa (621,393 samples) Atlantic Cod Gadusmorhua Jen Hammock (EOL) Edward van den Berge (OBIS)

26. Resilience

27. Richness assessment Large-scale data summaries can foster gap-filling and standing, dynamic knowledge analyses

28. Thank you http://www.eol.org 160+ content partners 2000 Flickr contributors 1000s Wikipedia contributors 43,000 EOL members Funding:John D. and Catherine T. MacArthur Foundation, Alfred P. Sloan Foundation, Cornerstone Institutions, Private Donors See Demo and Version 2 sneak peak in Software Bazaar Leadership: Erick Mata, Bob Corrigan, Mark Westneat, Marie Studer, Tom Garnett, Jim Edwards, David Patterson, Developers: Peter Mangiafico, Jeremy Rice, DimitriMozzherin, David Shorthouse, Lisa Whalley and others Biologists: Tanya Dewey, Audrey Aronowsky, Leo Shapiro

The emerging biodiversity data ecosystem

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (19)

En vedette

En vedette (7)

Similaire à The emerging biodiversity data ecosystem

Similaire à The emerging biodiversity data ecosystem (20)

Plus de Cyndy Parr

Plus de Cyndy Parr (20)

Dernier

Dernier (20)

The emerging biodiversity data ecosystem

Notes de l'éditeur