SlideShare une entreprise Scribd logo
1  sur  12
FactPub: the open-access web
platform for academic paper contents
Genome Institute of Singapore
2016Sep22
Sun Sagong, Xiaocheng Huang, Lucas Tan and Pauline Ng
DISCLAIMER: Nothing in this paper shall be construed as legal advice. The authors are not lawyers. Readers should
consult qualified legal counsel if they require an opinion on legal matters.
Background
- 72% of internet users look online for health information
- 26% of people seeking health information online have hit a pay wall*
*Pew Internet & American Life Project: Health Online 2013
Existing Solutions
- Advantages
- Full text
- Disadvantages
- Piracy!
- Unknown how long content will be available
- Temporary URLs
- Popularity attracts lawsuits
- In 2013, Academia.edu had to take down Elsevier’s request, even though they were uploaded by the
authors.
Our Solution
• FactPub: http://factpub.org
• Points
- Apermanent repository to provide scientific facts to the public
- A donor distributes facts, not academic papers
- Public can search for facts and find them
Step 1: Choose Academic PDF
- Scientific paper is chosen by users with either Desktop Application or Browser Extension
https://github.com/sunsagong/factify_chrome_extension_nativeapp
Desktop Application Browser Extension
User browses PDFUser drags & drops PDF
or
https://github.com/happybelly/factify_GUI
Step 2: Extraction Process
- Factify.jar : Fact Extraction Module (Java) - Runs on fact donor’s local machine
https://github.com/happybelly/factify
Scientific
Paper
Facts
1. PDF to structured texts
2. Rule-based fact extraction algorithm is performed
Step 2: Extraction Process
Step 2: Extraction Process
Step 3: Facts Sent & Publicized
- Facts file is sent to the server and user can see the generated page URL
- Server side processing produces the wiki page from the facts file
Facts
or
An Example Search box
Inferred title / DOI provide paper details
Full Abstract is provided
An Example
Acronyms & Tables
Content structure is
preserved
Each sentence converted to a fact
References
1. Pew Internet & American Life Project: Health Online 2013
2. Pauline, Ng - Breaking Down Paywalls for Online Health - Data By The Bay
2016 / ODSC East 2016
3. Klampfl, S et al : Unsupervised Document structure analysis of digital
scientific articles. : Digital Libraries 14(3-4): 83-99(2014)
4. Huang, Xiaocheng and Pauline, Ng: Enabling Public Access to Non-Open
Access Biomedical Literature via Idea-Expression Dichotomy and Fact
Extraction: AAAI Workshop on Scholarly Big Data, 2016

Contenu connexe

Tendances

Blog project media and rubric2
Blog project media and rubric2Blog project media and rubric2
Blog project media and rubric2
sherrychapman
 
Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...
Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...
Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...
Kathleen Reed
 
Charleston 2012: Altmetrics: Analyzing the Value in Scholarly Content
Charleston 2012: Altmetrics: Analyzing the Value in Scholarly ContentCharleston 2012: Altmetrics: Analyzing the Value in Scholarly Content
Charleston 2012: Altmetrics: Analyzing the Value in Scholarly Content
William Gunn
 
Christina engaging the biomedical researchers
Christina   engaging the biomedical researchersChristina   engaging the biomedical researchers
Christina engaging the biomedical researchers
djmichael156
 

Tendances (20)

Blog project media and rubric2
Blog project media and rubric2Blog project media and rubric2
Blog project media and rubric2
 
Demystifying research impact metrics and library support
Demystifying research impact   metrics and library supportDemystifying research impact   metrics and library support
Demystifying research impact metrics and library support
 
MAPing Library Resources: Using Massive Amounts of Data for Collection Analysis
MAPing Library Resources: Using Massive Amounts of Data for Collection AnalysisMAPing Library Resources: Using Massive Amounts of Data for Collection Analysis
MAPing Library Resources: Using Massive Amounts of Data for Collection Analysis
 
Altmetrics Apps: new approaches to measure the impact of scientific pubblicat...
Altmetrics Apps: new approaches to measure the impact of scientific pubblicat...Altmetrics Apps: new approaches to measure the impact of scientific pubblicat...
Altmetrics Apps: new approaches to measure the impact of scientific pubblicat...
 
Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...
Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...
Laying the Groundwork for a New Library Service: Scholar-Practitioner & Gradu...
 
"Open Access - What's Happening" - PeerJ at UC Berkeley
"Open Access - What's Happening" - PeerJ at UC Berkeley"Open Access - What's Happening" - PeerJ at UC Berkeley
"Open Access - What's Happening" - PeerJ at UC Berkeley
 
HSL and PubViz: a novel Medline Exploration Engine
HSL and PubViz: a novel Medline Exploration EngineHSL and PubViz: a novel Medline Exploration Engine
HSL and PubViz: a novel Medline Exploration Engine
 
Charleston 2012: Altmetrics: Analyzing the Value in Scholarly Content
Charleston 2012: Altmetrics: Analyzing the Value in Scholarly ContentCharleston 2012: Altmetrics: Analyzing the Value in Scholarly Content
Charleston 2012: Altmetrics: Analyzing the Value in Scholarly Content
 
How to own your research communications - The importance of identity and owne...
How to own your research communications - The importance of identity and owne...How to own your research communications - The importance of identity and owne...
How to own your research communications - The importance of identity and owne...
 
Impact Outside Academia
Impact Outside AcademiaImpact Outside Academia
Impact Outside Academia
 
Performance analysis model for big data applications in cloud computing
Performance analysis model for big data applications in cloud computingPerformance analysis model for big data applications in cloud computing
Performance analysis model for big data applications in cloud computing
 
Jan Reichelt Mendeley
Jan Reichelt MendeleyJan Reichelt Mendeley
Jan Reichelt Mendeley
 
Open access to scholarly communications
Open access to scholarly communicationsOpen access to scholarly communications
Open access to scholarly communications
 
Connecting Content
Connecting ContentConnecting Content
Connecting Content
 
Christina engaging the biomedical researchers
Christina   engaging the biomedical researchersChristina   engaging the biomedical researchers
Christina engaging the biomedical researchers
 
Clinical Anatomy 9566
Clinical Anatomy 9566Clinical Anatomy 9566
Clinical Anatomy 9566
 
Grants.gov.10.2015
Grants.gov.10.2015Grants.gov.10.2015
Grants.gov.10.2015
 
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
Poster RDAP13: A Workflow for Depositing to a Research Data Repository: A Cas...
 
High water raises all boats
High water raises all boatsHigh water raises all boats
High water raises all boats
 
Dha 723 Enthusiastic Study / snaptutorial.com
Dha 723 Enthusiastic Study / snaptutorial.comDha 723 Enthusiastic Study / snaptutorial.com
Dha 723 Enthusiastic Study / snaptutorial.com
 

Similaire à InCoB2016 FactPub: the open-access web platform for academic paper contents

CDC National Conference on Health Communication, Marketing and Media 2010
CDC National Conference on Health Communication, Marketing and Media 2010CDC National Conference on Health Communication, Marketing and Media 2010
CDC National Conference on Health Communication, Marketing and Media 2010
Michelle C. Farabough
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
dgarijo
 

Similaire à InCoB2016 FactPub: the open-access web platform for academic paper contents (20)

BLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, FigshareBLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, Figshare
 
Inroads into Data: Getting Involved in Data at Your Institution
Inroads into Data: Getting Involved in Data at Your InstitutionInroads into Data: Getting Involved in Data at Your Institution
Inroads into Data: Getting Involved in Data at Your Institution
 
Johnston - How to Curate Research Data
Johnston - How to Curate Research DataJohnston - How to Curate Research Data
Johnston - How to Curate Research Data
 
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
Scott Edmunds: Quantifying how FAIR is Hong Kong: The Hong Kong Shareability ...
 
CDC National Conference on Health Communication, Marketing and Media 2010
CDC National Conference on Health Communication, Marketing and Media 2010CDC National Conference on Health Communication, Marketing and Media 2010
CDC National Conference on Health Communication, Marketing and Media 2010
 
A Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical ResearchA Data Biosphere for Biomedical Research
A Data Biosphere for Biomedical Research
 
Benefits and practice of open science
Benefits and practice of open scienceBenefits and practice of open science
Benefits and practice of open science
 
Information literacy
Information literacyInformation literacy
Information literacy
 
Biomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital EnterpriseBiomedical Research as Part of the Digital Enterprise
Biomedical Research as Part of the Digital Enterprise
 
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiaoIRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
IRIDA: Canada’s federated platform for genomic epidemiology, ABPHM 2015 WHsiao
 
PhRMA Some Early Thoughts
PhRMA Some Early ThoughtsPhRMA Some Early Thoughts
PhRMA Some Early Thoughts
 
Figshare for institutions presentation swets customer day 2014
Figshare for institutions   presentation swets customer day 2014Figshare for institutions   presentation swets customer day 2014
Figshare for institutions presentation swets customer day 2014
 
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
AGU Leptoukh Lecture: Putting Data to Work: Moving science forward together b...
 
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven ScienceCapturing Context in Scientific Experiments: Towards Computer-Driven Science
Capturing Context in Scientific Experiments: Towards Computer-Driven Science
 
The world of research data: when should data be closed, shared or open
The world of research data: when should data be closed, shared or openThe world of research data: when should data be closed, shared or open
The world of research data: when should data be closed, shared or open
 
The Future of Open Science
The Future of Open ScienceThe Future of Open Science
The Future of Open Science
 
Open Access Week 2017: Introduction to Open Data Policies in H2020
Open Access Week 2017: Introduction to Open Data Policies in H2020Open Access Week 2017: Introduction to Open Data Policies in H2020
Open Access Week 2017: Introduction to Open Data Policies in H2020
 
General introduction to Open Data Policies H2020, influence of OD policies on...
General introduction to Open Data Policies H2020, influence of OD policies on...General introduction to Open Data Policies H2020, influence of OD policies on...
General introduction to Open Data Policies H2020, influence of OD policies on...
 
Data at the NIH: Some Early Thoughts
Data at the NIH: Some Early ThoughtsData at the NIH: Some Early Thoughts
Data at the NIH: Some Early Thoughts
 
RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015RDA Scholarly Infrastructure 2015
RDA Scholarly Infrastructure 2015
 

Dernier

Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899
Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899
Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899
Cara Menggugurkan Kandungan 087776558899
 
Unique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdfUnique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdf
ScottMeyers35
 
Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...
gajnagarg
 

Dernier (20)

Contributi dei parlamentari del PD - Contributi L. 3/2019
Contributi dei parlamentari del PD - Contributi L. 3/2019Contributi dei parlamentari del PD - Contributi L. 3/2019
Contributi dei parlamentari del PD - Contributi L. 3/2019
 
The NAP process & South-South peer learning
The NAP process & South-South peer learningThe NAP process & South-South peer learning
The NAP process & South-South peer learning
 
Finance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCCFinance strategies for adaptation. Presentation for CANCC
Finance strategies for adaptation. Presentation for CANCC
 
Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899
Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899
Cara Gugurkan Pembuahan Secara Alami Dan Cepat ABORSI KANDUNGAN 087776558899
 
Genuine Call Girls in Salem 9332606886 HOT & SEXY Models beautiful and charm...
Genuine Call Girls in Salem  9332606886 HOT & SEXY Models beautiful and charm...Genuine Call Girls in Salem  9332606886 HOT & SEXY Models beautiful and charm...
Genuine Call Girls in Salem 9332606886 HOT & SEXY Models beautiful and charm...
 
Call Girls Basheerbagh ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Basheerbagh ( 8250092165 ) Cheap rates call girls | Get low budgetCall Girls Basheerbagh ( 8250092165 ) Cheap rates call girls | Get low budget
Call Girls Basheerbagh ( 8250092165 ) Cheap rates call girls | Get low budget
 
Lorain Road Business District Revitalization Plan Final Presentation
Lorain Road Business District Revitalization Plan Final PresentationLorain Road Business District Revitalization Plan Final Presentation
Lorain Road Business District Revitalization Plan Final Presentation
 
Coastal Protection Measures in Hulhumale'
Coastal Protection Measures in Hulhumale'Coastal Protection Measures in Hulhumale'
Coastal Protection Measures in Hulhumale'
 
Unique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdfUnique Value Prop slide deck________.pdf
Unique Value Prop slide deck________.pdf
 
Scaling up coastal adaptation in Maldives through the NAP process
Scaling up coastal adaptation in Maldives through the NAP processScaling up coastal adaptation in Maldives through the NAP process
Scaling up coastal adaptation in Maldives through the NAP process
 
Time, Stress & Work Life Balance for Clerks with Beckie Whitehouse
Time, Stress & Work Life Balance for Clerks with Beckie WhitehouseTime, Stress & Work Life Balance for Clerks with Beckie Whitehouse
Time, Stress & Work Life Balance for Clerks with Beckie Whitehouse
 
Dating Call Girls inBaloda Bazar Bhatapara 9332606886Call Girls Advance Cash...
Dating Call Girls inBaloda Bazar Bhatapara  9332606886Call Girls Advance Cash...Dating Call Girls inBaloda Bazar Bhatapara  9332606886Call Girls Advance Cash...
Dating Call Girls inBaloda Bazar Bhatapara 9332606886Call Girls Advance Cash...
 
Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...
Top profile Call Girls In Morena [ 7014168258 ] Call Me For Genuine Models We...
 
Just Call VIP Call Girls In Bangalore Kr Puram ☎️ 6378878445 Independent Fem...
Just Call VIP Call Girls In  Bangalore Kr Puram ☎️ 6378878445 Independent Fem...Just Call VIP Call Girls In  Bangalore Kr Puram ☎️ 6378878445 Independent Fem...
Just Call VIP Call Girls In Bangalore Kr Puram ☎️ 6378878445 Independent Fem...
 
Call Girls AS Rao Nagar - 8250092165 Our call girls are sure to provide you w...
Call Girls AS Rao Nagar - 8250092165 Our call girls are sure to provide you w...Call Girls AS Rao Nagar - 8250092165 Our call girls are sure to provide you w...
Call Girls AS Rao Nagar - 8250092165 Our call girls are sure to provide you w...
 
Honasa Consumer Limited Impact Report 2024.pdf
Honasa Consumer Limited Impact Report 2024.pdfHonasa Consumer Limited Impact Report 2024.pdf
Honasa Consumer Limited Impact Report 2024.pdf
 
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girls
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girlsPakistani Call girls in Sharjah 0505086370 Sharjah Call girls
Pakistani Call girls in Sharjah 0505086370 Sharjah Call girls
 
Private Call Girls Bidar 9332606886Call Girls Advance Cash On Delivery Service
Private Call Girls Bidar  9332606886Call Girls Advance Cash On Delivery ServicePrivate Call Girls Bidar  9332606886Call Girls Advance Cash On Delivery Service
Private Call Girls Bidar 9332606886Call Girls Advance Cash On Delivery Service
 
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
Cheap Call Girls In Hyderabad Phone No 📞 9352988975 📞 Elite Escort Service Av...
 
Panchayath circular KLC -Panchayath raj act s 169, 218
Panchayath circular KLC -Panchayath raj act s 169, 218Panchayath circular KLC -Panchayath raj act s 169, 218
Panchayath circular KLC -Panchayath raj act s 169, 218
 

InCoB2016 FactPub: the open-access web platform for academic paper contents

  • 1. FactPub: the open-access web platform for academic paper contents Genome Institute of Singapore 2016Sep22 Sun Sagong, Xiaocheng Huang, Lucas Tan and Pauline Ng DISCLAIMER: Nothing in this paper shall be construed as legal advice. The authors are not lawyers. Readers should consult qualified legal counsel if they require an opinion on legal matters.
  • 2. Background - 72% of internet users look online for health information - 26% of people seeking health information online have hit a pay wall* *Pew Internet & American Life Project: Health Online 2013
  • 3. Existing Solutions - Advantages - Full text - Disadvantages - Piracy! - Unknown how long content will be available - Temporary URLs - Popularity attracts lawsuits - In 2013, Academia.edu had to take down Elsevier’s request, even though they were uploaded by the authors.
  • 4. Our Solution • FactPub: http://factpub.org • Points - Apermanent repository to provide scientific facts to the public - A donor distributes facts, not academic papers - Public can search for facts and find them
  • 5. Step 1: Choose Academic PDF - Scientific paper is chosen by users with either Desktop Application or Browser Extension https://github.com/sunsagong/factify_chrome_extension_nativeapp Desktop Application Browser Extension User browses PDFUser drags & drops PDF or https://github.com/happybelly/factify_GUI
  • 6. Step 2: Extraction Process - Factify.jar : Fact Extraction Module (Java) - Runs on fact donor’s local machine https://github.com/happybelly/factify Scientific Paper Facts 1. PDF to structured texts 2. Rule-based fact extraction algorithm is performed
  • 9. Step 3: Facts Sent & Publicized - Facts file is sent to the server and user can see the generated page URL - Server side processing produces the wiki page from the facts file Facts or
  • 10. An Example Search box Inferred title / DOI provide paper details Full Abstract is provided
  • 11. An Example Acronyms & Tables Content structure is preserved Each sentence converted to a fact
  • 12. References 1. Pew Internet & American Life Project: Health Online 2013 2. Pauline, Ng - Breaking Down Paywalls for Online Health - Data By The Bay 2016 / ODSC East 2016 3. Klampfl, S et al : Unsupervised Document structure analysis of digital scientific articles. : Digital Libraries 14(3-4): 83-99(2014) 4. Huang, Xiaocheng and Pauline, Ng: Enabling Public Access to Non-Open Access Biomedical Literature via Idea-Expression Dichotomy and Fact Extraction: AAAI Workshop on Scholarly Big Data, 2016