SlideShare une entreprise Scribd logo
1  sur  38
1
competitive. intelligence.
Text
Mining
OSFair 2017
Manuel Noya
Sept 7, 2017. Athens
Who I am
Researcher turned entrepreneur:
BSc Chemical Engineering (USC, Spain) and MSc Materials Science (UPM, Spain).
Know-how: technology scouting, competitive intelligence, NPD (new product
development), early-stage startups and innovation strategy.
Researcher for NEOKER (Spain) and SRI International (CA): an awarded spin-off from
USC (Spain), now scaling up in China. R&D projects in materials science for over 3
years at SRI International (Menlo Park, CA).
Cofounded Linknovate.com in May 2012 in Palo Alto (CA) and went through Stanford
University accelerator program (StartX) in 2013.
The company provides clients such as BMW AG and REPSOL with competitive
intelligence software tools.
CEO since 2014, now a 7 people company growing internationally.
Our credentials
4 4
We founded Linknovate en 2012 in Stanford University, CA.
We have been awarded 3 EU projects (grants) by the EC and have gone through 2 of the most prestigious startup
accelerators in the world.
Some of our satisfied clients:
Practise today
Problem today
Over 560 organizations
826 identified
504 contacted
158 positive
answers
66 detailed
info
Experts funnel
“Biometric Sensors”
826 experts identified in over 460 organizations worldwide
12
Aspire Food Group
13
2. Fresh User-Generated data: “We focus on bringing modern agricultural techniques to the farming of
insects. Currently our focus is primarily on the production of insects as food and as an ingredient.”
“We have been focused on producing insects for food for over three years. With regard to TRL, we have products on
the market, so arguably TRL8-9.”
3.Willingness: “Joint tech development / R&D collaboration; Partnerships - e.g. grant applications like NSF, DOE or
EU projects; Consider an investment (equity or other). We re open to discussing opportunities.”
4. Companies/ Groups recommended: “(…) the only North American company that we have seen any
credible results from is Tiny Farms, though there are indications that at least one of the established CPG companies
has begin making significant investments in R&D. Looking further afield, there are a number of companies in Europe
(particularly France and the Netherlands) that seem sophisticated.”
Email:
gm@aspirefg.com
1. Technology Maturity (2 aspects)
Insects as Protein Source: TRL 8 or higher. Already in the market
Industrial Processing of Insects: TRL 7. Less than 1,5y to be in the market
User Case: “Insects as an Alternative Source of Proteins”
User Generated Data (detailed, fresh, assesing willingness). Expert profile
Otros
1-23
Camera-based
TRL
Investigación aplicada
Prueba de concepto validada
Wearables
1-2 3 4
4 5-65-6 7 7
User Case: “biometric sensors”
User Generated Data (classified in categories). Companies’ visualization
Universidades
Relevancia
2 3 4 5
14
Centros de
Investigación
Compañías
Prot. en entorno de lab.
Prot. en entorno de altura.
Mayor
powered by
Market - Clients
16
Text Mining [unstructured text]. Insights
Ranking of Active Entities Worldwide
17
Ranking of Active Entities Worldwide (II)
18
Working in systems for reduction of irrelevant information during searches.
Works in the text-mining based bioassay neighboring analysis as a standalone or as a complementary
tool for the PubChem bioassay neighboring process to enable efficient integration of assay results
and generate hypotheses for the discovery of bioactivities of the tested reagents.
Ranking of Active Entities Worldwide (III)
19
Study investigating the usefulness of natural language processing (NLP) as an adjunct to dictionary-
based concept normalization. Methods used: two biomedical concept normalization systems,
MetaMap and Peregrine, with and without the use of a rule-based NLP module.
Interested in automated extraction of useful biomedical information from unstructured text.
Especially in the importance of named entity recognition and relationship extraction as fundamental
approaches that are relevant to systems biology.
Insights from Linknovate.com
20
Academic sources dominate over
‘industrial signals’.
We may be still waiting for the ‘great’
stuff to land?
Algorithmia over
‘Document Classification’,
‘Ontologies’, ‘Feature
Analysis’, ‘Rule Extraction’,
etc
Insights from Linknovate.com (II)
21
High academic activity,
primarily with universities as
the most involved
organisations.
Highlight: small-medium size
companies are the most active
in 2017…
Global Data Comparison
Comparing search engines of complementary results
Results show similar trends in all 3 engines and data sources.
22
Sources
• Linknovate (publications, conf proceedings, grants, patent apps, news, web monitoring)
• PubMed (publications, conf proceedings)
• Google Patents (patents)
Text Mining [unstructured text]. Academic Key-players
Stanford University – U.S.A.
https://www.stanford.edu/
24
Text Mining for Adverse Drug Events: the Promise,
Challenges, and State of the Art
A Framework for the Automatic Extraction of Rules
from Online Text
Learning the Structure of Biomedical Relationships
from Unstructured Text
References in this topic
This article provides an overview of recent advances in
pharmacovigilance driven by the application of text mining.
This paper presents a general-purpose framework for acquiring more
complex relationships from text and then encoding this knowledge
as rules.
Here we describe a novel algorithm, Ensemble Biclustering for
Classification (EBC), that learns the structure of biomedical
relationships automatically from text, overcoming differences in
word choice and sentence structure.
Stanford University Network
25
Elon University – U.S.A.
https://www.elon.edu/home/
26
CRI: RUI: CI-EN: Infrastructure to Enable Mining and
Analysis of Open Source Software Engineering
Artifacts
Awarded with a NSF Grant in 2014
This NSF CRI supported Research at Undergraduate Institutions (RUI)
project will integrate, expand and enhance several distinct data
sources currently used by three research communities: those who
study Free, Libre, and Open Source Software (FLOSS), the larger
empirical software engineering research community, and researchers
engaged in data mining and text mining.
Old Dominion University – U.S.A.
https://www.odu.edu/
27
Collaboration with James Madison University in a publication
related to the conpetitive analysis that three companies make
around the content that their customers generate in social
networks
This paper describes an in-depth case study which applies text mining
to analyze unstructured text content on Facebook and Twitter sites of
the three largest pizza chains: Pizza Hut, Domino's Pizza and Papa
John's Pizza. The results reveal the value of social media competitive
analysis and the power of text mining as an effective technique to
extract business value from the vast amount of available social media
data.
Social media competitive analysis and text mining: A case
study in the pizza industry
University of Illinois at Urbana - Champaign – U.S.A.
http://illinois.edu/
28
In this paper we introduce a text cube architecture designed to
organize social media data in multiple dimensions and
hierarchies for efficient information query and visualization from
multiple perspectives.
SocialCube: A Text Cube Framework for Analyzing Social
Media Data
Collaboration with Cornell University and the company
Intelligent Automation, Inc. in a publication related to the
study of social and cultural behaviors through the contents
generated by users of social networks
Text Mining [unstructured text]. Industry Key-players
IBM – U.S.A.
https://www.ibm.com/us-en/
30
Towards comprehensive longitudinal healthcare data
capture
Collaboration with Wright State University in a publication
about the use of text mining in unstructured clinical texts.
In this work therefore, we explore a pattern-based approach for
extracting Smoker Semantic Types (SST) from unstructured clinical
notes.
IBM Network
31
Linguamatics – U.K.
https://www.linguamatics.com/
32
Linguamatics_
deploying innovative NLP text mining software
-> high-value knowledge discovery
& decision support.
KBSI (Knowledge Based Systems, Inc.) – U.S.A.
https://www.kbsi.com/
33
SOME PARTNERS
Amenity Analytics – U.S.A.
http://www.amenityanalytics.com/
34
$7.6M of raised funds in August 2017
INVESTORS
Amenity Analytics provides next-generation
Text-Mining AI Platform. A leading edge text
analytics platform that allows customers to
identify actionable signals from unstructured
data.
AYLIEN – Ireland
http://aylien.com/
35
$1.14M of raised funds
$580k in March 2016
Aylien is an artificial intelligence
startup that focuses on creating
technologies that help machines
understand humans better. The firm
provides text analysis and news
API's that allow users to make sense
of human-generated content at
scale. They also provide a range of
content analysis solutions to
developers, data scientists,
marketers and academics.
INVESTORS
Lexalytics – U.S.A.
https://www.lexalytics.com/
36
Lexalytics transforms global
conversations into meaningful and
actionable insights. Their leading
text analysis platforms process
billions of pieces of unstructured
data, translating thoughts and
feelings into profitable decisions for
their customers. Lexalytics helps
companies implement vital
feedback and monitoring programs
that create an ongoing dialogue
with their customers.
Bitext – U.S.A.
https://www.bitext.com/
37
$900k of raised funds in January 2015
Bitext develops multilingual analytics
technology in 30 languages. The company
takes an approach to text analysis, using
linguistic knowledge as a scientific base.
38
Text & Data Mining for
Competitive Intelligence
manuel@linknovate.com | skype: manu_noia
Gracias!

Contenu connexe

Tendances

OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...
Barry Hardy
 

Tendances (20)

The European Open Science Cloud: just what is it?
The European Open Science Cloud: just what is it?The European Open Science Cloud: just what is it?
The European Open Science Cloud: just what is it?
 
Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...Open Access: Open Access Looking for ways to increase the reach and impact of...
Open Access: Open Access Looking for ways to increase the reach and impact of...
 
OpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open ScienceOpenAIRE: eInfrastructure for Open Science
OpenAIRE: eInfrastructure for Open Science
 
OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...OpenTox - an open community and framework supporting predictive toxicology an...
OpenTox - an open community and framework supporting predictive toxicology an...
 
FAIR History and the Future
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
OSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciencesOSFair2017 Workshop | Service provisioning for excellent sciences
OSFair2017 Workshop | Service provisioning for excellent sciences
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...
 
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and toolsOpen Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
Open Access Week 2017: Life Sciences and Open Sciences - worfkflows and tools
 
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
OpenData Public Research, University of Toronto, Open Access Week, 25/11/2011
 
Implementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research DataImplementing Open Access: Effective Management of Your Research Data
Implementing Open Access: Effective Management of Your Research Data
 
The Future of Open Science
The Future of Open ScienceThe Future of Open Science
The Future of Open Science
 
Think Big about Data: Archaeology and the Big Data Challenge
Think Big about Data: Archaeology and the Big Data ChallengeThink Big about Data: Archaeology and the Big Data Challenge
Think Big about Data: Archaeology and the Big Data Challenge
 
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
The Needs of stakeholders in the RDM process - the role of LEARN. By Paul Ayr...
 
FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
November 10, 2015 NISO/ICSTI Joint Webinar: A Pathway from Open Access and Da...
November 10, 2015 NISO/ICSTI Joint Webinar: A Pathway from Open Access and Da...November 10, 2015 NISO/ICSTI Joint Webinar: A Pathway from Open Access and Da...
November 10, 2015 NISO/ICSTI Joint Webinar: A Pathway from Open Access and Da...
 
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
MIE2014: A Framework for Evaluating and Utilizing Medical Terminology Mappings
 
The Developing Needs for e-infrastructures
The Developing Needs for e-infrastructuresThe Developing Needs for e-infrastructures
The Developing Needs for e-infrastructures
 
From Open Data to Open Science, by Geoffrey Boulton
 From Open Data to Open Science, by Geoffrey Boulton From Open Data to Open Science, by Geoffrey Boulton
From Open Data to Open Science, by Geoffrey Boulton
 

Similaire à OSFair2017 Workshop | Text mining

The Emerging Role of Data Scientists on Software Developmen.docx
The Emerging Role of Data Scientists  on Software Developmen.docxThe Emerging Role of Data Scientists  on Software Developmen.docx
The Emerging Role of Data Scientists on Software Developmen.docx
arnoldmeredith47041
 
The Emerging Role of Data Scientists on Software Developmen.docx
The Emerging Role of Data Scientists  on Software Developmen.docxThe Emerging Role of Data Scientists  on Software Developmen.docx
The Emerging Role of Data Scientists on Software Developmen.docx
todd701
 
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
Dr. Haxel Consult
 
resume for work in predictive analytics
resume for work in predictive analyticsresume for work in predictive analytics
resume for work in predictive analytics
butest
 
April 2014 building data science keynote at Boston Data Science Meetup - Crow...
April 2014 building data science keynote at Boston Data Science Meetup - Crow...April 2014 building data science keynote at Boston Data Science Meetup - Crow...
April 2014 building data science keynote at Boston Data Science Meetup - Crow...
Mona M. Vernon
 
TLNBusinessAnalytics_researchPoster_Final
TLNBusinessAnalytics_researchPoster_FinalTLNBusinessAnalytics_researchPoster_Final
TLNBusinessAnalytics_researchPoster_Final
Yi Qi
 

Similaire à OSFair2017 Workshop | Text mining (20)

F.S. Nucci - Search as an architectural component: searching for a new paradigm
F.S. Nucci - Search as an architectural component: searching for a new paradigmF.S. Nucci - Search as an architectural component: searching for a new paradigm
F.S. Nucci - Search as an architectural component: searching for a new paradigm
 
Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.Insight into AstraZeneca's Technology Services.
Insight into AstraZeneca's Technology Services.
 
Fake News Detection Using Machine Learning
Fake News Detection Using Machine LearningFake News Detection Using Machine Learning
Fake News Detection Using Machine Learning
 
WeSpline invdeck_oct2018
WeSpline invdeck_oct2018WeSpline invdeck_oct2018
WeSpline invdeck_oct2018
 
s00146-014-0549-4.pdf
s00146-014-0549-4.pdfs00146-014-0549-4.pdf
s00146-014-0549-4.pdf
 
The Emerging Role of Data Scientists on Software Developmen.docx
The Emerging Role of Data Scientists  on Software Developmen.docxThe Emerging Role of Data Scientists  on Software Developmen.docx
The Emerging Role of Data Scientists on Software Developmen.docx
 
The Emerging Role of Data Scientists on Software Developmen.docx
The Emerging Role of Data Scientists  on Software Developmen.docxThe Emerging Role of Data Scientists  on Software Developmen.docx
The Emerging Role of Data Scientists on Software Developmen.docx
 
We spline invdeck_may2018
We spline invdeck_may2018We spline invdeck_may2018
We spline invdeck_may2018
 
Rdaeu russia_fg_1_july2014_final
Rdaeu  russia_fg_1_july2014_finalRdaeu  russia_fg_1_july2014_final
Rdaeu russia_fg_1_july2014_final
 
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni MadridICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
ICIC 2013 Conference Proceedings Ricardo Eito Brun Uni Madrid
 
resume for work in predictive analytics
resume for work in predictive analyticsresume for work in predictive analytics
resume for work in predictive analytics
 
We spline invdeck_may2018
We spline invdeck_may2018We spline invdeck_may2018
We spline invdeck_may2018
 
Social Media and Text Analytics
Social Media and Text AnalyticsSocial Media and Text Analytics
Social Media and Text Analytics
 
Short CfP #DISC2016
Short CfP #DISC2016Short CfP #DISC2016
Short CfP #DISC2016
 
Final call for #DISC2016
Final call for #DISC2016Final call for #DISC2016
Final call for #DISC2016
 
April 2014 building data science keynote at Boston Data Science Meetup - Crow...
April 2014 building data science keynote at Boston Data Science Meetup - Crow...April 2014 building data science keynote at Boston Data Science Meetup - Crow...
April 2014 building data science keynote at Boston Data Science Meetup - Crow...
 
Some New Directions in the Economics of AI
Some New Directions in the Economics of AISome New Directions in the Economics of AI
Some New Directions in the Economics of AI
 
2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation2011-12-02 Open PHACTS at STM Innovation
2011-12-02 Open PHACTS at STM Innovation
 
TLNBusinessAnalytics_researchPoster_Final
TLNBusinessAnalytics_researchPoster_FinalTLNBusinessAnalytics_researchPoster_Final
TLNBusinessAnalytics_researchPoster_Final
 
Text mining and data mining
Text mining and data mining Text mining and data mining
Text mining and data mining
 

Plus de Open Science Fair

OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
Open Science Fair
 
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social SciencesOSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
Open Science Fair
 
OSFair2017 Worksop | Innovative dissemination practices & Altmetrics
OSFair2017 Worksop | Innovative dissemination practices & AltmetricsOSFair2017 Worksop | Innovative dissemination practices & Altmetrics
OSFair2017 Worksop | Innovative dissemination practices & Altmetrics
Open Science Fair
 
OSFair2017 | Barriers to Open Science for junior researchers
OSFair2017 | Barriers to Open Science for junior researchersOSFair2017 | Barriers to Open Science for junior researchers
OSFair2017 | Barriers to Open Science for junior researchers
Open Science Fair
 
OSFair2017 | The role of women in exploring, understanding and archiving the ...
OSFair2017 | The role of women in exploring, understanding and archiving the ...OSFair2017 | The role of women in exploring, understanding and archiving the ...
OSFair2017 | The role of women in exploring, understanding and archiving the ...
Open Science Fair
 

Plus de Open Science Fair (20)

OSFair2017 workshop | Monitoring open science trends in europe
OSFair2017 workshop | Monitoring open science trends in europeOSFair2017 workshop | Monitoring open science trends in europe
OSFair2017 workshop | Monitoring open science trends in europe
 
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
OSFair2017 Worksop | NUCLEUS project - Are you ready to perform in RRI ecosys...
 
OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...
OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...
OSFair2017 Workshop | Data Analytics meets Social Sciences: New Frontiers of ...
 
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social SciencesOSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
OSFair2017 Workshop | Research lifecycle in Arts, Humanities and Social Sciences
 
OSFair2017 Workshop | Big Mechanism: deep reading for cancer biology
OSFair2017 Workshop | Big Mechanism: deep reading for cancer biologyOSFair2017 Workshop | Big Mechanism: deep reading for cancer biology
OSFair2017 Workshop | Big Mechanism: deep reading for cancer biology
 
OSFair2017 Workshop | EOSCpilot governance
OSFair2017 Workshop | EOSCpilot governanceOSFair2017 Workshop | EOSCpilot governance
OSFair2017 Workshop | EOSCpilot governance
 
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...OSFair2017 Workshop | Brokering services facilitating interoperability and da...
OSFair2017 Workshop | Brokering services facilitating interoperability and da...
 
OSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystem
OSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystemOSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystem
OSFair2017 Theatrical Workshop | Are you ready to perform in the rri ecosystem
 
OSFair2017 Theatrical Workshop | Nucleus H2020 EU project
OSFair2017 Theatrical Workshop | Nucleus H2020 EU projectOSFair2017 Theatrical Workshop | Nucleus H2020 EU project
OSFair2017 Theatrical Workshop | Nucleus H2020 EU project
 
OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...
OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...
OSFair2017 Workshop | Open Knowledge Maps, A visual interface to the world's ...
 
OSFair2017 Training | Reproducibility in critical care research
OSFair2017 Training | Reproducibility in critical care researchOSFair2017 Training | Reproducibility in critical care research
OSFair2017 Training | Reproducibility in critical care research
 
OSFair2017 Training | Big data and evidence-based medicine in Greece
OSFair2017 Training | Big data and evidence-based medicine in GreeceOSFair2017 Training | Big data and evidence-based medicine in Greece
OSFair2017 Training | Big data and evidence-based medicine in Greece
 
OSFair2017 Training | What is Open Science and why should I care?
OSFair2017 Training | What is Open Science and why should I care?OSFair2017 Training | What is Open Science and why should I care?
OSFair2017 Training | What is Open Science and why should I care?
 
OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...
OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...
OSFair2017 Training | OpenAIRE monitoring services, EC FP7 & H2020 & other na...
 
OSFair2017 Training | Designing & implementing open access, open data & open ...
OSFair2017 Training | Designing & implementing open access, open data & open ...OSFair2017 Training | Designing & implementing open access, open data & open ...
OSFair2017 Training | Designing & implementing open access, open data & open ...
 
OSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open ScienceOSFair2017 Training | Best practice in Open Science
OSFair2017 Training | Best practice in Open Science
 
OSFair2017 Worksop | Innovative dissemination practices & Altmetrics
OSFair2017 Worksop | Innovative dissemination practices & AltmetricsOSFair2017 Worksop | Innovative dissemination practices & Altmetrics
OSFair2017 Worksop | Innovative dissemination practices & Altmetrics
 
OSFair2017 | Barriers to Open Science for junior researchers
OSFair2017 | Barriers to Open Science for junior researchersOSFair2017 | Barriers to Open Science for junior researchers
OSFair2017 | Barriers to Open Science for junior researchers
 
OSFair2017 | The role of women in exploring, understanding and archiving the ...
OSFair2017 | The role of women in exploring, understanding and archiving the ...OSFair2017 | The role of women in exploring, understanding and archiving the ...
OSFair2017 | The role of women in exploring, understanding and archiving the ...
 
OSFair2017 | Open Science: A Global South Perspective
OSFair2017 | Open Science: A Global South PerspectiveOSFair2017 | Open Science: A Global South Perspective
OSFair2017 | Open Science: A Global South Perspective
 

Dernier

LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Silpa
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
1301aanya
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
ANSARKHAN96
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
Silpa
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Silpa
 

Dernier (20)

TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptxTHE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
THE ROLE OF BIOTECHNOLOGY IN THE ECONOMIC UPLIFT.pptx
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.Reboulia: features, anatomy, morphology etc.
Reboulia: features, anatomy, morphology etc.
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 

OSFair2017 Workshop | Text mining

  • 2.
  • 3. Who I am Researcher turned entrepreneur: BSc Chemical Engineering (USC, Spain) and MSc Materials Science (UPM, Spain). Know-how: technology scouting, competitive intelligence, NPD (new product development), early-stage startups and innovation strategy. Researcher for NEOKER (Spain) and SRI International (CA): an awarded spin-off from USC (Spain), now scaling up in China. R&D projects in materials science for over 3 years at SRI International (Menlo Park, CA). Cofounded Linknovate.com in May 2012 in Palo Alto (CA) and went through Stanford University accelerator program (StartX) in 2013. The company provides clients such as BMW AG and REPSOL with competitive intelligence software tools. CEO since 2014, now a 7 people company growing internationally.
  • 4. Our credentials 4 4 We founded Linknovate en 2012 in Stanford University, CA. We have been awarded 3 EU projects (grants) by the EC and have gone through 2 of the most prestigious startup accelerators in the world. Some of our satisfied clients:
  • 5.
  • 6.
  • 7.
  • 8.
  • 9.
  • 11. Over 560 organizations 826 identified 504 contacted 158 positive answers 66 detailed info Experts funnel “Biometric Sensors” 826 experts identified in over 460 organizations worldwide
  • 13. 13 2. Fresh User-Generated data: “We focus on bringing modern agricultural techniques to the farming of insects. Currently our focus is primarily on the production of insects as food and as an ingredient.” “We have been focused on producing insects for food for over three years. With regard to TRL, we have products on the market, so arguably TRL8-9.” 3.Willingness: “Joint tech development / R&D collaboration; Partnerships - e.g. grant applications like NSF, DOE or EU projects; Consider an investment (equity or other). We re open to discussing opportunities.” 4. Companies/ Groups recommended: “(…) the only North American company that we have seen any credible results from is Tiny Farms, though there are indications that at least one of the established CPG companies has begin making significant investments in R&D. Looking further afield, there are a number of companies in Europe (particularly France and the Netherlands) that seem sophisticated.” Email: gm@aspirefg.com 1. Technology Maturity (2 aspects) Insects as Protein Source: TRL 8 or higher. Already in the market Industrial Processing of Insects: TRL 7. Less than 1,5y to be in the market User Case: “Insects as an Alternative Source of Proteins” User Generated Data (detailed, fresh, assesing willingness). Expert profile
  • 14. Otros 1-23 Camera-based TRL Investigación aplicada Prueba de concepto validada Wearables 1-2 3 4 4 5-65-6 7 7 User Case: “biometric sensors” User Generated Data (classified in categories). Companies’ visualization Universidades Relevancia 2 3 4 5 14 Centros de Investigación Compañías Prot. en entorno de lab. Prot. en entorno de altura. Mayor powered by
  • 16. 16 Text Mining [unstructured text]. Insights
  • 17. Ranking of Active Entities Worldwide 17
  • 18. Ranking of Active Entities Worldwide (II) 18 Working in systems for reduction of irrelevant information during searches. Works in the text-mining based bioassay neighboring analysis as a standalone or as a complementary tool for the PubChem bioassay neighboring process to enable efficient integration of assay results and generate hypotheses for the discovery of bioactivities of the tested reagents.
  • 19. Ranking of Active Entities Worldwide (III) 19 Study investigating the usefulness of natural language processing (NLP) as an adjunct to dictionary- based concept normalization. Methods used: two biomedical concept normalization systems, MetaMap and Peregrine, with and without the use of a rule-based NLP module. Interested in automated extraction of useful biomedical information from unstructured text. Especially in the importance of named entity recognition and relationship extraction as fundamental approaches that are relevant to systems biology.
  • 20. Insights from Linknovate.com 20 Academic sources dominate over ‘industrial signals’. We may be still waiting for the ‘great’ stuff to land? Algorithmia over ‘Document Classification’, ‘Ontologies’, ‘Feature Analysis’, ‘Rule Extraction’, etc
  • 21. Insights from Linknovate.com (II) 21 High academic activity, primarily with universities as the most involved organisations. Highlight: small-medium size companies are the most active in 2017…
  • 22. Global Data Comparison Comparing search engines of complementary results Results show similar trends in all 3 engines and data sources. 22 Sources • Linknovate (publications, conf proceedings, grants, patent apps, news, web monitoring) • PubMed (publications, conf proceedings) • Google Patents (patents)
  • 23. Text Mining [unstructured text]. Academic Key-players
  • 24. Stanford University – U.S.A. https://www.stanford.edu/ 24 Text Mining for Adverse Drug Events: the Promise, Challenges, and State of the Art A Framework for the Automatic Extraction of Rules from Online Text Learning the Structure of Biomedical Relationships from Unstructured Text References in this topic This article provides an overview of recent advances in pharmacovigilance driven by the application of text mining. This paper presents a general-purpose framework for acquiring more complex relationships from text and then encoding this knowledge as rules. Here we describe a novel algorithm, Ensemble Biclustering for Classification (EBC), that learns the structure of biomedical relationships automatically from text, overcoming differences in word choice and sentence structure.
  • 26. Elon University – U.S.A. https://www.elon.edu/home/ 26 CRI: RUI: CI-EN: Infrastructure to Enable Mining and Analysis of Open Source Software Engineering Artifacts Awarded with a NSF Grant in 2014 This NSF CRI supported Research at Undergraduate Institutions (RUI) project will integrate, expand and enhance several distinct data sources currently used by three research communities: those who study Free, Libre, and Open Source Software (FLOSS), the larger empirical software engineering research community, and researchers engaged in data mining and text mining.
  • 27. Old Dominion University – U.S.A. https://www.odu.edu/ 27 Collaboration with James Madison University in a publication related to the conpetitive analysis that three companies make around the content that their customers generate in social networks This paper describes an in-depth case study which applies text mining to analyze unstructured text content on Facebook and Twitter sites of the three largest pizza chains: Pizza Hut, Domino's Pizza and Papa John's Pizza. The results reveal the value of social media competitive analysis and the power of text mining as an effective technique to extract business value from the vast amount of available social media data. Social media competitive analysis and text mining: A case study in the pizza industry
  • 28. University of Illinois at Urbana - Champaign – U.S.A. http://illinois.edu/ 28 In this paper we introduce a text cube architecture designed to organize social media data in multiple dimensions and hierarchies for efficient information query and visualization from multiple perspectives. SocialCube: A Text Cube Framework for Analyzing Social Media Data Collaboration with Cornell University and the company Intelligent Automation, Inc. in a publication related to the study of social and cultural behaviors through the contents generated by users of social networks
  • 29. Text Mining [unstructured text]. Industry Key-players
  • 30. IBM – U.S.A. https://www.ibm.com/us-en/ 30 Towards comprehensive longitudinal healthcare data capture Collaboration with Wright State University in a publication about the use of text mining in unstructured clinical texts. In this work therefore, we explore a pattern-based approach for extracting Smoker Semantic Types (SST) from unstructured clinical notes.
  • 32. Linguamatics – U.K. https://www.linguamatics.com/ 32 Linguamatics_ deploying innovative NLP text mining software -> high-value knowledge discovery & decision support.
  • 33. KBSI (Knowledge Based Systems, Inc.) – U.S.A. https://www.kbsi.com/ 33 SOME PARTNERS
  • 34. Amenity Analytics – U.S.A. http://www.amenityanalytics.com/ 34 $7.6M of raised funds in August 2017 INVESTORS Amenity Analytics provides next-generation Text-Mining AI Platform. A leading edge text analytics platform that allows customers to identify actionable signals from unstructured data.
  • 35. AYLIEN – Ireland http://aylien.com/ 35 $1.14M of raised funds $580k in March 2016 Aylien is an artificial intelligence startup that focuses on creating technologies that help machines understand humans better. The firm provides text analysis and news API's that allow users to make sense of human-generated content at scale. They also provide a range of content analysis solutions to developers, data scientists, marketers and academics. INVESTORS
  • 36. Lexalytics – U.S.A. https://www.lexalytics.com/ 36 Lexalytics transforms global conversations into meaningful and actionable insights. Their leading text analysis platforms process billions of pieces of unstructured data, translating thoughts and feelings into profitable decisions for their customers. Lexalytics helps companies implement vital feedback and monitoring programs that create an ongoing dialogue with their customers.
  • 37. Bitext – U.S.A. https://www.bitext.com/ 37 $900k of raised funds in January 2015 Bitext develops multilingual analytics technology in 30 languages. The company takes an approach to text analysis, using linguistic knowledge as a scientific base.
  • 38. 38 Text & Data Mining for Competitive Intelligence manuel@linknovate.com | skype: manu_noia Gracias!