SlideShare une entreprise Scribd logo
1  sur  29
The Future of Text AnalysisDr. Stuart ShulmanTexifter, LLC Wednesday, June 15, 2011
Briefing Agenda R&D in Annotation and Public Comments “The Future of Text Analysis” – The vision “What is DiscoverText?” – The software  The Features – The basics Capturing social media importing other text Creating archives, buckets and datasets Coding a dataset or training a classifier
Dr. Stuart W. Shulman Founder & CEO, Texifter, LLCAssistant Professor, Department of Political ScienceUniversity of Massachusetts Amherst Director, Qualitative Data Analysis Program (QDAP)Associate Director, National Center for Digital Government Editor, Journal of Information Technology & Politics 413-545-5375 stu@polsci.umass.eduhttp://people.umass.edu/stu/
The Future of Projects Projects leverage users’ credentials to control access to documents, tools, and resources Major Project Components Credentials Documents Peers Advanced ‘Social’ Search Metadata Networks Filtering Tools for Tagging Shared Analysis Qualitative & Quantitative Findings
The Future of Documents Import & archive data from multiple sources into a single, searchable, unified repository
The Future of Search eDiscovery will search, merge, filter & classifyunlimited amounts of text and other data
Well Worth Reading
The Future of Tools Text processing tools will enable quicker processing and more accurate results
The Future of Peer Relations Utilize trusted peers to scale your knowledge resources, increase productivity & lower total project costs
Peers Groups Securely segment your peers into project groups by agency, firm, department, location, or affiliation, while controlling their access via credentials
Security & Credentials Data will be encrypted, secure and accessible by only peers who are granted specific permissions via their credentials
Coding, Tagging or Labeling Annotation enhances your analysis by applying human interpretation to machine results
Coding in Flexible Teams
Crowdsourcing Crowdsourcing will bring widely distributed wisdom to process of text analysis “This is really the biggest paradigm shift in innovation since the Industrial Revolution” - MIT professor Eric von Hippel, specialist in innovation management
Active Machine Learning By utilizing information and decisions previously captured, we can enhance future machine-based decisions Active  Learning Loop
What is DiscoverText? DiscoverText is a: personal or organizational  archive in the cloud search engine for eDiscovery  social media comment aggregator de-duplication and near duplicate clustering engine FOIA redaction toolkit coding, reporting and validation team workbench repository of human annotation (text about text), and customizable machine-learning classifier  (beta launched April 2011)
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing
Future of text analysis   forrester briefing

Contenu connexe

Tendances

Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Tom Plasterer
 
Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanyRobert Grossman
 
Contractor-Borner-SNA-SAC
Contractor-Borner-SNA-SACContractor-Borner-SNA-SAC
Contractor-Borner-SNA-SACwebuploader
 
Clinical Trials & Big Data-Final
Clinical Trials & Big Data-FinalClinical Trials & Big Data-Final
Clinical Trials & Big Data-FinalManoj Vig
 
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1jmorriso
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” diannepatricia
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?Anita de Waard
 
Asteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesAsteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesStormBourne, LLC
 
Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...
Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...
Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...Mary Bass
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive trackGeorge Komatsoulis
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsVivien Bonazzi
 
LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021Sean Manion PhD
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsMichel Dumontier
 
A New Model for Informed Consent - Participant Centered Consent - Broad Insti...
A New Model for Informed Consent - Participant Centered Consent - Broad Insti...A New Model for Informed Consent - Participant Centered Consent - Broad Insti...
A New Model for Informed Consent - Participant Centered Consent - Broad Insti...john wilbanks
 
Participant-Centered Consent Toolkit Overview
Participant-Centered Consent Toolkit OverviewParticipant-Centered Consent Toolkit Overview
Participant-Centered Consent Toolkit Overviewjohn wilbanks
 
Running Research as a Service. Implications for Privacy Policies and Ethics
Running Research as a Service. Implications for Privacy Policies and EthicsRunning Research as a Service. Implications for Privacy Policies and Ethics
Running Research as a Service. Implications for Privacy Policies and EthicsLancaster University Library
 
Twitter sentiment classifications 1
Twitter sentiment classifications 1Twitter sentiment classifications 1
Twitter sentiment classifications 1eshtiyak
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRMichel Dumontier
 

Tendances (20)

Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
Some Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your CompanySome Frameworks for Improving Analytic Operations at Your Company
Some Frameworks for Improving Analytic Operations at Your Company
 
Contractor-Borner-SNA-SAC
Contractor-Borner-SNA-SACContractor-Borner-SNA-SAC
Contractor-Borner-SNA-SAC
 
Clinical Trials & Big Data-Final
Clinical Trials & Big Data-FinalClinical Trials & Big Data-Final
Clinical Trials & Big Data-Final
 
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
Hscb Focus 2010 Data Acquisition Extraction Management Debrief Jgm R1
 
“Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services” “Semantic Technologies for Smart Services”
“Semantic Technologies for Smart Services”
 
SMART Emergency Medical Teams
SMART Emergency Medical TeamsSMART Emergency Medical Teams
SMART Emergency Medical Teams
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Asteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence SeriesAsteroid Observations - Real Time Operational Intelligence Series
Asteroid Observations - Real Time Operational Intelligence Series
 
Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...
Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...
Building Protected Data Sharing Networks to Advance Cancer Risk Assessment an...
 
Komatsoulis internet2 executive track
Komatsoulis internet2 executive trackKomatsoulis internet2 executive track
Komatsoulis internet2 executive track
 
NIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
 
LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021LASYR Slides IEEE event 07 APR 2021
LASYR Slides IEEE event 07 APR 2021
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
A New Model for Informed Consent - Participant Centered Consent - Broad Insti...
A New Model for Informed Consent - Participant Centered Consent - Broad Insti...A New Model for Informed Consent - Participant Centered Consent - Broad Insti...
A New Model for Informed Consent - Participant Centered Consent - Broad Insti...
 
Participant-Centered Consent Toolkit Overview
Participant-Centered Consent Toolkit OverviewParticipant-Centered Consent Toolkit Overview
Participant-Centered Consent Toolkit Overview
 
Running Research as a Service. Implications for Privacy Policies and Ethics
Running Research as a Service. Implications for Privacy Policies and EthicsRunning Research as a Service. Implications for Privacy Policies and Ethics
Running Research as a Service. Implications for Privacy Policies and Ethics
 
Twitter sentiment classifications 1
Twitter sentiment classifications 1Twitter sentiment classifications 1
Twitter sentiment classifications 1
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 

En vedette

Business plan presentation_format (1)
Business plan presentation_format (1)Business plan presentation_format (1)
Business plan presentation_format (1)BHARAT LIMAYE
 
Aws
Aws   Aws
Aws TauCt
 
Alt solunum yolu enfeksi̇yonlarinda opti̇mal örnek alim yerleri̇
Alt solunum yolu enfeksi̇yonlarinda  opti̇mal örnek alim yerleri̇Alt solunum yolu enfeksi̇yonlarinda  opti̇mal örnek alim yerleri̇
Alt solunum yolu enfeksi̇yonlarinda opti̇mal örnek alim yerleri̇edoktor
 
Finished! evaluation1
Finished! evaluation1 Finished! evaluation1
Finished! evaluation1 KendalMiles
 
Presentació treball: Va de llengua
Presentació treball: Va de llenguaPresentació treball: Va de llengua
Presentació treball: Va de llenguaeva
 
Bad effect of drug using
Bad effect of drug usingBad effect of drug using
Bad effect of drug usingerdemkarademir
 
Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015
Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015
Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015Arya Wibowo
 
Cabells
CabellsCabells
Cabellsciminf
 
Ctrmc answers to osha nprm questions 7 10 final
Ctrmc answers to osha nprm questions 7 10 finalCtrmc answers to osha nprm questions 7 10 final
Ctrmc answers to osha nprm questions 7 10 finalDaddy DeVinney
 
per què tenim el cabell de diferent color?
per què tenim el cabell de diferent color?per què tenim el cabell de diferent color?
per què tenim el cabell de diferent color?ciminf
 
Technology for Citizen Voices
Technology for Citizen VoicesTechnology for Citizen Voices
Technology for Citizen VoicesStuart Shulman
 
Ctrmc presentation at acc 3 11
Ctrmc presentation at acc 3 11Ctrmc presentation at acc 3 11
Ctrmc presentation at acc 3 11Daddy DeVinney
 

En vedette (20)

iSigned intro
iSigned introiSigned intro
iSigned intro
 
Business plan presentation_format (1)
Business plan presentation_format (1)Business plan presentation_format (1)
Business plan presentation_format (1)
 
Aws
Aws   Aws
Aws
 
About HR Ace
About HR AceAbout HR Ace
About HR Ace
 
Alt solunum yolu enfeksi̇yonlarinda opti̇mal örnek alim yerleri̇
Alt solunum yolu enfeksi̇yonlarinda  opti̇mal örnek alim yerleri̇Alt solunum yolu enfeksi̇yonlarinda  opti̇mal örnek alim yerleri̇
Alt solunum yolu enfeksi̇yonlarinda opti̇mal örnek alim yerleri̇
 
Comic commands
Comic commandsComic commands
Comic commands
 
Finished! evaluation1
Finished! evaluation1 Finished! evaluation1
Finished! evaluation1
 
Presentació treball: Va de llengua
Presentació treball: Va de llenguaPresentació treball: Va de llengua
Presentació treball: Va de llengua
 
Bad effect of drug using
Bad effect of drug usingBad effect of drug using
Bad effect of drug using
 
Comic commands
Comic commandsComic commands
Comic commands
 
Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015
Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015
Sharing Pengalaman & Tips Game Dev GGJ Bdg 2015
 
Cabells
CabellsCabells
Cabells
 
They we are
They we areThey we are
They we are
 
A step towards Maturity...
A step towards Maturity...A step towards Maturity...
A step towards Maturity...
 
A camel-story
A camel-storyA camel-story
A camel-story
 
Ctrmc answers to osha nprm questions 7 10 final
Ctrmc answers to osha nprm questions 7 10 finalCtrmc answers to osha nprm questions 7 10 final
Ctrmc answers to osha nprm questions 7 10 final
 
per què tenim el cabell de diferent color?
per què tenim el cabell de diferent color?per què tenim el cabell de diferent color?
per què tenim el cabell de diferent color?
 
Technology for Citizen Voices
Technology for Citizen VoicesTechnology for Citizen Voices
Technology for Citizen Voices
 
How to treat_herpes
How to treat_herpesHow to treat_herpes
How to treat_herpes
 
Ctrmc presentation at acc 3 11
Ctrmc presentation at acc 3 11Ctrmc presentation at acc 3 11
Ctrmc presentation at acc 3 11
 

Similaire à Future of text analysis forrester briefing

Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinarQuestionPro
 
Narrative Mind Week 2 H4D Stanford 2016
Narrative Mind Week 2 H4D Stanford 2016Narrative Mind Week 2 H4D Stanford 2016
Narrative Mind Week 2 H4D Stanford 2016Stanford University
 
Vellino presentationtocisti
Vellino presentationtocistiVellino presentationtocisti
Vellino presentationtocistiAndre Vellino
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptxprathammishra28
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf13DikshaDatir
 
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...UXPA International
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madnesssemanticsconference
 
NOW! Get the internet to work for you!
NOW! Get the internet to work for you!NOW! Get the internet to work for you!
NOW! Get the internet to work for you!Philip Hannah
 
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)paperpublications3
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesASIS&T
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04nihshowandtell
 
RUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docx
RUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docxRUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docx
RUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docxanhlodge
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04David Phillips
 
Empowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentEmpowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentThe Digital Group
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better ResearchCarole Goble
 
Semantic business applications - case examples - Ontology Summit 2011
Semantic business applications - case examples - Ontology Summit 2011Semantic business applications - case examples - Ontology Summit 2011
Semantic business applications - case examples - Ontology Summit 2011Mills Davis
 
According To The Author Of “Build A Streamlined Refinery”,
According To The Author Of “Build A Streamlined Refinery”,According To The Author Of “Build A Streamlined Refinery”,
According To The Author Of “Build A Streamlined Refinery”,Jill Lyons
 
KnowNow Syndication-Oriented Architecture
KnowNow Syndication-Oriented ArchitectureKnowNow Syndication-Oriented Architecture
KnowNow Syndication-Oriented Architecturerohitkhare
 

Similaire à Future of text analysis forrester briefing (20)

Sa discover text webinar
Sa discover text webinarSa discover text webinar
Sa discover text webinar
 
Narrative Mind Week 2 H4D Stanford 2016
Narrative Mind Week 2 H4D Stanford 2016Narrative Mind Week 2 H4D Stanford 2016
Narrative Mind Week 2 H4D Stanford 2016
 
Information entanglement
Information entanglementInformation entanglement
Information entanglement
 
Vellino presentationtocisti
Vellino presentationtocistiVellino presentationtocisti
Vellino presentationtocisti
 
Recommendation system (1).pptx
Recommendation system (1).pptxRecommendation system (1).pptx
Recommendation system (1).pptx
 
recommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdfrecommendationsystem1-221109055232-c8b46131.pdf
recommendationsystem1-221109055232-c8b46131.pdf
 
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
UXPA 2023 Poster: Atomic Research in Practice: Using a Feedback Repository to...
 
Session 0.0 poster minutes madness
Session 0.0   poster minutes madnessSession 0.0   poster minutes madness
Session 0.0 poster minutes madness
 
NOW! Get the internet to work for you!
NOW! Get the internet to work for you!NOW! Get the internet to work for you!
NOW! Get the internet to work for you!
 
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
Avoiding Anonymous Users in Multiple Social Media Networks (SMN)
 
Hughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication RepositoriesHughes RDAP11 Data Publication Repositories
Hughes RDAP11 Data Publication Repositories
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04
 
RUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docx
RUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docxRUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docx
RUNNING HEADER Analytics Ecosystem1Analytics Ecosystem4.docx
 
Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04Show and tell program 04 2014-09-04
Show and tell program 04 2014-09-04
 
Empowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic EnrichmentEmpowering Search Through 3RDi Semantic Enrichment
Empowering Search Through 3RDi Semantic Enrichment
 
Better Software, Better Research
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
 
Semantic business applications - case examples - Ontology Summit 2011
Semantic business applications - case examples - Ontology Summit 2011Semantic business applications - case examples - Ontology Summit 2011
Semantic business applications - case examples - Ontology Summit 2011
 
data analytics.pptx
data analytics.pptxdata analytics.pptx
data analytics.pptx
 
According To The Author Of “Build A Streamlined Refinery”,
According To The Author Of “Build A Streamlined Refinery”,According To The Author Of “Build A Streamlined Refinery”,
According To The Author Of “Build A Streamlined Refinery”,
 
KnowNow Syndication-Oriented Architecture
KnowNow Syndication-Oriented ArchitectureKnowNow Syndication-Oriented Architecture
KnowNow Syndication-Oriented Architecture
 

Plus de Stuart Shulman

Fear and loathing on the social campaign trail
Fear and loathing on the social campaign trailFear and loathing on the social campaign trail
Fear and loathing on the social campaign trailStuart Shulman
 
Fear and Loathing on the Social Campaign Trail
Fear and Loathing on the Social Campaign TrailFear and Loathing on the Social Campaign Trail
Fear and Loathing on the Social Campaign TrailStuart Shulman
 
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...Stuart Shulman
 
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!Stuart Shulman
 
CoderRank: Creating Gold Standards
CoderRank: Creating Gold StandardsCoderRank: Creating Gold Standards
CoderRank: Creating Gold StandardsStuart Shulman
 
Text Analytics for Social Data Using DiscoverText & Sifter
 Text Analytics for Social Data Using DiscoverText & Sifter Text Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & SifterStuart Shulman
 
Text Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & SifterText Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & SifterStuart Shulman
 
Sifting Social Data: Word Sense Disambiguation Using Machine Learning
Sifting Social Data: Word Sense Disambiguation Using Machine LearningSifting Social Data: Word Sense Disambiguation Using Machine Learning
Sifting Social Data: Word Sense Disambiguation Using Machine LearningStuart Shulman
 
CAQDAS 2014 Pecha Kucha - Stuart Shulman
CAQDAS 2014 Pecha Kucha - Stuart ShulmanCAQDAS 2014 Pecha Kucha - Stuart Shulman
CAQDAS 2014 Pecha Kucha - Stuart ShulmanStuart Shulman
 
Measuring reliability and validity in human coding and machine classification
Measuring reliability and validity in human coding and machine classificationMeasuring reliability and validity in human coding and machine classification
Measuring reliability and validity in human coding and machine classificationStuart Shulman
 
DiscoverText: Tools for Text
DiscoverText: Tools for TextDiscoverText: Tools for Text
DiscoverText: Tools for TextStuart Shulman
 
Citizen Voices in a Networked Age of #BigData
Citizen Voices in a Networked Age of #BigDataCitizen Voices in a Networked Age of #BigData
Citizen Voices in a Networked Age of #BigDataStuart Shulman
 
DiscoverText Product Overview
DiscoverText Product OverviewDiscoverText Product Overview
DiscoverText Product OverviewStuart Shulman
 
Importing bulk outlook email into DiscoverText - the .pst file upload
Importing bulk outlook email into DiscoverText - the .pst file uploadImporting bulk outlook email into DiscoverText - the .pst file upload
Importing bulk outlook email into DiscoverText - the .pst file uploadStuart Shulman
 

Plus de Stuart Shulman (17)

Fear and loathing on the social campaign trail
Fear and loathing on the social campaign trailFear and loathing on the social campaign trail
Fear and loathing on the social campaign trail
 
Fear and Loathing on the Social Campaign Trail
Fear and Loathing on the Social Campaign TrailFear and Loathing on the Social Campaign Trail
Fear and Loathing on the Social Campaign Trail
 
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
Text Analytics: From Colored Pens and Crumbly Papers to Custom Machine Classi...
 
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
Texifter Presentation at Boston New Technology’s #BNT77 Startup Showcase!
 
CoderRank: Creating Gold Standards
CoderRank: Creating Gold StandardsCoderRank: Creating Gold Standards
CoderRank: Creating Gold Standards
 
Text Analytics for Social Data Using DiscoverText & Sifter
 Text Analytics for Social Data Using DiscoverText & Sifter Text Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & Sifter
 
Text Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & SifterText Analytics for Social Data Using DiscoverText & Sifter
Text Analytics for Social Data Using DiscoverText & Sifter
 
Twitter for Research
Twitter for ResearchTwitter for Research
Twitter for Research
 
Sifting Social Data: Word Sense Disambiguation Using Machine Learning
Sifting Social Data: Word Sense Disambiguation Using Machine LearningSifting Social Data: Word Sense Disambiguation Using Machine Learning
Sifting Social Data: Word Sense Disambiguation Using Machine Learning
 
CAQDAS 2014 Pecha Kucha - Stuart Shulman
CAQDAS 2014 Pecha Kucha - Stuart ShulmanCAQDAS 2014 Pecha Kucha - Stuart Shulman
CAQDAS 2014 Pecha Kucha - Stuart Shulman
 
Measuring reliability and validity in human coding and machine classification
Measuring reliability and validity in human coding and machine classificationMeasuring reliability and validity in human coding and machine classification
Measuring reliability and validity in human coding and machine classification
 
DiscoverText: Tools for Text
DiscoverText: Tools for TextDiscoverText: Tools for Text
DiscoverText: Tools for Text
 
Summit slide loop ny
Summit slide loop nySummit slide loop ny
Summit slide loop ny
 
Citizen Voices in a Networked Age of #BigData
Citizen Voices in a Networked Age of #BigDataCitizen Voices in a Networked Age of #BigData
Citizen Voices in a Networked Age of #BigData
 
DiscoverText Product Overview
DiscoverText Product OverviewDiscoverText Product Overview
DiscoverText Product Overview
 
Importing bulk outlook email into DiscoverText - the .pst file upload
Importing bulk outlook email into DiscoverText - the .pst file uploadImporting bulk outlook email into DiscoverText - the .pst file upload
Importing bulk outlook email into DiscoverText - the .pst file upload
 
Texifter
TexifterTexifter
Texifter
 

Dernier

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 

Dernier (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Future of text analysis forrester briefing

  • 1. The Future of Text AnalysisDr. Stuart ShulmanTexifter, LLC Wednesday, June 15, 2011
  • 2. Briefing Agenda R&D in Annotation and Public Comments “The Future of Text Analysis” – The vision “What is DiscoverText?” – The software The Features – The basics Capturing social media importing other text Creating archives, buckets and datasets Coding a dataset or training a classifier
  • 3. Dr. Stuart W. Shulman Founder & CEO, Texifter, LLCAssistant Professor, Department of Political ScienceUniversity of Massachusetts Amherst Director, Qualitative Data Analysis Program (QDAP)Associate Director, National Center for Digital Government Editor, Journal of Information Technology & Politics 413-545-5375 stu@polsci.umass.eduhttp://people.umass.edu/stu/
  • 4.
  • 5.
  • 6.
  • 7. The Future of Projects Projects leverage users’ credentials to control access to documents, tools, and resources Major Project Components Credentials Documents Peers Advanced ‘Social’ Search Metadata Networks Filtering Tools for Tagging Shared Analysis Qualitative & Quantitative Findings
  • 8. The Future of Documents Import & archive data from multiple sources into a single, searchable, unified repository
  • 9. The Future of Search eDiscovery will search, merge, filter & classifyunlimited amounts of text and other data
  • 11. The Future of Tools Text processing tools will enable quicker processing and more accurate results
  • 12. The Future of Peer Relations Utilize trusted peers to scale your knowledge resources, increase productivity & lower total project costs
  • 13. Peers Groups Securely segment your peers into project groups by agency, firm, department, location, or affiliation, while controlling their access via credentials
  • 14. Security & Credentials Data will be encrypted, secure and accessible by only peers who are granted specific permissions via their credentials
  • 15. Coding, Tagging or Labeling Annotation enhances your analysis by applying human interpretation to machine results
  • 17. Crowdsourcing Crowdsourcing will bring widely distributed wisdom to process of text analysis “This is really the biggest paradigm shift in innovation since the Industrial Revolution” - MIT professor Eric von Hippel, specialist in innovation management
  • 18. Active Machine Learning By utilizing information and decisions previously captured, we can enhance future machine-based decisions Active Learning Loop
  • 19. What is DiscoverText? DiscoverText is a: personal or organizational archive in the cloud search engine for eDiscovery social media comment aggregator de-duplication and near duplicate clustering engine FOIA redaction toolkit coding, reporting and validation team workbench repository of human annotation (text about text), and customizable machine-learning classifier (beta launched April 2011)