SlideShare une entreprise Scribd logo
1  sur  17
Télécharger pour lire hors ligne
“I meant what I said and I said what I meant.
An elephant's faithful one-hundred percent!”
Dr. Seuss, Horton Hears a Who!
Due thanks and apologies to Dr. Seuss and to the illustrator. Apart from the ones above, all the words are mine.
As for me, I’m neither a linguist nor a programmer. I’m a writer, interested in words, awed by their expressive and impressive powers.
The Conversation Company
presents
Programming:
“Joos” Rajagopal
Logic:
Suresh Manian
Arthur Koestler Noam ChomskyMax MuellerAristotle Tim Berners Lee
Inspired by the words and ideas of:
Metaphic
A Grammatically Different Approach to Text Analysis.
<createdBy>
Dedicated to my children, family, friends and foafs.
Suresh Manian, Bangalore 2014.
The Act of CreationThe Science of ThoughtPrior Analytics Context Free Grammar Semantic Web
One day, almost three years ago, I realized that for all intents and purposes, we are our words.
Our actions and attitudes, behaviours and beliefs, emotions and experiences are all captured in our words;
in the verbs, nouns, adjectives, adverbs… even the punctuations of our sentences.
Grammar, not mathematics or statistics, makes the rules in the world of text communication.
The Premise
When it comes to text and sentiment analysis, grammar knows best.
Words can make elephants fly,
or turn lamb into sheep.
Words, like people, are known by
the company they keep.
Words are not numbers, with precise values ruled by ratios. But words are alive in a way numbers are not. A word has an inherent awareness
of its significance and position in relation to its immediate neighbours. It has a dynamic and analogous relationship with other words,
governed by a set of universal rules and conventions. In the final analysis, words are anything but ambiguous.
Supple. Stable. Scalable. - The natural properties of English grammar.
While nouns and a few new verbs get added from time to time, the
“universals” of the language are predetermined and inviolable. No new
preposition has been added to the English language in the last 300 years.
Words are kept few, common and shared to make communication
possible.
Unlike data or content, words grow rather slowly @25,000 words per
year as per Wiki. The entirety of English words is estimated to be over
just a million words, give or take a few thousands.
Basically, the world of words seemed large but finite, and based on
predictable and observable patterns that could be easily retrieved,
analysed and interpreted, given our syntax and library based approach,
smileys, emoticons and wtf included.
So we set forth in our pursuit of meaning.
The Program Logic
Syntax are the rails semantics ride on to reach meaning.
People use same words to mean
same things and different words to
mean different things.
Sometimes they use different words
to mean same things and same
words to mean different things.
In general, people are quite precise
with their words.
More or less.
Our Content-Context framework
Subtext is to Text what Metadata is to Data, and other assumptions.
What is text? What is content? What is context? What is meaning? There were many questions that we had to ask and answer ourselves
during the development. We just “reasoned up” as Elon Musk of Tesla says, based on first principles and a natural order of questioning,
led by language, logic and StackOverflow. The program will return true or false before we could say good or bad was the thinking.
The Process Attitude
Reason up. Follow the Word. Focus on output, not outcome.
What is Data? We hold the view that “All is data” or “Sarvam Datum”, as we say in
Sanskrit-Latin. And that all data can be captured and categorized under four basic
types of metadata: Time|Place|Topic|Person, responding to a
Who|Where|What|When query framework. And grammar is the gopher.
Content can be text/sound/image but for our purposes, it is restricted to text.
The Proofs
A grammar book and a calculator can take you a long way in making sense of Google and Twitter.
It works!
From Search to Find: In Trouve, we employ a simple and practical framework for relevant content discovery, with categorization, entity/concept extraction, and
even publishing. You can use it like you would Google. In fact, you are using Google, with a little helper attached to it. There’s a lot of cleaning up to do still, but the
hypothesis seemed to work. For instance, an input like “text mining” is likely to give you Text Mining 101 as a category result, among others. Makes research simpler.
You can access the content in multiple ways. Helped us make a knowledge base out of our own history data. But that’s another localized demo.
The difference between “Search” and “Find” is one of expectation, so to speak.
Input term: ibm quarter. Feel free to try anything. And wait just a bit.
Topics, entities, people, places, questions, requests, demands, interests, opinions, comparisons,
superlatives… The combination of grammar tags and regexes gives us full control over the text to
mine all the information we need. And with colour and sentiment coded inline graphs like “Sentipede”
and features like Summarizer, Querifier etc., there are many facets we can show of the data. This
approach is ideal to create customized tl;dr templates. Add regular metadata information into the mix
and the possibilities increase substantially.
Demo url: http://metaphic.in/monitor.php?term=coke
What kind of messages are they?
Where are they talking from?
The advantage? The questions you can ask of the data increase significantly. And it is easy to customize it to brand or domain.
Who is talking? What are they talking about? Who else are they talking about? How many are talking? When are they talking mostly? Where are they from?
Are they saying good things or bad things? Are they being funny or sarcastic?...
The questions are but natural. Shouldn’t the answers be too?
Case in point:
With Parts of Speech tagging, there’s no such thing as unstructured data.
The “sentipede” and “message type” categorizations are early stages yet but hopefully proves the point. It is easy to build and train the libraries.
If you really want to know how your customers or employees are feeling, just follow the adjectives.
We analysed the employee survey results using our algorithm to assess needs and
aspirations of the employees of a very large Business Unit of a very large IT major, and
mapped it to their employee engagement programs. Extremely good results.
This was when we analysed Nokia’s social data
over three months. http://metaphic.in/nokia/
With this “whole-grained” approach to analysis, the data bends to yield insights.
The metadata is much richer now, providing greater flexibility to query and retrieve relevant information.
Linked text, with a syntactic grid that underpins it, is perhaps a way to go. Is perhaps the key to linked data and the dream of the semantic web.
Try a url/term/text at http://metaphic.in/home/index.php? The program reads at approx. 4000 word per minute so request a little
patience, especially if you want classification. The screenshots are from a link analysis from The Guardian. Nice article.
We generate a subject cloud which looks at the word in context and
doesn’t just look at frequency. Apologies for the lack of UI/UX and also
please note that the categorizations and the library work is not
complete.
If meaning is conveyed in the flow of words, then a preposition is where the flow bends at.
Biographer employs grammar and phrase patterns to make 2 minute dossiers of people. This is Arvind Kejriwal’s life story published by our
algorithm. On a related note, we’ve also developed a “job description to resume match” candidate finder (RecoDex, we call it).
http://metaphic.in/peop/e/
http://metaphic.in/home/whysguy.html?
Please keep the questions simple. For now.
What is the meaning of life?
Ask the Whysguy. Be patient.
Words are parts of speech. Literally, grammatically, programmatically and pragmatically speaking.
Answers, not analytics.
The Promise
Adjectives and verbs reveal attitudes and behaviour that statistics can never aspire to. Proper Nouns
and simple capitalization rules capture topics and people like no hashtag can. And prepositions can
access insights like metadata can't.
With so much emphasis on social media and unstructured text, with so much content being generated
that needs to be analysed and interpreted, text analytics is one of the hottest focus areas of the
technology investor. Text Analytics Market is worth $4.90 Billion by 2019 say the pundits.
Metaphic is one way of looking at the opportunity.
Mine the language to mind the business.
Data is the new oil, and text makes up nearly 90% of it.
The Potential
Lack of money, expertise and infrastructure to bring it to market readiness and revenue.
The Problem
These are all proofs of concept that now need to be tested, validated and applied to answering real business
questions. We have more or less reached the end of the road on this owing to lack of resources and I’m
looking for an angel that might see merit in taking the development further.
With a couple of projects, and a recently developed API, we have managed to and will continue to generate
some revenue but it is not enough. In my estimation, there is at least another six months of work left before
this can start generating steady revenue.
And in two years, working overtime, this can be of some serious value.
Grammatically Different
Tel: +919845057220 mail: manian@conversation.co.in
Thank you for your time and hope to hear back.
Metaphic

Contenu connexe

Similaire à Metaphic or the art of looking another way.

Language First Protocol from QSi
Language First Protocol from QSiLanguage First Protocol from QSi
Language First Protocol from QSiJohn O'Gorman
 
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperContent Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperJohn Felahi
 
Essential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual SearchEssential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual Searchandrew_paulsen
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxBoston Institute of Analytics
 
Dictionary based concept mining an application for turkish
Dictionary based concept mining  an application for turkishDictionary based concept mining  an application for turkish
Dictionary based concept mining an application for turkishcsandit
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
Text-mining and Automation
Text-mining and AutomationText-mining and Automation
Text-mining and Automationbenosteen
 
Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)Trey Grainger
 
Information Retrieval on Text using Concept Similarity
Information Retrieval on Text using Concept SimilarityInformation Retrieval on Text using Concept Similarity
Information Retrieval on Text using Concept Similarityrahulmonikasharma
 
Metadata & Taxonomy: The Foundation for Content and Digital Asset Management
Metadata & Taxonomy: The Foundation for Content and Digital Asset ManagementMetadata & Taxonomy: The Foundation for Content and Digital Asset Management
Metadata & Taxonomy: The Foundation for Content and Digital Asset ManagementVeeva Systems
 
DICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISH
DICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISHDICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISH
DICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISHcscpconf
 
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerHaystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerOpenSource Connections
 
Semantic Search_ NLP_ ML.pdf
Semantic Search_ NLP_ ML.pdfSemantic Search_ NLP_ ML.pdf
Semantic Search_ NLP_ ML.pdfPlamenaDzharadat
 
Deep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUDeep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUWalid Saba
 
Text Analytics for Dummies 2010
Text Analytics for Dummies 2010Text Analytics for Dummies 2010
Text Analytics for Dummies 2010Seth Grimes
 

Similaire à Metaphic or the art of looking another way. (20)

Language First Protocol from QSi
Language First Protocol from QSiLanguage First Protocol from QSi
Language First Protocol from QSi
 
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White PaperContent Analyst - Conceptualizing LSI Based Text Analytics White Paper
Content Analyst - Conceptualizing LSI Based Text Analytics White Paper
 
Textmining
TextminingTextmining
Textmining
 
Essential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual SearchEssential Elements of Excellent Multilingual Search
Essential Elements of Excellent Multilingual Search
 
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptxNLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
NLP Project PPT: Flipkart Product Reviews through NLP Data Science.pptx
 
Dictionary based concept mining an application for turkish
Dictionary based concept mining  an application for turkishDictionary based concept mining  an application for turkish
Dictionary based concept mining an application for turkish
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Word Embedding In IR
Word Embedding In IRWord Embedding In IR
Word Embedding In IR
 
sent_analysis_report
sent_analysis_reportsent_analysis_report
sent_analysis_report
 
Text-mining and Automation
Text-mining and AutomationText-mining and Automation
Text-mining and Automation
 
The impact of standardized terminologies and domain-ontologies in multilingua...
The impact of standardized terminologies and domain-ontologies in multilingua...The impact of standardized terminologies and domain-ontologies in multilingua...
The impact of standardized terminologies and domain-ontologies in multilingua...
 
Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)Natural Language Search with Knowledge Graphs (Haystack 2019)
Natural Language Search with Knowledge Graphs (Haystack 2019)
 
Information Retrieval on Text using Concept Similarity
Information Retrieval on Text using Concept SimilarityInformation Retrieval on Text using Concept Similarity
Information Retrieval on Text using Concept Similarity
 
Aq35241246
Aq35241246Aq35241246
Aq35241246
 
Metadata & Taxonomy: The Foundation for Content and Digital Asset Management
Metadata & Taxonomy: The Foundation for Content and Digital Asset ManagementMetadata & Taxonomy: The Foundation for Content and Digital Asset Management
Metadata & Taxonomy: The Foundation for Content and Digital Asset Management
 
DICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISH
DICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISHDICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISH
DICTIONARY-BASED CONCEPT MINING: AN APPLICATION FOR TURKISH
 
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey GraingerHaystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
Haystack 2019 - Natural Language Search with Knowledge Graphs - Trey Grainger
 
Semantic Search_ NLP_ ML.pdf
Semantic Search_ NLP_ ML.pdfSemantic Search_ NLP_ ML.pdf
Semantic Search_ NLP_ ML.pdf
 
Deep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLUDeep misconceptions and the myth of data driven NLU
Deep misconceptions and the myth of data driven NLU
 
Text Analytics for Dummies 2010
Text Analytics for Dummies 2010Text Analytics for Dummies 2010
Text Analytics for Dummies 2010
 

Dernier

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysismanisha194592
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...shambhavirathore45
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 

Dernier (20)

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 
April 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's AnalysisApril 2024 - Crypto Market Report's Analysis
April 2024 - Crypto Market Report's Analysis
 
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...Determinants of health, dimensions of health, positive health and spectrum of...
Determinants of health, dimensions of health, positive health and spectrum of...
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 

Metaphic or the art of looking another way.

  • 1. “I meant what I said and I said what I meant. An elephant's faithful one-hundred percent!” Dr. Seuss, Horton Hears a Who! Due thanks and apologies to Dr. Seuss and to the illustrator. Apart from the ones above, all the words are mine. As for me, I’m neither a linguist nor a programmer. I’m a writer, interested in words, awed by their expressive and impressive powers.
  • 2. The Conversation Company presents Programming: “Joos” Rajagopal Logic: Suresh Manian Arthur Koestler Noam ChomskyMax MuellerAristotle Tim Berners Lee Inspired by the words and ideas of: Metaphic A Grammatically Different Approach to Text Analysis. <createdBy> Dedicated to my children, family, friends and foafs. Suresh Manian, Bangalore 2014. The Act of CreationThe Science of ThoughtPrior Analytics Context Free Grammar Semantic Web
  • 3. One day, almost three years ago, I realized that for all intents and purposes, we are our words. Our actions and attitudes, behaviours and beliefs, emotions and experiences are all captured in our words; in the verbs, nouns, adjectives, adverbs… even the punctuations of our sentences. Grammar, not mathematics or statistics, makes the rules in the world of text communication. The Premise When it comes to text and sentiment analysis, grammar knows best.
  • 4. Words can make elephants fly, or turn lamb into sheep. Words, like people, are known by the company they keep. Words are not numbers, with precise values ruled by ratios. But words are alive in a way numbers are not. A word has an inherent awareness of its significance and position in relation to its immediate neighbours. It has a dynamic and analogous relationship with other words, governed by a set of universal rules and conventions. In the final analysis, words are anything but ambiguous.
  • 5. Supple. Stable. Scalable. - The natural properties of English grammar. While nouns and a few new verbs get added from time to time, the “universals” of the language are predetermined and inviolable. No new preposition has been added to the English language in the last 300 years. Words are kept few, common and shared to make communication possible. Unlike data or content, words grow rather slowly @25,000 words per year as per Wiki. The entirety of English words is estimated to be over just a million words, give or take a few thousands. Basically, the world of words seemed large but finite, and based on predictable and observable patterns that could be easily retrieved, analysed and interpreted, given our syntax and library based approach, smileys, emoticons and wtf included. So we set forth in our pursuit of meaning. The Program Logic Syntax are the rails semantics ride on to reach meaning. People use same words to mean same things and different words to mean different things. Sometimes they use different words to mean same things and same words to mean different things. In general, people are quite precise with their words. More or less.
  • 6. Our Content-Context framework Subtext is to Text what Metadata is to Data, and other assumptions. What is text? What is content? What is context? What is meaning? There were many questions that we had to ask and answer ourselves during the development. We just “reasoned up” as Elon Musk of Tesla says, based on first principles and a natural order of questioning, led by language, logic and StackOverflow. The program will return true or false before we could say good or bad was the thinking. The Process Attitude Reason up. Follow the Word. Focus on output, not outcome. What is Data? We hold the view that “All is data” or “Sarvam Datum”, as we say in Sanskrit-Latin. And that all data can be captured and categorized under four basic types of metadata: Time|Place|Topic|Person, responding to a Who|Where|What|When query framework. And grammar is the gopher. Content can be text/sound/image but for our purposes, it is restricted to text.
  • 7. The Proofs A grammar book and a calculator can take you a long way in making sense of Google and Twitter. It works!
  • 8. From Search to Find: In Trouve, we employ a simple and practical framework for relevant content discovery, with categorization, entity/concept extraction, and even publishing. You can use it like you would Google. In fact, you are using Google, with a little helper attached to it. There’s a lot of cleaning up to do still, but the hypothesis seemed to work. For instance, an input like “text mining” is likely to give you Text Mining 101 as a category result, among others. Makes research simpler. You can access the content in multiple ways. Helped us make a knowledge base out of our own history data. But that’s another localized demo. The difference between “Search” and “Find” is one of expectation, so to speak. Input term: ibm quarter. Feel free to try anything. And wait just a bit.
  • 9. Topics, entities, people, places, questions, requests, demands, interests, opinions, comparisons, superlatives… The combination of grammar tags and regexes gives us full control over the text to mine all the information we need. And with colour and sentiment coded inline graphs like “Sentipede” and features like Summarizer, Querifier etc., there are many facets we can show of the data. This approach is ideal to create customized tl;dr templates. Add regular metadata information into the mix and the possibilities increase substantially. Demo url: http://metaphic.in/monitor.php?term=coke What kind of messages are they? Where are they talking from? The advantage? The questions you can ask of the data increase significantly. And it is easy to customize it to brand or domain. Who is talking? What are they talking about? Who else are they talking about? How many are talking? When are they talking mostly? Where are they from? Are they saying good things or bad things? Are they being funny or sarcastic?... The questions are but natural. Shouldn’t the answers be too? Case in point: With Parts of Speech tagging, there’s no such thing as unstructured data. The “sentipede” and “message type” categorizations are early stages yet but hopefully proves the point. It is easy to build and train the libraries.
  • 10. If you really want to know how your customers or employees are feeling, just follow the adjectives. We analysed the employee survey results using our algorithm to assess needs and aspirations of the employees of a very large Business Unit of a very large IT major, and mapped it to their employee engagement programs. Extremely good results. This was when we analysed Nokia’s social data over three months. http://metaphic.in/nokia/ With this “whole-grained” approach to analysis, the data bends to yield insights.
  • 11. The metadata is much richer now, providing greater flexibility to query and retrieve relevant information. Linked text, with a syntactic grid that underpins it, is perhaps a way to go. Is perhaps the key to linked data and the dream of the semantic web. Try a url/term/text at http://metaphic.in/home/index.php? The program reads at approx. 4000 word per minute so request a little patience, especially if you want classification. The screenshots are from a link analysis from The Guardian. Nice article. We generate a subject cloud which looks at the word in context and doesn’t just look at frequency. Apologies for the lack of UI/UX and also please note that the categorizations and the library work is not complete.
  • 12. If meaning is conveyed in the flow of words, then a preposition is where the flow bends at. Biographer employs grammar and phrase patterns to make 2 minute dossiers of people. This is Arvind Kejriwal’s life story published by our algorithm. On a related note, we’ve also developed a “job description to resume match” candidate finder (RecoDex, we call it). http://metaphic.in/peop/e/
  • 13. http://metaphic.in/home/whysguy.html? Please keep the questions simple. For now. What is the meaning of life? Ask the Whysguy. Be patient. Words are parts of speech. Literally, grammatically, programmatically and pragmatically speaking.
  • 15. Adjectives and verbs reveal attitudes and behaviour that statistics can never aspire to. Proper Nouns and simple capitalization rules capture topics and people like no hashtag can. And prepositions can access insights like metadata can't. With so much emphasis on social media and unstructured text, with so much content being generated that needs to be analysed and interpreted, text analytics is one of the hottest focus areas of the technology investor. Text Analytics Market is worth $4.90 Billion by 2019 say the pundits. Metaphic is one way of looking at the opportunity. Mine the language to mind the business. Data is the new oil, and text makes up nearly 90% of it. The Potential
  • 16. Lack of money, expertise and infrastructure to bring it to market readiness and revenue. The Problem These are all proofs of concept that now need to be tested, validated and applied to answering real business questions. We have more or less reached the end of the road on this owing to lack of resources and I’m looking for an angel that might see merit in taking the development further. With a couple of projects, and a recently developed API, we have managed to and will continue to generate some revenue but it is not enough. In my estimation, there is at least another six months of work left before this can start generating steady revenue. And in two years, working overtime, this can be of some serious value.
  • 17. Grammatically Different Tel: +919845057220 mail: manian@conversation.co.in Thank you for your time and hope to hear back. Metaphic