About the webinar:
Social media is one of the richest sources of data for brands. According to Domo's 'Data never sleeps' report, every single minute 456,000 tweets are posted on Twitter, 46,740 photos are uploaded on Instagram and 510,000 comments & 293,000 statuses are updated on Facebook.
This data contains valuable information like product feedback or reviews and information that can be used to better understand users or find valuable insights. However, traditional ways struggle to analyze the unstructured data and this is where sentiment analysis using machine learning comes to the rescue!, Machine learning can help to understand the text and extract the sentiment using Natural Language Processing. Sentiment analysis can be applied in a range of business applications like - social media channel analysis, 360-degree customer insights, user reviews, competitive analysis, and many more.
What you will learn
- How businesses are leveraging sentiment analysis to their advantage
- Best practice to automate machine learning models in hours not months
- Demo: How to build a twitter sentiment analysis model
2. Technology enthusiast with 13+ years of experience working
in the information technology and services industry. Leads
cutting-edge solutions for businesses using Machine Learning
and Artificial Intelligence.
Areas of expertise includes Architecture design, Solutioning,
Data Engineering and Deep Learning.Mohit Juneja
Solutions Architect
The Speaker
3. Extensive experience building future tech products using
Machine Learning and Artificial Intelligence.
Areas of expertise includes Deep Learning, Data Analysis,
full stack development and building world class products
in ecommerce, travel and healthcare sector.
Shruti Tanwar
Lead - Data Science
The Speaker
4. Bikash Sharma
CTO and Co-founder at
Skyl.ai
CTO & Software Architect with 15 years of experience
working at the forefront of cutting-edge technology
leading innovative projects
Areas of expertise include Architecture design, rapid
product development, Deep Learning and Data Analysis
The Panelist
5. Getting familiar with ‘Zoom’
All dial-in participants will be muted to enable the presenters
to speak without interruption
Questions can be submitted via Zoom Questions chat
window and will be addressed at the end during Q&A
The recording will be emailed to you after the webinar
Please familiarize yourself with the Zoom ‘Control Panel’ on your screen
6. Live Demo of Twitter
sentiment analysis
in 10 minutes
Diving into
Sentiment Analysis
...In the next 45 minutes
1 2 3
How to quickly
overcome the
challenges in
building ML models
7. Machine Learning automation platform for unstructured data
A quick intro about Skyl.ai
Guided Machine Learning Workflow
Build & deploy ML models faster on
unstructured data
Collaborative Data Collection & Labelling
Easy-to-use & scalable AI SaaS platform
8. POLL #1
At what stage of Machine learning adoption your
organization is at?
⊚ Exploring - Curious about it
⊚ Planning - Creating AI/ML strategy
⊚ Experimenting - Building proof of concepts
⊚ Scaling up - Some departments are using it
⊚ In production - Using it in product features
⊚ Transforming - AI/Ml driven business
10. Opinion Mining i.e. advanced text mining techniques to analyze the
sentiment of the text in the form of positive, negative and neutral
Sentiment = Opinion = Emotions = Attitude
Sentiment Analysis
11. Why do we need it?
Traditional BI tools cannot
⊚ Capture sarcasm or learn new slangs
⊚ Extract insights from large volume of unstructured data
⊚ Help brands gauge reactions of customers
14. 500 million tweets
Sent every day globally
80% of Twitter users
mentioned a brand in a tweet.
77% of users
Feel more positive about a brand
when their Tweet has been
replied to
Twitter Statistics , Twitter Blog, New Holiday Research
15. Applications of sentiment Analysis
Marketing
Social media
monitoring
Ecommerce
Product/Service
Reviews
Politics
Election results,
perception of
policies
Public Actions
social phenomena,
red alert situations
19. Skyl.ai - as ML automation platform
Efficient
Data Management
Solve your data issues; collect and manage data
efficiently
Accuracy
& Quality
Maintain accuracy and quality; train and test faster;
monitor quality
Effective
Collaboration
Collaborate and manage projects efficiently
Early
Visibility
Get early visibility; visualize and affirm correctness
on every step of the way
Scalable
High - Performance
Access on-demand and scalable, high-performance
infrastructure
Reduce
Cost
Reduce cost of implementation; do it with less
specialized resources
20. POLL #2
Some challenges that you are facing while
implementing AI & Machine Learning
⊚ Not started yet, so no challenges
⊚ Data collection
⊚ Data Labeling
⊚ Large volumes of data
⊚ Identifying the right data set to
train
⊚ Data Security
⊚ Lack of knowledge of ML tools
⊚ Lack of end to end platform
⊚ Lack of expertise
⊚ Choosing the right algorithms
21. Overcoming the AI / ML
challenges with the right tools
and technologies03
22. Best Practices for Data Collection
⊚ Use relevant data sources
for data collection
⊚ Establish proper data
collection mechanisms
⊚ Do not stop with too-small
data sample size
Data Collection
Data Quality
Data SecurityData Security
Data SecurityData Labeling
23. Best Practices for Data Quality
⊚ Do validate your data and data
sources
⊚ Clean up your data regularly -
“garbage out”
⊚ Data correction - remove
duplicates, missing data, etc
⊚ Check the consistency of data while
data acquisition
Data Collection
Data Quality
Data SecurityData Security
Data SecurityData Labeling
24. Best Practices for Data Security
⊚ Monitor data processes
continuously to mitigate risks
⊚ Increase data security with
encryption and tokenization
⊚ Controlled access flows with
different organizational roles
Data Quality
Data Collection
Data SecurityData Security
Data SecurityData Labeling
25. Best Practices for Data Labeling
⊚ Define the problem you want to solve
and use relevant labels inline with the
entities you want to predict
⊚ Analyse trends and progress of your
data labeling in real time - to find biases
⊚ Do not add new entity types midway
⊚ Use short tag lists and annotationsData Labeling
Data Quality
Data SecurityData Security
Data Collection
26. Challenges
⊚ Requisite Infrastructure
⊚ Cost of Infrastructure
⊚ Data and ML pipeline
⊚ Model at scale for
inference
Best Practices
⊚ Use SaaS Model (Pay as you go) -
reliable, scalable and secure
⊚ The right software tuned and
optimized to fit the underlying hardware
⊚ A flexible infrastructure that can be
deployed in the cloud or in an on-premise
data center to optimize performance
Technology issues and solutions
27. Best Practices
⊚ Train existing employees with
education related to AI and ML
⊚ Use Saas products with good
documentation, support and
implementation that alleviates the
need to have highly skilled data
scientists and resources with multiple
skills.
40%
Lack of skilled talent
Source: Techrepublic
Barrier in adopting AI
⊚ Companies face
shortage of necessary
in-house talent.
Specialized skills and knowledge
28. Challenges
⊚ Long implementation time
⊚ Measure ROI of the AI
deployment
Best Practices
⊚ AI implementation results in
increased process efficiency and
automation.
⊚ Create own AI KPIs and analyze
the difference in the
measurements before and after AI
deployment.
TechRepublic claims that 56%
of global CEOs expect it to take
3-5 years to see any real ROI on
their AI investment.
Speed and time to market
29. Collect
Feedback
Monitor the
model
Process
Feedback
Deploy the
changes
Train and
Evaluate
Continuous
Improvement
Best Practices
⊚ Perform incremental and
measurable improvements
⊚ Monitor your deployed models
and analyse inference count,
accuracy and execution time.
⊚ Check model performance in
real time
Monitoring and continuous improvement
30. AI Project Management
More Challenges and
Concerns
⊚ Project Cost
⊚ Return on Investment
⊚ On-demand scalability
⊚ Iterative corrections in
AI project
Source: AI for People and Business: A Framework for Better Human Experiences and Business Success
DATA
Time Cost
Performance Requirements
The TCPR Model
31. Skyl.ai - as ML automation platform
Efficient
Data Management
Solve your data issues; collect and manage data
efficiently
Accuracy
& Quality
Maintain accuracy and quality; train and test faster;
monitor quality
Effective
Collaboration
Collaborate and manage projects efficiently
Early
Visibility
Get early visibility; visualize and affirm correctness
on every step of the way
Scalable
High - Performance
Access on-demand and scalable, high-performance
infrastructure
Reduce
Cost
Reduce cost of implementation; do it with less
specialized resources
32. ⊚ Free 1 month Trial + POC
⊚ Complimentary 30 min consultation
⊚ AI Implementation Playbook
www.skyl.ai contact@skyl.ai
Special offer for you...
34. 85 Broad Street, New York, NY, 10004
+1 718 300 2104, +1 646 202 9343
contact@skyl.ai
We hope to hear from you soon
Thank you for joining!
Notes de l'éditeur
Hello everyone and welcome. Thank you for joining today’s webinar on Twitter Sentiment Analysis in 10 minutes with Machine Learning. My name is Edwin Martinez and I’ll be your host today. First off, I’d like to introduce 3 speakers for today’s webinar, who are experts in the field of AI.
First we have Mohit Juneja, Mohit is a Solutions Architect and Technology supporter with over 13 years of experience in the IT and Service industry.
He leads cutting-edge solutions for businesses using Machine Learning and AI. He’s an expert in Architect design, Data Engineering, and Deep Learning. Welcome Mohit!
Next we have Shruti Tanwar - Shruti is an expert in data science who is a veteran in building SaaS products using Machine Learning and AI.
Her expertise includes Deep Learning and Data Analysis, as well as full stack development and building tech products in various different fields such as ecommerce, travel, and healthcare. Welcome, Shruti!
Finally, we have Bikash Sharma, joining us as a panelist.
Bikash is CTO and Software Architect with 15 years of experience in leading innovative software projects and solutions.
He’s co-founded Skyl with his expert knowledge in AI and Machine Learning. Welcome, Bikash!
Before we begin, I’d like to briefly talk about some Zoom features that will be relevant to us.
All participants in the webinar will be muted to avoid any interruptions during the session.
Any questions you might have can be submitted to the Zoom Questions chat window in the control panel, located on the bottom of the screen.
We’ll make sure to address your questions during the Q&A session.
Also, the recording of the webinar will be emailed to you afterwards, just in case you’ve missed any talking points or wish to view it again.
So that’s all for the introduction - let’s bring in our first speaker Mohit for more on this topic
Exploring - Curious about it
Planning - Creating AI/ML strategy
Experimenting - Building proof of concepts
Scaling up - Some departments are using it
In production - Using it in product features
Transforming - AI/Ml driven business
One of the first studies on Twitter data for sentiment was to study public perception of Obama’s performance as President. Another example could be the to explore the variation of sentiment regarding the TV series “Game of Thrones.” The unpredictable episode “The Rains of Castamere” resulted in a lot of negative tweets and a peak in the sentiment score.
In the past, many companies have used traditional business intelligence tools to monitor social media. However, this is not efficient because traditional BI tools cannot handle true sentiment analysis, capture sarcasm, or process and learn new slang.
Any guess? What this graph is about?
This is Donald Trump’s tweet activity from 2009 onwards. You can see the pattern changing from 2013
Traditionally, we would run surveys to gather data and do statistical analysis. With Twitter, it works by extracting tweets containing references to the desired topic, computing the sentiment polarity and strength of each tweet, and then aggregating the results for all such tweets. Companies use this information to gather public opinion on their products and services, and make data-informed decisions.
Twitter is one of the top social media platforms for information and interaction with brands and influential people across the world. Approximately 321 million active users send about 500 million tweets daily. Therefore, this platform is a great channel for customer service and marketing strategy. Twitter allows the mining of data of any user through Twitter API or Tweepy.
Monitoring Twitter enables companies to know their audience, be on top of what is being said about their brand, discover new trends, and analyze the competition. But while analyzing Twitter data, just the quantitative metrics like the number of mentions or retweets are not enough, what matters is being able to grasp the effect of those mentions on the brand, whether they create a positive or negative effect. In the case of negative content going viral, social listening and monitoring of conversation/feedback become even more necessary as they can harm a brand’s reputation, leading up to an unexpected PR crisis
Marketing - helps in understanding customer feelings towards a brand or product. I Social media monitoring Review - Uber had used sentiment analysis and social media monitoring tools to find out whether users are liking the new version of their app.
2 Ecommerce - explains how people respond to a certain product or campaign in a certain way.
3. Politics - It highlights inconsistencies between actions and statements at the government level and can also be used to predict election results. The sentiment analysis tool was used during the 2012 US presidential elections by the Obama administration to analyze the reception of policy announcements.
4. Public actions - Social phenomenon can be tracked with the help of Twitter sentiment analysis. It could help identify dangerous situations or determine the general mood of an environment. It can help in crisis prevention by analyzing negative mentions in real-time, which allows reacting in the nick of time and nipping the problem in the bud.
How
5 minutes intro - 10 industry awareness - 15 min demo - 20 minutes QnA
Define problem - Features model - How this model is built using skyl.ai
Add slide of Pneumonia detection
Not started yet, so no challenges
Data collection
Data Labeling
Data Bias
Large volumes of data
Identifying the right data set to train
Lack of knowledge of ML tools
Lack of end to end platform
Lack of expertise
Choosing the right algorithms
Monitoring the model performance
How
Data is one of the most valuable resources today’s businesses have. The more information you have about your customers, the better you can understand their interests, wants and needs.
Use of relevant data sources - to get consistent and accurate data relevant for the problem you want to solve
Collection mechanisms - A formal data collection process is necessary as it ensures that the data gathered are both defined and accurate.
Small sample size - do not give the distribution of the data to the edge cases and will not train the model for exceptional cases.
Keep in mind that machine learning is a process of induction. The model can only capture what it has seen. If your training data does not include edge cases, they will very likely not be supported by the model.
The best practices can be achieved by Data Cleaning: Applying a detailed data analysis at the initial phase for recognizing which sorts of irregularities and errors must be expelled. Notwithstanding a manual assessment of the information or data samples, analytic programs are frequently expected to pick up metadata about the data resources and distinguish the issues of data quality.
Don’t let bad data or records go unresolved - remove duplicates and fill missing data;
For missing data, you should flag and fill the values.
Flag the observation with an indicator variable of missingness.
Incorrect or inconsistent data leads to false conclusions. And so, how well you clean and understand the data has a high impact on the quality of the results.
Encrypted data sources
All data sources are encrypted; thus giving users an additional layer of security, making sure your data stays safe and protected.
Access controlled flow
Defined and controlled access flows with different organizational roles like business owner, project lead, collaborators etc. allow for selective restriction so that you have full command to regulate who can view or use resources in your ML projects.
Adding tags midway - For example, the set of tags for a pizza chatbot might start with the tags “Size” “topping” and “drink” before someone realizes that you also need a “Side Dish” tag to capture Garlic Bread and Chicken Wings. Simply adding these tags and continuing work on the documents that haven’t been labeled yet poses a danger to the project. The new tags will be missing from all of the documents annotated before the new tags were added This means that your test set will be wrong for those tags, and your training data won’t contain the new tags leading to a model that won’t capture them.
<https://towardsdatascience.com/four-mistakes-you-make-when-labeling-data-7e431c4438a2>
In an annotation process, increasing the number of choices the annotator needs to make slows them down and leads to poor data quality.
Requisite Infrastructure: When launching a machine learning initiative, organizations can easily underestimate the resources they need for infrastructure. There can be substantial infrastructure requirements for machine learning, especially in the cases of image, video, and audio processing.
Cost of Infrastructure: Training and deploying a scalable infrastructure to support machine learning can be expensive and difficult to maintain.
Having a cloud approach allows experimentation with machine learning at scale without the overhead of physical hardware acquisition, configuration, and deployment.
Data & ML pipeline: AI, Machine learning and deep learning solutions require a high degree of computation speeds offered.
Model at Scale for inference: Deploying a scalable infrastructure to support machine learning can be expensive and difficult to maintain. Things get tedious and difficult to maintain at scale compared to having single server deployment or in a developer’s environment which is not a usual case.
Skills challenges: Choice of right ML algorithm - ML, DL, RL
AI product management - Dealing with Cold start, managing data labeling project, keeping transparency in the project; Keeping the model up to date.
Adoption of AI technologies requires specialists like data scientists, data engineers, infrastructure engineers and other SMEs (Subject Matter Experts).
Even with long implementation time, AI has potential to cut the expenses.
TechRepublic claims that 56% of global CEOs expect it to take 3-5 years to see any real ROI on their AI investment.
Machine automation produces quality products faster and more efficiently, while providing critical information to help managers make more informed business decisions.
Making continuous improvement part of company culture is an excellent and cost-effective approach to tackling an organization’s most difficult challenges. When supported by improvement technology, results can be achieved quickly and success can be sustained over time.
On-demand scalabilty: The truth that it’s better to have a working prototype of a smaller product, rather than an unfinished large one, still stands here with machine learning products. New ML MVPs should be prioritized based on the speed of delivery and their value to the company. If you can deliver products, even those which may be smaller, with speed, it can be a good, quick win for the whole team—you should prioritize these products first.
Organizations need to keep in mind that machine learning is an iterative process, and modifications to models might happen over time to support changing requirements.
TCPR Model: The TCPR model represents an indeterminate system—one in which more than one solution exists. In this way, it’s like a four-legged table. Engineers know that unless a four-legged table is perfectly made, and the floor on which it rests is completely flat, it’s impossible to calculate the simultaneous forces on all four legs. Moreover, the table is unbalanced and is likely resting mainly on three legs, which causes it to wobble as a result.
Link: TCPR
Thank you Mohit and Shruti, for the wonderful presentation and demo.
As mentioned earlier, the recording of the webinar will be emailed to you afterwards. [pause]
Before we get to the Q&A, I want to mention some of the offers Skyl has for those of you that are curious about incorporating Machine Learning to your business.
Skyl offers a free 1 month trial, plus Proof of Concept.
You’ll be able to interact with real data on the screen, just like we showed in the demo. You’ll experience the process of going from collecting & labeling the data… all the way to deploying a model!
Skyl also offers a complimentary 30 min consultation and an AI Implementation Playbook to go along.
This is a great opportunity to see how Skyl can provide Machine Learning solutions to your challenges.
If you’re interested in finding out more, please visit the skyl.ai website or you can send an email directly to contact@skyl.ai.
Alright, now it’s Q&A time!
As a reminder, if you have any questions, go to the question box in your control panel - located on the bottom of your Zoom screen.
We’ll try to answer as many questions as possible in the time that we have left.
So let’s answer some questions.
Sample questions:
Shruti
-(Julie) If I build a lot of models, how do I handle model deployment in that case?
-How do you avoid creating a biased model and if you detect one, how do you rebuild it?
Mohit
How can Skyl help me with my data labelling needs if I have data privacy issues?
How can I build a labelled dataset using Skyl?
Ok, that’s all the time we have for questions today, but feel free to contact us with your specific questions and we’ll make sure to get them answered.
All right, so that’s it for today’s webinar, I hope you enjoyed it
We have a lot more webinars coming up on different machine learning topics and how they can be implemented into different businesses and industries,
So don’t miss out and make sure you sign up for upcoming ones as well
Thank you for joining and I hope you have a wonderful day.