2. Training Content
• Brief history Machine Learning
• What is Machine Learning
• What is
• Statistics
• Data Science
• Data Engineering
• Data Analysis
• Real world scenarios to understand the
perspective
• Types of Machine Learning
• Unsupervised Learning
• Supervised Learning
• Reinforced learning
• What are open source technologies
• Examine data types – quantitative, qualitative,
continuous, discrete, ordinal, nominal etc.
• What is Missing and Outliers values 2
• Definition of Artificial intelligence
• PAST of AI
• History
• The Original 7 Aspects of AI (1955)
• What is Intelligence
• Types of AI
• PRESENT of AI
• How to Teach Machines?
• Maturity Level
• Applications of AI in terms of Analytics
(six)
• Miscellaneous related to AI in terms of
Analytics
• Future of AI
• Next Generation AI
• Next Generation Applications (< 5 years)
Day1 Day1
3. Training Content cont
Text Analytics
•Text analysis steps
•Tokenization
•Part of speech tagging
•Lemmatization
•Named entity recognition
•Sentiment analysis
Time Series Analysis
•What is Time Series Analysis
•Periodic models: monthly,
weekly, and daily averages
•Anomaly detection
•Predictions
•Trend Analysis
Clustering/
Grouping/Segmentation
•Principle Component Analysis
•What is clustering and use cases
•Proximity Matrices – find
dissimilarity between two
observation
•Choice of attributes
•Units of measure of attributes
•Determining number of clusters
•K-means clustering
•Practical Issues with K-means
clustering
•Measure performance of clustering
•Hierarchical clustering
•Practical Issues in clustering
3
Regression and Classifications
•What is Regression
•What can be predicted
•How to make sure given result is
good enough
•What are classification problems
and types of classification
algorithms
•Decision Tree Classification
•Performance metrics –
confusion matrix
•Receiver Operating
Characteristic AUC (ROC AUC)
and Precision Recall AUC (PR
AUC)
Day1 Day2 Day2
4. Training Content cont
• Introduction to statistics
• What is Statistics
• Population and Sample
• Descriptive and Inferential
Statistics
• Parameters and Statistics
• Measures of Central Tendency:
Mean, median, mode
• Measures of Dispersion: Range,
quartile deviation, mean deviation ,
standard deviation
• Measure of Shape: Skewness,
Kurtosis
• Sampling Procedure: Probability &
Non-Probability
• Normal Distribution
• The Histograms
• Hypothesis Testing
• The Testing Process
• Sample Averages
• Confidence Intervals
• Hypothesis Tests
• The Null Hypothesis
• The p-Value
• Interpreting the Test
Results
• One & Two sided Tests
• Type I & II Errors and
Power
• Deciding on the Sample
Size 4
• Representing Data -
Graphical /Tabular
• XY Graphs
• Scatter Graphs
• Correlation
• Box Plots
• Box Plots for
Comparison
• Grouped Data
• Cumulative Frequency
• Percentiles
• Pareto Charts
• Stem and Leaf Plots
• Multi variance Charts
Day2
Day2
Day2