Prediction and Analysis of Mood Disorders Based On Physical and Social Health Indicators

Prediction and Analysis
of Mood Disorders Based
On Physical and Social
Health Indicators
Findings from the CCHS-2014 survey
IMAT5314 PROJECT 2019
P16233152

Problem
Statement
 Although anxiety and mood
disorders are commonly found in
many communities, there is little
empirical evidence of one single
concrete cause of these illnesses
 In fact, mental illnesses typically
have multiple causes that can
stem from factors such as
individual emotional experiences,
state of living, addiction or/and
upbringing.
 How can we understand which
factors influence, cause or
deepen anxiety and mood
disorders?

Solution Scope
 By extracting results from the CCHS
(Canadian Community Health Survey), it
was possible to perform an exploratory
data analysis on physical, social and mental
health factors.
 The extracted data showed an opportunity
to apply machine learning techniques to
attempt to uncover patterns and to
attempt to understand the relationships
between physical and social health factors
on mood and anxiety disorders.
 This research focuses on the underlying
influences of physical health (such as onset
of physical illnesses, level of exercise,
smoking habits) and social factors
(including sense of belonging, individual
income) and their relationship with anxiety
and mood disorders.

Project Goal and Objectives
Goal:
 To understand the relationships between physical health and
social aspects and whether they coincide with anxiety or mood
disorders.
Objectives:
1. To achieve a deeper general understanding of the physical and
social factors that potentially influence or are influenced by
mental health
2. To understand identified relationships and patterns from a
technical perspective in the data
3. To transform the data using techniques so that it is a suitable
input for the models being used.
4. To create the basis for a machine learning model that can be
used to predict the onset of mental disease and to ultimately
answer the question of whether mental illness can be predicted
based on a set of physical and social factors

Project Approach
Literature Review
Methodology
Results
Comparison
Conclusions

Literature Review: Short Summary of Findings:
Relationships between physical/social health
aspects and mental illnesses
 Relationships have been previously researched and observed between:
 Poverty, social cohesiveness, identity, self-esteem  Anxiety, Depression
 Anxiety  (Stress  Heart rhythm)  Blood pressure
 Alcohol Anxiety, Depression
 Anxiety  Smoking
 Diabetes / Cancer  (low number of cases) Anxiety, Depression
 Arthritis (RA) / Asthma  Depression
 Physical Activity  Anxiety
Note: All references are included in the Project Documentation

The wider view of data analytics

Literature Review: Short Summary of
Findings: Technical understanding of
analysis methods suitable for the dataset
 Exploring case studies from existing research projects that applied
machine learning to health data and digital health:
 Many used ML methods to predict the onset of mental illnesses using
different techniques such as random forests, neural networks and
naïve bayes
 Data Analytics and ML go hand in hand, where DA attempts to construct
hypothesis through investigation, and ML attempts to answer these
hypothesis through training and testing data
 Understanding Machine Learning methods:
 Classification VS Regression
 Supervised VS Unsupervised
 Random Forests, Regression, Ensemble Methods, Pattern Mining
 Class imbalance
 Feature selection and reduction
 Performance Metrics (confusion matrix, AUC, F score)

Methodology
 First, conduct exploratory pattern analysis of the data and
extract meaningful findings, thus addressing research objectives
1 and 2.
 Transform, resample and normalize the data to address research
objective 3, which is also a prerequisite for objective 4
 Apply machine learning models in order to build a prototype for
a mood disorder prediction model, thus referencing objective 4.
Tools:
 Python & various libraries for Machine Learning
Objectives
revisited:
1. Deeper general understanding
2. Understand patterns from a
technical perspective
3. Transform the data
4. Apply/Configure a machine
learning model to predict the
onset of mental disease

Methodology cont.
 Data pre-processing and sampling (DA)
 Splitting between nominal and continuous
 Deriving count, mean, ranges, standard deviation and
distribution
 Correlation analysis to determine whether features would
need to be stripped
 Comparative analysis
Tools:
 Data exploration was done using Microsoft Excel
 Python was used with packages for statistical analysis

Methodology cont.
 Data pre-processing and sampling (ML)
 Normalisation
 Splitting between test and training data 70%/30%
 Tackling class imbalance with SMOTE
 Applying classification models
 Measuring performance
 Selecting the top performing models were selected based on the highest scores
Tools:
 Data exploration was done using Microsoft Excel
 Python & various libraries for Machine Learning

Data Analysis Findings
 The comparative analysis exercise did in fact justify that mental illness was, in
general, more present in those suffering from physical illness - although the
differences were not significant.
 Due to the complexity of the relationships that each variable has, which cannot simply
be explained directly with correlation, this gives a further reason of why machine
learning is a suitable candidate to analyse this sort of data.
 Some of the results:
 Alcohol drinkers experience more mood disorders and anxiety. Although the difference is
minimal.
 Out of the segment of smokers that smoke at least 31 cigarettes a day, 25.52% are classified
as suffering from a mood disorder, an increase of 16.86% from the general sample.
 Active people that engage in regular physical exercise show a lower proportion of people
diagnosed with (1.42% lower) anxiety and (2.69% lower) mood disorder
 Association results demonstrated that the strongest physical illness links with mental
illnesses were arthritis, followed by high blood pressure and asthma.

Machine
Learning Results
 SVMs were observed to
be the most effective
predictor for mood
disorders and anxiety
as in terms of accuracy

Conclusions & Lessons
Learnt
• Data availability is the true bottleneck for DA and ML projects
• Pre-processing was possibly the most important step in this project.
Throughout the first phases, lots of experimentation and research was
done in order to fine tune and prepare the data in the best way possible
• Machine Learning can prove to be a reliable predictor for classification
problems and can be applied in many ways as long as the data is
available
• Tools and learning resources are prevalent, updates are frequent,
techniques are evolving continuously

Future steps for mental illness classification?
Other data types could be
explored, such as images stemming
from brain scans (MRI, PET) that
show brain activity for individuals
experiencing mood disorders or
anxiety.
Better infrastructure allows for
heavier algorithms, which means
that there can be better results
Eventually, a more refined version
of this model can be used as a
back-end structure to an app or
website that raises awareness for
individuals to be able to gauge how
their lifestyle, habits and physical
factors could potentially affect
their mental health.

Prediction and Analysis of Mood Disorders Based On Physical and Social Health Indicators

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (19)

Similaire à Prediction and Analysis of Mood Disorders Based On Physical and Social Health Indicators

Similaire à Prediction and Analysis of Mood Disorders Based On Physical and Social Health Indicators (20)

Dernier

Dernier (20)

Prediction and Analysis of Mood Disorders Based On Physical and Social Health Indicators

Notes de l'éditeur