SlideShare a Scribd company logo
1 of 27
The ACP
MS Repository
A Case Study

tranSMART Community Meeting, Nov. 2013

Stephen Wicks, Ph.D.
The ACP Repository: a Case Study

What is Multiple Sclerosis?









Chronic inflammatory/demyelination disorder affecting
the CNS. (about 0.1%)
Leading cause of neurological disability in young
adults.
Symptoms are variable and significant. They include
vision, cognition, locomotion, pain, disorientation,
dexterity, mood, bowel/bladder control, others.
Generally progressive, but progression is idiosyncratic.
(CISRRMSSPMS, vs. CISPPMS etc.)
Complex etiology
The ACP Repository: a Case Study

What is the cost of MS?
 Difficult

and costly to diagnose (MRI, symptom
variability leads to extensive differential diagnosis)
 Treatments can slow progression, but are expensive.
 Many different drugs exist, but patient stratification
for maximum efficacy and minimum side effects is
non-existent. “Role the dice”
 Often strikes early in life, and is a life-long disability.
 Average Diagnosis at about 30. 5% before 16.
Orion Bionetworks
 ACP

is a founding member of Orion.
 Orion seeks to cure MS by harnessing the power of
computational modeling of disease progression.
 ACP will provide its data to Orion in tranSMART to
facilitate this goal.
 Rancho BioSciences will curate and harmonize the
ACP data for Orion
The ACP Repository: a Case Study

ACP and the MS Repository
 Founded

in 2001 by an MIT entrepreneur with MS
 ACP MS Repository started in 2006. The goal was
to identify the cause of MS.
 ACP MS Repository enrollment shut down this
year. Approximately 3200 participants enrolled.
 Biosamples, demographics, medical history etc.
 Research data
 OPT-UP
The ACP Repository: a Case Study

Repository Enrollment Status (6/21/2013)
• 3,220 subjects enrolled; 467 longitudinal visits completed
Subject Breakdown

Treatment Status

Subjects: 3220

TX+

TX
Naïve

TX-

Time Since Diagnosis
Pending

< 5 yrs

5-10 yrs

> 10 yrs

Pending

RRMS:1436

146

229

116

675

388

291

82

SPMS:234

164

53

13

4

27

58

149

0

PPMS:117

46

38

31

2

48

37

32

0

CIS:88

19

2

62

5

74

5

5

4

TM:175

12

17

131

15

106

32

24

13

NMO:383

200

66

39

78

252

52

13

66

ON:16

1

1

12

2

12

1

0

3

ADEM:32

MS:
1787

945

1

6

20

5

24

5

1

2

Cases:
2481

Controls: 739

• DNA, RNA, Plasma, Serum, PBMCs + data from 52 page CRF
6
The ACP Repository: a Case Study

The ACP Engine
“Matchmaker”
Database Graphical
User Interface

MS Discovery Forum
Reviewing Developments in the MS Field
Communicating with MS Researchers

Allowing MS Researchers
Worldwide to Explore the
ACP Repository Database

ACP Repository
$13 million
Invested

3200+ participants
Biosamples & Datasets

77 sets of biosamples+data

(b,m)illions of datapoints,
From 36 studies, so far

MS Researchers
Worldwide
Academia & Industry

Insights and Results

Mechanisms

Diagnostics

Causes

Treatments
The ACP Repository: a Case Study

ACP MS Repository
Open-access collection of highly annotated bloodderived samples plus data from MS, related diseases, &
control subjects gathered from 2006-2013.
 Requirement for research data derived from samples to
be with them (ACP) allowed us to obtain IP protection).
“Workingdeposited (with a provision forcritical samples and confirm our
results for only $20,000. If I had to obtain these samples from scratch, it would
 Contributes to MS+ research in many ways:
have cost $1 million and added 5 years to the project.”
 - Enables studies that might not be conducted School of
Thomas M. Aune, PhD, Molecular Biology, Vanderbilt University
Medicine otherwise (academic & commercial)
(from Scientific American)
 Creates a common results database for studies from
multiple bio-analytical techniques on overlapping sets
of subjects.
 Approximately 3200 participants.

The ACP Repository: a Case Study

Case Report Form (CRF)
Curation challenges
The ACP Repository: a Case Study

ACP Case Report Form
48 Page (first visit) and 38 page (second visit)
complete clinical workup
 Form completed with the assistance of a
clinical research associate over a several hour
interview (with sample draw and lab workup)
 Broad data: 80 distinct tables in an SQL
database
 Deep data: in flat data files, more than 20
million cells

The ACP Repository: a Case Study

CRF Sample Fields

Study drugs Drug”measurelethaly so) drug meaningless.
103 distinct textual responses.“CS-0777”, orordinal with betaseron, BETASERON, etc.
Illustrates some Nothe problems associated order was units dataset.
Inappropriate (sometimesdrugfrequency.
of consistent etc…
of trail enrollment
“First Drug”, “Second “Betseron”, beta-seron, curating this
“BG00012 (FUMARATE) OR PLACEBO”
The ACP Repository: a Case Study

DMD Curation Solutions
 We

applied drug ontologies and mapping
vocabularies where needed.
 We repaired and consolidated dose,
frequency, etc. to a single measure with 3
values (high, standard, low)
 We re-formatted the data to eliminate the
ambiguous cardinal ordering of reporting
The ACP Repository: a Case Study

CRF Sample Fields

Multiple Drugs (Observations) were addressed with…
The ACP Repository: a Case Study

VISIT_NAME application
The ACP Repository: a Case Study

Controlled Vocabularies
(sports)
 ~5000

responses
 779 distinct sports reported
 When filtered by “ski”,29
 “gym”, 45
 “walk”, 30; “jog”, 17, “run”, 40
The ACP Repository: a Case Study

Controlled Vocabularies (sports)

All sports mapped to a 29 term vocabulary.
The ACP Repository: a Case Study

Controlled Vocabularies (pets)
 ~6500

pets reported
 600 distinct pets reported
 When filtered by “dog”, 112, however, this misses
mispellings (“diog”, “dot”, “pubs”, dog-like pets
“wolf”, “half-wolf”, “mutt”, and breeds (“poddle”,
“poodle”, “Afghan Hound”, etc.)
 59 additional dog-like entries
The ACP Repository: a Case Study

Controlled Vocabularies (pets)

All pets mapped to a 31 category controlled vocabulary
The ACP Repository: a Case Study

Medication Curation Challenges









Amitriptaline
>10,000 medications listed.
Amitriptylin
Amitriptyline
2703 distinct medications listed.
Amitriptyline HCL
Mapped these to 614 real medications (e.g.
Amitroptyline
Amitriptyline)
Amitryetyline
This was split into twoAmitrypatiline
tables:
Amitryptailine
 Continuing Medications (541 entities)
Amitryptaline
 Stopped Medications (317 entities)
Amitryptilin
VISIT_NAME was used Amitryptiline distinct observations
to represent
across the whole study
Amitryptilline
Amitryptylene
Truly longitudinal measures were reified in the tree
hierarchy in the data Amitryptylinefile.
mapping
The ACP Repository: a Case Study

ACP Repository
Tree
The ACP Repository: a Case Study

Date and Time Coding
 All
Dates in multiple formats:
15/12/2001
15/Dec/2001
Dec-2001
2001
Dec./2001
12/2001
--/--/-----/2001
------------/12/2001

dates converted to
periods (Months, Years,
or Days) prior to the
relevant blood draw
date.
 Dates were represented
by International
Standard ISO 8601. i.e.
YYYY-MM-DD (e.g. 200112-15)
The ACP Repository: a Case Study

Repository Usage




77 studies ongoing or completed
36 studies have returned data to ACP
Data types:








Low-D biomarker (antibodies, metabolites, serum
markers of inflammation, etc.)
Low-D genotype data
High-D SNP/GWAS data
Gene-expression studies
Whole-genome sequencing (2 distinct studies)

Study types:




Etiology
Diagnostics
Disease activity biomarkers
The ACP Repository: a Case Study

Repository Usage
The ACP Repository: a Case Study

Research Data Curation Challenges
 Few

guidelines provided to researchers for data
formatting or treatment
 Often little or no documentation describing how
the data was generated or handled (raw vs.
normalized, transformations e.g.)
 Load study meta-data (contact info, description,
etc. at the node level)
The ACP Repository: a Case Study

Sample Study Results
Biogen gene expression study:
Designed to identify gene-expression
profiles that discriminate progressive
forms of MS from relapsing-remitting
forms of the disease.
The ACP Repository: a Case Study

Future Directions








Rancho BioSciences is providing guidance to ACP
for data-collection practices going forward (e.g.
OPT-UP)
We loaded the clinical data and 6 sample study
datasets into an Oracle-based tranSMART instance
that we host in-house for QC purposes.
The full dataset is slated to be loaded into a 1.1
postgreSQL-based tranSMART instance (hosted by
Recombinant by Deloitte for Orion).
This and other data sources (Inst. For Neuroscience
at B&W) will be analyzed and modeled by Orion
The ACP Repository: a Case Study

Thanks for your time!
Questions?

More Related Content

Viewers also liked

Viewers also liked (8)

tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Advancing tranSMART Analy...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART a Data Warehous...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart’s application t...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Characterization of the c...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: eTRIKS - Science Driven D...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMARTtranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Simulation in tranSMART
 
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
tranSMART Community Meeting 5-7 Nov 13 - Session 5: Recent tranSMART Lessons ...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Pfizer’s Recent Use of tr...
 

Similar to tranSMART Community Meeting 5-7 Nov 13 - Session 5: The Accelerated Cure Project MS Repository Dataset as a Case Study

20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0
Computer Science Club
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
Ian Foster
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
inside-BigData.com
 
Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...
CSITiaesprime
 
Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...
BASMAJUMAASALEHALMOH
 
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-applicationAcute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Cemal Ardil
 

Similar to tranSMART Community Meeting 5-7 Nov 13 - Session 5: The Accelerated Cure Project MS Repository Dataset as a Case Study (20)

Data analytics challenges in genomics
Data analytics challenges in genomicsData analytics challenges in genomics
Data analytics challenges in genomics
 
Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management Evolution of Knowledge Discovery and Management
Evolution of Knowledge Discovery and Management
 
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbuJax bio dataworldcongress.ngs.20181128finalwithoutbu
Jax bio dataworldcongress.ngs.20181128finalwithoutbu
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0
 
Basics of Data Analysis in Bioinformatics
Basics of Data Analysis in BioinformaticsBasics of Data Analysis in Bioinformatics
Basics of Data Analysis in Bioinformatics
 
Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009Quantitative Medicine Feb 2009
Quantitative Medicine Feb 2009
 
Challenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical researchChallenges and opportunities for machine learning in biomedical research
Challenges and opportunities for machine learning in biomedical research
 
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and ChallengesSingle-Cell Sequencing for Drug Discovery: Applications and Challenges
Single-Cell Sequencing for Drug Discovery: Applications and Challenges
 
Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...Genome feature optimization and coronary artery disease prediction using cuck...
Genome feature optimization and coronary artery disease prediction using cuck...
 
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
COMPARISON AND EVALUATION DATA MINING TECHNIQUES IN THE DIAGNOSIS OF HEART DI...
 
AAPM Foster July 2009
AAPM Foster July 2009AAPM Foster July 2009
AAPM Foster July 2009
 
Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...Heart disease prediction by using novel optimization algorithm_ A supervised ...
Heart disease prediction by using novel optimization algorithm_ A supervised ...
 
Heart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining TechniquesHeart Disease Prediction Using Data Mining Techniques
Heart Disease Prediction Using Data Mining Techniques
 
Survey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease predictionSurvey on data mining techniques in heart disease prediction
Survey on data mining techniques in heart disease prediction
 
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
EUSFLAT 2019: explainable neuro fuzzy recurrent neural network to predict col...
 
AETIONOMY Overview AD/PD Conference 2015 Nice
AETIONOMY Overview AD/PD Conference 2015 NiceAETIONOMY Overview AD/PD Conference 2015 Nice
AETIONOMY Overview AD/PD Conference 2015 Nice
 
C0344023028
C0344023028C0344023028
C0344023028
 
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-applicationAcute coronary-syndrome-prediction-using-data-mining-techniques--an-application
Acute coronary-syndrome-prediction-using-data-mining-techniques--an-application
 
BioData World Basel 2018
BioData World Basel 2018BioData World Basel 2018
BioData World Basel 2018
 
Bioinformatics final
Bioinformatics finalBioinformatics final
Bioinformatics final
 

More from David Peyruc

More from David Peyruc (9)

tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Modularization (Plug‐Ins,...
 
Community
CommunityCommunity
Community
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
tranSMART Community Meeting 5-7 Nov 13 - Session 3: tranSMART and the One Min...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-datatranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
tranSMART Community Meeting 5-7 Nov 13 - Session 3: transmart-data
 
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker DiscoverytranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
tranSMART Community Meeting 5-7 Nov 13 - Session 3: Clinical Biomarker Discovery
 
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
tranSMART Community Meeting 5-7 Nov 13 - Session 2: Creating a Comprehensive ...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Translational Drug Disco...
tranSMART Community Meeting 5-7 Nov 13 - Session 1:  Translational Drug Disco...tranSMART Community Meeting 5-7 Nov 13 - Session 1:  Translational Drug Disco...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Translational Drug Disco...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Chilly-Mazarin Meeting Ob...
 
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
tranSMART Community Meeting 5-7 Nov 13 - Session 1: Amsterdam Community Meeti...
 

Recently uploaded

Cytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose Academics
Cytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose AcademicsCytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose Academics
Cytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose Academics
MedicoseAcademics
 

Recently uploaded (20)

Video capsule endoscopy (VCE ) in children
Video capsule endoscopy (VCE ) in childrenVideo capsule endoscopy (VCE ) in children
Video capsule endoscopy (VCE ) in children
 
ANAPHYLAXIS BY DR.SOHAN BISWAS,MBBS,DNB(INTERNAL MEDICINE) RESIDENT.pptx
ANAPHYLAXIS BY DR.SOHAN BISWAS,MBBS,DNB(INTERNAL MEDICINE) RESIDENT.pptxANAPHYLAXIS BY DR.SOHAN BISWAS,MBBS,DNB(INTERNAL MEDICINE) RESIDENT.pptx
ANAPHYLAXIS BY DR.SOHAN BISWAS,MBBS,DNB(INTERNAL MEDICINE) RESIDENT.pptx
 
Tips and tricks to pass the cardiovascular station for PACES exam
Tips and tricks to pass the cardiovascular station for PACES examTips and tricks to pass the cardiovascular station for PACES exam
Tips and tricks to pass the cardiovascular station for PACES exam
 
Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad
Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.GawadHemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad
Hemodialysis: Chapter 1, Physiological Principles of Hemodialysis - Dr.Gawad
 
The Clean Living Project Episode 24 - Subconscious
The Clean Living Project Episode 24 - SubconsciousThe Clean Living Project Episode 24 - Subconscious
The Clean Living Project Episode 24 - Subconscious
 
Premium ℂall Girls In Mumbai👉 Dail ℂALL ME: 📞9833325238 📲 ℂall Richa VIP ℂall...
Premium ℂall Girls In Mumbai👉 Dail ℂALL ME: 📞9833325238 📲 ℂall Richa VIP ℂall...Premium ℂall Girls In Mumbai👉 Dail ℂALL ME: 📞9833325238 📲 ℂall Richa VIP ℂall...
Premium ℂall Girls In Mumbai👉 Dail ℂALL ME: 📞9833325238 📲 ℂall Richa VIP ℂall...
 
TEST BANK For Lewis's Medical Surgical Nursing in Canada, 4th Edition by Jane...
TEST BANK For Lewis's Medical Surgical Nursing in Canada, 4th Edition by Jane...TEST BANK For Lewis's Medical Surgical Nursing in Canada, 4th Edition by Jane...
TEST BANK For Lewis's Medical Surgical Nursing in Canada, 4th Edition by Jane...
 
Gallbladder Double-Diverticular: A Case Report المرارة مزدوجة التج: تقرير حالة
Gallbladder Double-Diverticular: A Case Report  المرارة مزدوجة التج: تقرير حالةGallbladder Double-Diverticular: A Case Report  المرارة مزدوجة التج: تقرير حالة
Gallbladder Double-Diverticular: A Case Report المرارة مزدوجة التج: تقرير حالة
 
Tips to Choose the Best Psychiatrists in Indore
Tips to Choose the Best Psychiatrists in IndoreTips to Choose the Best Psychiatrists in Indore
Tips to Choose the Best Psychiatrists in Indore
 
Treatment Choices for Slip Disc at Gokuldas Hospital
Treatment Choices for Slip Disc at Gokuldas HospitalTreatment Choices for Slip Disc at Gokuldas Hospital
Treatment Choices for Slip Disc at Gokuldas Hospital
 
PYODERMA, IMPETIGO, FOLLICULITIS, FURUNCLES, CARBUNCLES.pdf
PYODERMA, IMPETIGO, FOLLICULITIS, FURUNCLES, CARBUNCLES.pdfPYODERMA, IMPETIGO, FOLLICULITIS, FURUNCLES, CARBUNCLES.pdf
PYODERMA, IMPETIGO, FOLLICULITIS, FURUNCLES, CARBUNCLES.pdf
 
VIP Pune 7877925207 WhatsApp: Me All Time Serviℂe Available Day and Night
VIP Pune 7877925207 WhatsApp: Me All Time Serviℂe Available Day and NightVIP Pune 7877925207 WhatsApp: Me All Time Serviℂe Available Day and Night
VIP Pune 7877925207 WhatsApp: Me All Time Serviℂe Available Day and Night
 
Cytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose Academics
Cytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose AcademicsCytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose Academics
Cytoskeleton and Cell Inclusions - Dr Muhammad Ali Rabbani - Medicose Academics
 
Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...
Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...
Unlocking Holistic Wellness: Addressing Depression, Mental Well-Being, and St...
 
Results For Love Spell Is Guaranteed In 1 Day +27834335081 [BACK LOST LOVE SP...
Results For Love Spell Is Guaranteed In 1 Day +27834335081 [BACK LOST LOVE SP...Results For Love Spell Is Guaranteed In 1 Day +27834335081 [BACK LOST LOVE SP...
Results For Love Spell Is Guaranteed In 1 Day +27834335081 [BACK LOST LOVE SP...
 
Let's Talk About It: Ovarian Cancer (The Emotional Toll of Treatment Decision...
Let's Talk About It: Ovarian Cancer (The Emotional Toll of Treatment Decision...Let's Talk About It: Ovarian Cancer (The Emotional Toll of Treatment Decision...
Let's Talk About It: Ovarian Cancer (The Emotional Toll of Treatment Decision...
 
Sell pmk powder cas 28578-16-7 from pmk supplier Telegram +85297504341
Sell pmk powder cas 28578-16-7 from pmk supplier Telegram +85297504341Sell pmk powder cas 28578-16-7 from pmk supplier Telegram +85297504341
Sell pmk powder cas 28578-16-7 from pmk supplier Telegram +85297504341
 
Signs It’s Time for Physiotherapy Sessions Prioritizing Wellness
Signs It’s Time for Physiotherapy Sessions Prioritizing WellnessSigns It’s Time for Physiotherapy Sessions Prioritizing Wellness
Signs It’s Time for Physiotherapy Sessions Prioritizing Wellness
 
Integrated Neuromuscular Inhibition Technique (INIT)
Integrated Neuromuscular Inhibition Technique (INIT)Integrated Neuromuscular Inhibition Technique (INIT)
Integrated Neuromuscular Inhibition Technique (INIT)
 
Cas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best suppler
Cas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best supplerCas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best suppler
Cas 28578-16-7 PMK ethyl glycidate ( new PMK powder) best suppler
 

tranSMART Community Meeting 5-7 Nov 13 - Session 5: The Accelerated Cure Project MS Repository Dataset as a Case Study

  • 1. The ACP MS Repository A Case Study tranSMART Community Meeting, Nov. 2013 Stephen Wicks, Ph.D.
  • 2. The ACP Repository: a Case Study What is Multiple Sclerosis?      Chronic inflammatory/demyelination disorder affecting the CNS. (about 0.1%) Leading cause of neurological disability in young adults. Symptoms are variable and significant. They include vision, cognition, locomotion, pain, disorientation, dexterity, mood, bowel/bladder control, others. Generally progressive, but progression is idiosyncratic. (CISRRMSSPMS, vs. CISPPMS etc.) Complex etiology
  • 3. The ACP Repository: a Case Study What is the cost of MS?  Difficult and costly to diagnose (MRI, symptom variability leads to extensive differential diagnosis)  Treatments can slow progression, but are expensive.  Many different drugs exist, but patient stratification for maximum efficacy and minimum side effects is non-existent. “Role the dice”  Often strikes early in life, and is a life-long disability.  Average Diagnosis at about 30. 5% before 16.
  • 4. Orion Bionetworks  ACP is a founding member of Orion.  Orion seeks to cure MS by harnessing the power of computational modeling of disease progression.  ACP will provide its data to Orion in tranSMART to facilitate this goal.  Rancho BioSciences will curate and harmonize the ACP data for Orion
  • 5. The ACP Repository: a Case Study ACP and the MS Repository  Founded in 2001 by an MIT entrepreneur with MS  ACP MS Repository started in 2006. The goal was to identify the cause of MS.  ACP MS Repository enrollment shut down this year. Approximately 3200 participants enrolled.  Biosamples, demographics, medical history etc.  Research data  OPT-UP
  • 6. The ACP Repository: a Case Study Repository Enrollment Status (6/21/2013) • 3,220 subjects enrolled; 467 longitudinal visits completed Subject Breakdown Treatment Status Subjects: 3220 TX+ TX Naïve TX- Time Since Diagnosis Pending < 5 yrs 5-10 yrs > 10 yrs Pending RRMS:1436 146 229 116 675 388 291 82 SPMS:234 164 53 13 4 27 58 149 0 PPMS:117 46 38 31 2 48 37 32 0 CIS:88 19 2 62 5 74 5 5 4 TM:175 12 17 131 15 106 32 24 13 NMO:383 200 66 39 78 252 52 13 66 ON:16 1 1 12 2 12 1 0 3 ADEM:32 MS: 1787 945 1 6 20 5 24 5 1 2 Cases: 2481 Controls: 739 • DNA, RNA, Plasma, Serum, PBMCs + data from 52 page CRF 6
  • 7. The ACP Repository: a Case Study The ACP Engine “Matchmaker” Database Graphical User Interface MS Discovery Forum Reviewing Developments in the MS Field Communicating with MS Researchers Allowing MS Researchers Worldwide to Explore the ACP Repository Database ACP Repository $13 million Invested 3200+ participants Biosamples & Datasets 77 sets of biosamples+data (b,m)illions of datapoints, From 36 studies, so far MS Researchers Worldwide Academia & Industry Insights and Results Mechanisms Diagnostics Causes Treatments
  • 8. The ACP Repository: a Case Study ACP MS Repository Open-access collection of highly annotated bloodderived samples plus data from MS, related diseases, & control subjects gathered from 2006-2013.  Requirement for research data derived from samples to be with them (ACP) allowed us to obtain IP protection). “Workingdeposited (with a provision forcritical samples and confirm our results for only $20,000. If I had to obtain these samples from scratch, it would  Contributes to MS+ research in many ways: have cost $1 million and added 5 years to the project.”  - Enables studies that might not be conducted School of Thomas M. Aune, PhD, Molecular Biology, Vanderbilt University Medicine otherwise (academic & commercial) (from Scientific American)  Creates a common results database for studies from multiple bio-analytical techniques on overlapping sets of subjects.  Approximately 3200 participants. 
  • 9. The ACP Repository: a Case Study Case Report Form (CRF) Curation challenges
  • 10. The ACP Repository: a Case Study ACP Case Report Form 48 Page (first visit) and 38 page (second visit) complete clinical workup  Form completed with the assistance of a clinical research associate over a several hour interview (with sample draw and lab workup)  Broad data: 80 distinct tables in an SQL database  Deep data: in flat data files, more than 20 million cells 
  • 11. The ACP Repository: a Case Study CRF Sample Fields Study drugs Drug”measurelethaly so) drug meaningless. 103 distinct textual responses.“CS-0777”, orordinal with betaseron, BETASERON, etc. Illustrates some Nothe problems associated order was units dataset. Inappropriate (sometimesdrugfrequency. of consistent etc… of trail enrollment “First Drug”, “Second “Betseron”, beta-seron, curating this “BG00012 (FUMARATE) OR PLACEBO”
  • 12. The ACP Repository: a Case Study DMD Curation Solutions  We applied drug ontologies and mapping vocabularies where needed.  We repaired and consolidated dose, frequency, etc. to a single measure with 3 values (high, standard, low)  We re-formatted the data to eliminate the ambiguous cardinal ordering of reporting
  • 13. The ACP Repository: a Case Study CRF Sample Fields Multiple Drugs (Observations) were addressed with…
  • 14. The ACP Repository: a Case Study VISIT_NAME application
  • 15. The ACP Repository: a Case Study Controlled Vocabularies (sports)  ~5000 responses  779 distinct sports reported  When filtered by “ski”,29  “gym”, 45  “walk”, 30; “jog”, 17, “run”, 40
  • 16. The ACP Repository: a Case Study Controlled Vocabularies (sports) All sports mapped to a 29 term vocabulary.
  • 17. The ACP Repository: a Case Study Controlled Vocabularies (pets)  ~6500 pets reported  600 distinct pets reported  When filtered by “dog”, 112, however, this misses mispellings (“diog”, “dot”, “pubs”, dog-like pets “wolf”, “half-wolf”, “mutt”, and breeds (“poddle”, “poodle”, “Afghan Hound”, etc.)  59 additional dog-like entries
  • 18. The ACP Repository: a Case Study Controlled Vocabularies (pets) All pets mapped to a 31 category controlled vocabulary
  • 19. The ACP Repository: a Case Study Medication Curation Challenges       Amitriptaline >10,000 medications listed. Amitriptylin Amitriptyline 2703 distinct medications listed. Amitriptyline HCL Mapped these to 614 real medications (e.g. Amitroptyline Amitriptyline) Amitryetyline This was split into twoAmitrypatiline tables: Amitryptailine  Continuing Medications (541 entities) Amitryptaline  Stopped Medications (317 entities) Amitryptilin VISIT_NAME was used Amitryptiline distinct observations to represent across the whole study Amitryptilline Amitryptylene Truly longitudinal measures were reified in the tree hierarchy in the data Amitryptylinefile. mapping
  • 20. The ACP Repository: a Case Study ACP Repository Tree
  • 21. The ACP Repository: a Case Study Date and Time Coding  All Dates in multiple formats: 15/12/2001 15/Dec/2001 Dec-2001 2001 Dec./2001 12/2001 --/--/-----/2001 ------------/12/2001 dates converted to periods (Months, Years, or Days) prior to the relevant blood draw date.  Dates were represented by International Standard ISO 8601. i.e. YYYY-MM-DD (e.g. 200112-15)
  • 22. The ACP Repository: a Case Study Repository Usage    77 studies ongoing or completed 36 studies have returned data to ACP Data types:       Low-D biomarker (antibodies, metabolites, serum markers of inflammation, etc.) Low-D genotype data High-D SNP/GWAS data Gene-expression studies Whole-genome sequencing (2 distinct studies) Study types:    Etiology Diagnostics Disease activity biomarkers
  • 23. The ACP Repository: a Case Study Repository Usage
  • 24. The ACP Repository: a Case Study Research Data Curation Challenges  Few guidelines provided to researchers for data formatting or treatment  Often little or no documentation describing how the data was generated or handled (raw vs. normalized, transformations e.g.)  Load study meta-data (contact info, description, etc. at the node level)
  • 25. The ACP Repository: a Case Study Sample Study Results Biogen gene expression study: Designed to identify gene-expression profiles that discriminate progressive forms of MS from relapsing-remitting forms of the disease.
  • 26. The ACP Repository: a Case Study Future Directions     Rancho BioSciences is providing guidance to ACP for data-collection practices going forward (e.g. OPT-UP) We loaded the clinical data and 6 sample study datasets into an Oracle-based tranSMART instance that we host in-house for QC purposes. The full dataset is slated to be loaded into a 1.1 postgreSQL-based tranSMART instance (hosted by Recombinant by Deloitte for Orion). This and other data sources (Inst. For Neuroscience at B&W) will be analyzed and modeled by Orion
  • 27. The ACP Repository: a Case Study Thanks for your time! Questions?

Editor's Notes

  1. Ethos of ACP = removing barriers to research. The first challenge was to create a new type of infrastructure for research to get conducted within. They say the journey of a thousand miles begins with a single step – for us the beginning of the journey was with a single study.