SlideShare une entreprise Scribd logo
1  sur  1
Télécharger pour lire hors ligne
www.helmholtz-hzi.de
Combination of informative
biomarkers in small pilot studies and estimation
of sample size for extended studies
Amani Al-Mekhlafi1,2,
Frank Klawonn1,3
Figure 1. HAUCA Curve starting from 0.85 AUC value for hip
infection dataset
Aims:
• Finding the optimal combination of biomarkers to maximize the AUC
• Estimating the sample size for extended studies
1Department of Biostatistics, Helmholtz Centre for Infection Research
2PhD student Epidemiology, Braunschweig-Hannover
3Department of Computer Science, Ostfalia University of Applied Sciences
Method:
Data:
A pilot study by Omar et al.5 has a total of 24 patients, 12 patients with chronic
periprosthetic hip infection and 12 patients with aseptic hip prosthesis loosening, and
50,416 biomarker candidates (Hip infection dataset)
Feature Selection Approach:
The classification criterion is based on area under the receiver operating characteristic
curve (AUC)
Calculation of p-value:
Based on the same statistic that is used for the Wilcoxon-Mann-Whitney-U test 6
Correction of p-value:
Holm- Bonferroni correction
Background:
Biomarker candidates are defined as measurable molecules found in biological media. According to Biomarkers Definitions Working Group, 20011, biomarkers cover a rather
wide range of parameters. Recently, biomarkers are used widely in medical researches, where single biomarkers may not possess the desired cause-effect association for
disease classification and outcome prediction2. Therefore the efforts of the researchers currently is to combine biomarkers. By new technologies like microarrays, next
generation sequencing and mass spectrometry, researchers can obtain many biomarker candidates that can exceed tens of thousands3. To avoid wasting money and time, it
is suggested to control the number of patients strictly. However, pilot studies usually have low statistical power which reduces the chance of detecting a true effect 4.
Step I:
HAUCA Curve:
A method indicating how many good biomarkers a
data set contains compared to pure random effects7
• Calculate the number of biomarkers that exceed
specific values of AUC:
 in the real dataset
 in a random dataset
• Compute 95% quantile of the binomial distribution
of each AUC value to obtain a confidence interval
In the hip infection data, there is more than random
association between the biomarker candidates and
the disease. Moreover, the study is worthwhile for
further studies.
Biomarker AUC p-value Corrected p-value
1 Bio. with high AUC 0.951 3.328e-05 1.678
2 Bio. with high AUC 0.944 4.955e-05 2.498
5 Bio. with high AUC 0.931 1.028e-04 5.183
4 Bio. with high AUC 0.924 1.442e-04 7.271
6 Bio. with high AUC 0.917 2.012e-04 10.142
9 Bio. with high AUC 0.910 2.744e-04 13.834
12 Bio. with high AUC 0.903 3.713e-04 18.718
Table 1. Top Biomarkers with highest AUC values, their p-value, and corrected p-value
Step III:
Estimate the Sample size:
• Specify the AUC value which is wanted to be validated
• Specify the prevalence of the positive cases
• Specify the number of hypothesis tests
n and n+ are increased gradually until the wanted AUC
value with a significant corrected p-value is achieved
In the hip infection data, a sample size of 60 is needed
to validate the 0.85 AUC value
Step II:
Combination of Biomarkers:
• Select the top k features according to the AUC
• Calculate within the groups:
 The difference in means for each feature
 The variance-covariance matrices between combined
features
• Calculate the AUC of the combination of possibly correlated
biomarkers according to Demler et al. 8
• Measure the lower confidence interval for this combination by
bootstrapping with different levels (0.025, 0.05, 0.1)
In the same dataset, we can notice that when 10 biomarkers are
combined, AUC value becomes close to 1 and the different lower
confidence intervals are not less than 0.95.
NO.
COMBINED
BIOMARKERS
AUC
VALUE
1 0.906186
2 0.952129
3 0.935715
4 0.955792
5 0.944372
6 0.958933
7 0.965342
8 0.978783
9 0.986358
10 0.993131
Figure 2. Curve of AUCs of the combination of the
top 20 biomarkers
Figure 3. Sample Size to validate each AUC valueTable 2. Top 10 combined
AUC values
The LEGaTO project has received funding from the European Union’s Horizon 2020 research and innovation programme under the grant agreement No 780681.
References
1. Biomarkers Definitions Working Group: Biomarkers and surrogate endpoints: Preferred definitions and conceptual framework. Clin. Pharmacol. Ther,2001; 69, 89–95.
2. Yan, L., Tian, L., and Liu, S. Combining large number of weak biomarkers based on AUC. Stat Med, 2015; 34(29): 3811-3830.
3. Soon, W.W., Hariharan, M., and Snyder, M.P. High-throughput sequencing for biology and medicine. Molecular systems biology,2013; 9:640.
4. Button, KS., Ioannidis, JP., Mokrysz C., Nosek BA., Flint J., Robinson ES., and Munafo MR. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci. 2013;.14(5):365-76.
5. Omar, M., Klawonn, F., Brand, S., Stiesch, M. Krettek, C., and Eberhard, J.Transcriptome wide high-density microarray analysis reveals differential gene transcription in periprosthetic tissue from hips with low-grade
infection versus aseptic loosening. Journal of Arthroplasty, 2017;32: 234-240,2016.
6. Mason, S.J., Graham, N.E.: Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation. Quarterly Journal of the Royal Meteorological
Society 128(584) (2002) 2145–2166
7. Klawonn, F., Wang, J., Koch, I., Eberhard, J., and Omar, M. HAUCA curves for the evaluation of biomarker pilot studies with small sample sizes and large numbers of features. Advances in Intelligent Data Analysis XV,
2016; 356–367.
8. Demler, O., Pencina, M., D’Agostino, R.S.: Impact of correlation on predictive ability of biomarkers. Statistics in Medicine 32 (2013) 4196–421
Conclusion:
• AUC performance measurement has been used not just because it is well established but also it yields closed form solutions of the required calculations and therefore to fast
computation. However, other measurement like entropy, misclassification rate or mutual information might be very good alternatives to AUC.
• The correlation between biomarkers may influence the performance of their combination. Therefore it has been taken into consideration and the variance-covariance matrices
have been calculated to minimize the overoptimistic performance of the combination when independence is assumed.
• In order to statistically validate biomarker candidates from pilot studies it is necessary to estimate the required larger sample size
Result:

Contenu connexe

Similaire à Biomarker study estimates sample size

Slides for st judes
Slides for st judesSlides for st judes
Slides for st judesSean Ekins
 
Automated hematology analyzer as a cost effective aid to screen and monitor s...
Automated hematology analyzer as a cost effective aid to screen and monitor s...Automated hematology analyzer as a cost effective aid to screen and monitor s...
Automated hematology analyzer as a cost effective aid to screen and monitor s...nisaiims
 
Assay Standardisation - how this leads to improved patient results
Assay Standardisation - how this leads to improved patient resultsAssay Standardisation - how this leads to improved patient results
Assay Standardisation - how this leads to improved patient resultsWalt Whitman
 
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdfEffective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdfPubrica
 
Measuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitMeasuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitLaure Wynants
 
Point of Care Microfluidics Device for hr-HPV Detection
Point of Care Microfluidics Device for hr-HPV DetectionPoint of Care Microfluidics Device for hr-HPV Detection
Point of Care Microfluidics Device for hr-HPV DetectionCFTCC
 
Antimicrobial stewardship ppt.pptx
Antimicrobial stewardship ppt.pptxAntimicrobial stewardship ppt.pptx
Antimicrobial stewardship ppt.pptxssuser62f0ca
 
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug DiscoveryCollaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug DiscoverySean Ekins
 
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...Fundación Ramón Areces
 
ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...
ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...
ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...Nishita Jaykumar
 
POSTER_JIANYU_LIU
POSTER_JIANYU_LIUPOSTER_JIANYU_LIU
POSTER_JIANYU_LIUJianyu Liu
 
In silico prediction of novel therapeutic targets using gene - disease associ...
In silico prediction of novel therapeutic targets using gene - disease associ...In silico prediction of novel therapeutic targets using gene - disease associ...
In silico prediction of novel therapeutic targets using gene - disease associ...Enrico Ferrero
 
Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...
Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...
Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...ScHARR HEDS
 
Saccharomyces boulardii in the prevention of antibiotic-associated diarrhoea
Saccharomyces boulardii in the prevention of antibiotic-associated diarrhoeaSaccharomyces boulardii in the prevention of antibiotic-associated diarrhoea
Saccharomyces boulardii in the prevention of antibiotic-associated diarrhoeaUtai Sukviwatsirikul
 
Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...
Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...
Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...Utai Sukviwatsirikul
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906Mark Gusack
 
E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara CarvalhoSara Carvalho
 
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...Loan Pham
 

Similaire à Biomarker study estimates sample size (20)

Slides for st judes
Slides for st judesSlides for st judes
Slides for st judes
 
Automated hematology analyzer as a cost effective aid to screen and monitor s...
Automated hematology analyzer as a cost effective aid to screen and monitor s...Automated hematology analyzer as a cost effective aid to screen and monitor s...
Automated hematology analyzer as a cost effective aid to screen and monitor s...
 
Assay Standardisation - how this leads to improved patient results
Assay Standardisation - how this leads to improved patient resultsAssay Standardisation - how this leads to improved patient results
Assay Standardisation - how this leads to improved patient results
 
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdfEffective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
Effective strategies to monitor clinical risks using biostatistics - Pubrica.pdf
 
Measuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net BenefitMeasuring clinical utility: uncertainty in Net Benefit
Measuring clinical utility: uncertainty in Net Benefit
 
Point of Care Microfluidics Device for hr-HPV Detection
Point of Care Microfluidics Device for hr-HPV DetectionPoint of Care Microfluidics Device for hr-HPV Detection
Point of Care Microfluidics Device for hr-HPV Detection
 
Antimicrobial stewardship ppt.pptx
Antimicrobial stewardship ppt.pptxAntimicrobial stewardship ppt.pptx
Antimicrobial stewardship ppt.pptx
 
Flacs vs mcs
Flacs vs mcsFlacs vs mcs
Flacs vs mcs
 
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug DiscoveryCollaborative Database and Computational Models for Tuberculosis Drug Discovery
Collaborative Database and Computational Models for Tuberculosis Drug Discovery
 
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
Bertrand de Meulder-El impacto de las ciencias ómicas en la medicina, la nutr...
 
Plos
PlosPlos
Plos
 
ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...
ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...
ResQu: A Framework for Automatic Evaluation of Knowledge-Driven Automatic Sum...
 
POSTER_JIANYU_LIU
POSTER_JIANYU_LIUPOSTER_JIANYU_LIU
POSTER_JIANYU_LIU
 
In silico prediction of novel therapeutic targets using gene - disease associ...
In silico prediction of novel therapeutic targets using gene - disease associ...In silico prediction of novel therapeutic targets using gene - disease associ...
In silico prediction of novel therapeutic targets using gene - disease associ...
 
Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...
Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...
Valuing Trial Designs from a Pharmaceutical Perspective using Value Based Pri...
 
Saccharomyces boulardii in the prevention of antibiotic-associated diarrhoea
Saccharomyces boulardii in the prevention of antibiotic-associated diarrhoeaSaccharomyces boulardii in the prevention of antibiotic-associated diarrhoea
Saccharomyces boulardii in the prevention of antibiotic-associated diarrhoea
 
Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...
Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...
Systematic review with meta-analysis: Saccharomyces boulardii in the preventi...
 
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
[Typ]Poster[Sbj]1593Synoptics[Dte]20150906
 
E-book Thesis Sara Carvalho
E-book Thesis  Sara CarvalhoE-book Thesis  Sara Carvalho
E-book Thesis Sara Carvalho
 
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
Clinical Trial Simulation to Evaluate the Pharmacokinetics of an Abuse-Deterr...
 

Plus de LEGATO project

Scrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitScrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitLEGATO project
 
A practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemA practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemLEGATO project
 
TEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsTEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsLEGATO project
 
secureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworksecureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworkLEGATO project
 
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...LEGATO project
 
LEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGATO project
 
Smart Home AI at the edge
Smart Home AI at the edgeSmart Home AI at the edge
Smart Home AI at the edgeLEGATO project
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGATO project
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGATO project
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGATO project
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGATO project
 
LEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGATO project
 
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneTZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneLEGATO project
 
Infection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingInfection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingLEGATO project
 
Smart Home - AI at the edge
Smart Home - AI at the edgeSmart Home - AI at the edge
Smart Home - AI at the edgeLEGATO project
 
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-ResiliencyFPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-ResiliencyLEGATO project
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...LEGATO project
 
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsScheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsLEGATO project
 

Plus de LEGATO project (20)

Scrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitScrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for Profit
 
A practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemA practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating system
 
TEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsTEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEs
 
secureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworksecureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow Framework
 
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
 
LEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use Case
 
Smart Home AI at the edge
Smart Home AI at the edgeSmart Home AI at the edge
Smart Home AI at the edge
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
 
LEGaTO Integration
LEGaTO IntegrationLEGaTO Integration
LEGaTO Integration
 
LEGaTO: Use cases
LEGaTO: Use casesLEGaTO: Use cases
LEGaTO: Use cases
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming Models
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack Runtimes
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous Hardware
 
LEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing Workshop
 
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneTZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
 
Infection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingInfection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow Computing
 
Smart Home - AI at the edge
Smart Home - AI at the edgeSmart Home - AI at the edge
Smart Home - AI at the edge
 
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-ResiliencyFPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
 
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsScheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
 

Dernier

Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSSLeenakshiTyagi
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisDiwakar Mishra
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxpradhanghanshyam7136
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxjana861314
 

Dernier (20)

Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
DIFFERENCE IN BACK CROSS AND TEST CROSS
DIFFERENCE IN  BACK CROSS AND TEST CROSSDIFFERENCE IN  BACK CROSS AND TEST CROSS
DIFFERENCE IN BACK CROSS AND TEST CROSS
 
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral AnalysisRaman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
Raman spectroscopy.pptx M Pharm, M Sc, Advanced Spectral Analysis
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Cultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptxCultivation of KODO MILLET . made by Ghanshyam pptx
Cultivation of KODO MILLET . made by Ghanshyam pptx
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptxBroad bean, Lima Bean, Jack bean, Ullucus.pptx
Broad bean, Lima Bean, Jack bean, Ullucus.pptx
 

Biomarker study estimates sample size

  • 1. www.helmholtz-hzi.de Combination of informative biomarkers in small pilot studies and estimation of sample size for extended studies Amani Al-Mekhlafi1,2, Frank Klawonn1,3 Figure 1. HAUCA Curve starting from 0.85 AUC value for hip infection dataset Aims: • Finding the optimal combination of biomarkers to maximize the AUC • Estimating the sample size for extended studies 1Department of Biostatistics, Helmholtz Centre for Infection Research 2PhD student Epidemiology, Braunschweig-Hannover 3Department of Computer Science, Ostfalia University of Applied Sciences Method: Data: A pilot study by Omar et al.5 has a total of 24 patients, 12 patients with chronic periprosthetic hip infection and 12 patients with aseptic hip prosthesis loosening, and 50,416 biomarker candidates (Hip infection dataset) Feature Selection Approach: The classification criterion is based on area under the receiver operating characteristic curve (AUC) Calculation of p-value: Based on the same statistic that is used for the Wilcoxon-Mann-Whitney-U test 6 Correction of p-value: Holm- Bonferroni correction Background: Biomarker candidates are defined as measurable molecules found in biological media. According to Biomarkers Definitions Working Group, 20011, biomarkers cover a rather wide range of parameters. Recently, biomarkers are used widely in medical researches, where single biomarkers may not possess the desired cause-effect association for disease classification and outcome prediction2. Therefore the efforts of the researchers currently is to combine biomarkers. By new technologies like microarrays, next generation sequencing and mass spectrometry, researchers can obtain many biomarker candidates that can exceed tens of thousands3. To avoid wasting money and time, it is suggested to control the number of patients strictly. However, pilot studies usually have low statistical power which reduces the chance of detecting a true effect 4. Step I: HAUCA Curve: A method indicating how many good biomarkers a data set contains compared to pure random effects7 • Calculate the number of biomarkers that exceed specific values of AUC:  in the real dataset  in a random dataset • Compute 95% quantile of the binomial distribution of each AUC value to obtain a confidence interval In the hip infection data, there is more than random association between the biomarker candidates and the disease. Moreover, the study is worthwhile for further studies. Biomarker AUC p-value Corrected p-value 1 Bio. with high AUC 0.951 3.328e-05 1.678 2 Bio. with high AUC 0.944 4.955e-05 2.498 5 Bio. with high AUC 0.931 1.028e-04 5.183 4 Bio. with high AUC 0.924 1.442e-04 7.271 6 Bio. with high AUC 0.917 2.012e-04 10.142 9 Bio. with high AUC 0.910 2.744e-04 13.834 12 Bio. with high AUC 0.903 3.713e-04 18.718 Table 1. Top Biomarkers with highest AUC values, their p-value, and corrected p-value Step III: Estimate the Sample size: • Specify the AUC value which is wanted to be validated • Specify the prevalence of the positive cases • Specify the number of hypothesis tests n and n+ are increased gradually until the wanted AUC value with a significant corrected p-value is achieved In the hip infection data, a sample size of 60 is needed to validate the 0.85 AUC value Step II: Combination of Biomarkers: • Select the top k features according to the AUC • Calculate within the groups:  The difference in means for each feature  The variance-covariance matrices between combined features • Calculate the AUC of the combination of possibly correlated biomarkers according to Demler et al. 8 • Measure the lower confidence interval for this combination by bootstrapping with different levels (0.025, 0.05, 0.1) In the same dataset, we can notice that when 10 biomarkers are combined, AUC value becomes close to 1 and the different lower confidence intervals are not less than 0.95. NO. COMBINED BIOMARKERS AUC VALUE 1 0.906186 2 0.952129 3 0.935715 4 0.955792 5 0.944372 6 0.958933 7 0.965342 8 0.978783 9 0.986358 10 0.993131 Figure 2. Curve of AUCs of the combination of the top 20 biomarkers Figure 3. Sample Size to validate each AUC valueTable 2. Top 10 combined AUC values The LEGaTO project has received funding from the European Union’s Horizon 2020 research and innovation programme under the grant agreement No 780681. References 1. Biomarkers Definitions Working Group: Biomarkers and surrogate endpoints: Preferred definitions and conceptual framework. Clin. Pharmacol. Ther,2001; 69, 89–95. 2. Yan, L., Tian, L., and Liu, S. Combining large number of weak biomarkers based on AUC. Stat Med, 2015; 34(29): 3811-3830. 3. Soon, W.W., Hariharan, M., and Snyder, M.P. High-throughput sequencing for biology and medicine. Molecular systems biology,2013; 9:640. 4. Button, KS., Ioannidis, JP., Mokrysz C., Nosek BA., Flint J., Robinson ES., and Munafo MR. Power failure: why small sample size undermines the reliability of neuroscience. Nat Rev Neurosci. 2013;.14(5):365-76. 5. Omar, M., Klawonn, F., Brand, S., Stiesch, M. Krettek, C., and Eberhard, J.Transcriptome wide high-density microarray analysis reveals differential gene transcription in periprosthetic tissue from hips with low-grade infection versus aseptic loosening. Journal of Arthroplasty, 2017;32: 234-240,2016. 6. Mason, S.J., Graham, N.E.: Areas beneath the relative operating characteristics (ROC) and relative operating levels (ROL) curves: Statistical significance and interpretation. Quarterly Journal of the Royal Meteorological Society 128(584) (2002) 2145–2166 7. Klawonn, F., Wang, J., Koch, I., Eberhard, J., and Omar, M. HAUCA curves for the evaluation of biomarker pilot studies with small sample sizes and large numbers of features. Advances in Intelligent Data Analysis XV, 2016; 356–367. 8. Demler, O., Pencina, M., D’Agostino, R.S.: Impact of correlation on predictive ability of biomarkers. Statistics in Medicine 32 (2013) 4196–421 Conclusion: • AUC performance measurement has been used not just because it is well established but also it yields closed form solutions of the required calculations and therefore to fast computation. However, other measurement like entropy, misclassification rate or mutual information might be very good alternatives to AUC. • The correlation between biomarkers may influence the performance of their combination. Therefore it has been taken into consideration and the variance-covariance matrices have been calculated to minimize the overoptimistic performance of the combination when independence is assumed. • In order to statistically validate biomarker candidates from pilot studies it is necessary to estimate the required larger sample size Result: