SlideShare une entreprise Scribd logo
1  sur  18
Perl cures coronary heart disease (well, sort of) Spiros Denaxas, @fruit90210 London BioGeeks, 24 th  Feb. 2011
Talk outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Hi, I am Spiros. ,[object Object],[object Object],[object Object],[object Object],[object Object]
Bioinformatics vs. epidemiology ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Bioinformatics vs. epidemiology ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Epidemiology now ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Life before Perl
At least he’s happy!
What did I do ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
NHS numbers ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
International Classification of Diseases (ICD10) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Standards? What's that? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
 
Introduced Perl ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Life after Perl ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Life after Perl
Please help out! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Thank you. ,[object Object]

Contenu connexe

En vedette

Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015
Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015
Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015Andy Yasutake
 
Mastery journey timeline presentation
Mastery journey timeline presentationMastery journey timeline presentation
Mastery journey timeline presentationSteve Young
 
Curriculum vitate - Ankit Kumar Jain
Curriculum vitate - Ankit Kumar JainCurriculum vitate - Ankit Kumar Jain
Curriculum vitate - Ankit Kumar JainAnkit Jain
 
2015-12-07 Matematica felice a 4-5 anni
2015-12-07 Matematica felice a 4-5 anni2015-12-07 Matematica felice a 4-5 anni
2015-12-07 Matematica felice a 4-5 anniGIOVANNI LARICCIA
 
ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.
ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.
ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.Fabiola Martinez
 
Operations Strategy-A Literature Review
Operations Strategy-A Literature ReviewOperations Strategy-A Literature Review
Operations Strategy-A Literature ReviewMatthew Morris
 

En vedette (8)

Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015
Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015
Disrupting the Customers Journey: Gartner Customer 360 Summit 09092015
 
Mastery journey timeline presentation
Mastery journey timeline presentationMastery journey timeline presentation
Mastery journey timeline presentation
 
Curriculum vitate - Ankit Kumar Jain
Curriculum vitate - Ankit Kumar JainCurriculum vitate - Ankit Kumar Jain
Curriculum vitate - Ankit Kumar Jain
 
Enterprise Gamification Taxonomy
Enterprise Gamification Taxonomy Enterprise Gamification Taxonomy
Enterprise Gamification Taxonomy
 
CAIC Prof. Mariano Costa
CAIC Prof. Mariano CostaCAIC Prof. Mariano Costa
CAIC Prof. Mariano Costa
 
2015-12-07 Matematica felice a 4-5 anni
2015-12-07 Matematica felice a 4-5 anni2015-12-07 Matematica felice a 4-5 anni
2015-12-07 Matematica felice a 4-5 anni
 
ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.
ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.
ANÁLISIS DE UN NOVELA: La tía julia y el escribidor.
 
Operations Strategy-A Literature Review
Operations Strategy-A Literature ReviewOperations Strategy-A Literature Review
Operations Strategy-A Literature Review
 

Similaire à Perl cures coronary heart disease

Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Fondazione Giannino Bassetti
 
Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Kees van Bochove
 
Standards & Coding Systems in Biomedical and Health Informatics
Standards & Coding Systems in Biomedical and Health InformaticsStandards & Coding Systems in Biomedical and Health Informatics
Standards & Coding Systems in Biomedical and Health InformaticsNawanan Theera-Ampornpunt
 
The researcher perspective, Jean-Fred Fontaine, MDC Berlin
The researcher perspective, Jean-Fred Fontaine, MDC BerlinThe researcher perspective, Jean-Fred Fontaine, MDC Berlin
The researcher perspective, Jean-Fred Fontaine, MDC BerlinLIBER Europe
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical DataPaul Agapow
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesJosef Scheiber
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsJTADrexel
 
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...DataScienceConferenc1
 
OSS 2011 Multi-Level Modelling Presentation
OSS 2011 Multi-Level Modelling PresentationOSS 2011 Multi-Level Modelling Presentation
OSS 2011 Multi-Level Modelling PresentationTimothy Cook
 
Leroy Hood biomedical challenges at Skolkovo
Leroy Hood biomedical challenges at SkolkovoLeroy Hood biomedical challenges at Skolkovo
Leroy Hood biomedical challenges at Skolkovoigorod
 
Waterloo September 00 Presentations
Waterloo September 00 PresentationsWaterloo September 00 Presentations
Waterloo September 00 Presentationsbrighteyes
 
BiTeM / SIBTex @ TREC CDS 2014
BiTeM / SIBTex @ TREC CDS 2014BiTeM / SIBTex @ TREC CDS 2014
BiTeM / SIBTex @ TREC CDS 2014Julien Gobeill
 
Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...Koray Atalag
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Dataljmcneill33
 
RoleOfTerminologies
RoleOfTerminologiesRoleOfTerminologies
RoleOfTerminologiesguest66dc5f
 
Digital Health 101 for Hospital Executives (October 4, 2021)
Digital Health 101 for Hospital Executives (October 4, 2021)Digital Health 101 for Hospital Executives (October 4, 2021)
Digital Health 101 for Hospital Executives (October 4, 2021)Nawanan Theera-Ampornpunt
 
Nlp for the precision medicine
Nlp for the precision medicineNlp for the precision medicine
Nlp for the precision medicineVishwas N
 
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Koray Atalag
 
Digital Health Transformation for Health Executives (January 18, 2022)
Digital Health Transformation for Health Executives (January 18, 2022)Digital Health Transformation for Health Executives (January 18, 2022)
Digital Health Transformation for Health Executives (January 18, 2022)Nawanan Theera-Ampornpunt
 

Similaire à Perl cures coronary heart disease (20)

Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...
 
Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...Usage of open source software for Real World Data Analysis in pharmaceutical ...
Usage of open source software for Real World Data Analysis in pharmaceutical ...
 
Standards & Coding Systems in Biomedical and Health Informatics
Standards & Coding Systems in Biomedical and Health InformaticsStandards & Coding Systems in Biomedical and Health Informatics
Standards & Coding Systems in Biomedical and Health Informatics
 
The researcher perspective, Jean-Fred Fontaine, MDC Berlin
The researcher perspective, Jean-Fred Fontaine, MDC BerlinThe researcher perspective, Jean-Fred Fontaine, MDC Berlin
The researcher perspective, Jean-Fred Fontaine, MDC Berlin
 
Big Data & ML for Clinical Data
Big Data & ML for Clinical DataBig Data & ML for Clinical Data
Big Data & ML for Clinical Data
 
Big Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use CasesBig Data in Pharma - Overview and Use Cases
Big Data in Pharma - Overview and Use Cases
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
[DSC Europe 23][DigiHealth] Anja Baresic 0- Croatian digital Healthcare ecosy...
 
OSS 2011 Multi-Level Modelling Presentation
OSS 2011 Multi-Level Modelling PresentationOSS 2011 Multi-Level Modelling Presentation
OSS 2011 Multi-Level Modelling Presentation
 
Leroy Hood biomedical challenges at Skolkovo
Leroy Hood biomedical challenges at SkolkovoLeroy Hood biomedical challenges at Skolkovo
Leroy Hood biomedical challenges at Skolkovo
 
Waterloo September 00 Presentations
Waterloo September 00 PresentationsWaterloo September 00 Presentations
Waterloo September 00 Presentations
 
BiTeM / SIBTex @ TREC CDS 2014
BiTeM / SIBTex @ TREC CDS 2014BiTeM / SIBTex @ TREC CDS 2014
BiTeM / SIBTex @ TREC CDS 2014
 
Making Terminology Work
Making Terminology WorkMaking Terminology Work
Making Terminology Work
 
Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...Health research, clinical registries, electronic health records – how do they...
Health research, clinical registries, electronic health records – how do they...
 
The Many Lives of Data
The Many Lives of DataThe Many Lives of Data
The Many Lives of Data
 
RoleOfTerminologies
RoleOfTerminologiesRoleOfTerminologies
RoleOfTerminologies
 
Digital Health 101 for Hospital Executives (October 4, 2021)
Digital Health 101 for Hospital Executives (October 4, 2021)Digital Health 101 for Hospital Executives (October 4, 2021)
Digital Health 101 for Hospital Executives (October 4, 2021)
 
Nlp for the precision medicine
Nlp for the precision medicineNlp for the precision medicine
Nlp for the precision medicine
 
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
Linkages to EHRs and Related Standards. What can we learn from the Parallel U...
 
Digital Health Transformation for Health Executives (January 18, 2022)
Digital Health Transformation for Health Executives (January 18, 2022)Digital Health Transformation for Health Executives (January 18, 2022)
Digital Health Transformation for Health Executives (January 18, 2022)
 

Dernier

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Dernier (20)

The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

Perl cures coronary heart disease

Notes de l'éditeur

  1. I come from both a sciency and a commercial environment where large datasets were used from multiple stakeholders and sharing was a good thing.
  2. Bioinformatics, - essentially computer science and molecular biology dealing with DNA, RNA etc. They are used to extremely large datasets, the raw human genome was 30TB big There has been substantial innovation in both hardware and software and established standards for storing, searchin, visualizing information The bioinformatics community is international and collaborative and data is shared amongst peers. The bioperl project is an excellent example of this A big collection of perl modules for doing many operatios on bioinformatics data, An international collaboration with many people working on it, cross platform and a plethora tools based on it Very good documentation and there’s even a O’Reilly book on it.
  3. Epidemiology on the other hand, clinical epidemiology, is all about collecting and analyzing clinical data on patients Traditionally it is very expencive to follow up people with medical exams, questionnaires etc and the typical study size would have less than 5000 individuals Paper is king and everything is based on it, slowly doing the transition to electronic format for data gathering Times however have changed, there’s a bunch if NHS IT projects going on to bring medical data together, electronic health records have come into play etc There is more and more data available from multiple sources such as GP surgeries, hospitals, office of national statistics and government data sources.
  4. So what are people doing. When I first joined my new job I saw that people were not happy. The size of the data is every increasing, I am dealing with a 6m patient database with over 5 bn rows Of course it was delivered as text files Of course I had to sign 40 page forms to obtain the data Data is well kept secret. There is very little sharing going on. Researchers are struggling to actually manage the data rather than analyze it. Data cleaning, formatting, specifications (lack of) Statistical packages are used to manage the data which in my head is not entirely appropriate Only very recently did funding organizations start requiring research teams to actually hire somebody dedicated for managing and curating these data sets. Some common patterns emerged which I examined in an academic fashion.
  5. Fear leads to hate, hate leads to anger, anger is the path to suffering. And only one person is happy with all those.
  6. So what did I try to do. I took a small step for man and created the medical namespace after emailing the dev list I started thinking of similar ways to create something like bioperl but for medical-specific modules. There already are several modules on CPAN which are of interest. DICOM is a image format widely used and UMLS is a structured ontology used in biomedical sciences The main issue is to expose these, and others, to non-Perl people, aka normal people
  7. The NHS deals with 1m patients per 36 hours The nhs number,is a ten digit UID essentially that everybody gets assigned and is based on the mod11 algorithm Of course, this is the NHS so there are 21 different formats of old school NHS numbers floating around I looked on CPAN and could not find anything, but its no problem, I just created medical::nhsnumber which was the first module for the medical namespace
  8. The ICD10 coding system is basically one huge ontology for coding diseases, signs, symptoms , test results etc Everytime you visit the hospital, you get a series of codes according to what the problem was, its very very widely used. What do most people do? They try to open it in Excel… And how do you take all the parents of the term if you want to? Weeeeelll, we use this search function and paste results into another spreadsheet and then we use stata to check it… Ok ok stop. Medical::icd10, a very simple module doing very simple things saving people time. Coupled with a very basic web interface.
  9. Another thing I looked at was standards used to describe the data Or perhaps more appropriately, the lack of standards to describe the data. Documentation is delivered as a excel file or a email or a word document with cryptic variable names and all that fun So I said, there is an established data documentation standard called the DDI, why don’t we use it and make our lifes easier? I created two modules and a bunch of scripts and turned a flat excel file of little usability into something better, much better. Similarly, for study registration in the interest of transparency, we use clinicaltrials.gov all the time. I created a module for people to use.
  10. It turns out people do want to do their lives easier. We got activestate and we have the excellent resource of learning perl so I t Finally I introduced perl to people in my team. Most of them got scared away but one of them was happy with Perl. She even considered Python. Still in my book it’s a win.
  11. So life after perl, what did I do: I itrodiuced a new namespace I created several modules internal and external I created better data documentation using perl and promoted standards And I introduced perl to normal people? Was all this technically complicated? Probably not, it was very straightforward in the majority of cases. Was it worth it? This is how my work was after Perl.
  12. Please help out! Introduce perl to your academic group Contribute to the medical namespace Help design and implement medperl Use more standards at work if you are not already using them And finally, shameless, please join the UCL perl users group if you are from UCL Thanks!