SlideShare une entreprise Scribd logo
1  sur  22
Télécharger pour lire hors ligne
Visual Programming
Environments for
Science and Business
MITCH MILLER
SCIENTIFIC THINKING
CODE CAMP 2015
SEPTEMBER 19, 2015
Disclaimer
 This talk represents my opinion and personal experience using 2 fine
software systems developed by third parties
 The software systems shown are very complex and have hundreds
of components. I have only worked with a small number.
 Every task shown today can be accomplished in multiple ways. I’m
only showing of those ways.
Overview
 Introduction: first demo
 What is a ‘visual programming environment’
 The two systems we’ll look at today
 What are these systems capable of?
 Second set demos (in-depth)
Demo 1: set-up
 Task: produce report of all compounds registered during January
Visual Programming: informal
definition
 Drag functional components onto canvas to create program
 Configure most components by setting parameters
 Connect components to route data from one to another
 Run and observe data traveling down the lines
Component types
 File I/O
 Read/write text files
 Read/write MS Office documents
 XML
 JSON
 PDF
 Database access
 Connect
 Query
 Update
Component types (continued)
 Web service consumption
 Domain-specific processing
 Chemical structure I/O
 Chemical structure processing and analysis
 Sequence processing
 Extensibility
 Add your own libraries for more sophisticated processing
Component types (continued)
 Visualization
 Graphing
 Statistical calculations
 Scripting
 Tip: aim for brief scripts
 Data transformation
 If/else processing
 Filtering
 Column selection
 And many more…
KNIME
 Originally a production of the University of Konstanz, Germany 2004
 Currently produced by KNIME.com AG, a company in Zurich,
Switzerland
 KNIME stands for KoNstanz Information MinEr
 Pronounced “Nighm”
 A general purpose data analytics platform
 Free version available for download
 For-sale version available with added extensions
KNIME (continued)
 Java based
 Written in Java
 Scripted, extensible in Java
 URL: https://www.knime.org/
Pipeline Pilot
 Developed and sold by BIOVIA, San Diego, CA
 Originally developed by Scitegic, San Diego in 1999
 Designed for scientists to “rapidly create, test and publish scientific
services that automate the process of accessing, analyzing and
reporting scientific data”
(http://accelrys.com/products/collaborative-science/biovia-
pipeline-pilot/)
 Client-server system
 Commercial product
 Extensible using .NET and Java
 Scripted using an original language, ‘PilotScript’
KNIME Terminology
 Components are called “Nodes”
 Programs are “Workflows”
 Reusable sets of Nodes are “Metanodes”
 Groups of related Nodes are “Extensions”
Pipeline Pilot Terminology
 Components are called “Components”
 Programs are “Protocols”
 Reusable sets of Components are “Subprotocols”
 Groups of related Components are “Packages”
 Different protocols can be combined
 One protocol provides initial UI –including a Web form
 A second protocol handles form data processing (‘work protocol’)
Different systems shown today
serve different populations
 KNIME can be used ad hoc on the desktop of a power user. It is also
used by companies in a variety of industries
 Pipeline Pilot is geared towards scientists and is part of an enterprise
system and requires a server installation
Programs can be deployed outside
the development client
 Give users a URL to access your program
 Users of BIOVIA Electronic Lab Notebook and other software can access
Pipeline Pilot protocols outside the Pipeline Pilot UI
 Users access a Web application that shows them the data they’re
looking for in a purpose-built user interface
 The application does not look like the system with which it was built
 For-sale version of KNIME Server provides similar functionality
Server Features
 User access configuration
 Shared data sources
 Automatic jobs
 Etc.
Second demo
 Exploration of data set using KNIME and Pipeline Pilot
 Data set comes from National Cancer Institute (NCI)’s Developmental
Therapeutics Program (DTP)
 Results of laboratory tests for activity against 60 types of human cancer
cell lines
 Data freely available:
https://dtp.cancer.gov/discovery_development/nci-60/default.htm
Additional demos
 Pipeline Pilot Web Port sample
Suggestions for getting started
 Download the KNIME software(knime.org)
 Install on your computer
 Look at the sample workflows
 Start simple; build up
Types of applications
 Reporting
 Data set comparisons
 ETL
 Data Analysis
References
 Scholarly article on KNIME and Pipeline Pilot
 https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3414708/
 www.knime.org
 https://www.youtube.com/user/KNIMETV
 http://accelrys.com/products/collaborative-science/biovia-
pipeline-pilot/
 https://dtp.cancer.gov/
Who is your speaker?
 Mitch Miller, Ph.D. in Chemistry and 20+ years of IT experience
 Independent consultant: Scientific Thinking, LLC
 mitch.miller@thinkscience.us
 Some recent projects
 Ongoing custodian of one chemical database implementation for
ChemIDplus project within the National Library of Medicine
 Upgraded 10-year-old Java Servlet lab workflow application to latest
version of JDK, Internet Explorer 11 and implemented enhancements
 Windows service to handle communication between 2 legacy
applications
 Import wizard for chemical array designer
 Merged a set of chemical databases and harmonized data

Contenu connexe

Similaire à Code camp 2015 visual programming mm

Kallio Chipster Bosc2008
Kallio Chipster Bosc2008Kallio Chipster Bosc2008
Kallio Chipster Bosc2008bosc_2008
 
Part 2 improving your software development v1.0
Part 2   improving your software development v1.0Part 2   improving your software development v1.0
Part 2 improving your software development v1.0Jasmine Conseil
 
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docxRunning Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docxjeffsrosalyn
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOpen Science Fair
 
Ohio LinuxFest: Crash Course in Open Source Cloud Computing
Ohio LinuxFest:  Crash Course in Open Source Cloud ComputingOhio LinuxFest:  Crash Course in Open Source Cloud Computing
Ohio LinuxFest: Crash Course in Open Source Cloud ComputingMark Hinkle
 
Fast, Secure Deployments with Docker on AWS
Fast, Secure Deployments with Docker on AWSFast, Secure Deployments with Docker on AWS
Fast, Secure Deployments with Docker on AWSAmazon Web Services
 
COMPRO- WEB ALBUM & MOTION ANALYZER
COMPRO- WEB ALBUM  & MOTION ANALYZERCOMPRO- WEB ALBUM  & MOTION ANALYZER
COMPRO- WEB ALBUM & MOTION ANALYZERAshish Tanwer
 
Applying Linux to the Civil Infrastructure
Applying Linux to the Civil InfrastructureApplying Linux to the Civil Infrastructure
Applying Linux to the Civil InfrastructureYoshitake Kobayashi
 
Frequently Used Off Host Developer Toolsl
Frequently Used Off Host Developer ToolslFrequently Used Off Host Developer Toolsl
Frequently Used Off Host Developer ToolslSharan S
 
Linuxcon 2011 Crash Course in Open Source Cloud Computing
Linuxcon 2011   Crash Course in Open Source Cloud ComputingLinuxcon 2011   Crash Course in Open Source Cloud Computing
Linuxcon 2011 Crash Course in Open Source Cloud ComputingMark Hinkle
 
Decoder Open Research Webinar
Decoder Open Research WebinarDecoder Open Research Webinar
Decoder Open Research WebinarDecoder Project
 
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxSimulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxedgar6wallace88877
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigmJonathan Challener
 
Unit Testing Essay
Unit Testing EssayUnit Testing Essay
Unit Testing EssayDani Cox
 
generic-software-process-models.ppt
generic-software-process-models.pptgeneric-software-process-models.ppt
generic-software-process-models.pptAayush847388
 

Similaire à Code camp 2015 visual programming mm (20)

Kallio Chipster Bosc2008
Kallio Chipster Bosc2008Kallio Chipster Bosc2008
Kallio Chipster Bosc2008
 
Internship msc cs
Internship msc csInternship msc cs
Internship msc cs
 
Crime security.
Crime security.Crime security.
Crime security.
 
Part 2 improving your software development v1.0
Part 2   improving your software development v1.0Part 2   improving your software development v1.0
Part 2 improving your software development v1.0
 
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docxRunning Head WINDOWS AND LINUX     1WINDOWS AND LINUX12.docx
Running Head WINDOWS AND LINUX 1WINDOWS AND LINUX12.docx
 
Documentation
DocumentationDocumentation
Documentation
 
OSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications databaseOSFair2017 Workshop | EGI applications database
OSFair2017 Workshop | EGI applications database
 
Ohio LinuxFest: Crash Course in Open Source Cloud Computing
Ohio LinuxFest:  Crash Course in Open Source Cloud ComputingOhio LinuxFest:  Crash Course in Open Source Cloud Computing
Ohio LinuxFest: Crash Course in Open Source Cloud Computing
 
Fast, Secure Deployments with Docker on AWS
Fast, Secure Deployments with Docker on AWSFast, Secure Deployments with Docker on AWS
Fast, Secure Deployments with Docker on AWS
 
COMPRO- WEB ALBUM & MOTION ANALYZER
COMPRO- WEB ALBUM  & MOTION ANALYZERCOMPRO- WEB ALBUM  & MOTION ANALYZER
COMPRO- WEB ALBUM & MOTION ANALYZER
 
Applying Linux to the Civil Infrastructure
Applying Linux to the Civil InfrastructureApplying Linux to the Civil Infrastructure
Applying Linux to the Civil Infrastructure
 
Frequently Used Off Host Developer Toolsl
Frequently Used Off Host Developer ToolslFrequently Used Off Host Developer Toolsl
Frequently Used Off Host Developer Toolsl
 
Linuxcon 2011 Crash Course in Open Source Cloud Computing
Linuxcon 2011   Crash Course in Open Source Cloud ComputingLinuxcon 2011   Crash Course in Open Source Cloud Computing
Linuxcon 2011 Crash Course in Open Source Cloud Computing
 
Nt1320 Unit 6
Nt1320 Unit 6Nt1320 Unit 6
Nt1320 Unit 6
 
Proposal with sdlc
Proposal with sdlcProposal with sdlc
Proposal with sdlc
 
Decoder Open Research Webinar
Decoder Open Research WebinarDecoder Open Research Webinar
Decoder Open Research Webinar
 
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docxSimulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
Simulation Modelling Practice and Theory 47 (2014) 28–45Cont.docx
 
The path to an hybrid open source paradigm
The path to an hybrid open source paradigmThe path to an hybrid open source paradigm
The path to an hybrid open source paradigm
 
Unit Testing Essay
Unit Testing EssayUnit Testing Essay
Unit Testing Essay
 
generic-software-process-models.ppt
generic-software-process-models.pptgeneric-software-process-models.ppt
generic-software-process-models.ppt
 

Dernier

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 

Code camp 2015 visual programming mm

  • 1. Visual Programming Environments for Science and Business MITCH MILLER SCIENTIFIC THINKING CODE CAMP 2015 SEPTEMBER 19, 2015
  • 2. Disclaimer  This talk represents my opinion and personal experience using 2 fine software systems developed by third parties  The software systems shown are very complex and have hundreds of components. I have only worked with a small number.  Every task shown today can be accomplished in multiple ways. I’m only showing of those ways.
  • 3. Overview  Introduction: first demo  What is a ‘visual programming environment’  The two systems we’ll look at today  What are these systems capable of?  Second set demos (in-depth)
  • 4. Demo 1: set-up  Task: produce report of all compounds registered during January
  • 5. Visual Programming: informal definition  Drag functional components onto canvas to create program  Configure most components by setting parameters  Connect components to route data from one to another  Run and observe data traveling down the lines
  • 6. Component types  File I/O  Read/write text files  Read/write MS Office documents  XML  JSON  PDF  Database access  Connect  Query  Update
  • 7. Component types (continued)  Web service consumption  Domain-specific processing  Chemical structure I/O  Chemical structure processing and analysis  Sequence processing  Extensibility  Add your own libraries for more sophisticated processing
  • 8. Component types (continued)  Visualization  Graphing  Statistical calculations  Scripting  Tip: aim for brief scripts  Data transformation  If/else processing  Filtering  Column selection  And many more…
  • 9. KNIME  Originally a production of the University of Konstanz, Germany 2004  Currently produced by KNIME.com AG, a company in Zurich, Switzerland  KNIME stands for KoNstanz Information MinEr  Pronounced “Nighm”  A general purpose data analytics platform  Free version available for download  For-sale version available with added extensions
  • 10. KNIME (continued)  Java based  Written in Java  Scripted, extensible in Java  URL: https://www.knime.org/
  • 11. Pipeline Pilot  Developed and sold by BIOVIA, San Diego, CA  Originally developed by Scitegic, San Diego in 1999  Designed for scientists to “rapidly create, test and publish scientific services that automate the process of accessing, analyzing and reporting scientific data” (http://accelrys.com/products/collaborative-science/biovia- pipeline-pilot/)  Client-server system  Commercial product  Extensible using .NET and Java  Scripted using an original language, ‘PilotScript’
  • 12. KNIME Terminology  Components are called “Nodes”  Programs are “Workflows”  Reusable sets of Nodes are “Metanodes”  Groups of related Nodes are “Extensions”
  • 13. Pipeline Pilot Terminology  Components are called “Components”  Programs are “Protocols”  Reusable sets of Components are “Subprotocols”  Groups of related Components are “Packages”  Different protocols can be combined  One protocol provides initial UI –including a Web form  A second protocol handles form data processing (‘work protocol’)
  • 14. Different systems shown today serve different populations  KNIME can be used ad hoc on the desktop of a power user. It is also used by companies in a variety of industries  Pipeline Pilot is geared towards scientists and is part of an enterprise system and requires a server installation
  • 15. Programs can be deployed outside the development client  Give users a URL to access your program  Users of BIOVIA Electronic Lab Notebook and other software can access Pipeline Pilot protocols outside the Pipeline Pilot UI  Users access a Web application that shows them the data they’re looking for in a purpose-built user interface  The application does not look like the system with which it was built  For-sale version of KNIME Server provides similar functionality
  • 16. Server Features  User access configuration  Shared data sources  Automatic jobs  Etc.
  • 17. Second demo  Exploration of data set using KNIME and Pipeline Pilot  Data set comes from National Cancer Institute (NCI)’s Developmental Therapeutics Program (DTP)  Results of laboratory tests for activity against 60 types of human cancer cell lines  Data freely available: https://dtp.cancer.gov/discovery_development/nci-60/default.htm
  • 18. Additional demos  Pipeline Pilot Web Port sample
  • 19. Suggestions for getting started  Download the KNIME software(knime.org)  Install on your computer  Look at the sample workflows  Start simple; build up
  • 20. Types of applications  Reporting  Data set comparisons  ETL  Data Analysis
  • 21. References  Scholarly article on KNIME and Pipeline Pilot  https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3414708/  www.knime.org  https://www.youtube.com/user/KNIMETV  http://accelrys.com/products/collaborative-science/biovia- pipeline-pilot/  https://dtp.cancer.gov/
  • 22. Who is your speaker?  Mitch Miller, Ph.D. in Chemistry and 20+ years of IT experience  Independent consultant: Scientific Thinking, LLC  mitch.miller@thinkscience.us  Some recent projects  Ongoing custodian of one chemical database implementation for ChemIDplus project within the National Library of Medicine  Upgraded 10-year-old Java Servlet lab workflow application to latest version of JDK, Internet Explorer 11 and implemented enhancements  Windows service to handle communication between 2 legacy applications  Import wizard for chemical array designer  Merged a set of chemical databases and harmonized data