SlideShare a Scribd company logo
1 of 28
Download to read offline
Instant JChemInstant JChem -- enabling newenabling new
ways of working with data andways of working with data andways of working with data andways of working with data and
access to new data to work withaccess to new data to work with
Dana Vanderwall
Bristol-Myers Squibb
Research Information Technology & Automation
Chemaxon US UGM, Sept 2014
1
Initial State in Chemistry AnalyticsInitial State in Chemistry Analyticsy yy y
CDR
SI FormsSI Forms
KnowledgeKnowledge
•Annotation
•Folks-onomies
Additional dataAdditional data
Manual copyManual copy
& paste,& paste,
typingtyping
SI FormsSI Forms
HPLC log P vs. rat Vds
y = 0.0344x + 3.886
R2
= 0.2737
4.00
4.50
5.00
5.50
logP
ExcelExcel
•Folks-onomies
VisualizationVisualization
ExportExport
2.00
2.50
3.00
3.50
0.00 5.00 10.00 15.00 20.00 25.00 30.00 35.00 40.00
Vds
HPLC
Scatter PlotScatter Plot
Master Spreadsheet (Excel, Word)Master Spreadsheet (Excel, Word)
Additional chemical structure analyses:
Rat Pct BoundRat Pct Bound
ExportExport
Manual copy & pasteManual copy & paste
2
Additional chemical structure analyses:
•SAR R-group analysis
•Clustering (CADD and in-house solutions)
•Predictive models (HERG, Solubility, Permeability; FACT)
compoundcompound
structures, IDsstructures, IDs
The DARE ProjectThe DARE Project
(Data & Analytics for Research)(Data & Analytics for Research)(Data & Analytics for Research)(Data & Analytics for Research)
Simplify.
• Replace legacy app/workflow… with integrated tools for analytics
• Decrease stand alone docs/reports
• Put any needed calculations & predicted properties where they’re
neededneeded
Modernize
• A new product that maintains the functionality of form view…
• Plus a richer set of views, tables, in-place conditional formatting,
graphs, & more chemistry functionality
• Learn by doing; established base camp in 1st yr, then ramped up
Phased approach to dev. & migration
y g y
• Gradually phasing in IJC over 2013-2014
3
DARE technology mapDARE technology map
User interface
gy pgy p
Drill down: web service for
conc. response curves &
secondary results
SOLR Index for text queries
(IBM Patent DB only)
Data Alerts
Annotations
Data Marts:
New data layer
for access &
integration
Lead
Evaluation
PAMPACellular CYP Inh
DWG A
Enzyme
DWG A
Cellular
DWG B
Receptor
DWG B
Cellular
Data common to most DWGs Data unique to a DWG
ss
Evaluation
Profiling:
Enzymes
MetStab CYP
Induction
DWG A
selectivity
DWG B
Selectivity
InformaticaInformatica
Operational
Screening
BioBook
calculated fields
Chemical
structures,
properties,
calculated fields
Meta DataAnnotation
WebWeb
servicesservices
Central Data
Repository (CDR)
4
DARE TimelinesDARE Timelines
H1-2012 H2-2012 H1-2013 H2-2013 H1-2014 H2-2014
Phase 1:
Informatica &
ChemAxon
Set-up ,
Prototyping
and Build
Deliverable
5 DWGs deployed Deliverableand Build 5 DWGs deployed Deliverable
~40 DWGs
deployed
Deliverables
~20 DWGs
deployed
Phase 2:
GUI & Datamart –
Prototyping and
B ild
Phase 3:
GUI & Mart – Deployment
deployed
Phase 4:
Build
Phase 4:
GUI & Mart – Deployment
Decommissions
5
Start with the basics & build upStart with the basics & build uppp
Foundation
• Program Specific Forms and use cases
• Universal Forms (profiling platforms or compilations of data commonly
used)
Extended use cases
• Use cases requiring bespoke data structures, scripting, or visualization
• Unique data sources, combinations of data, all biological data
• Hooks into internal web services: drill down for curves/secondary data
Extended functionality
6
Hooks into internal web services: drill down for curves/secondary data
• Query to SOLR index
• Data Alerts
DatamartDatamart InfrastructureInfrastructure
General
• ETL from primarily CDR, some additional sources
• Provides environment to create tables & other data structures for IJC
• Tables in IJC not enormously popular with users
• Comfort and orientation with data in text box, fixed in position on form
‘Cell Factory’
, p
• Cell = entity in oracle that effectively provides the data for one assay;
CDR queries sometimes require complex set of conditions
• Captures metadata associated with cell creation, keeps them unique,
etcetc
Incremental updates
7
• Via Informatica Power Center, 15-30 min incremental updates
• Gentle failure in face of long running jobs
Data management v1Data management v1gg
BA catalogs data required
for new project teamfor new project team
Passes it to DB
Manually:
• New tables/entities
promoted to IJC
• New data tree created
• Build Form
• Add newPasses it to DB
developer to define
new ETL
New data tree created
• Build edges cells/columns to form
IJCIJC
FormsForms
DAREDARE
D t tD t t
IJC SchemaIJC SchemaCDRCDR
ETL
FormsForms
Data martData mart
Manual coding/scripting
Rate determining step
8
Rate determining step
DB development not self documenting
Automated data managementAutomated data managementgg
UserUser
• UI to search/define/create cells, tables, calc. fields
• Consumes metadata & creates meta data ‘cell’
definitioncreatescreates
cell/tablecell/table
definition
• Promotes the new table / new fields into IJC
• If it’s a new entity then
o Creates a new data Tree using a data tree Template
Add th T bl t th d t t
MetadataMetadata
UIUI
RepositoryRepository
MetadataMetadata
RepositoryRepository
o Adds the new Table to the new data tree
o Create a new form on the new data tree
• Creates edges
Auto PromotionAuto PromotionETL
IJCIJC
FormsForms
DAREDARE
Data martData mart
IJC SchemaIJC SchemaCDRCDR
ETL
Promote QueuePromote Queue
Data martData mart
• Creates tables & columns
immediately upon cell ‘activation’ 9
ScaleScale
Instant JChem
• 1455 forms + Grids
• 288 saved queries
Data
• 211 data trees
• 526 ‘entities’
Traffic
• 631 users (to date)
• 1000-2000 db
• 474 saved lists
• 10 scripts
• 8 schema
• 2400 assays
• 41,571 ‘fields’
connections daily
10
The flexibility of [datamart + IJC] have enabled
solutions well beyond the standard ‘program’ formsolutions well beyond the standard program form
IBM
PatentExternal data source
Novel & multiple data
structures & presentationsPatent
DB
HTMetabolite
structures & presentations
IJC
MutagenesisDB
Visualizations;
Integration of custom
scripts & calculations
Datamarts
Chiral
Alliance
D t
scripts & calculations
Chiral
Separations
Drug Safety
Data
AccessIntegration active &
historical of BMS data
Drug Safety
Warehouse
Integration of BMS data not in the CDR
11
HighHigh--Throughput Mutagenesis: SAR, but differentThroughput Mutagenesis: SAR, but differentgg g p gg p g
• Lead Evaluation Applied Genomics Research IT & Automation ComputerLead Evaluation, Applied Genomics, Research IT & Automation, Computer
Aided Drug Design designed & built cloning and screening platform
• >150 mutants, testing >30 compounds
12
A Different DataA Different Data ScaleScale
For each cmpd compare WT to 150 mutants For 30 compounds
13
Endpoint variation over mutants by compoundEndpoint variation over mutants by compound
Datamart
• Mutagenesis datamart created drawing on data from 2 operational data sources
• ‘Mart generation automated & refreshed as new data is available
• DataMart structure is heavily augmented based on the need of Instant JChem(IJC)y g ( )
• Utilize IJC’s flexible entity relationship model & charting fxns to aid data visualization
14
All compounds per endpoint variation over mutantsAll compounds per endpoint variation over mutantsAll compounds per endpoint variation over mutantsAll compounds per endpoint variation over mutants
• Offered summary birds-eye view on all compounds by each individual
result type (EC50, WTRATIO, KBWTRATIO etc) to identify trendsresult type (EC50, WTRATIO, KBWTRATIO etc) to identify trends
• Compound as column header- a novel pivot
15
Shift workload from queryquery & discovering todiscovering to
alertingalerting & reportingreportinggg p gp g
 Define what the teams want to monitor
 Automate the delivery of new data packages
Base case: Go find the data & construct analysis
Open SI
Forms
Open form query
SelectSelect
data
Export
data
Import
data
Table or
visualization
Table or
visualization
Is my data there yet?
Q d t
New capability: Push data alerts
Is my data there yet?
Is my data….
Instant
JChem with
new data
Spreadsheet &
link to open
project-form-list
in IJC
Datamart
Query data
source
in IJCin IJC
Automated email to user
when new data
User data alert
parameters
16
Alert manager (internal GWT), 2Alert manager (internal GWT), 2--wayway
integration with IJCintegration with IJCgg
• Grab active data tree ID and bring it to alert tool
• Take all the assays under the data tree as selection
source for data alert
St thi i f ti d t h it i t DMART• Store this information and match it against DMART
• Create hit list using compound ID/lot ID as
‘permanent list’.
• Send the link to subscribers
17
What do the users think about all this?What do the users think about all this?
• Change is never easyg y
• Sub-populations are attracted to new capabilities and
adopt new tools and practices
• Others need more encouragement; stability is critical
• Maintaining the capabilities of the familiar and well
understood in the new environment a pre requisite forunderstood in the new environment a pre-requisite for
complete migration
• We’re getting thereg g
18
Legacy application usage vs. Instant JChem
Unique Users per Month
Announcement of SI
Forms retirement
700
800
900
500
600
DARE
SI
200
300
400
SI
0
100
200
Mar 2014 Apr 2014 May 2014 Jun 2014Mar 2014 Apr 2014 May 2014 Jun 2014
19
Reduced number of data sets exported for analysis
Number of Data Exports per month
2600
2700
Number of Data Exports per month
2400
2500
2300
2400
2100
2200
1900
2000
2014 - MAR 2014 - APR 2014 - May 2014 - Jun
20
Monitor URL Sharing in IJC
70
80
Launched URLs
50
60
70
30
40 Total Form URL
List URLs
Query URLs
10
20
0
1 2 3 4 5 6
2014
21
a moment for reflectiona moment for reflection
cause for dancing
• Conditional formatting!
what we learned
• Train just in time
coaching
• More thorough regressionConditional formatting!
• Grid view
• Query builder
• Query/browse
performance!
Train just in time
• Listen; listen some more
• STOP the presses if it’s not
right- they’d rather wait
• Simple >> rich
More thorough regression
testing
• Clearer release notes
• Login/start-up
performanceperformance!
• Tabbed panes
• URL sharing*
• Help from CXN!!
• Simple >> rich
• Provide a thread of
continuity to lead through
new tools
• Don’t disrupt the
performance
• List query result retains
original order
• Cleaner Excel export, keep
structure orientation• Don t disrupt the
workflow, let it evolve
structure orientation
• More conversations!
• Web services
• Plexus! 22
The DARE teamThe DARE team
Heather Artman Dong Li
Acknowledgements
Scientific Computing
Core Team
Heather Artman
Dawn Cohen
John Duncan
Dong Li
Mark Manfredi
Minimol Mathew
Scientific Computing
Services
Ray Reichard
Padma Vellanki
Ramesh Durvasula
James Ewen
Lisa Johnson
Christa Musial
Matthias Nolte
Anusha Ramanathan
Padma Vellanki
Thomas Curneal
Mike Beluch
Lisa Johnson
Sangeet Khullar
Anusha Ramanathan
David Vanderbrooke
Dana Vanderwall
Nelly Masias
Mahesh Nawade
BMS Internal 23
End user supportEnd user support
Support email group
User Community SharePoint
 Training and reference, FAQs, External links, contact info
 All reported issues and status All reported issues and status
– [open, in progress, scheduled fix/improvement, resolved]
Internal BMS User Group Meeting
1 h thl i t d t ti & t i i f 2 3 i l 1 hr. monthly session to cover demonstration & training for 2-3 special
topics or features
 Topics drawn from suggestions and requests for more info or training;
topics covered to date:topics covered to date:
– IJC: Query Builder; Visualization; Sharing by URL; exporting; working list (pick
list); R group decomposition; Markush draw/search
– JChem4XL- patent doc creation
BMS Internal
p
– IBM Patent Database; Metabolite Database
24
Assay meta dataAssay meta datayy
• Describe assay protocol & conditions
in controlled vocabulary
Biological description
 Targety
• Protocols would have a minimum set
of fields that would have to be
populated before going into production
 Gene name (look-up, and
capture locus link)
 Species
C ll t
• Opportunity for business rules that guide the
protocol registration
• All downstream systems would utilize
th f k & t d t
 Cell type
Assay description
 Assay type
A dthe same framework & meta-data
• Propose adopting established
standard, aligning/collaborating with
 Assay mode
 Detection method
Results
R lt t
NIH BARD Project & BioAssay
Ontology (BAO)
• Requires process & roles for
 Result type
 Modifier
 Units
t
Requires process & roles for
maintaining up to date dictionaries and
governance
 etc
25
BAO scope and purposeBAO scope and purposep p pp p p
• BAO to describe assays and screening results
• Defines relevant assays and result annotations
• Provides controlled terminology
Formalizes knowledge of assays and screening results• Formalizes knowledge of assays and screening results
• Describes and formalizes screening campaigns, i.e.
relationship between assays in terms of their use
• BAO addresses problems with using data and
facilitates
• Leveraging existing data in discovery projects
• Global analysis across diverse data sets
I t ti f d t f diff t• Integration of data from different resources
26
What do we need to describe assaysWhat do we need to describe assaysyy
27
28

More Related Content

What's hot

Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process Omid Vahdaty
 
Migrating Data Warehouse Solutions from Oracle to non-Oracle Databases
Migrating Data Warehouse Solutions from Oracle to non-Oracle DatabasesMigrating Data Warehouse Solutions from Oracle to non-Oracle Databases
Migrating Data Warehouse Solutions from Oracle to non-Oracle DatabasesJade Global
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyushastronish
 
Data Warehouse (ETL) testing process
Data Warehouse (ETL) testing processData Warehouse (ETL) testing process
Data Warehouse (ETL) testing processRakesh Hansalia
 
ADaM datasets for graphs (paper)
ADaM datasets for graphs (paper)ADaM datasets for graphs (paper)
ADaM datasets for graphs (paper)Kevin Lee
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...OSTHUS
 
Online retail a look at data consulting approach
Online retail   a look at data consulting approachOnline retail   a look at data consulting approach
Online retail a look at data consulting approachShesha R
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-stepsShesha R
 
Interpreting CDISC ADaM IG through Users Interpretation
Interpreting CDISC ADaM IG through Users InterpretationInterpreting CDISC ADaM IG through Users Interpretation
Interpreting CDISC ADaM IG through Users InterpretationAngelo Tinazzi
 
Etl process in data warehouse
Etl process in data warehouseEtl process in data warehouse
Etl process in data warehouseKomal Choudhary
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseBugRaptors
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterOSTHUS
 
Etl testing contents
Etl testing contentsEtl testing contents
Etl testing contentsManoj Jagtap
 
Metadata and ADaM
Metadata and ADaMMetadata and ADaM
Metadata and ADaMKevin Lee
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data miningEr. Nawaraj Bhandari
 
Software for Qualitative and Quantitative Data Analysis
Software for Qualitative and Quantitative Data AnalysisSoftware for Qualitative and Quantitative Data Analysis
Software for Qualitative and Quantitative Data AnalysisAlexandru Caratas Ghenea
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingPrithwis Mukerjee
 

What's hot (20)

Introduction to ETL process
Introduction to ETL process Introduction to ETL process
Introduction to ETL process
 
Migrating Data Warehouse Solutions from Oracle to non-Oracle Databases
Migrating Data Warehouse Solutions from Oracle to non-Oracle DatabasesMigrating Data Warehouse Solutions from Oracle to non-Oracle Databases
Migrating Data Warehouse Solutions from Oracle to non-Oracle Databases
 
Data Warehouse By Piyush
Data Warehouse By PiyushData Warehouse By Piyush
Data Warehouse By Piyush
 
Data Warehouse (ETL) testing process
Data Warehouse (ETL) testing processData Warehouse (ETL) testing process
Data Warehouse (ETL) testing process
 
ADaM datasets for graphs (paper)
ADaM datasets for graphs (paper)ADaM datasets for graphs (paper)
ADaM datasets for graphs (paper)
 
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...Revolutionizing Laboratory  Instrument Data for the  Pharmaceutical Industry:...
Revolutionizing Laboratory Instrument Data for the Pharmaceutical Industry:...
 
Online retail a look at data consulting approach
Online retail   a look at data consulting approachOnline retail   a look at data consulting approach
Online retail a look at data consulting approach
 
Data analytcis-first-steps
Data analytcis-first-stepsData analytcis-first-steps
Data analytcis-first-steps
 
Interpreting CDISC ADaM IG through Users Interpretation
Interpreting CDISC ADaM IG through Users InterpretationInterpreting CDISC ADaM IG through Users Interpretation
Interpreting CDISC ADaM IG through Users Interpretation
 
Etl process in data warehouse
Etl process in data warehouseEtl process in data warehouse
Etl process in data warehouse
 
What is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data WharehouseWhat is ETL testing & how to enforce it in Data Wharehouse
What is ETL testing & how to enforce it in Data Wharehouse
 
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & FasterReinventing Laboratory Data To Be Bigger, Smarter & Faster
Reinventing Laboratory Data To Be Bigger, Smarter & Faster
 
Etl testing contents
Etl testing contentsEtl testing contents
Etl testing contents
 
MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018MOCHA 2018 Challenge @ ESWC2018
MOCHA 2018 Challenge @ ESWC2018
 
Sas demo
Sas demoSas demo
Sas demo
 
Metadata and ADaM
Metadata and ADaMMetadata and ADaM
Metadata and ADaM
 
Classification and prediction in data mining
Classification and prediction in data miningClassification and prediction in data mining
Classification and prediction in data mining
 
Software for Qualitative and Quantitative Data Analysis
Software for Qualitative and Quantitative Data AnalysisSoftware for Qualitative and Quantitative Data Analysis
Software for Qualitative and Quantitative Data Analysis
 
OLAP Cubes in Datawarehousing
OLAP Cubes in DatawarehousingOLAP Cubes in Datawarehousing
OLAP Cubes in Datawarehousing
 
E05312426
E05312426E05312426
E05312426
 

Similar to USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem

The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DLaura Berry
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupCaserta
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offeringsSandeep Vyas
 
Data Ops at TripActions
Data Ops at TripActionsData Ops at TripActions
Data Ops at TripActionsRob Winters
 
20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vs20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vsIan Feller
 
Applying linear regression and predictive analytics
Applying linear regression and predictive analyticsApplying linear regression and predictive analytics
Applying linear regression and predictive analyticsMariaDB plc
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedRobert Grossman
 
Reports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLSReports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLSKatalyst HLS
 
rough-work.pptx
rough-work.pptxrough-work.pptx
rough-work.pptxsharpan
 
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...Big Data Value Association
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Denodo
 
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...Data Con LA
 
Building Custom Big Data Integrations
Building Custom Big Data IntegrationsBuilding Custom Big Data Integrations
Building Custom Big Data IntegrationsPat Patterson
 
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...RTTS
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?RTTS
 
Analysis of economic data using big data
Analysis of economic data using big data Analysis of economic data using big data
Analysis of economic data using big data Shivu Manjesh
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationDenodo
 

Similar to USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem (20)

The challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&DThe challenges of Analytical Data Management in R&D
The challenges of Analytical Data Management in R&D
 
Predictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing MeetupPredictive Analytics - Big Data Warehousing Meetup
Predictive Analytics - Big Data Warehousing Meetup
 
Analytics&IoT
Analytics&IoTAnalytics&IoT
Analytics&IoT
 
Alten calsoft labs analytics service offerings
Alten calsoft labs   analytics service offeringsAlten calsoft labs   analytics service offerings
Alten calsoft labs analytics service offerings
 
Data Ops at TripActions
Data Ops at TripActionsData Ops at TripActions
Data Ops at TripActions
 
20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vs20150814 Wrangling Data From Raw to Tidy vs
20150814 Wrangling Data From Raw to Tidy vs
 
Applying linear regression and predictive analytics
Applying linear regression and predictive analyticsApplying linear regression and predictive analytics
Applying linear regression and predictive analytics
 
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed DeployedCrossing the Analytics Chasm and Getting the Models You Developed Deployed
Crossing the Analytics Chasm and Getting the Models You Developed Deployed
 
Reports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLSReports & Analysis_Katalyst HLS
Reports & Analysis_Katalyst HLS
 
rough-work.pptx
rough-work.pptxrough-work.pptx
rough-work.pptx
 
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
BDVe Webinar Series - Designing Big Data pipelines with Toreador (Ernesto Dam...
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
Bridging the Last Mile: Getting Data to the People Who Need It (APAC)
 
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
Big Data Day LA 2015 - Scalable and High-Performance Analytics with Distribut...
 
Building Custom Big Data Integrations
Building Custom Big Data IntegrationsBuilding Custom Big Data Integrations
Building Custom Big Data Integrations
 
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
Big Data Testing : Automate theTesting of Hadoop, NoSQL & DWH without Writing...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?What is a Data Warehouse and How Do I Test It?
What is a Data Warehouse and How Do I Test It?
 
Analysis of economic data using big data
Analysis of economic data using big data Analysis of economic data using big data
Analysis of economic data using big data
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 

More from ChemAxon

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?ChemAxon
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemAxon
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive modelsChemAxon
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...ChemAxon
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data managementChemAxon
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseChemAxon
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation ChemAxon
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...ChemAxon
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...ChemAxon
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryChemAxon
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...ChemAxon
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloudChemAxon
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationChemAxon
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction ChemAxon
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...ChemAxon
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology ChemAxon
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem MicroservicesChemAxon
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choralChemAxon
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5ChemAxon
 

More from ChemAxon (20)

Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
Akos Tarcsay (ChemAxon): How fast is Chemaxon RDBMS Search?
 
Chemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive modelsChemaxon EU UGM 2022 | Translating data to predictive models
Chemaxon EU UGM 2022 | Translating data to predictive models
 
Translating data to predictive models
Translating data to predictive modelsTranslating data to predictive models
Translating data to predictive models
 
Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...Efficient biomolecular structural data handling and analysis - Webinar with D...
Efficient biomolecular structural data handling and analysis - Webinar with D...
 
Biomolecule structural data management
Biomolecule structural data managementBiomolecule structural data management
Biomolecule structural data management
 
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first releaseCheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
Cheminfo Stories 2021 | Virtual UGM | Marvin Pro: The first release
 
Enhanced stereochemistry representation
Enhanced stereochemistry representation Enhanced stereochemistry representation
Enhanced stereochemistry representation
 
Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...Intellectual property (IP) intelligence solutions designed for the way resear...
Intellectual property (IP) intelligence solutions designed for the way resear...
 
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
GPS for Chemical Space - Digital Assistants to Support Molecule Design - Chem...
 
Patent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug DiscoveryPatent Data for Artificial Intelligence based Drug Discovery
Patent Data for Artificial Intelligence based Drug Discovery
 
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
Cheminfo Stories APAC 2020 - Chemical Descriptors & Standardizers for Machine...
 
Research data management on the cloud
Research data management on the cloudResearch data management on the cloud
Research data management on the cloud
 
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound RegistrationCheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
Cheminfo Stories APAC 2020 - Introducing Design Hub & Compound Registration
 
Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction Cheminfo Stories APAC 2020 - JChem Engines introduction
Cheminfo Stories APAC 2020 - JChem Engines introduction
 
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
Cheminfo Stories APAC 2020 - Database management on desktop with JChem for Of...
 
Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology Cheminfo Stories APAC 2020 -- Markush technology
Cheminfo Stories APAC 2020 -- Markush technology
 
JChem Microservices
JChem MicroservicesJChem Microservices
JChem Microservices
 
Migration from joc to jpc or choral
Migration from joc to jpc or choralMigration from joc to jpc or choral
Migration from joc to jpc or choral
 
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
ChemAxon's Compliance Checker - Cheminfo Stories 2020 Day 5
 
Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5Chemicalize Pro - Cheminfo Stories 2020 Day 5
Chemicalize Pro - Cheminfo Stories 2020 Day 5
 

Recently uploaded

HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comFatema Valibhai
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...harshavardhanraghave
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfkalichargn70th171
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️Delhi Call girls
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...Health
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...panagenda
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Modelsaagamshah0812
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...ICS
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...OnePlan Solutions
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsJhone kinadey
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxComplianceQuest1
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Steffen Staab
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AIABDERRAOUF MEHENNI
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️anilsa9823
 

Recently uploaded (20)

HR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.comHR Software Buyers Guide in 2024 - HRSoftware.com
HR Software Buyers Guide in 2024 - HRSoftware.com
 
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
Reassessing the Bedrock of Clinical Function Models: An Examination of Large ...
 
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdfLearn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
Learn the Fundamentals of XCUITest Framework_ A Beginner's Guide.pdf
 
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICECHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
CHEAP Call Girls in Pushp Vihar (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE
 
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
call girls in Vaishali (Ghaziabad) 🔝 >༒8448380779 🔝 genuine Escort Service 🔝✔️✔️
 
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
+971565801893>>SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHAB...
 
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
W01_panagenda_Navigating-the-Future-with-The-Hitchhikers-Guide-to-Notes-and-D...
 
Unlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language ModelsUnlocking the Future of AI Agents with Large Language Models
Unlocking the Future of AI Agents with Large Language Models
 
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...
 
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
The Real-World Challenges of Medical Device Cybersecurity- Mitigating Vulnera...
 
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS LiveVip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
Vip Call Girls Noida ➡️ Delhi ➡️ 9999965857 No Advance 24HRS Live
 
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
Tech Tuesday-Harness the Power of Effective Resource Planning with OnePlan’s ...
 
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsUnveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications
 
Right Money Management App For Your Financial Goals
Right Money Management App For Your Financial GoalsRight Money Management App For Your Financial Goals
Right Money Management App For Your Financial Goals
 
A Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docxA Secure and Reliable Document Management System is Essential.docx
A Secure and Reliable Document Management System is Essential.docx
 
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
Shapes for Sharing between Graph Data Spaces - and Epistemic Querying of RDF-...
 
Microsoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdfMicrosoft AI Transformation Partner Playbook.pdf
Microsoft AI Transformation Partner Playbook.pdf
 
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AISyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
SyndBuddy AI 2k Review 2024: Revolutionizing Content Syndication with AI
 
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...
 
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online  ☂️
CALL ON ➥8923113531 🔝Call Girls Kakori Lucknow best sexual service Online ☂️
 

USUGM 2014 - Dana Vanderwall (Bristol-Myers Squibb): Instant JChem

  • 1. Instant JChemInstant JChem -- enabling newenabling new ways of working with data andways of working with data andways of working with data andways of working with data and access to new data to work withaccess to new data to work with Dana Vanderwall Bristol-Myers Squibb Research Information Technology & Automation Chemaxon US UGM, Sept 2014 1
  • 2. Initial State in Chemistry AnalyticsInitial State in Chemistry Analyticsy yy y CDR SI FormsSI Forms KnowledgeKnowledge •Annotation •Folks-onomies Additional dataAdditional data Manual copyManual copy & paste,& paste, typingtyping SI FormsSI Forms HPLC log P vs. rat Vds y = 0.0344x + 3.886 R2 = 0.2737 4.00 4.50 5.00 5.50 logP ExcelExcel •Folks-onomies VisualizationVisualization ExportExport 2.00 2.50 3.00 3.50 0.00 5.00 10.00 15.00 20.00 25.00 30.00 35.00 40.00 Vds HPLC Scatter PlotScatter Plot Master Spreadsheet (Excel, Word)Master Spreadsheet (Excel, Word) Additional chemical structure analyses: Rat Pct BoundRat Pct Bound ExportExport Manual copy & pasteManual copy & paste 2 Additional chemical structure analyses: •SAR R-group analysis •Clustering (CADD and in-house solutions) •Predictive models (HERG, Solubility, Permeability; FACT) compoundcompound structures, IDsstructures, IDs
  • 3. The DARE ProjectThe DARE Project (Data & Analytics for Research)(Data & Analytics for Research)(Data & Analytics for Research)(Data & Analytics for Research) Simplify. • Replace legacy app/workflow… with integrated tools for analytics • Decrease stand alone docs/reports • Put any needed calculations & predicted properties where they’re neededneeded Modernize • A new product that maintains the functionality of form view… • Plus a richer set of views, tables, in-place conditional formatting, graphs, & more chemistry functionality • Learn by doing; established base camp in 1st yr, then ramped up Phased approach to dev. & migration y g y • Gradually phasing in IJC over 2013-2014 3
  • 4. DARE technology mapDARE technology map User interface gy pgy p Drill down: web service for conc. response curves & secondary results SOLR Index for text queries (IBM Patent DB only) Data Alerts Annotations Data Marts: New data layer for access & integration Lead Evaluation PAMPACellular CYP Inh DWG A Enzyme DWG A Cellular DWG B Receptor DWG B Cellular Data common to most DWGs Data unique to a DWG ss Evaluation Profiling: Enzymes MetStab CYP Induction DWG A selectivity DWG B Selectivity InformaticaInformatica Operational Screening BioBook calculated fields Chemical structures, properties, calculated fields Meta DataAnnotation WebWeb servicesservices Central Data Repository (CDR) 4
  • 5. DARE TimelinesDARE Timelines H1-2012 H2-2012 H1-2013 H2-2013 H1-2014 H2-2014 Phase 1: Informatica & ChemAxon Set-up , Prototyping and Build Deliverable 5 DWGs deployed Deliverableand Build 5 DWGs deployed Deliverable ~40 DWGs deployed Deliverables ~20 DWGs deployed Phase 2: GUI & Datamart – Prototyping and B ild Phase 3: GUI & Mart – Deployment deployed Phase 4: Build Phase 4: GUI & Mart – Deployment Decommissions 5
  • 6. Start with the basics & build upStart with the basics & build uppp Foundation • Program Specific Forms and use cases • Universal Forms (profiling platforms or compilations of data commonly used) Extended use cases • Use cases requiring bespoke data structures, scripting, or visualization • Unique data sources, combinations of data, all biological data • Hooks into internal web services: drill down for curves/secondary data Extended functionality 6 Hooks into internal web services: drill down for curves/secondary data • Query to SOLR index • Data Alerts
  • 7. DatamartDatamart InfrastructureInfrastructure General • ETL from primarily CDR, some additional sources • Provides environment to create tables & other data structures for IJC • Tables in IJC not enormously popular with users • Comfort and orientation with data in text box, fixed in position on form ‘Cell Factory’ , p • Cell = entity in oracle that effectively provides the data for one assay; CDR queries sometimes require complex set of conditions • Captures metadata associated with cell creation, keeps them unique, etcetc Incremental updates 7 • Via Informatica Power Center, 15-30 min incremental updates • Gentle failure in face of long running jobs
  • 8. Data management v1Data management v1gg BA catalogs data required for new project teamfor new project team Passes it to DB Manually: • New tables/entities promoted to IJC • New data tree created • Build Form • Add newPasses it to DB developer to define new ETL New data tree created • Build edges cells/columns to form IJCIJC FormsForms DAREDARE D t tD t t IJC SchemaIJC SchemaCDRCDR ETL FormsForms Data martData mart Manual coding/scripting Rate determining step 8 Rate determining step DB development not self documenting
  • 9. Automated data managementAutomated data managementgg UserUser • UI to search/define/create cells, tables, calc. fields • Consumes metadata & creates meta data ‘cell’ definitioncreatescreates cell/tablecell/table definition • Promotes the new table / new fields into IJC • If it’s a new entity then o Creates a new data Tree using a data tree Template Add th T bl t th d t t MetadataMetadata UIUI RepositoryRepository MetadataMetadata RepositoryRepository o Adds the new Table to the new data tree o Create a new form on the new data tree • Creates edges Auto PromotionAuto PromotionETL IJCIJC FormsForms DAREDARE Data martData mart IJC SchemaIJC SchemaCDRCDR ETL Promote QueuePromote Queue Data martData mart • Creates tables & columns immediately upon cell ‘activation’ 9
  • 10. ScaleScale Instant JChem • 1455 forms + Grids • 288 saved queries Data • 211 data trees • 526 ‘entities’ Traffic • 631 users (to date) • 1000-2000 db • 474 saved lists • 10 scripts • 8 schema • 2400 assays • 41,571 ‘fields’ connections daily 10
  • 11. The flexibility of [datamart + IJC] have enabled solutions well beyond the standard ‘program’ formsolutions well beyond the standard program form IBM PatentExternal data source Novel & multiple data structures & presentationsPatent DB HTMetabolite structures & presentations IJC MutagenesisDB Visualizations; Integration of custom scripts & calculations Datamarts Chiral Alliance D t scripts & calculations Chiral Separations Drug Safety Data AccessIntegration active & historical of BMS data Drug Safety Warehouse Integration of BMS data not in the CDR 11
  • 12. HighHigh--Throughput Mutagenesis: SAR, but differentThroughput Mutagenesis: SAR, but differentgg g p gg p g • Lead Evaluation Applied Genomics Research IT & Automation ComputerLead Evaluation, Applied Genomics, Research IT & Automation, Computer Aided Drug Design designed & built cloning and screening platform • >150 mutants, testing >30 compounds 12
  • 13. A Different DataA Different Data ScaleScale For each cmpd compare WT to 150 mutants For 30 compounds 13
  • 14. Endpoint variation over mutants by compoundEndpoint variation over mutants by compound Datamart • Mutagenesis datamart created drawing on data from 2 operational data sources • ‘Mart generation automated & refreshed as new data is available • DataMart structure is heavily augmented based on the need of Instant JChem(IJC)y g ( ) • Utilize IJC’s flexible entity relationship model & charting fxns to aid data visualization 14
  • 15. All compounds per endpoint variation over mutantsAll compounds per endpoint variation over mutantsAll compounds per endpoint variation over mutantsAll compounds per endpoint variation over mutants • Offered summary birds-eye view on all compounds by each individual result type (EC50, WTRATIO, KBWTRATIO etc) to identify trendsresult type (EC50, WTRATIO, KBWTRATIO etc) to identify trends • Compound as column header- a novel pivot 15
  • 16. Shift workload from queryquery & discovering todiscovering to alertingalerting & reportingreportinggg p gp g  Define what the teams want to monitor  Automate the delivery of new data packages Base case: Go find the data & construct analysis Open SI Forms Open form query SelectSelect data Export data Import data Table or visualization Table or visualization Is my data there yet? Q d t New capability: Push data alerts Is my data there yet? Is my data…. Instant JChem with new data Spreadsheet & link to open project-form-list in IJC Datamart Query data source in IJCin IJC Automated email to user when new data User data alert parameters 16
  • 17. Alert manager (internal GWT), 2Alert manager (internal GWT), 2--wayway integration with IJCintegration with IJCgg • Grab active data tree ID and bring it to alert tool • Take all the assays under the data tree as selection source for data alert St thi i f ti d t h it i t DMART• Store this information and match it against DMART • Create hit list using compound ID/lot ID as ‘permanent list’. • Send the link to subscribers 17
  • 18. What do the users think about all this?What do the users think about all this? • Change is never easyg y • Sub-populations are attracted to new capabilities and adopt new tools and practices • Others need more encouragement; stability is critical • Maintaining the capabilities of the familiar and well understood in the new environment a pre requisite forunderstood in the new environment a pre-requisite for complete migration • We’re getting thereg g 18
  • 19. Legacy application usage vs. Instant JChem Unique Users per Month Announcement of SI Forms retirement 700 800 900 500 600 DARE SI 200 300 400 SI 0 100 200 Mar 2014 Apr 2014 May 2014 Jun 2014Mar 2014 Apr 2014 May 2014 Jun 2014 19
  • 20. Reduced number of data sets exported for analysis Number of Data Exports per month 2600 2700 Number of Data Exports per month 2400 2500 2300 2400 2100 2200 1900 2000 2014 - MAR 2014 - APR 2014 - May 2014 - Jun 20
  • 21. Monitor URL Sharing in IJC 70 80 Launched URLs 50 60 70 30 40 Total Form URL List URLs Query URLs 10 20 0 1 2 3 4 5 6 2014 21
  • 22. a moment for reflectiona moment for reflection cause for dancing • Conditional formatting! what we learned • Train just in time coaching • More thorough regressionConditional formatting! • Grid view • Query builder • Query/browse performance! Train just in time • Listen; listen some more • STOP the presses if it’s not right- they’d rather wait • Simple >> rich More thorough regression testing • Clearer release notes • Login/start-up performanceperformance! • Tabbed panes • URL sharing* • Help from CXN!! • Simple >> rich • Provide a thread of continuity to lead through new tools • Don’t disrupt the performance • List query result retains original order • Cleaner Excel export, keep structure orientation• Don t disrupt the workflow, let it evolve structure orientation • More conversations! • Web services • Plexus! 22
  • 23. The DARE teamThe DARE team Heather Artman Dong Li Acknowledgements Scientific Computing Core Team Heather Artman Dawn Cohen John Duncan Dong Li Mark Manfredi Minimol Mathew Scientific Computing Services Ray Reichard Padma Vellanki Ramesh Durvasula James Ewen Lisa Johnson Christa Musial Matthias Nolte Anusha Ramanathan Padma Vellanki Thomas Curneal Mike Beluch Lisa Johnson Sangeet Khullar Anusha Ramanathan David Vanderbrooke Dana Vanderwall Nelly Masias Mahesh Nawade BMS Internal 23
  • 24. End user supportEnd user support Support email group User Community SharePoint  Training and reference, FAQs, External links, contact info  All reported issues and status All reported issues and status – [open, in progress, scheduled fix/improvement, resolved] Internal BMS User Group Meeting 1 h thl i t d t ti & t i i f 2 3 i l 1 hr. monthly session to cover demonstration & training for 2-3 special topics or features  Topics drawn from suggestions and requests for more info or training; topics covered to date:topics covered to date: – IJC: Query Builder; Visualization; Sharing by URL; exporting; working list (pick list); R group decomposition; Markush draw/search – JChem4XL- patent doc creation BMS Internal p – IBM Patent Database; Metabolite Database 24
  • 25. Assay meta dataAssay meta datayy • Describe assay protocol & conditions in controlled vocabulary Biological description  Targety • Protocols would have a minimum set of fields that would have to be populated before going into production  Gene name (look-up, and capture locus link)  Species C ll t • Opportunity for business rules that guide the protocol registration • All downstream systems would utilize th f k & t d t  Cell type Assay description  Assay type A dthe same framework & meta-data • Propose adopting established standard, aligning/collaborating with  Assay mode  Detection method Results R lt t NIH BARD Project & BioAssay Ontology (BAO) • Requires process & roles for  Result type  Modifier  Units t Requires process & roles for maintaining up to date dictionaries and governance  etc 25
  • 26. BAO scope and purposeBAO scope and purposep p pp p p • BAO to describe assays and screening results • Defines relevant assays and result annotations • Provides controlled terminology Formalizes knowledge of assays and screening results• Formalizes knowledge of assays and screening results • Describes and formalizes screening campaigns, i.e. relationship between assays in terms of their use • BAO addresses problems with using data and facilitates • Leveraging existing data in discovery projects • Global analysis across diverse data sets I t ti f d t f diff t• Integration of data from different resources 26
  • 27. What do we need to describe assaysWhat do we need to describe assaysyy 27
  • 28. 28