2. Content
The data is in my head!
Unsafe Ad hoc data loads
Insert->Verify->Publish->Insight
Implementing the cycle in Pentaho
Customization is required
Challenges
3. The Data Is In My Head
+Providing
Data Sources Data
Driving
Analysis
Gaining
Insight
4. Common Ad Hoc Data Use Case
User Mastered Data
Excel Sheet of Right Format
Placed in Right Folder
Placed at Right Time
Picked up by Batch Job
Loaded into Warehouse Overnight
Used to run Batch Canned Reports
500KG
100KG
50KG
Accident
waiting to
happen
6. Insert: the process
Staging
Area
Batch
Log
Select file
to Insert
Upload
File
ETL to
staging
Log new
upload
Run Report
XML
config
•File Type
•To Staging ETL
•To Publish ETL
•To Unpublish ETL
•Verify Report
•Final Report
•(Batch Log JNDI)
Upload xAction Insert and Report xAction
File Uploader
PDI Report
7. File Uploader
File Upload Service
•File to Upload
•Target Web
Page
•Redirect page
to Target
•All parameter
repeated
XAction
Apache FileUpload library
8. Publish and Unpublish
DW
Select
batch
Update batch logETL staging to DW Run Report
XML
config
•File Type
•To Staging ETL
•To Publish ETL
•To Unpublish ETL
•Verify Report
•Final Report
•(Batch Log JNDI
Publish xAction
Batch
Log
PDI Report
9. Challenges
Parameters from XML configuration file
– Examples online
File Upload Service
– Actually not challenging
Passing parameters from xAction into PDI
– Some work arounds
• Using parameter on table input select
10. Summary
End users to insert data
Instant feedback and verification
Greater flexibility in analysis
All within a reporting dashboard
Insert Verify Publish Insight