Using Perforce Data in Development at Tableau

Using Perforce Data in
Development at Tableau
Ed Mack
Staff Systems Software Engineer
Robert Orr
Systems Software Engineer

2
Tableau Software
 Award-winning data analytics software that helps people see and
understand data.
 More than 35,000 customer accounts get rapid results with Tableau
in the office and on-the-go, and tens of thousands of people use
Tableau Public to share data in their blogs and websites.
 Check out our products by downloading a free trial at
www.tableau.com/trial.

3
Using Tableau to Analyze Perforce Data
 We use data to help answer questions in our day-to-day
work and to help us make decisions for the future.
 Perforce provides a lot of data -- beyond basic changelist
and file activity.
 Let’s take a look at our data sources and the Tableau
visualizations (vizes) we’ve created for analysis.

4
Data Sources
 Direct calls to Perforce servers
• Current data
• Historical data
• Calls to ‘p4’ or P4Python API
 P4toDB
• A read-only replica feeds a PostGRES database

5
Data Sources
 Custom views, tables, and databases
• “ChangesByHumans” view added to P4toDB database to filter out
background users
• Join table between P4toDB and TFS
• Integrations table
• Table with server names, types, and locations
• CSV files
• Tableau Extracts and Data Sources

6
P4toDB
 Supported by Perforce -- no new development
since 2012.1
 Lessons learned
• List tables to include rather than exclude (the default)
• Upgrade rather than rebuild (when possible)
- Metadata table is P4TODB_CFG

7
Areas of Data Tracking
 General health & monitoring
 Codeline queries & analysis
 Infrastructure planning
 Historical analysis

8
General Health & Monitoring
 Usual hardware monitors
• Disk space, available RAM, CPU usage, etc.
 Perforce-specific monitors
• Number of active processes (“p4 monitor show”)
• Long-running processes (“p4 monitor show -ale”)
• Number & age of workspaces
- Number/Growth: “p4 files //spec/client/...@date”
- Age (last accessed): “p4 clients” (P4Python)
• Replica lag (“p4 pull”)

9
Replica Lag
 How the process works
• Script runs every 30 seconds
- Running p4 pull –ls and p4 pull –lj commands against our
servers
• Returned data is parsed & stored in a database table
• Data is currently:
- About 9 million records, back to 06-2014
- Total size 1.4Gb

12
Codeline Queries & Analysis
 What’s in a branch?
 Branch changelists & planning
 Edited backports
 Who knows the code?
 Integration tracking

14
Branch Changelists & Planning

17
Integration Tracking
 Extensive mining of the P4ToDB database
• Tracks the path of integrations through multiple branches
• On a changelist level instead of individual file level
 2-part process

18
Integration Tracking: Part 1
 Script runs against our p4todb database every
5 minutes, looking for new integration records.
- Ignores streams
- Generates a de-duped list of ‘from cl’, ‘from branch’, ‘to cl’,
‘to branch’ tuples, for each file in each integration
- Most integrations are small, so processing takes only a few
seconds per changelist.

19
Integration Tracking: Part 2
 Web application runs accepting queries for
integration changelists
• Searches backwards recursively through pre-computed records for
changelists whose ‘to cl’ is the target
• Typically, recursive search back is only a few levels; sometimes
can be many.
• Response time is variable, but usable in most cases.

21
Infrastructure Planning
 Size and growth of server db files
 Number of Perforce clients
• “p4 files //spec/client/*@date” (ignore deleted revisions)
 License usage
• Commands
- “p4 files //spec/user/*@date” (ignore deleted revisions)
• Determine “active” users
- Working backwards, draw a decreasing graph (assumes no reduction in head
count)

22
Size & Growth of Server db Files

23
Size & Growth of Server db Files

26
Active License Growth (Forecasting)

27
Historical Analysis
 Changelist History
 Edge Server Migration

32
Where to go from here?
 We’ve given you ideas for analyzing your own Perforce data
 What are your ideas?
 Hint: Identify and collect data that helps you answer
questions you have now and that’s broad enough to help
you answer new questions.
 P4toDB is a great source!

33
Resources
 P4toDB Release notes
(https://www.perforce.com/perforce/doc.current/user/p4todbnotes.txt)
 P4toDB README (http://answers.perforce.com/articles/KB/3581)
 P4toDB Getting Started (http://answers.perforce.com/articles/KB/2727)
 Scripting APIs
(https://www.perforce.com/perforce/doc.current/manuals/p4script/index.html)
 Tableau Software (http://tableau.com)

Thank you!
For more information:
emack@tableau.com
rorr@tableau.com

Using Perforce Data in Development at Tableau

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Using Perforce Data in Development at Tableau

Similar to Using Perforce Data in Development at Tableau (20)

More from Perforce

More from Perforce (20)

Recently uploaded

Recently uploaded (20)

Using Perforce Data in Development at Tableau

Editor's Notes