PRE0184
Requirements
64-bit compatible hardware
64-bit operating system
64-bit designed software (not the same as 32-bit software that will run
on a 64-bit computer)
Benefits
Access to larger amounts of memory, 32-bit OS can only read 4 GB of
RAM (maximum accessible RAM is closer to 3.4 GB due to other
hardware limitations)
More data can be read in at once
More RAM space available for multiple processes requiring memory space
More stable
Less memory management needed to prevent out of memory errors
32-bit operating systems limit the kernel-mode virtual address space to 2
GB, the 64-bit limit is 8 TB
64-bit Computing
PRE0184
Multi-Core / Multi-Threaded Processing
00:00:00
00:28:48
00:57:36
01:26:24
01:55:12
02:24:00
02:52:48
03:21:36
03:50:24
04:19:12
04:48:00
04:31:59
00:25:14
HH:MM:SS
MARS Auto-Filter Benchmark Results
Single Processor Multi Processor
Multiprocessing with
16 processors yielding
a 91% time savings
PRE0184
Multi-Processing Results (single large file vs. small tiles)
00:00:00
00:02:53
00:05:46
00:08:38
00:11:31
00:14:24
00:17:17
00:20:10
00:23:02
00:21:33
00:04:27
00:15:04
00:02:49
HH:MM:SS
MARS Grid Export Multi-Processing Benchmark Results
Single File Export Single Processor Single File Export Multi Processor
Tiled Export Single Processor Tiled Export Multi Processor
Multiprocessing with 16 processors yielding
a 79% time savings for single file export
Multiprocessing with 16 processors yielding
an 81% time savings for tiled export
PRE0184
Multi-Processing # of CPUs Performance Curve
00:00:00
00:14:24
00:28:48
00:43:12
00:57:36
01:12:00
01:26:24
01:40:48
01:55:12
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16
HH:MM:SS
Number of Processors
MARS Multi-Processing Times (Runtime per # of CPUs)
Processor Intensive = High
Processor Intensive = Medium
PRE0184
CPUs are designed to handle a variety of different
applications and operating system needs (currently
maxed at 16 cores)
GPUs were originally designed to handle video
rendering and screen display
GP-GPUs can be used for massively parallel
processing with the use of SDKs like NVIDIA’s
CUDA (Compute Unified Device Architecture)
GP-GPU processing system enhancements can be
accomplished by the use of one or more graphics
cards in a workstation or rack mounted GPU server
clusters (thousands of processor cores)
General-Purpose Graphics Processing Unit (GP-GPU)
PRE0184
CPU vs. GPU Speeds
Source: NVIDIA webinar: http://www.nvidia.com/content/webinar/Tesla_Fermi_Webinar_Jan13_10_v1_1.pdf
August 24, 2009 - Nvidia’s CEO predicted that GPU computing will experience a rapid performance boost over the next six
years. According to Jen-Hsun Huang, GPU computing is likely to increase its current capabilities by 570x, while 'pure'
CPU performance will progress by a limited 3x. This would require tripling the speed of the GPU every year.
PRE0184
Internal Computer Hard Drives
Technologies
Solid state (no moving parts)
Pros
Faster start up
Faster read/write
Fragmentation has little effect
Silent operations
More reliable
Can endure shock, high altitude, extreme temperatures
Cons (currently)
Lower capacities
More expensive
Spinning (most widely used)
Typical types
SAS (Serial Attached SCSI) – successor to SCSI
SATA (Serial AT Attachment) – successor to ATA
Performance
SAS – faster 15,000 RPMs
SATA – slower 9,200 RPMs
Size
SAS – up to 500 GB
SATA – up to 2 TB
RAID levels
10 (RAID 1+0) – highest performance
50 (RAID 5+0) - larger space, more redundancy
Connector types
iSCSI – quick install, less hardware
Fiber – complex install, more hardware
PRE0184
Writing Data to Temporary Local Drive Space
00:00:00
00:28:48
00:57:36
01:26:24
01:55:12
02:24:00
02:52:48
03:21:36
03:50:24
04:19:12
04:48:00
04:22:59
00:55:27
00:47:47
HH:MM:SS
MARS Grid Export Testing Network and Local Drive I/O
Export to grid across network (input and output on network)
Export to grid across network (input on network, output to local drive temp space and then moved to network)
Export to grid with system drive (input and output on local drive)
80% export time savings
writing to local drive then
moving product to
network drive
PRE0184
High Speed Local Area Networks (LAN)
00:00:00
00:00:43
00:01:26
00:02:10
00:02:53
00:03:36
00:03:30
00:02:08
HH:MM:SS
Gigabit vs. 10 Gigabit Ethernet Network File Copy Times
Gigabit 10 Gigabit
22.66 GB of varying files sized
from 6 kb to 18 GB (files read
and written using Windows
Server 2008 R2)
39% disc I/O time savings
using 10 Gbps as
compared to 1 Gbps
PRE0184
Disc I/O Improvements in Windows Operating Systems
00:00:00
00:00:43
00:01:26
00:02:10
00:02:53
00:03:36
00:04:19
00:05:02
00:05:46
00:05:17
00:01:56 00:01:36
HH:MM:SS
File Copy Times For Windows Operating System
Windows XP (Windows 2003) Windows Vista (Windows 2008) Windows 7 (Windows 2008 R2)
70% disc I/O time savings
using Windows 7 as
compared to Windows XP
4.66 GB of varying files
sized from 6 kb to 1.2 GB
PRE0184
Network – 10GE
Processing server(s)
Fast processor
technology – Nehalem
microarchitecture
Multi-processor CPUs,
8 or more cores
Fast local HDD (does
not have to be huge)
GP-GPU cluster
File server(s)
Scalable
Tiered storage for best
performance
SSD – for temporary,
unfragmented files
SAS – for fast
processing
SATA – for large
storage
Server-side Processing
PRE0184
Distributed Processing
Typically designed to work within a LAN environment
Highly scalable
Scheduled processes
Harvest free CPU clock cycles
Very configurable
Resource limits
Priorities
Time limits
Cloud Computing
Shared computer processing resources via the Internet by
renting usage from a third-party provider
Data and software is usually stored on remote servers
Key features
Clustering
Agility
Cost
Device and location
independence
Multi-tenancy
Reliability
Scalability
Security
Maintenance
Metering
High performance
Source: Wikipedia http://en.wikipedia.org/wiki/Cloud_computing
PRE0184
Technologies Recommendation
Processor microarchitecture Nehalem
32/64 bit 64-bit*
Multi-processing Multi-Core / Multi-Thread*
Number of CPUs 8 or more*
Amount of RAM 8GB or more*
Beyond CPU processing
GP-GPU processing*
(higher end Nvidia card is worth the money)
Internal Hard Drives
SAS with iSCSI connection
(SSD when price drops and size increases)
Read/Write “trick” File server local internal HDD file server*
LAN 10 Gigabit Ethernet (10GE)
Operating System (if using Windows) Windows 7 / Server 2008 R2*
Processing architecture Server-side
Clustering Distributed processing or Cloud computing*
Summary
* If software supports this functionality