Publicité
Publicité

Contenu connexe

Publicité

High Performance Computing for LiDAR Data Production

  1. March 3, 2010 High Performance Computing for LiDAR Data Production
  2. PRE0184 Processor Technologies and Speeds
  3. PRE0184  Requirements  64-bit compatible hardware  64-bit operating system  64-bit designed software (not the same as 32-bit software that will run on a 64-bit computer)  Benefits  Access to larger amounts of memory, 32-bit OS can only read 4 GB of RAM (maximum accessible RAM is closer to 3.4 GB due to other hardware limitations)  More data can be read in at once  More RAM space available for multiple processes requiring memory space  More stable  Less memory management needed to prevent out of memory errors  32-bit operating systems limit the kernel-mode virtual address space to 2 GB, the 64-bit limit is 8 TB 64-bit Computing
  4. PRE0184 CPU Trends
  5. PRE0184 Use of Multiple Processors and Large Amounts of RAM
  6. PRE0184 Multi-Core / Multi-Threaded Processing 00:00:00 00:28:48 00:57:36 01:26:24 01:55:12 02:24:00 02:52:48 03:21:36 03:50:24 04:19:12 04:48:00 04:31:59 00:25:14 HH:MM:SS MARS Auto-Filter Benchmark Results Single Processor Multi Processor Multiprocessing with 16 processors yielding a 91% time savings
  7. PRE0184 Multi-Processing Results (single large file vs. small tiles) 00:00:00 00:02:53 00:05:46 00:08:38 00:11:31 00:14:24 00:17:17 00:20:10 00:23:02 00:21:33 00:04:27 00:15:04 00:02:49 HH:MM:SS MARS Grid Export Multi-Processing Benchmark Results Single File Export Single Processor Single File Export Multi Processor Tiled Export Single Processor Tiled Export Multi Processor Multiprocessing with 16 processors yielding a 79% time savings for single file export Multiprocessing with 16 processors yielding an 81% time savings for tiled export
  8. PRE0184 Multi-Processing # of CPUs Performance Curve 00:00:00 00:14:24 00:28:48 00:43:12 00:57:36 01:12:00 01:26:24 01:40:48 01:55:12 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 HH:MM:SS Number of Processors MARS Multi-Processing Times (Runtime per # of CPUs) Processor Intensive = High Processor Intensive = Medium
  9. PRE0184  CPUs are designed to handle a variety of different applications and operating system needs (currently maxed at 16 cores)  GPUs were originally designed to handle video rendering and screen display  GP-GPUs can be used for massively parallel processing with the use of SDKs like NVIDIA’s CUDA (Compute Unified Device Architecture)  GP-GPU processing system enhancements can be accomplished by the use of one or more graphics cards in a workstation or rack mounted GPU server clusters (thousands of processor cores) General-Purpose Graphics Processing Unit (GP-GPU)
  10. PRE0184 CPU vs. GPU Speeds Source: NVIDIA webinar: http://www.nvidia.com/content/webinar/Tesla_Fermi_Webinar_Jan13_10_v1_1.pdf August 24, 2009 - Nvidia’s CEO predicted that GPU computing will experience a rapid performance boost over the next six years. According to Jen-Hsun Huang, GPU computing is likely to increase its current capabilities by 570x, while 'pure' CPU performance will progress by a limited 3x. This would require tripling the speed of the GPU every year.
  11. PRE0184 Internal Computer Hard Drives  Technologies  Solid state (no moving parts)  Pros  Faster start up  Faster read/write  Fragmentation has little effect  Silent operations  More reliable  Can endure shock, high altitude, extreme temperatures  Cons (currently)  Lower capacities  More expensive  Spinning (most widely used)  Typical types  SAS (Serial Attached SCSI) – successor to SCSI  SATA (Serial AT Attachment) – successor to ATA  Performance  SAS – faster 15,000 RPMs  SATA – slower 9,200 RPMs  Size  SAS – up to 500 GB  SATA – up to 2 TB  RAID levels  10 (RAID 1+0) – highest performance  50 (RAID 5+0) - larger space, more redundancy  Connector types  iSCSI – quick install, less hardware  Fiber – complex install, more hardware
  12. PRE0184 Writing Data to Temporary Local Drive Space 00:00:00 00:28:48 00:57:36 01:26:24 01:55:12 02:24:00 02:52:48 03:21:36 03:50:24 04:19:12 04:48:00 04:22:59 00:55:27 00:47:47 HH:MM:SS MARS Grid Export Testing Network and Local Drive I/O Export to grid across network (input and output on network) Export to grid across network (input on network, output to local drive temp space and then moved to network) Export to grid with system drive (input and output on local drive) 80% export time savings writing to local drive then moving product to network drive
  13. PRE0184 High Speed Local Area Networks (LAN) 00:00:00 00:00:43 00:01:26 00:02:10 00:02:53 00:03:36 00:03:30 00:02:08 HH:MM:SS Gigabit vs. 10 Gigabit Ethernet Network File Copy Times Gigabit 10 Gigabit 22.66 GB of varying files sized from 6 kb to 18 GB (files read and written using Windows Server 2008 R2) 39% disc I/O time savings using 10 Gbps as compared to 1 Gbps
  14. PRE0184 Disc I/O Improvements in Windows Operating Systems 00:00:00 00:00:43 00:01:26 00:02:10 00:02:53 00:03:36 00:04:19 00:05:02 00:05:46 00:05:17 00:01:56 00:01:36 HH:MM:SS File Copy Times For Windows Operating System Windows XP (Windows 2003) Windows Vista (Windows 2008) Windows 7 (Windows 2008 R2) 70% disc I/O time savings using Windows 7 as compared to Windows XP 4.66 GB of varying files sized from 6 kb to 1.2 GB
  15. PRE0184  Network – 10GE  Processing server(s)  Fast processor technology – Nehalem microarchitecture  Multi-processor CPUs, 8 or more cores  Fast local HDD (does not have to be huge)  GP-GPU cluster  File server(s)  Scalable  Tiered storage for best performance  SSD – for temporary, unfragmented files  SAS – for fast processing  SATA – for large storage Server-side Processing
  16. PRE0184  Distributed Processing  Typically designed to work within a LAN environment  Highly scalable  Scheduled processes  Harvest free CPU clock cycles  Very configurable  Resource limits  Priorities  Time limits  Cloud Computing  Shared computer processing resources via the Internet by renting usage from a third-party provider  Data and software is usually stored on remote servers  Key features Clustering  Agility  Cost  Device and location independence  Multi-tenancy  Reliability  Scalability  Security  Maintenance  Metering  High performance Source: Wikipedia http://en.wikipedia.org/wiki/Cloud_computing
  17. PRE0184 Technologies Recommendation Processor microarchitecture Nehalem 32/64 bit 64-bit* Multi-processing Multi-Core / Multi-Thread* Number of CPUs 8 or more* Amount of RAM 8GB or more* Beyond CPU processing GP-GPU processing* (higher end Nvidia card is worth the money) Internal Hard Drives SAS with iSCSI connection (SSD when price drops and size increases) Read/Write “trick” File server  local internal HDD  file server* LAN 10 Gigabit Ethernet (10GE) Operating System (if using Windows) Windows 7 / Server 2008 R2* Processing architecture Server-side Clustering Distributed processing or Cloud computing* Summary * If software supports this functionality
  18. PRE0184 Thank you! Any questions?
Publicité