Game Worlds from Polygon Soup: Visibility, Spatial Connectivity and Rendering

2. Game Worlds from Polygon Soup: Visibility, Spatial Connectivity and Rendering

3. Talk Outline Game Environment Previous Approach New Approach Implementation Demo

4. Halo Reach

5. More than pretty pixels Path-finding Collision/Physics Visibility Spatial Connectivity Rendering AI perception Activation Object Attachment Audibility Caching/Paging

6. More than pretty pixels Path-finding Collision/Physics Visibility Spatial Connectivity Rendering AI perception Activation Object Attachment Audibility Caching/Paging

8. Potentially Visible Sets (PVS)

9. Occluderrasterization

10. Software rasterization

11. Hardware occlusion queries

12. GPGPU solutions

13. Spatial Connectivity

14. Watershed Transform

16. Pros Unified visibility/spatial connectivity Precise spatial decomposition Inside/outside test Great for indoor spaces with natural portals

17. Cons Manual portalization is non-trivial! Watertightness is painful for content authoring Force early level design decision Optimized for indoor scene only.

18. Portalization Example

19. Polygon Soup!

20. Polygon Soup Just pieces jammed together No water-tightness No manual portals Incremental construction/fast iteration. Allow late design changes

21. General Idea Sub-divide the scene Voxelize the subdivided volume Segment the voxels into regions Build a connectivity graph between regions Build simplified volumes from voxel regions

22. 2D Example –Path Finding Input Scene [Recast Library: MikkoMononenhttp://code.google.com/p/recastnavigation/]

23. 2D Example –Path Finding Voxelization

24. 2D Example –Path Finding “Walk-able” voxels

25. 2D Example –Path Finding Distance Field

26. 2D Example –Path Finding 2D Watershed Transform

27. 2D Example –Path Finding Contour

28. 2D Example –Path Finding NavMesh

29. 3D Watershed Bungie & Zhe Jiang UniversityZhefengWu, Xinguo Liu

30. 3D Watershed Transform

31. Problems 3D is considerably harder/slower Over-segmentation (small regions) Sensitive to scene changes Simplified representation non-trivial What about visibility?

32. Collaboration with Umbra Automatic portal generation Incremental/local updates CPU based solution, low latency Same solution for visibility and spatial connectivity Handle doors and elevators Precise around user placed portals Fast run time / low memory fooprint

33. Umbra Solution Polygon soup Automatic cell and portal generation Preprocess Visibility and connectivity queries Runtime

34. Preprocess Overview Polygon soup Automatic cell and portal generation Preprocess Visibility and connectivity queries Runtime

36. Determine voxel connectivity with respect to input geometry

37. Propagate connectivity to find connected components

39. Localizes computation

40. Distributed computing

42. Subdivide to discretization level

43. Skip empty space

46. Connect voxels

47. Local connected components represent view cells

49. Connect voxels

52. Connect voxels

55. Connect voxels

58. Connect voxels

61. Connect voxels

64. Connect voxels

67. Connect voxels

70. Build voxel connections

73. Build voxel connections

77. Runtime algorithms are graph traversals

80. Connectivity queries are just graph traversals

81. Examples:

82. Find connected region (region == set of view cells)

83. Find shortest 3D path

84. Intersection queries

85. Ray casts

86. Combinations: Ray cast -> connected region -> objects in region

88. Axis aligned portals enable effective algorithms

89. From point visibility queries

90. From region visibility queries

91. Volumetric visibility

93. Computation time is directly related to culling efficiency

94. Every solution is useful

95. Sampling based visibility solvers can take long time to converge

96. Additional use cases:

97. Identify visibility hotspots

98. Cull always hidden triangles

100. Recursive algorithm does not scale

101. Many paths to one cell – combinatorial explosion

102. Rasterization based approach

103. BSP-style front-to-back traversal

104. Update coverage buffer on entry and exit

106. Halo Reach Game Loop

107. Halo Reach Game Loop Coarse-grain parallelism System on a thread

108. Halo Reach Game Loop Coarse-grain parallelism System on a thread Explicit synchronization through state mirroring

109. Halo Reach Game Loop Coarse-grain parallelism System on a thread Explicit synchronization through state mirroring Mostly manual load-balancing

110. Halo Reach Game Loop Simulation loop @ 30 hz HW Thread 0 Job kernel HW Thread 1 HW Thread 2 Render loop @ 30 hz HW Thread 3 Audio loop HW Thread 4 Job kernel, debug logging Async tasks, I/O, misc. HW Thread 5

113. Halo Reach: Simulation Thread (MP) Simulation loop: 75-100% HW Thread 0 Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

114. Halo Reach: Simulation Thread (MP) Simulation loop: 75-100% HW Thread 0 Object Update Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

115. Halo Reach: Simulation Thread (MP) Simulation loop: 75-100% HW Thread 0 Object Update Havok Update Obj move Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

118. Halo Reach: Simulation Thread (MP) Simulation loop: 75-100% HW Thread 0 Object Update Havok Update Obj move Job kernel: 20-30% HW Thread 1 game state mirror frame is published for rendering HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

119. Halo Reach: Render Thread (MP) Simulation loop: 75-100% HW Thread 0 Object Update Havok Update Obj move Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% Player Viewport 1 Player Viewport 2 HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

120. Halo Reach: Render Thread (MP) Simulation loop: 75-100% HW Thread 0 Object Update Havok Update Obj move Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% Player Viewport 1 Player Viewport 2 PV1: Visib PV1: Submission PV2: Visib PV2: Submission HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

121. Halo Reach: Thread Utilization Simulation loop: 75-100% utilized HW Thread 0 Job kernel: 20-30% utilized HW Thread 1 HW Thread 2 Render loop: 70-100% utilized HW Thread 3 Audio loop: 50-80% utilized HW Thread 4 Job kernel, debug logging: 20-30% utilized Async tasks, I/O, misc: 10-30% with bursts of 100% utilization HW Thread 5

122. Halo Reach: Thread Utilization Simulation loop: 75-100% utilized HW Thread 0 Job kernel: 20-30% utilized HW Thread 1 HW Thread 2 Render loop: 70-100% utilized HW Thread 3 Audio loop: 50-80% utilized HW Thread 4 Job kernel, debug logging: 20-30% utilized Async tasks, I/O, misc: 10-30% with bursts of 100% utilization HW Thread 5

123. Can We Do Better? Observation #1: We don’t need the entire game state for rendering

124. Gamestate and Visibility In Reach, game state extraction happens before we do visibility That’s why we have to copy the entire game state Expensive (in ms and memory footprint)

125. Reduce CPU Latency Visibility is a large chunk of CPU time on render thread Yet we have CPU time is under utilized Underutilized HW threads And not to mention other platforms!

126. Gamestate and Visibility But we can invert that operation Only copy data for visible objects out of game state Only extract data for objects that will be rendered

127. Extract Post Visibility Better: Drive game extraction and processing based on results of visibility Only extract data for visible objects (both static and dynamic) No need to double buffer the entire game state Only buffer game data for the per-frame transient state for visible objects Smaller memory footprint

128. Better Load Balancing Start by splitting off visibility computation into jobs per view This includes visibility computations for player, shadow, reflection views Visibility jobs can have viewport-to-viewport dependencies Can reuse results of one visibility job computation as input to another

129. Reducing Input Latency Stagger visibility computation at the same time as game object update Start static visibility early with predictive camera early in the frame Start this before we do object update

130. Improve CPU Latency Run expensive CPU render operations for visible objects only Just make sure to run this after visibility These would be render-only operations (skinning, cloth sim, polygon sorting) – they do not affect game play

131. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Predict cameraenvelope Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

132. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Job kernel: 20-30% HW Thread 1 Determine render views next HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

133. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 Start static objects’ (environment) visibility and broadphase dynamic objects visibility for render views as jobson available threads:player, shadows, etc. HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

134. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 Execute object update jobs HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

135. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 Run render prepare for static environment rendering jobs (bake precompiled command buffers, etc.) HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

136. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 Finalize camera (poll input) HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

137. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 Compute narrow phase dynamic objects visibility (as jobs) HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

138. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 Preparing visible dynamic objects and extracting game state data for them (in jobs) HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

139. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 Object Update/Move Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% Execute final prepare jobs to finalize frame packet data HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

140. An Improved Game Loop Simulation loop: 75-100% HW Thread 0 br />Object Update/Move Job kernel: 20-30% HW Thread 1 Publish the frame for rendering HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

141. Streamlined Submission Thread Havok Update Simulation loop: 75-100% HW Thread 0 Obj move Object Update Job kernel: 20-30% HW Thread 1 HW Thread 2 Render loop: 70-100% HW Thread 3 Audio loop: 50-80% Render submission job HW Thread 4 Job kernel, debug logging: 20-30% Async tasks, socket polling, misc: 10-30% with bursts of 100% utilization HW Thread 5

142. Benefits Decouple game-state traversal from drawing Better CPU utilization with staggered visibility computation Earlier results for each frame mean reduced input latency

143. Benefits Decouple game-state traversal from drawing Better CPU utilization with staggered visibility computation Render thread becomes a streamlined kernel processor

144. A Simple Little Job Tree

145. Future Work Predict dynamic objects visibility with temporal bounding volume Fixup after final camera and object positions are known

Game Worlds from Polygon Soup: Visibility, Spatial Connectivity and Rendering

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Game Worlds from Polygon Soup: Visibility, Spatial Connectivity and Rendering

Similaire à Game Worlds from Polygon Soup: Visibility, Spatial Connectivity and Rendering (20)

Dernier

Dernier (20)

Game Worlds from Polygon Soup: Visibility, Spatial Connectivity and Rendering

Notes de l'éditeur