Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Welcome To Ask or Share your Answers For Others

Categories

0 votes
673 views
in Technique[技术] by (71.8m points)

cuda - What are "Other" Issue Stall Reasons displayed by the Nsight profiler?

I have a kernel that is performing poorly on CC 3.0 (Kepler) as opposed to CC 2.0 (Fermi). In the Nsight profiler, the Warp Issue Efficiency chart is showing that 60% of the time, there were no eligible warps and the Issue Stall Reasons chart is showing that 60% of these are due to "Other".

I'm wondering what the Other issue stall reasons are and what I might do to reduce them.

CUDA 5.0. / Nsight 3.0. RC / CC 3.0.

Issue Stall Reasons

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
Welcome To Ask or Share your Answers For Others

1 Reply

0 votes
by (71.8m points)

In Nsight Visual Studio Edition 3.0 CUDA Profiler the Issue Efficiency displays a pie chart of the warp stall reasons. The stall reasons are Instruction Fetch, Execution Dependency, Data Requests, Texture, Synchronization, and Other.

For Compute Capability 3.* devices the Other category is the percentage of time that active warps are stalled due to the following reasons:

  • execution unit is busy (reduce use of low throughput integer operations)
  • register bank conflicts (compiler issue that can sometimes be made worst by heavy use of vector data types)
  • too few warps per scheduler

For Compute Capability 5.* and 6.* devices the Other category is the percentage of time that active warps are stalled due to the following reasons:

  • register bank conflicts (compiler issue that can sometimes be made worst by heavy use of vector data types)
  • warps waiting to resolve branches
  • warps that are lower priority and are not currently being considered for scheduling

For 5.* and 6.*, especially gp100, the last reason can be very high (~75%) if the kernel reaches 32 warps per warp scheduler.

These stalls reasons are grouped into the other category as it is hard to identify actions that a developer can taken to resolve these issues.


与恶龙缠斗过久,自身亦成为恶龙;凝视深渊过久,深渊将回以凝视…
OGeek|极客中国-欢迎来到极客的世界,一个免费开放的程序员编程交流平台!开放,进步,分享!让技术改变生活,让极客改变未来! Welcome to OGeek Q&A Community for programmer and developer-Open, Learning and Share
Click Here to Ask a Question

1.4m articles

1.4m replys

5 comments

57.0k users

...