Home
last modified time | relevance | path

Searched defs:elements_per_thread (Results 1 – 2 of 2) sorted by relevance

/aosp_15_r20/external/pytorch/aten/src/ATen/native/cuda/
H A DShape.cu57 constexpr unsigned int elements_per_thread = 8; in getCatGridRocm() local
82 unsigned int elements_per_thread = ALIGNED_VEC_LOAD_BYTES / sizeof(T) * in getCatGridContig() local
H A DTriangularOps.cu119 constexpr int elements_per_thread = sizeof(scalar_t) < 8 ? 8 / sizeof(scalar_t) : 1; in triu_tril_cuda_template() local