Go to the source code of this file.
|
DiagnosedSilenceableFailure | mlir::transform::gpu::mapForallToBlocksImpl (RewriterBase &rewriter, TransformOpInterface transformOp, scf::ForallOp forallOp, SmallVectorImpl< int64_t > &gridDims, const GpuIdBuilder &gpuIdBuilder) |
| Map the top level scf.forall op to GPU blocks. More...
|
|
DiagnosedSilenceableFailure | mlir::transform::gpu::mapOneForallToThreadsImpl (RewriterBase &rewriter, std::optional< TransformOpInterface > transformOp, scf::ForallOp forallOp, ArrayRef< int64_t > blockDims, int64_t warpSize, bool syncAfterDistribute) |
| Search scf.forall ops nested under target and map each such op to an explicit GPU implementation along blockDims . More...
|
|
DiagnosedSilenceableFailure | mlir::transform::gpu::mapNestedForallToThreadsImpl (RewriterBase &rewriter, std::optional< TransformOpInterface > transformOp, Operation *target, ArrayRef< int64_t > blockDims, int64_t warpSize, bool syncAfterDistribute) |
| Search scf.forall ops nested under target and map each such op to an explicit GPU implementation along blockDims . More...
|
|
void | mlir::gpu::registerTransformDialectExtension (DialectRegistry ®istry) |
|
◆ GET_OP_CLASSES