Go to the source code of this file.
|
| DiagnosedSilenceableFailure | mlir::transform::gpu::mapForallToBlocksImpl (RewriterBase &rewriter, TransformOpInterface transformOp, scf::ForallOp forallOp, SmallVectorImpl< int64_t > &gridDims, const GpuIdBuilder &gpuIdBuilder) |
| | Map the top level scf.forall op to GPU blocks. More...
|
| |
| DiagnosedSilenceableFailure | mlir::transform::gpu::mapOneForallToThreadsImpl (RewriterBase &rewriter, std::optional< TransformOpInterface > transformOp, scf::ForallOp forallOp, ArrayRef< int64_t > blockSizes, int64_t warpSize, bool syncAfterDistribute) |
| | Search scf.forall ops nested under target and map each such op to an explicit GPU implementation along blockDims. More...
|
| |
| DiagnosedSilenceableFailure | mlir::transform::gpu::mapNestedForallToThreadsImpl (RewriterBase &rewriter, std::optional< TransformOpInterface > transformOp, Operation *target, ArrayRef< int64_t > blockDims, int64_t warpSize, bool syncAfterDistribute) |
| | Search scf.forall ops nested under target and map each such op to an explicit GPU implementation along blockDims. More...
|
| |
| void | mlir::gpu::registerTransformDialectExtension (DialectRegistry ®istry) |
| |
◆ GET_OP_CLASSES