|
void | mlir::populateGpuGlobalIdPatterns (RewritePatternSet &patterns) |
| Collect a set of patterns to rewrite GlobalIdOp op within the GPU dialect. More...
|
|
void | mlir::populateGpuSubgroupIdPatterns (RewritePatternSet &patterns) |
| Collect a set of patterns to rewrite SubgroupIdOp op within the GPU dialect. More...
|
|
void | mlir::populateGpuShufflePatterns (RewritePatternSet &patterns) |
| Collect a set of patterns to rewrite shuffle ops within the GPU dialect. More...
|
|
void | mlir::populateGpuAllReducePatterns (RewritePatternSet &patterns) |
| Collect a set of patterns to rewrite all-reduce ops within the GPU dialect. More...
|
|
void | mlir::populateGpuBreakDownSubgroupReducePatterns (RewritePatternSet &patterns, unsigned maxShuffleBitwidth=32, PatternBenefit benefit=1) |
| Collect a set of patterns to break down subgroup_reduce ops into smaller ones supported by the target of size <= maxShuffleBitwidth , where size is the subgroup_reduce value bitwidth. More...
|
|
void | mlir::populateGpuLowerSubgroupReduceToShufflePatterns (RewritePatternSet &patterns, unsigned subgroupSize, unsigned shuffleBitwidth=32, PatternBenefit benefit=1) |
| Collect a set of patterns to lower gpu.subgroup_reduce into gpu.shuffle ops over shuffleBitwidth scalar types. More...
|
|
void | mlir::populateGpuLowerClusteredSubgroupReduceToShufflePatterns (RewritePatternSet &patterns, unsigned subgroupSize, unsigned shuffleBitwidth=32, PatternBenefit benefit=1) |
| Disjoint counterpart of populateGpuLowerSubgroupReduceToShufflePatterns that only matches gpu.subgroup_reduce ops with a cluster_size . More...
|
|
void | mlir::populateGpuLowerSubgroupReduceToDPPPatterns (RewritePatternSet &patterns, unsigned subgroupSize, amdgpu::Chipset chipset, PatternBenefit benefit=1) |
| Collect a set of patterns to lower gpu.subgroup_reduce into amdgpu.dpp ops over scalar types. More...
|
|
void | mlir::populateGpuLowerClusteredSubgroupReduceToDPPPatterns (RewritePatternSet &patterns, unsigned subgroupSize, amdgpu::Chipset chipset, PatternBenefit benefit=1) |
| Disjoint counterpart of populateGpuLowerSubgroupReduceToDPPPatterns that only matches gpu.subgroup_reduce ops with a cluster_size . More...
|
|
void | mlir::populateGpuRewritePatterns (RewritePatternSet &patterns) |
| Collect all patterns to rewrite ops within the GPU dialect. More...
|
|
LogicalResult | mlir::gpu::transformGpuModulesToBinaries (Operation *op, OffloadingLLVMTranslationAttrInterface handler=nullptr, const gpu::TargetOptions &options={}) |
| Searches for all GPU modules in op and transforms them into GPU binary operations. More...
|
|
void | mlir::populateGpuDecomposeMemrefsPatterns (RewritePatternSet &patterns) |
| Collect a set of patterns to decompose memrefs ops. More...
|
|
void | mlir::populateGpuEliminateBarriersPatterns (RewritePatternSet &patterns) |
| Erase barriers that do not enforce conflicting memory side effects. More...
|
|