Extend multi-dimensional batched matmul support #2180

affifboudaoud · 2025-10-16T15:23:02Z

Extended Batched Matrix Multiplication Support

Summary

Extend batched matrix multiplication to support N-dimensional tensors with NumPy-style broadcasting across all implementations (Pure, MKL, OpenBLAS, cuBLAS).

Changes

N-D tensor support: Handles tensors with arbitrary batch dimensions (e.g., [12, 2, 64, 64] @ [12, 2, 64, 128])
Broadcasting: Supports broadcasting patterns like [b, m, k] @ [k, n] and [m, k] @ [b, k, n]
Batch flattening: Multi-dimensional batches are flattened internally for efficient BLAS operations

New Capabilities

# 3D broadcasting
[b, m, k] @ [k, n] → [b, m, n]
[m, k] @ [b, k, n] → [b, m, n]

# 4D batched matmul
[12, 2, 64, 64] @ [12, 2, 64, 128] → [12, 2, 64, 128]
[12, 2, m, k] @ [k, n] → [12, 2, m, n]  # with broadcasting

Files Modified

dace/libraries/blas/nodes/matmul.py: Extended _get_batchmm_opts() for N-D tensors and broadcasting
dace/libraries/blas/nodes/batched_matmul.py: Updated validation and Pure expansion for dynamic dimensions
dace/frontend/python/replacements/linalg.py: Removed 3D tensor check
tests/library/batched_matmul_test.py: Added tests for newly supported 3D/4D batched matmuls with broadcasting

phschaad

Nice addition, looks good to me

# Extended Batched Matrix Multiplication Support ## Summary Extend batched matrix multiplication to support N-dimensional tensors with NumPy-style broadcasting across all implementations (Pure, MKL, OpenBLAS, cuBLAS). ## Changes - **N-D tensor support**: Handles tensors with arbitrary batch dimensions (e.g., `[12, 2, 64, 64] @ [12, 2, 64, 128]`) - **Broadcasting**: Supports broadcasting patterns like `[b, m, k] @ [k, n]` and `[m, k] @ [b, k, n]` - **Batch flattening**: Multi-dimensional batches are flattened internally for efficient BLAS operations ## New Capabilities ```python # 3D broadcasting [b, m, k] @ [k, n] → [b, m, n] [m, k] @ [b, k, n] → [b, m, n] # 4D batched matmul [12, 2, 64, 64] @ [12, 2, 64, 128] → [12, 2, 64, 128] [12, 2, m, k] @ [k, n] → [12, 2, m, n] # with broadcasting ``` ## Files Modified - `dace/libraries/blas/nodes/matmul.py`: Extended `_get_batchmm_opts()` for N-D tensors and broadcasting - `dace/libraries/blas/nodes/batched_matmul.py`: Updated validation and Pure expansion for dynamic dimensions - `dace/frontend/python/replacements/linalg.py`: Removed 3D tensor check - `tests/library/batched_matmul_test.py`: Added tests for newly supported 3D/4D batched matmuls with broadcasting

Extend multi-dimensional batched matmul support

ab71b81

affifboudaoud marked this pull request as ready for review October 17, 2025 09:23

phschaad approved these changes Oct 18, 2025

View reviewed changes

phschaad added this pull request to the merge queue Oct 18, 2025

Merged via the queue into main with commit d32f51d Oct 18, 2025
10 checks passed

phschaad deleted the batched_matmul_improvements branch October 18, 2025 09:11

affifboudaoud mentioned this pull request Oct 21, 2025

Add more batched-MatMul broadcasting support and tests #2190

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend multi-dimensional batched matmul support #2180

Extend multi-dimensional batched matmul support #2180

Uh oh!

affifboudaoud commented Oct 16, 2025

Uh oh!

phschaad left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Extend multi-dimensional batched matmul support #2180

Extend multi-dimensional batched matmul support #2180

Uh oh!

Conversation

affifboudaoud commented Oct 16, 2025

Extended Batched Matrix Multiplication Support

Summary

Changes

New Capabilities

Files Modified

Uh oh!

phschaad left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants