Skip to content

Tags: pytorch/pytorch

Tags

viable/strict/1765628856

Toggle viable/strict/1765628856's commit message
[xpu][fix] Fix UT test_fuse_mix_order_reductions_combo_kernels (#170297)

Fixes #170296

Pull Request resolved: #170297
Approved by: https://github.com/EikanWang, https://github.com/jansel

viable/strict/1765621526

Toggle viable/strict/1765621526's commit message
[dynamo][DebugMode] make ModTracker a no-op in compiled regions (#170124

)

ModTracker causes a graph break in compiled regions: #169995, so this makes it a no-op by introducing the `torch._dynamo.eval_frame._is_in_compiled_region()` check.

The next PR in the stack makes nn.Module tracking work for DebugMode by introducing an interpreter.

Pull Request resolved: #170124
Approved by: https://github.com/tugsbayasgalan

trunk/5058132088b93b3cd507b6cb258c4fc91f4b0530

Toggle trunk/5058132088b93b3cd507b6cb258c4fc91f4b0530's commit message
[ROCm] Enable group gemm on gfx90a (#169356)

Fix concurrency race condition for group gemm and enable group gemm support on
gfx90a architecture.

Test command:
PYTORCH_TEST_WITH_ROCM=1 pytest test/test_matmul_cuda.py -v -k "test_grouped_gemm_2d_2d or test_grouped_gemm_2d_3d or or test_grouped_gemm_3d_3d or test_grouped_gemm_3d_2d"

Pull Request resolved: #169356
Approved by: https://github.com/slayton58, https://github.com/jeffdaily

Co-authored-by: Jeff Daily <[email protected]>

trunk/51109071a502e65c490e551a8e029e51e1d20133

Toggle trunk/51109071a502e65c490e551a8e029e51e1d20133's commit message
Fix: torch.view_as_complex() does not work on memory layout produced …

…by torch.contiguous() after transpose (#169780)

Fixes #150050, by ignoring the stride divisibility requirement for singleton dimensions (since stride is irrelevant for singleton dimensions).

Pull Request resolved: #169780
Approved by: https://github.com/soulitzer

trunk/0213024dcfc49947a656b4cc2fad6ed74304905d

Toggle trunk/0213024dcfc49947a656b4cc2fad6ed74304905d's commit message
[codemod][lowrisk] Remove unused exception parameter from caffe2 (#17…

…0325)

Summary:
`-Wunused-exception-parameter` has identified an unused exception parameter. This diff removes it.

This:
```
try {
    ...
} catch (exception& e) {
    // no use of e
}
```
should instead be written as
```
} catch (exception&) {
```

If the code compiles, this is safe to land.

Test Plan: Sandcastle

Reviewed By: dmm-fb

Differential Revision: D89071569

Pull Request resolved: #170325
Approved by: https://github.com/malfet

trunk/21705cd350d737151b2d51d55fa77b7d61137b47

Toggle trunk/21705cd350d737151b2d51d55fa77b7d61137b47's commit message
[18/N] Use Python 3.10 typing (#170280)

This PR uses Python 3.10 typing to some files.

Pull Request resolved: #170280
Approved by: https://github.com/Lucaskabela

trunk/877a3a56f359106275216ad2152c7196f77d1b06

Toggle trunk/877a3a56f359106275216ad2152c7196f77d1b06's commit message
[CI] Update update_expected.py to skip cuda-13 results (#170348)

Summary: Since cuda-13 runs skip more tests, we should only use cuda-12 runs to update the expected result files.

Pull Request resolved: #170348
Approved by: https://github.com/eellison

trunk/798f4201688023f447a01f218a137c0bfee1cde8

Toggle trunk/798f4201688023f447a01f218a137c0bfee1cde8's commit message
[Release 2.11] Version Bump (#170346)

Same as #162526
Compatibility matrix will be updated separately

Pull Request resolved: #170346
Approved by: https://github.com/huydhn

trunk/674a97c503eebc30bfd48a1dc839bdfeb2d5648c

Toggle trunk/674a97c503eebc30bfd48a1dc839bdfeb2d5648c's commit message
[ROCm] enable fastSpecializedAtomicAdd for gfx950 (#170330)

Use standard HIP headers for unsafeAtomicAdd. Cannot remove copy/paste of unsafeAtomicAdd as "preview" implementation until ROCm 6.2 can be fully deprecated.

Re-land of #167661

Co-author: @jeffdaily

Pull Request resolved: #170330
Approved by: https://github.com/jeffdaily

trunk/94d012b1cfa22451c1b8edd2a9ca0220b573129f

Toggle trunk/94d012b1cfa22451c1b8edd2a9ca0220b573129f's commit message
Don't call str when in redistribute hotpath (#170366)

Signed-off-by: Edward Z. Yang <[email protected]>
Pull Request resolved: #170366
Approved by: https://github.com/wconstab