Commit a9fada6
authored
[SYCL] Fix NVPTX compilation with the new offload driver (#19039)
A typical SYCL compilation for NVPTX with the default
`nvptx(64)?-nvidia-cuda` triple compiles for an older SM_50
architecture, relying on forward compatibility and JIT to run on newer
devices. Thus compilation for NVPTX with the new offload driver relies
on us generating a fat binary containing the textual PTX as well as the
compiled object. This is a kind of LTO though not officially considered
so.
A recent pulldown broke the generation of textual assembly with the `-S`
flag, and so we were later passing a compiled ELF binary into `ptxas`
which predictably went wrong.
Fixes #18432.1 parent b0523f4 commit a9fada6
5 files changed
Lines changed: 3 additions & 10 deletions
File tree
- clang/tools/clang-linker-wrapper
- sycl/test-e2e
- DeviceImageDependencies/NewOffloadDriver
- NewOffloadDriver
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1691 | 1691 | | |
1692 | 1692 | | |
1693 | 1693 | | |
1694 | | - | |
| 1694 | + | |
1695 | 1695 | | |
1696 | 1696 | | |
1697 | 1697 | | |
| |||
1724 | 1724 | | |
1725 | 1725 | | |
1726 | 1726 | | |
1727 | | - | |
| 1727 | + | |
1728 | 1728 | | |
1729 | 1729 | | |
1730 | 1730 | | |
| |||
Lines changed: 1 addition & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | 1 | | |
2 | 2 | | |
3 | | - | |
4 | | - | |
| 3 | + | |
5 | 4 | | |
6 | 5 | | |
7 | 6 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
5 | 5 | | |
6 | 6 | | |
7 | 7 | | |
8 | | - | |
9 | | - | |
10 | 8 | | |
11 | 9 | | |
12 | 10 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | | - | |
3 | 1 | | |
4 | 2 | | |
5 | 3 | | |
| |||
Lines changed: 0 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1 | | - | |
2 | | - | |
3 | 1 | | |
4 | 2 | | |
5 | 3 | | |
| |||
0 commit comments