Skip to content

Commit c38c6f9

Browse files
authored
Merge pull request NVIDIA#1390 from allisonvacanti/changelog_1.12.0-rc0
Update metafiles for 1.12.0 release.
2 parents 6b116a6 + 6eafe08 commit c38c6f9

3 files changed

Lines changed: 74 additions & 4 deletions

File tree

CHANGELOG.md

Lines changed: 70 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,73 @@
1-
# Thrust 1.11.0
1+
# Thrust 1.12.0 (NVIDIA HPC SDK 21.3, CUDA Toolkit 11.4)
2+
3+
## Summary
4+
5+
Thrust 1.12.0 is the major release accompanying the NVIDIA HPC SDK 21.3
6+
and the CUDA Toolkit 11.4.
7+
8+
It includes a new `thrust::universal_vector`, which holds data that is
9+
accessible from both host and device. This allows users to easily leverage
10+
CUDA's unified memory with Thrust.
11+
New asynchronous `thrust::async:exclusive_scan` and `inclusive_scan` algorithms
12+
have been added, and the synchronous versions of these have been updated to
13+
use `cub::DeviceScan` directly.
14+
CUB radix sort for floating point types is now stable when both +0.0 and -0.0
15+
are present in the input. This affects some usages of `thrust::sort` and
16+
`thrust::stable_sort`.
17+
Many compilation warnings and subtle overflow bugs were fixed in the device
18+
algorithms, including a long-standing bug that returned invalid temporary
19+
storage requirements when `num_items` was close to (but not
20+
exceeding) `INT32_MAX`.
21+
22+
This release deprecates support for Clang < 7.0 and MSVC < 2019 (aka
23+
19.20/16.0/14.20).
24+
25+
## Breaking Changes
26+
27+
- NVIDIA/thrust#1372: Deprecate Clang < 7 and MSVC < 2019.
28+
- NVIDIA/thrust#1376: Standardize `thrust::scan_by_key` functors / accumulator
29+
types. This may change the results from `scan_by_key` when input, output, and
30+
initial value types are not the same type.
31+
32+
## New Features
33+
34+
- NVIDIA/thrust#1251: Add two new `thrust::async::` algorithms: `inclusive_scan`
35+
and `exclusive_scan`.
36+
- NVIDIA/thrust#1334: Add `thrust::universal_vector`, `universal_ptr`,
37+
and `universal_allocator`.
38+
39+
## Bug Fixes
40+
41+
- NVIDIA/thrust#1347: Qualify calls to `make_reverse_iterator`.
42+
- NVIDIA/thrust#1359: Enable stricter warning flags. This fixes several
43+
outstanding issues:
44+
- NVIDIA/cub#221: Overflow in `temp_storage_bytes` when `num_items` close to
45+
(but not over) `INT32_MAX`.
46+
- NVIDIA/cub#228: CUB uses non-standard C++ extensions that break strict
47+
compilers.
48+
- NVIDIA/cub#257: Warning when compiling `GridEvenShare` with unsigned
49+
offsets.
50+
- NVIDIA/thrust#974: Conversion warnings in `thrust::transform_reduce`.
51+
- NVIDIA/thrust#1091: Conversion warnings in `thrust::counting_iterator`.
52+
- NVIDIA/thrust#1373: Fix compilation error when a standard library type is
53+
wrapped in `thrust::optional`. Thanks to Vukasin Milovanovic for this
54+
contribution.
55+
- NVIDIA/thrust#1388: Fix `signbit(double)` implementation on MSVC.
56+
- NVIDIA/thrust#1389: Support building Thrust tests without CUDA enabled.
57+
58+
## Other Enhancements
59+
60+
- NVIDIA/thrust#1304: Use `cub::DeviceScan` to implement
61+
`thrust::exclusive_scan` and `thrust::inclusive_scan`.
62+
- NVIDIA/thrust#1362, NVIDIA/thrust#1370: Update smoke test naming.
63+
- NVIDIA/thrust#1380: Fix typos in `set_operation` documentation. Thanks to
64+
Hongyu Cai for this contribution.
65+
- NVIDIA/thrust#1383: Include FreeBSD license in LICENSE.md for
66+
`thrust::complex` implementation.
67+
- NVIDIA/thrust#1384: Add missing precondition to `thrust::gather`
68+
documentation.
69+
70+
# Thrust 1.11.0 (CUDA Toolkit 11.3)
271

372
## Summary
473

README.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -146,8 +146,9 @@ See the [changelog](CHANGELOG.md) for details about specific releases.
146146

147147
| Thrust Release | Included In |
148148
| ----------------- | --------------------------------------- |
149-
| 1.11.0 | |
150-
| 1.10.0 | NVIDIA HPC SDK 20.9 |
149+
| 1.12.0 | NVIDIA HPC SDK 21.3 & CUDA Toolkit 11.4 |
150+
| 1.11.0 | CUDA Toolkit 11.3 |
151+
| 1.10.0 | NVIDIA HPC SDK 20.9 & CUDA Toolkit 11.2 |
151152
| 1.9.10-1 | NVIDIA HPC SDK 20.7 & CUDA Toolkit 11.1 |
152153
| 1.9.10 | NVIDIA HPC SDK 20.5 |
153154
| 1.9.9 | CUDA Toolkit 11.0 |

dependencies/cub

Submodule cub updated from 47d71d9 to fef1b9c

0 commit comments

Comments
 (0)