This repository was archived by the owner on Mar 21, 2024. It is now read-only.
CUB 1.17.0 #475
alliepiper
announced in
Announcements
CUB 1.17.0
#475
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
CUB 1.17.0
Summary
CUB 1.17.0 is the final minor release of the 1.X series. It provides a variety of bug fixes and miscellaneous enhancements, detailed below.
Known Issues
“Run-to-run” Determinism Broken
Several CUB device algorithms are documented to provide deterministic results (per device) for non-associative reduction operators (e.g. floating-point addition). Unfortunately, the implementations of these algorithms contain performance optimizations that violate this guarantee. The
DeviceReduce::ReduceByKeyandDeviceScanalgorithms are known to be affected. We’re currently evaluating the scope and impact of correcting this in a future CUB release. See NVIDIA/cub#471 for details.Bug Fixes
DeviceSelectto work with discard iterators and mixed input/output types.CMAKE_INSTALL_LIBDIRcontained nested directories. Thanks to @robertmaynard for this contribution.DeviceSegmentedSorton sm_61 and sm_70.DeviceSelect::Flaggedso that flags are normalized to 0 or 1.DeviceRadixSortgivennum_itemsclose to 2^32. Thanks to @canonizer for this contribution.Other Enhancements
DeviceSegmentedSortwhen launched via CDP.BlockDiscontinuity: Replaced recursive-template loop unrolling with#pragma unroll. Thanks to @kshitij12345 for this contribution.TexRefInputIteratorimplementation with an alias toTexObjInputIterator. This fully removes all usages of the deprecated CUDA texture reference APIs from CUB.BlockAdjacentDifference: Replaced recursive-template loop unrolling with#pragma unroll. Thanks to @kshitij12345 for this contribution.cub::DeviceAdjacentDifferenceAPI has been updated to use the newOffsetTdeduction approach described in Transparent support for 64-bit indexing in device algorithms #212.This discussion was created from the release CUB 1.17.0.
Beta Was this translation helpful? Give feedback.
All reactions