Skip to content

Commit 4f97597

Browse files
committed
Merge branch 'cpu_optimizations_v2' of github.com:vthumbe1503/TransformerEngine into cpu_optimizations_v2
Signed-off-by: Varun Thumbe <vthumbe@nvidia.com>
2 parents a9a9746 + 4cd6a67 commit 4f97597

File tree

1 file changed

+1
-1
lines changed
  • transformer_engine/pytorch/csrc/extensions

1 file changed

+1
-1
lines changed

transformer_engine/pytorch/csrc/extensions/cast.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1098,7 +1098,7 @@ std::vector<py::object> split_quantize(const at::Tensor &tensor,
10981098
uint8_t *input_dptr = reinterpret_cast<uint8_t *>(input_py.data_ptr());
10991099
auto input_dtype = GetTransformerEngineDType(input_py.scalar_type());
11001100
NVTEShape input_shape;
1101-
input_shape.ndim=0;
1101+
input_shape.ndim = 0;
11021102
size_t input_size = 1;
11031103
for (const auto &d : input_py.sizes()) {
11041104
input_shape.data[input_shape.ndim++] = static_cast<size_t>(d);

0 commit comments

Comments
 (0)