[Fix] issues mentioned in comments of #22472 by ChiragSW · Pull Request #22595 · keras-team/keras

ChiragSW · 2026-03-31T09:39:07Z

Fixes issues mentioned in comments of #22472

Actually, we have discovered that many APIs in the keras.ops module exhibit the issue described above—they lack validity checks when accepting Keras.Input as an input.
These include:

keras.ops.flip
keras.ops.log_softmax
keras.ops.sparse_categorical_crossentropy
keras.ops.roll
keras.ops.sparsemax
keras.ops.trace
keras.ops.image.pad_images
keras.ops.image.crop_images

Root cause

keras.ops functions branch like:
if any_symbolic_tensors(inputs): call Operation(...).symbolic_call() which relies on compute_output_spec() for validation
else: run backend/eager code paths where the real argument checks happen
So any checks that only existed in the eager helper/backend path were skipped for keras.Input.

Fix

keras.ops.image.pad_images / crop_images (`keras/src/ops/image.py`)

pad_images(): added the missing “must specify exactly two of …” argument validation in the public wrapper so it runs for both eager and symbolic inputs.
crop_images(): added the same “exactly two of …” validation in the wrapper.
CropImages.compute_output_spec(): added validation by inferring missing crop amounts when input_height/input_width are known, and raising if the inferred values would be negative.

keras.ops.flip, keras.ops.roll, keras.ops.trace (`keras/src/ops/numpy.py`)

flip(): validate axis is None, int, or a sequence of ints.
roll(): validate axis/shift compatibility.
trace(): validate axis1 != axis2 during symbolic shape inference.

keras.ops.log_softmax, keras.ops.sparsemax, keras.ops.sparse_categorical_crossentropy (`keras/src/ops/nn.py`)

log_softmax() and sparsemax(): validate axis bounds against rank when rank is known (so axis=-3 for a rank-2 input raises for symbolic too)
sparse_categorical_crossentropy(): enforce the existing constraint axis == -1 in the wrapper so symbolic inputs don’t bypass it.

I am a human, and not a bot.
I will be responsible for responding to review comments in a timely manner.
I will work with the maintainers to push this PR forward until submission.

gemini-code-assist

Code Review

This pull request introduces input validation and axis canonicalization for several image, neural network, and numpy operations to improve API reliability. However, the logic used to determine tensor rank is fragile, as it fails for Python lists and causes crashes for KerasTensor instances with unknown rank. Additionally, the roll operation incorrectly validates the shift argument by not supporting broadcasting, and some manual checks are redundant with existing utility functions. The feedback highlights the need for more robust tensor conversion and consistent use of backend utilities to handle all valid input types and edge cases without crashing.

gemini-code-assist · 2026-03-31T09:42:23Z

+    ndim = len(getattr(x, "shape", []))
+    if isinstance(axis, int):
+        if axis < -ndim or axis >= ndim:
+            raise ValueError(
+                f"axis {axis} is out of bounds for array of dimension {ndim}"
+            )
+        axis = axis if axis >= 0 else axis + ndim
+    elif isinstance(axis, tuple):
+        canonical_axis = []
+        for a in axis:
+            if not isinstance(a, int):
+                raise TypeError(
+                    "Argument `axis` must be an integer or tuple of integers. "
+                    f"Received: axis={axis}"
+                )
+            if a < -ndim or a >= ndim:
+                raise ValueError(
+                    f"axis {a} is out of bounds for array of dimension {ndim}"
+                )
+            a = a if a >= 0 else a + ndim
+            canonical_axis.append(a)
+        axis = tuple(canonical_axis)


Using len(getattr(x, "shape", [])) to determine rank is problematic for two reasons:

It returns 0 for Python lists (which lack a .shape attribute), causing valid calls like log_softmax([1.0, 2.0]) to fail with an out-of-bounds error.

It will crash with a TypeError if x.shape is None, which is the case for KerasTensor instances with unknown rank.

It is recommended to use backend.convert_to_tensor(x) first and then check if x.shape is not None before validating the axis.

References

APIs should be consistent with established conventions (NumPy) and handle common input types like lists gracefully. ^(link)

gemini-code-assist · 2026-03-31T09:42:23Z

+    ndim = len(getattr(x, "shape", []))
+    if not isinstance(axis, int):
+        raise TypeError(
+            f"Argument `axis` must be an integer. Received: axis={axis}"
+        )
+    if axis < -ndim or axis >= ndim:
+        raise ValueError(
+            f"axis {axis} is out of bounds for array of dimension {ndim}"
+        )
+    axis = axis if axis >= 0 else axis + ndim


This block has the same ndim detection issue as log_softmax. It will fail for Python lists and crash for tensors with unknown rank. Please convert x to a tensor and verify x.shape is not None before proceeding with validation.

gemini-code-assist · 2026-03-31T09:42:23Z

+        ndim = len(getattr(x, "shape", []))
+        if isinstance(axis, int):
+            canonicalize_axis(axis, ndim)
+        elif isinstance(axis, (tuple, list)):
+            for a in axis:
+                if not isinstance(a, int):
+                    raise TypeError(
+                        "Argument `axis` must be an integer or a sequence of "
+                        f"integers. Received: axis={axis}"
+                    )
+                canonicalize_axis(a, ndim)
+        else:
+            raise TypeError(
+                "Argument `axis` must be an integer, a sequence of integers, "
+                f"or `None`. Received: axis={axis}"
+            )


The ndim detection here fails for Python lists and crashes for tensors with unknown rank. Additionally, this manual validation logic is redundant with backend_utils.canonicalize_axis. Consider converting x to a tensor first and using the utility function safely.

gemini-code-assist · 2026-03-31T09:42:23Z

+    ndim = len(getattr(x, "shape", []))
+    if axis is None:
+        if isinstance(shift, (tuple, list)):
+            raise ValueError(
+                "When `axis` is `None`, `shift` must be an integer. "
+                f"Received: shift={shift}"
+            )
+    elif isinstance(axis, int):
+        canonicalize_axis(axis, ndim)
+        if isinstance(shift, (tuple, list)):
+            raise ValueError(
+                "When `axis` is an integer, `shift` must be an integer. "
+                f"Received: shift={shift}"
+            )
+    elif isinstance(axis, (tuple, list)):
+        for a in axis:
+            if not isinstance(a, int):
+                raise TypeError(
+                    "Argument `axis` must be an integer or a sequence of "
+                    f"integers. Received: axis={axis}"
+                )
+            canonicalize_axis(a, ndim)
+        if not isinstance(shift, (tuple, list)) or len(shift) != len(axis):
+            raise ValueError(
+                "`shift` and `axis` must have the same size. "
+                f"Received: shift={shift}, axis={axis}"
+            )
+        for s in shift:
+            if not isinstance(s, int):
+                raise TypeError(
+                    "Argument `shift` must be an integer or a sequence of "
+                    f"integers. Received: shift={shift}"
+                )
+    else:
+        raise TypeError(
+            "Argument `axis` must be an integer, a sequence of integers, "
+            f"or `None`. Received: axis={axis}"
+        )


There are two issues in this block:

The ndim detection fails for lists and crashes on unknown rank tensors.

The logic for shift validation when axis is a sequence (lines 7054-7058) is incorrect. It currently raises a ValueError if shift is a single integer but axis is a tuple/list. NumPy allows a single integer shift to be broadcast across all specified axes.

Suggested fix for the shift logic:

if isinstance(shift, (tuple, list)): if len(shift) != len(axis): raise ValueError( "`shift` and `axis` must have the same size. " f"Received: shift={shift}, axis={axis}" ) for s in shift: if not isinstance(s, int): raise TypeError( "Argument `shift` must be an integer or a sequence of " f"integers. Received: shift={shift}" ) elif not isinstance(shift, int): raise TypeError( "Argument `shift` must be an integer or a sequence of " f"integers. Received: shift={shift}" )

gemini-code-assist · 2026-03-31T09:42:23Z

+    ndim = len(getattr(x, "shape", []))
+    axis1 = canonicalize_axis(axis1, ndim)
+    axis2 = canonicalize_axis(axis2, ndim)
+    if axis1 == axis2:
+        raise ValueError(
+            f"axis1 and axis2 must be different. Received: axis1={axis1}, "
+            f"axis2={axis2}"
+        )


Same ndim detection issue as other ops: this will fail for Python lists and crash for tensors with unknown rank. Please ensure x is converted to a tensor and its shape is verified before calling canonicalize_axis.

gemini-code-assist · 2026-03-31T09:42:23Z

+        ndim = len(getattr(x, "shape", []))
+        axis1 = canonicalize_axis(self.axis1, ndim)
+        axis2 = canonicalize_axis(self.axis2, ndim)
+        if axis1 == axis2:
+            raise ValueError(
+                f"axis1 and axis2 must be different. Received: "
+                f"axis1={self.axis1}, axis2={self.axis2}"
+            )


In compute_output_spec, x is a KerasTensor. If the rank is unknown (x.shape is None), len(getattr(x, "shape", [])) will crash. Since trace requires at least 2 dimensions, you should handle the None shape case by returning a KerasTensor with unknown shape or raising a more descriptive error if rank must be known.

codecov-commenter · 2026-03-31T10:10:27Z

Codecov Report

❌ Patch coverage is 59.43396% with 86 lines in your changes missing coverage. Please review.
✅ Project coverage is 83.01%. Comparing base (a2e97e1) to head (94b9793).
⚠️ Report is 88 commits behind head on master.

Files with missing lines	Patch %	Lines
keras/src/ops/image.py	31.91%	16 Missing and 16 partials ⚠️
keras/src/ops/numpy.py	66.27%	16 Missing and 13 partials ⚠️
keras/src/ops/nn.py	64.81%	8 Missing and 11 partials ⚠️
keras/src/utils/rng_utils.py	60.00%	2 Missing and 2 partials ⚠️
keras/src/ops/operation_utils.py	81.81%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #22595      +/-   ##
==========================================
- Coverage   83.26%   83.01%   -0.26%     
==========================================
  Files         596      596              
  Lines       67828    69050    +1222     
  Branches    10562    10855     +293     
==========================================
+ Hits        56480    57322     +842     
- Misses       8605     8861     +256     
- Partials     2743     2867     +124

Flag	Coverage Δ
keras	`82.83% <59.43%> (-0.26%)`	⬇️
keras-jax	`59.01% <58.49%> (-0.81%)`	⬇️
keras-numpy	`54.85% <58.49%> (+0.41%)`	⬆️
keras-openvino	`59.32% <58.49%> (+7.62%)`	⬆️
keras-tensorflow	`60.57% <59.43%> (-0.57%)`	⬇️
keras-torch	`59.34% <58.49%> (-0.66%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ChiragSW · 2026-04-01T05:28:26Z

There are some crashes and bugs that need to be fixed. I will look into it soon

ChiragSW · 2026-04-02T15:47:43Z

Summary of new changes

Updated many ops to stop using len(getattr(x, "shape", [])) for rank detection, since it breaks on Python lists (no .shape) and can crash when x.shape is None for unknown-rank symbolic tensors.
For log_softmax, sparsemax, flip, roll, and trace, now converted x to a tensor and only does canonicalization when the rank is actually known, so those failures donot happen.
Also fixed roll’s validation so a single integer shift is allowed when axis is a tuple or list.
Trace.compute_output_spec now handles unknown-rank symbolic inputs by raising an error and not crashing.

ChiragSW · 2026-04-02T19:28:34Z

The changes resolve the issues. Please review @keerthanakadiri @hertschuh

hertschuh

I'm not following the context of this. It seems unrelated to the linked bug.

Are we missing some validation for the axis argument in some places?

If so, can you add tests with self.assertRaises to demonstrate the problem.

Also, there should never be any code in-between if any_symbolic_tensor(...) and the call to symbolic_call. Some validation can happen before the if. But some validation has to happen in the backend specific implementation after convert_to_tensor.

hertschuh · 2026-04-01T01:25:22Z

+    if isinstance(axis, int):
+        if axis < -ndim or axis >= ndim:
+            raise ValueError(
+                f"axis {axis} is out of bounds for array of dimension {ndim}"
+            )
+        axis = axis if axis >= 0 else axis + ndim


Use canonicalize_axis for this.

ChiragSW · 2026-04-04T09:47:20Z

Problem is that Keras ops have two execution paths, symbolic and eager. Validation checks only lived in the eager path, so invalid inputs via keras.Input would silently pass through without any error.

Fix:

Run shared validation before the any_symbolic_tensors branch when rank is statically knowable
Keep validation after convert_to_tensor on the eager path
For purely symbolic cases, put validation inside compute_output_spec

Tests added
Across nn_test.py, numpy_test.py, image_test.py, and operation_utils_test.py, all asserting that bad axis or rank inputs on keras.Input now correctly raise ValueError.

ChiragSW · 2026-04-09T15:46:34Z

The issue has been fixed. Please review @hertschuh

hertschuh

First, this PR is combining a few unrelated things into one. Please split this PR in at least 3 separate PRs:

the image padding / cropping validation
the axis verification / canonicalization
the RNG seed generator changes (and I don't understand the context of these changes)

Then, I see a lot of code duplication like this:

    if axis is not None:
        ndim = get_static_tensor_ndim(x)
        if isinstance(axis, int):
            if ndim is not None:
                canonicalize_axis(axis, ndim)
        elif isinstance(axis, (tuple, list)):
            for a in axis:
                if not isinstance(a, int):
                    raise TypeError(
                        "Argument `axis` must be an integer or a sequence "
                        f"of integers. Received: axis={axis}"
                    )
                if ndim is not None:
                    canonicalize_axis(a, ndim)
        else:
            raise TypeError(
                "Argument `axis` must be an integer, a sequence of "
                f"integers, or `None`. Received: axis={axis}"
            )

Please create a helper function called canonicalize_axes after canonicalize_axis. It will take an int or a list of ints and always return a tuple of ints after doing the validation.

Next, see this pattern a lot:

# Some validation
if any_symbolic_tensors(...)
   ...
x = convert_to_tensor(x)
# The same validation

This is not a pattern that we should use:

It creates a ton of code duplication
We should let the backend implementation do the x = convert_to_tensor(x) part, sometimes the backend implementation needs to look at the type or value of x before converting it to a tensor

If you can't do the validation because you don't fully know the shape of the input, it means it shouldn't be here. It's ok to move the validation in the backend specific implementations (as long as it's factored as one-liner) and in symbolic_call.

hertschuh · 2026-04-15T17:35:17Z

+        and hasattr(images, "_keras_history")
+        and images._keras_history.operation.__class__.__name__ == "InputLayer"


I don't understand why you are looking at the keras history, you should never have to do that. Also, you shouldn't even care if it's a keras tensor or a normal tensor.

hertschuh · 2026-04-15T17:37:36Z

+import keras
 from keras.src import backend


Do not import keras in unit test, please import the feature from keras.src

ChiragSW · 2026-04-16T18:13:02Z

For now I will close this PR and put up 3 separate PRs. I will add the details in what each PR fixes. Should I go ahead with this process @hertschuh ?
The 3 PRs will be:

image padding / cropping validation
axis verification / canonicalization
RNG seed generator changes

hertschuh · 2026-04-17T21:56:23Z

For now I will close this PR and put up 3 separate PRs. I will add the details in what each PR fixes. Should I go ahead with this process @hertschuh ? The 3 PRs will be:

image padding / cropping validation

axis verification / canonicalization

RNG seed generator changes

Yes, thank you!

[Fix] issues mentioned in comments of keras-team#22472

9f67f3c

google-ml-butler bot added the size:M label Mar 31, 2026

google-ml-butler bot assigned gbaned Mar 31, 2026

gemini-code-assist bot reviewed Mar 31, 2026

View reviewed changes

keerthanakadiri added the stat:awaiting response from contributor label Apr 2, 2026

ChiragSW added 2 commits April 2, 2026 21:09

resolved the critical issues

5ef5178

format changes

9d1e08d

google-ml-butler bot removed the stat:awaiting response from contributor label Apr 2, 2026

ChiragSW added 2 commits April 2, 2026 21:37

refactored ops

71b6627

changes to resolve errors

330918b

hertschuh requested changes Apr 3, 2026

View reviewed changes

hertschuh added the stat:awaiting response from contributor label Apr 3, 2026

ChiragSW added 2 commits April 4, 2026 15:10

adding tests and some fixes

8cef376

format changes

5052700

google-ml-butler bot removed the stat:awaiting response from contributor label Apr 4, 2026

keerthanakadiri added the stat:awaiting response from contributor label Apr 7, 2026

ChiragSW added 4 commits April 9, 2026 11:27

fixed the api error

8752b30

test case failure fixes

8be0425

reformats

1d0c1dc

numpy issues fixed

94b9793

google-ml-butler bot removed the stat:awaiting response from contributor label Apr 9, 2026

ChiragSW requested a review from hertschuh April 10, 2026 20:52

google-ml-butler bot added the awaiting review label Apr 10, 2026

hertschuh requested changes Apr 15, 2026

View reviewed changes

hertschuh added stat:awaiting response from contributor and removed awaiting review labels Apr 15, 2026

google-ml-butler bot removed the stat:awaiting response from contributor label Apr 16, 2026

hertschuh closed this Apr 17, 2026

This was referenced Apr 18, 2026

[Fix] Image padding / cropping validation #22707

Open

[Fix] Refactor axis validation and canonicalization for targeted review #22708

Open

[Fix] Isolate RNG seed generator changes and clarify documentation #22709

Open

		and hasattr(images, "_keras_history")
		and images._keras_history.operation.__class__.__name__ == "InputLayer"

Conversation

ChiragSW commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Root cause

Fix

keras.ops.image.pad_images / crop_images (keras/src/ops/image.py)

keras.ops.flip, keras.ops.roll, keras.ops.trace (keras/src/ops/numpy.py)

keras.ops.log_softmax, keras.ops.sparsemax, keras.ops.sparse_categorical_crossentropy (keras/src/ops/nn.py)

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Mar 31, 2026

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ChiragSW commented Apr 1, 2026

Uh oh!

ChiragSW commented Apr 2, 2026

Summary of new changes

Uh oh!

ChiragSW commented Apr 2, 2026

Uh oh!

hertschuh left a comment

Choose a reason for hiding this comment

Uh oh!

hertschuh Apr 1, 2026

Choose a reason for hiding this comment

Uh oh!

ChiragSW commented Apr 4, 2026

Uh oh!

ChiragSW commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hertschuh left a comment

Choose a reason for hiding this comment

Uh oh!

hertschuh Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

hertschuh Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

ChiragSW commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hertschuh commented Apr 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ChiragSW commented Mar 31, 2026 •

edited

Loading

keras.ops.image.pad_images / crop_images (`keras/src/ops/image.py`)

keras.ops.flip, keras.ops.roll, keras.ops.trace (`keras/src/ops/numpy.py`)

keras.ops.log_softmax, keras.ops.sparsemax, keras.ops.sparse_categorical_crossentropy (`keras/src/ops/nn.py`)

codecov-commenter commented Mar 31, 2026 •

edited

Loading

ChiragSW commented Apr 9, 2026 •

edited

Loading

ChiragSW commented Apr 16, 2026 •

edited

Loading