Use float32 for LoRA weights to avoid the risk of underflow and overflow. by james77777778 · Pull Request #22559 · keras-team/keras

james77777778 · 2026-03-27T02:48:20Z

Description

As reported in keras-team/keras-hub#2629

We should use high precision (float32) for LoRA weights to stabilize the finetuning.

References:

https://huggingface.co/docs/peft/developer_guides/troubleshooting (autocast_adapter_dtype)
Adapters saved in float16 are loaded in float32 huggingface/peft#2421
https://github.com/huggingface/peft/blob/c75485a2144a15a526e0290835a7439daefc3925/src/peft/tuners/lora/layer.py#L966-L968 (huggingface/peft impl)

Contributor Agreement

Please check all boxes below before submitting your PR for review:

I am a human, and not a bot.
I will be responsible for responding to review comments in a timely manner.
I will work with the maintainers to push this PR forward until submission.

Note: Failing to adhere to this agreement may result in your future PRs no longer being reviewed.

gemini-code-assist

Code Review

This pull request updates the LoRA implementation across several layers—including Convolutional, Dense, EinsumDense, and Embedding—to ensure that LoRA weights are initialized as float32 to prevent numerical instability. It also introduces explicit casting to the appropriate variable or compute dtypes during kernel composition and forward passes. A critical issue was identified in the EinsumDense layer where a trailing comma incorrectly converts the LoRA update into a tuple, which will cause a TypeError during tensor operations.

codecov-commenter · 2026-03-27T02:57:37Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 82.96%. Comparing base (e94cb07) to head (5822cef).

Additional details and impacted files

@@           Coverage Diff           @@
##           master   #22559   +/-   ##
=======================================
  Coverage   82.95%   82.96%           
=======================================
  Files         596      596           
  Lines       69252    69259    +7     
  Branches    10814    10814           
=======================================
+ Hits        57451    57458    +7     
  Misses       8973     8973           
  Partials     2828     2828

Flag	Coverage Δ
keras	`82.78% <100.00%> (+<0.01%)`	⬆️
keras-jax	`58.72% <100.00%> (+<0.01%)`	⬆️
keras-numpy	`54.56% <69.23%> (-0.01%)`	⬇️
keras-openvino	`59.42% <69.23%> (-0.01%)`	⬇️
keras-tensorflow	`60.29% <100.00%> (+<0.01%)`	⬆️
keras-torch	`59.06% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

james77777778 · 2026-04-07T07:47:30Z

PR rebased. The openvino test failure should be unrelated to this PR.

…low.

amitsrivastava78 · 2026-04-20T04:10:07Z

Thanks for the PR, it's very well written, please check my minor comment about this.

amitsrivastava78 · 2026-04-20T04:02:47Z

                "lora is already enabled. This can only be done once per layer."
            )
        self._tracker.unlock()
+


The PR itself is fine; just worth adding a note that users should merge LoRA weights before deploying for inference.

Sure. The comments have been updated:

# LoRA weights should be float32 to avoid the risk of underflow or # overflow during fine-tuning. # When deploying the model, these weights should be merged with the # original kernel while maintaining the original kernel's dtype. ...

Add notes for deploying with lora weights.

google-ml-butler bot added the size:M label Mar 27, 2026

google-ml-butler bot assigned gbaned Mar 27, 2026

gemini-code-assist bot reviewed Mar 27, 2026

View reviewed changes

Comment thread keras/src/layers/core/einsum_dense.py Outdated

james77777778 mentioned this pull request Mar 27, 2026

[Bug] Adam optimizer produces near-zero effective updates when training LoRA weights in bfloat16 with ModelParallel on TPU keras-team/keras-hub#2629

Open

james77777778 force-pushed the use-float32-for-lora-weights branch 2 times, most recently from 7722450 to 28a8fe9 Compare March 27, 2026 06:20

keerthanakadiri added the stat:awaiting keras-eng Awaiting response from Keras engineer label Mar 27, 2026

hertschuh added the keras-team-review-pending Pending review by a Keras team member. label Mar 31, 2026

james77777778 force-pushed the use-float32-for-lora-weights branch from 28a8fe9 to 4080af9 Compare April 7, 2026 06:51

hertschuh requested a review from amitsrivastava78 April 8, 2026 17:27

google-ml-butler bot added the awaiting review label Apr 8, 2026

james77777778 force-pushed the use-float32-for-lora-weights branch from 4080af9 to b91abca Compare April 9, 2026 13:49

james77777778 added 2 commits April 18, 2026 19:02

Use float32 for LoRA weights to avoid the risk of underflow and overf…

509d281

…low.

Resolve gemini's comment.

2910d86

james77777778 force-pushed the use-float32-for-lora-weights branch from b91abca to 9818dc4 Compare April 18, 2026 11:02

amitsrivastava78 reviewed Apr 20, 2026

View reviewed changes

Fix autocasting issues.

5822cef

Add notes for deploying with lora weights.

james77777778 force-pushed the use-float32-for-lora-weights branch from 9818dc4 to 5822cef Compare April 20, 2026 13:56

james77777778 requested a review from amitsrivastava78 April 20, 2026 13:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use float32 for LoRA weights to avoid the risk of underflow and overflow.#22559

Use float32 for LoRA weights to avoid the risk of underflow and overflow.#22559
james77777778 wants to merge 3 commits intokeras-team:masterfrom
james77777778:use-float32-for-lora-weights

james77777778 commented Mar 27, 2026 •

edited

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

codecov-commenter commented Mar 27, 2026 •

edited

Loading

Uh oh!

james77777778 commented Apr 7, 2026

Uh oh!

amitsrivastava78 commented Apr 20, 2026

Uh oh!

amitsrivastava78 Apr 20, 2026

Uh oh!

james77777778 Apr 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

james77777778 commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Contributor Agreement

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

codecov-commenter commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

james77777778 commented Apr 7, 2026

Uh oh!

amitsrivastava78 commented Apr 20, 2026

Uh oh!

amitsrivastava78 Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

james77777778 Apr 20, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

james77777778 commented Mar 27, 2026 •

edited

Loading

codecov-commenter commented Mar 27, 2026 •

edited

Loading