remove hardcoded torch.backends.cuda.matmul.allow_tf32 = True by Parskatt · Pull Request #18 · naver/croco

Parskatt · 2024-01-30T16:31:16Z

allow_tf32 can screw up stuff that relies on high precision, don't hardcode.
In my case, it messed up torch.cdist so that a bunch of small values went to 0 (not good).

allow_tf32 can screw up stuff that relies on high precision, don't hardcode.

dabeschte · 2025-08-18T10:58:50Z

@yocabon Would be great if we could get this PR in - at least the allow_tf32 part

I spent more than a day to find out why one (unrelated) model worked with cuda11.8, but not anymore with cuda>12 and it turned out it had to do with low precision matmuls which were enabled in croco.py

Parskatt and others added 4 commits January 30, 2024 17:30

remove hardcoded torch.backends.cuda.matmul.allow_tf32 = True

6488d27

allow_tf32 can screw up stuff that relies on high precision, don't hardcode.

Merge branch 'naver:master' into master

eb28ee0

torch native attention

6311881

add many to one / one to many cross att

ee35847

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove hardcoded torch.backends.cuda.matmul.allow_tf32 = True#18

remove hardcoded torch.backends.cuda.matmul.allow_tf32 = True#18
Parskatt wants to merge 4 commits intonaver:masterfrom
Parskatt:master

Parskatt commented Jan 30, 2024

Uh oh!

dabeschte commented Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Parskatt commented Jan 30, 2024

Uh oh!

dabeschte commented Aug 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants