CVE-2025-40303

Source
https://cve.org/CVERecord?id=CVE-2025-40303
Import Source
https://storage.googleapis.com/osv-test-cve-osv-conversion/osv-output/CVE-2025-40303.json
JSON Data
https://api.test.osv.dev/v1/vulns/CVE-2025-40303
Downstream
Related
Published
2025-12-08T00:46:27.820Z
Modified
2026-03-20T12:43:15.418148Z
Summary
btrfs: ensure no dirty metadata is written back for an fs with errors
Details

In the Linux kernel, the following vulnerability has been resolved:

btrfs: ensure no dirty metadata is written back for an fs with errors

[BUG] During development of a minor feature (make sure all btrfsbio::endio() is called in task context), I noticed a crash in generic/388, where metadata writes triggered new works after btrfsstopall_workers().

It turns out that it can even happen without any code modification, just using RAID5 for metadata and the same workload from generic/388 is going to trigger the use-after-free.

[CAUSE] If btrfs hits an error, the fs is marked as error, no new transaction is allowed thus metadata is in a frozen state.

But there are some metadata modifications before that error, and they are still in the btree inode page cache.

Since there will be no real transaction commit, all those dirty folios are just kept as is in the page cache, and they can not be invalidated by invalidateinodepages2() call inside close_ctree(), because they are dirty.

And finally after btrfsstopall_workers(), we call iput() on btree inode, which triggers writeback of those dirty metadata.

And if the fs is using RAID56 metadata, this will trigger RMW and queue new works into rmwworkers, which is already stopped, causing warning from queuework() and use-after-free.

[FIX] Add a special handling for writeoneeb(), that if the fs is already in an error state, immediately mark the bbio as failure, instead of really submitting them.

Then during close_ctree(), iput() will just discard all those dirty tree blocks without really writing them back, thus no more new jobs for already stopped-and-freed workqueues.

The extra discard in writeoneeb() also acts as an extra safenet. E.g. the transaction abort is triggered by some extent/free space tree corruptions, and since extent/free space tree is already corrupted some tree blocks may be allocated where they shouldn't be (overwriting existing tree blocks). In that case writing them back will further corrupting the fs.

Database specific
{
    "osv_generated_from": "https://github.com/CVEProject/cvelistV5/tree/main/cves/2025/40xxx/CVE-2025-40303.json",
    "cna_assigner": "Linux"
}
References

Affected packages

Git / git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git

Affected ranges

Type
GIT
Repo
https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git
Events
Introduced
13e6c37b989859e70b0d73d3f2cb0aa022159b17
Fixed
066ee13f05fbd82ada01883e51f0695172f98dff
Fixed
e2b3859067bf012d53c49b3f885fef40624a2c83
Fixed
54a5b5a15588e3b0b294df31474d08a2678d4291
Fixed
2618849f31e7cf51fadd4a5242458501a6d5b315

Database specific

source
"https://storage.googleapis.com/osv-test-cve-osv-conversion/osv-output/CVE-2025-40303.json"

Linux / Kernel

Package

Name
Kernel

Affected ranges

Type
ECOSYSTEM
Events
Introduced
3.10.0
Fixed
6.6.117
Type
ECOSYSTEM
Events
Introduced
6.7.0
Fixed
6.12.58
Type
ECOSYSTEM
Events
Introduced
6.13.0
Fixed
6.17.8

Database specific

source
"https://storage.googleapis.com/osv-test-cve-osv-conversion/osv-output/CVE-2025-40303.json"