Fix file-corrupting bug, kernels 2.5.41 to 2.5.44

Fix file-corrupting bug, kernels 2.5.41 to 2.5.44

Post by Andrew Morto » Tue, 29 Oct 2002 06:20:05



This patch fixes a filesystem corrupting bug, present in 2.5.41 through
2.5.44.  It can cause ext2 indirect blocks to not be written out.  A
fsck will fix it up.

Under heavy memory pressure a PF_MEMALLOC task attemtps to write out a
blockdev page whose buffers are already under writeback and which were
dirtied while under writeback.

The writepage call returns -EAGAIN but because the caller is
PF_MEMALLOC, the page was not being marked dirty again.

The page sits on mapping->clean_pages for ever and it not written out.

The fix is to mark that page dirty again for all callers, regardless of
PF_MEMALLOC state.

 fs/mpage.c |    8 ++++----
 1 files changed, 4 insertions(+), 4 deletions(-)

--- 25/fs/mpage.c~set_page_dirty-pf_memalloc    Sun Oct 27 19:22:44 2002

                                        test_clear_page_dirty(page)) {
                        if (writepage) {
                                ret = (*writepage)(page);
+                               if (ret == -EAGAIN) {
+                                       __set_page_dirty_nobuffers(page);
+                                       ret = 0;
+                               }
                        } else {
                                bio = mpage_writepage(bio, page, get_block,

                                        pagevec_deactivate_inactive(&pvec);
                                page = NULL;
                        }
-                       if (ret == -EAGAIN && page) {
-                               __set_page_dirty_nobuffers(page);
-                               ret = 0;
-                       }
                        if (ret || (--(wbc->nr_to_write) <= 0))
                                done = 1;
                        if (wbc->nonblocking && bdi_write_congested(bdi)) {

.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

 
 
 

Fix file-corrupting bug, kernels 2.5.41 to 2.5.44

Post by Jens Axbo » Tue, 29 Oct 2002 13:10:08



> This patch fixes a filesystem corrupting bug, present in 2.5.41 through
> 2.5.44.  It can cause ext2 indirect blocks to not be written out.  A
> fsck will fix it up.

> Under heavy memory pressure a PF_MEMALLOC task attemtps to write out a
> blockdev page whose buffers are already under writeback and which were
> dirtied while under writeback.

*, it's odd how noone else seems to have noticed this?

Quote:> The writepage call returns -EAGAIN but because the caller is
> PF_MEMALLOC, the page was not being marked dirty again.

> The page sits on mapping->clean_pages for ever and it not written out.

> The fix is to mark that page dirty again for all callers, regardless of
> PF_MEMALLOC state.

I can confirm that this fixes my 'loosing data under vm pressure' bug,
both for O_DIRECT case and sgio. It passed 1 iteration of both tests, it
would not even get past the 10% mark before. Thanks!

BTW, 2.5.44-mm6 showed some funnies and corrupted data in other
interesting ways. I'm hesitant to report this as a bug right now, as it
may just have been that the target fs had not been fsck'ed after being
run under one of the buggy kernels. But it did crash in the end, dumping
lots of hot/cold warnings. The above verification was run under 2.5.44 +
sgio patches + your standalone __set_page_dirty() fix.

--
Jens Axboe

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://www.veryComputer.com/
Please read the FAQ at  http://www.veryComputer.com/