2.4.15-pre7 Severe VM Bugs in 2.4.15-pre7

2.4.15-pre7 Severe VM Bugs in 2.4.15-pre7

Post by Jeff V. Merke » Thu, 22 Nov 2001 14:30:11



More Info.  The 3Ware driver appears to be corrupting memory.  I
did more checks.  This driver will bark if the memory passed
is not 4K aligned.  I do not know if this is specific to this
driver, or if it is related to the recent VM issues.

When I pass 4K aligned memory, the problem disappears.  The IDE and
standard SCSI drivers do not exhibit this behavior.  2.4.15-pre6
does not seem to exhibit this behavior.

Please advise,

Jeff



> > On Tue, 20 Nov 2001 23:14:49 -0700,

> > >ksymoops 2.4.0 on i686 2.4.15-pre7.  Options used
> > >     -m /boot/System.map-2.4.15-pre7 (default)
> > >Error (regular_file): read_system_map stat /boot/System.map-2.4.15-pre7 failed

> > Without a valid System.map, the decode is going to be very vague.  When
> > you see offset 5207 in code sized 32410 you know you don't have enough
> > detail.  Try the decode again with a valid System.map.

> Got it.  I need to check my build scripts and figure out why it did
> not copy the System.Map to /boot/.  I updated the file, and you
> were correct.  The oops looks more readable.  Getting a crash in
> the VM when a kfree() is attempted.  We are getting a page fault, it
> appears.

> Jeff

> ksymoops 2.4.0 on i686 2.4.15-pre7.  Options used
>      -V (default)
>      -k /proc/ksyms (default)
>      -l /proc/modules (default)
>      -o /lib/modules/2.4.15-pre7/ (default)
>      -m /boot/System.map-2.4.15-pre7 (default)

> Warning: You did not tell me where to find symbol information.  I will
> assume that the log matches the kernel and modules that are running
> right now and I'll use the default options above for symbol resolution.
> If the current kernel and/or modules do not match the log, you can get
> more accurate output by telling me the kernel version and where to find
> map, modules, ksyms etc.  ksymoops -h explains the options.

> No modules in ksyms, skipping objects
> Warning (read_lsmod): no symbols in lsmod, is /proc/modules a valid lsmod file?
> Nov 20 09:58:50 scimitar kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000008
> Nov 20 09:58:50 scimitar kernel: c0172f1b
> Nov 20 09:58:50 scimitar kernel: *pde = 00000000
> Nov 20 09:58:50 scimitar kernel: Oops: 0000
> Nov 20 09:58:50 scimitar kernel: CPU:    1
> Nov 20 09:58:50 scimitar kernel: EIP:    0010:[<c0172f1b>]    Not tainted
> Using defaults from ksymoops -t elf32-i386 -a i386
> Nov 20 09:58:50 scimitar kernel: EFLAGS: 00010246
> Nov 20 09:58:50 scimitar kernel: eax: 00000000   ebx: cea30000   ecx: cea30018   edx: 000002b8
> Nov 20 09:58:50 scimitar kernel: esi: 00000000   edi: 00000000   ebp: 0000002f   esp: cead5ab8
> Nov 20 09:58:50 scimitar kernel: ds: 0018   es: 0018   ss: 0018
> Nov 20 09:58:50 scimitar kernel: Process mount (pid: 720, stackpage=cead5000)
> Nov 20 09:58:50 scimitar kernel: Stack: cea30018 c0173e29 cea30018 00000080 00000009 cea30018 00000003 00000004
> Nov 20 09:58:50 scimitar kernel:        000000ae 09c23fff 00000000 c1556600 00000000 00000000 00000009 c019b9bb
> Nov 20 09:58:50 scimitar kernel:        cead5b28 00001000 cead5b5c 00001000 c030f4fc 00000009 c0172e39 c0172e7f
> Nov 20 09:58:50 scimitar kernel: Call Trace: [<c0173e29>] [<c019b9bb>] [<c0172e39>] [<c0172e7f>] [<c019a64a>]
> Nov 20 09:58:50 scimitar kernel:    [<c0124382>] [<c014463c>] [<c013c560>] [<c0295e70>] [<c01124ba>] [<c0176127>]
> Nov 20 09:58:50 scimitar kernel:    [<c0124d91>] [<c0124da2>] [<c0112320>] [<c010700c>] [<c0124804>] [<c0112320>]
> Nov 20 09:58:50 scimitar kernel:    [<c010700c>] [<c02949fb>] [<c014bf8c>] [<c014cf8f>] [<c013c560>] [<c012d9e1>]
> Nov 20 09:58:50 scimitar kernel:    [<c01241ba>] [<c0124214>] [<c0124382>] [<c012779c>] [<c01277c9>] [<c01124ba>]
> Nov 20 09:58:50 scimitar kernel:    [<c0128742>] [<c0183a39>] [<c01277c9>] [<c0154922>] [<c01125f0>] [<c0294922>]
> Nov 20 09:58:50 scimitar kernel:    [<c0137fe1>] [<c0138460>] [<c013894f>] [<c0148f53>] [<c0112320>] [<c010700c>]
> Nov 20 09:58:50 scimitar kernel:    [<c01491ec>] [<c014905c>] [<c01492b0>] [<c0106f1b>]
> Nov 20 09:58:50 scimitar kernel: Code: 8b 50 08 2b 51 f0 89 50 08 8b 41 f4 ff 48 04 8b 41 f8 83 f8

> >>EIP; c0172f1b <NWFSFree+b/e0>   <=====
> Trace; c0173e29 <ScanDiskDevices+449/460>
> Trace; c019b9bb <nwvp_part_scan+1b/4d0>
> Trace; c0172e39 <NWFSIOAlloc+19/70>
> Trace; c0172e7f <NWFSIOAlloc+5f/70>
> Trace; c019a64a <nwvp_scan_routine+8a/600>
> Trace; c0124382 <handle_mm_fault+62/d0>
> Trace; c014463c <dput+1c/150>
> Trace; c013c560 <cached_lookup+10/50>
> Trace; c0295e70 <rb_insert_color+70/f0>
> Trace; c01124ba <do_page_fault+19a/4e0>
> Trace; c0176127 <NWFSVolumeScan+57/180>
> Trace; c0124d91 <do_mmap_pgoff+421/4f0>
> Trace; c0124da2 <do_mmap_pgoff+432/4f0>
> Trace; c0112320 <do_page_fault+0/4e0>
> Trace; c010700c <error_code+34/3c>
> Trace; c0124804 <__vma_link+64/c0>
> Trace; c0112320 <do_page_fault+0/4e0>
> Trace; c010700c <error_code+34/3c>
> Trace; c02949fb <clear_user+2b/40>
> Trace; c014bf8c <padzero+1c/20>
> Trace; c014cf8f <load_elf_binary+90f/a50>
> Trace; c013c560 <cached_lookup+10/50>
> Trace; c012d9e1 <__alloc_pages+41/180>
> Trace; c01241ba <do_anonymous_page+8a/b0>
> Trace; c0124214 <do_no_page+34/140>
> Trace; c0124382 <handle_mm_fault+62/d0>
> Trace; c012779c <filemap_nopage+bc/200>
> Trace; c01277c9 <filemap_nopage+e9/200>
> Trace; c01124ba <do_page_fault+19a/4e0>
> Trace; c0128742 <read_cache_page+42/120>
> Trace; c0183a39 <nwfs_read_super+79/1b0>
> Trace; c01277c9 <filemap_nopage+e9/200>
> Trace; c0154922 <ext2_get_page+22/80>
> Trace; c01125f0 <do_page_fault+2d0/4e0>
> Trace; c0294922 <__generic_copy_from_user+32/60>
> Trace; c0137fe1 <read_super+b1/140>
> Trace; c0138460 <get_sb_nodev+30/60>
> Trace; c013894f <do_kern_mount+df/150>
> Trace; c0148f53 <do_add_mount+23/e0>
> Trace; c0112320 <do_page_fault+0/4e0>
> Trace; c010700c <error_code+34/3c>
> Trace; c01491ec <do_mount+13c/160>
> Trace; c014905c <copy_mount_options+4c/a0>
> Trace; c01492b0 <sys_mount+a0/100>
> Trace; c0106f1b <system_call+33/38>
> Code;  c0172f1b <NWFSFree+b/e0>
> 00000000 <_EIP>:
> Code;  c0172f1b <NWFSFree+b/e0>   <=====
>    0:   8b 50 08                  mov    0x8(%eax),%edx   <=====
> Code;  c0172f1e <NWFSFree+e/e0>
>    3:   2b 51 f0                  sub    0xfffffff0(%ecx),%edx
> Code;  c0172f21 <NWFSFree+11/e0>
>    6:   89 50 08                  mov    %edx,0x8(%eax)
> Code;  c0172f24 <NWFSFree+14/e0>
>    9:   8b 41 f4                  mov    0xfffffff4(%ecx),%eax
> Code;  c0172f27 <NWFSFree+17/e0>
>    c:   ff 48 04                  decl   0x4(%eax)
> Code;  c0172f2a <NWFSFree+1a/e0>
>    f:   8b 41 f8                  mov    0xfffffff8(%ecx),%eax
> Code;  c0172f2d <NWFSFree+1d/e0>
>   12:   83 f8 00                  cmp    $0x0,%eax

> 2 warnings issued.  Results may not be reliable.

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/
 
 
 

1. 2.4.15-pre7 bzImage fails

2.4.14 patched to 2.4.15-pre7, on "make bzImage", failed.  if anyone wants
additional details, let me know.  i'm not particularly interested in it, the
es1371 was enabled by accident in a new configuration.  i've already fixed
it and moved on.

---

ld -m elf_i386 -T /usr/src/linux-2.4.15-pre7/arch/i386/vmlinux.lds -e stext arch/i386/kernel/head.o arch/i386/kernel/init_task.o init/main.o init/version.o \
        --start-group \
        arch/i386/kernel/kernel.o arch/i386/mm/mm.o kernel/kernel.o mm/mm.o fs/fs.o ipc/ipc.o \
         drivers/acpi/acpi.o drivers/parport/driver.o drivers/char/char.o drivers/block/block.o drivers/misc/misc.o drivers/net/net.o drivers/media/media.o drivers/char/agp/agp.o drivers/char/drm/drm.o drivers/ide/idedriver.o drivers/scsi/scsidrv.o drivers/cdrom/driver.o drivers/sound/sounddrivers.o drivers/pci/driver.o drivers/pnp/pnp.o drivers/video/video.o drivers/usb/usbdrv.o \
        net/network.o \
        /usr/src/linux-2.4.15-pre7/arch/i386/lib/lib.a /usr/src/linux-2.4.15-pre7/lib/lib.a /usr/src/linux-2.4.15-pre7/arch/i386/lib/lib.a \
        --end-group \
        -o vmlinux
drivers/sound/sounddrivers.o: In function `es1371_probe':
drivers/sound/sounddrivers.o(.text+0x103d7): undefined reference to `gameport_register_port'
drivers/sound/sounddrivers.o: In function `es1371_remove':
drivers/sound/sounddrivers.o(.text+0x1051b): undefined reference to `gameport_unregister_port'
make: *** [vmlinux] Error 1

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in

More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/

2. Where's in.ftpd?

3. 2.4.15-pre7

4. Runaway Processes

5. Kernel error during "umount" on ext3 with 2.4.15-pre7

6. Partition problem in Red hat Linux 7.2

7. [REISERFS TESTING] new patches on ftp.namesys.com: 2.4.15-pre7

8. OT: Connecting IRC servers via http?

9. 2.4.15-pre7 kmem_cache_create invalid opcode

10. OOM killer very aggressive on 2.4.15.pre7

11. new patches on ftp.namesys.com: 2.4.15-pre7

12. Merge BUG in 2.4.15-pre4 serial.c

13. 2.4.15 and GNU 3.0.2