NFS crash (2.4; probably symlink-related)

From: Jan Kasprzak (kas@informatics.muni.cz)
Date: Mon Jun 12 2000 - 15:08:48 EST


        Hello NFS/VFS hackers,

        one of my NFS-servers crashes very often and the crash seems to be
NFS-related. The server keeps uptime of several days, usually no longer
than one week. I think it is because of my not-so-common setup:

        I have a directory /export/packages.run.linux, which is NFS-exported
(/export is a separate volume different from /). I have a
symbolic link /packages.run.linux -> /export/packages.run.linux
for historical reasons. The source of problems may be that some NFS clients
mount the directory under the name server:/packages.run.linux instead of
server:/export/packages.run.linux.

        The box is Athlon with 128M ram, IDE disks, EEpro100 NIC.
Kernel is 2.4.0-test1-ac9.

        Do you have any idea of what's the problem here?
Here is the relevant part of /var/log/messages:

Jun 12 19:53:12 server mountd[787]: authenticated unmount request from client.fi.muni.cz:942 for /export/packages.run.linux (/export/packages.run.linux)
Jun 12 19:53:27 server kernel: Negative d_count (-1) for //packages.run.linux
Jun 12 19:53:27 server kernel: kernel BUG at dcache.c:175!
Jun 12 19:53:27 server kernel: invalid operand: 0000
Jun 12 19:53:27 server kernel: CPU: 0
Jun 12 19:53:27 server kernel: EIP: 0010:[dput+351/368]
Jun 12 19:53:27 server kernel: EFLAGS: 00010286
Jun 12 19:53:27 server kernel: eax: 0000001c ebx: c70fed80 ecx: c0246a14 edx: c76e7f84
Jun 12 19:53:27 server kernel: esi: c1fe8380 edi: ffffffff ebp: 00000002 esp: c7f95f70
Jun 12 19:53:27 server kernel: ds: 0018 es: 0018 ss: 0018
Jun 12 19:53:27 server kernel: Process kswapd (pid: 2, stackpage=c7f95000)
Jun 12 19:53:27 server kernel: Stack: c0207ac4 c0207c22 000000af c70fed80 c1fe8380 c7e50e00 c0143edd c70fed80
Jun 12 19:53:27 server kernel: 00000004 0000003e ffffffff 00000004 c014414a 00000002 c012c0ec 0000003e
Jun 12 19:53:27 server kernel: 00000004 c0247b24 c0247a99 c02479e0 c0247a94 00000001 c012c228 00000004
Jun 12 19:53:27 server kernel: Call Trace: [tvecs+29280/57436] [tvecs+29630/57436] [prune_dcache+221/256] [shrink_dcache_memory+26/48] [do_try_to_free_pages+92/224] [kswapd+184/288] [empty_bad_page+0/4096]
Jun 12 19:53:27 server kernel: [kernel_thread+35/48]
Jun 12 19:53:27 server kernel: Code: 0f 0b 83 c4 0c 5b 5e 5f c3 90 8d b4 26 00 00 00 00 56 53 8b
Jun 12 19:53:27 server kernel: Negative d_count (-1) for //packages.run.linux
Jun 12 19:53:27 server kernel: kernel BUG at dcache.c:175!
Jun 12 19:53:27 server kernel: invalid operand: 0000
Jun 12 19:53:27 server kernel: CPU: 0
<Here the box crashed>
Jun 12 20:15:46 server syslogd 1.3-3: restart.

here are my bootup messages:

Linux version 2.4.0-test1-ac8 (root@server.fi.muni.cz) (gcc version egcs-2.91.66 19990314/Linux (egcs-1.1.2 release)) #2 Tue Jun 6 15:15:08 CEST 2000
BIOS-provided physical RAM map:
 e820: 000000000009fc00 @ 0000000000000000 (usable)
 e820: 0000000000000400 @ 000000000009fc00 (reserved)
 e820: 0000000000010000 @ 00000000000f0000 (reserved)
 e820: 0000000007ef0000 @ 0000000000100000 (usable)
 e820: 0000000000008000 @ 0000000007ff0000 (ACPI data)
 e820: 0000000000008000 @ 0000000007ff8000 (ACPI NVS)
 e820: 0000000000010000 @ 00000000ffff0000 (reserved)
On node 0 totalpages: 32752
zone(0): 4096 pages.
zone(1): 28656 pages.
zone(2): 0 pages.
Kernel command line: BOOT_IMAGE=linux ro root=1601
Initializing CPU#0
Detected 630016949 Hz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1258.29 BogoMIPS
Memory: 126420k/131008k available (1291k kernel code, 4200k reserved, 87k data, 172k init, 0k highmem)
Dentry-cache hash table entries: 16384 (order: 5, 131072 bytes)
Buffer-cache hash table entries: 4096 (order: 2, 16384 bytes)
Page-cache hash table entries: 32768 (order: 5, 131072 bytes)
kmem_create: Poisoning requested, but con given - bdev_cache
Inode-cache hash table entries: 8192 (order: 4, 65536 bytes)
kmem_create: Poisoning requested, but con given - inode_cache
VFS: Diskquotas version dquot_6.4.0 initialized
CPU: L1 I Cache: 64K L1 D Cache: 64K
CPU: L2 Cache: 512K
CPU: AMD Athlon(tm) Processor stepping 01
Checking 386/387 coupling... OK, FPU using exception 16 error reporting.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.36 (20000221) Richard Gooch (rgooch@atnf.csiro.au)
PCI: PCI BIOS revision 2.10 entry at 0xfdaf1, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
PCI: Using IRQ router default [1022/740b] at 00:07.3
Linux NET4.0 for Linux 2.3
Based upon Swansea University Computer Society NET3.039
kmem_create: Poisoning requested, but con given - skbuff_head_cache
NET4: Unix domain sockets 1.0/SMP for Linux NET4.0.
NET4: Linux TCP/IP 1.0 for NET4.0
IP Protocols: ICMP, UDP, TCP
IP: routing cache hash table of 512 buckets, 4Kbytes
TCP: Hash tables configured (established 8192 bind 8192)
Starting kswapd v1.6
pty: 256 Unix98 ptys configured
Uniform Multi-Platform E-IDE driver Revision: 6.30
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
AMD7409: IDE controller on PCI bus 00 dev 39
AMD7409: chipset revision 3
AMD7409: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0xf000-0xf007, BIOS settings: hda:DMA, hdb:pio
    ide1: BM-DMA at 0xf008-0xf00f, BIOS settings: hdc:DMA, hdd:DMA
hda: QUANTUM FIREBALLP LM10.2, ATA DISK drive
hdc: ST33210A, ATA DISK drive
hdd: CD-540E, ATAPI CDROM drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
ide1 at 0x170-0x177,0x376 on irq 15
hda: 20066251 sectors (10274 MB) w/1900KiB Cache, CHS=1249/255/63, UDMA(66)
hdc: 6346368 sectors (3249 MB) w/256KiB Cache, CHS=6296/16/63, UDMA(33)
Partition check:
 /dev/ide/host0/bus0/target0/lun0: p1 p2 p3
 /dev/ide/host0/bus1/target0/lun0: p1 p2
Serial driver version 5.01 (2000-05-29) with MANY_PORTS SHARE_IRQ SERIAL_PCI enabled
ttyS00 at 0x03f8 (irq = 4) is a 16550A
ttyS01 at 0x02f8 (irq = 3) is a 16550A
eepro100.c:v1.09j-t 9/29/99 Donald Becker http://cesdis.gsfc.nasa.gov/linux/drivers/eepro100.html
eepro100.c: $Revision: 1.33 $ 2000/05/24 Modified by Andrey V. Savochkin <saw@saw.sw.com.sg> and others
eth0: Intel Corporation 82557 [Ethernet Pro 100], 00:D0:B7:6B:4A:B2, IRQ 11.
  Receiver lock-up bug exists -- enabling work-around.
  Board assembly 721383-008, Physical connectors present: RJ45
  Primary interface chip i82555 PHY #1.
  General self-test: passed.
  Serial sub-system self-test: passed.
  Internal registers self-test: passed.
  ROM checksum self-test: passed (0x04f4518b).
eth1: Intel Corporation 82557 [Ethernet Pro 100] (#2), 00:D0:B7:6B:4A:DB, IRQ 12.
  Receiver lock-up bug exists -- enabling work-around.
  Board assembly 721383-008, Physical connectors present: RJ45
  Primary interface chip i82555 PHY #1.
  General self-test: passed.
  Serial sub-system self-test: passed.
  Internal registers self-test: passed.
  ROM checksum self-test: passed (0x04f4518b).
Linux agpgart interface v0.99 (c) Jeff Hartmann
agpgart: Maximum main memory to use for agp memory: 94M
agpgart: Detected AMD Irongate chipset
agpgart: AGP aperture is 64M @ 0xe0000000
devfs: v0.96 (20000430) Richard Gooch (rgooch@atnf.csiro.au)
devfs: boot_options: 0x2
kmem_create: Forcing size word alignment - nfs_fh
VFS: Mounted root (ext2 filesystem) readonly.
Freeing unused kernel memory: 172k freed
Adding Swap: 265032k swap-space (priority 10)
Adding Swap: 122936k swap-space (priority 10)
ip_conntrack (1023 buckets, 8184 max)

... and finally, here is my kernel config (grep '=[ym]' .config):

CONFIG_X86=y
CONFIG_ISA=y
CONFIG_UID16=y
CONFIG_EXPERIMENTAL=y
CONFIG_MK7=y
CONFIG_X86_WP_WORKS_OK=y
CONFIG_X86_INVLPG=y
CONFIG_X86_CMPXCHG=y
CONFIG_X86_BSWAP=y
CONFIG_X86_POPAD_OK=y
CONFIG_X86_TSC=y
CONFIG_X86_GOOD_APIC=y
CONFIG_X86_USE_3DNOW=y
CONFIG_X86_PGE=y
CONFIG_X86_USE_PPRO_CHECKSUM=y
CONFIG_X86_MSR=y
CONFIG_NOHIGHMEM=y
CONFIG_MTRR=y
CONFIG_MODULES=y
CONFIG_KMOD=y
CONFIG_NET=y
CONFIG_PCI=y
CONFIG_PCI_GOANY=y
CONFIG_PCI_BIOS=y
CONFIG_PCI_DIRECT=y
CONFIG_PCI_NAMES=y
CONFIG_SYSVIPC=y
CONFIG_SYSCTL=y
CONFIG_KCORE_ELF=y
CONFIG_BINFMT_AOUT=m
CONFIG_BINFMT_ELF=y
CONFIG_BINFMT_MISC=m
CONFIG_PNP=y
CONFIG_BLK_DEV_FD=m
CONFIG_BLK_DEV_LOOP=m
CONFIG_BLK_DEV_LVM=m
CONFIG_LVM_PROC_FS=y
CONFIG_BLK_DEV_RAM=m
CONFIG_PACKET=y
CONFIG_NETFILTER=y
CONFIG_FILTER=y
CONFIG_UNIX=y
CONFIG_INET=y
CONFIG_IP_ALIAS=y
CONFIG_SKB_LARGE=y
CONFIG_IP_NF_CONNTRACK=m
CONFIG_IP_NF_FTP=m
CONFIG_IP_NF_IPTABLES=m
CONFIG_IP_NF_MATCH_LIMIT=m
CONFIG_IP_NF_MATCH_MULTIPORT=m
CONFIG_IP_NF_FILTER=m
CONFIG_IP_NF_TARGET_REJECT=m
CONFIG_IP_NF_NAT=m
CONFIG_IP_NF_NAT_NEEDED=y
CONFIG_IP_NF_TARGET_MASQUERADE=m
CONFIG_IP_NF_TARGET_REDIRECT=m
CONFIG_IP_NF_TARGET_LOG=m
CONFIG_IP_NF_COMPAT_IPCHAINS=m
CONFIG_IP_NF_NAT_NEEDED=y
CONFIG_IDE=y
CONFIG_BLK_DEV_IDE=y
CONFIG_BLK_DEV_IDEDISK=y
CONFIG_IDEDISK_MULTI_MODE=y
CONFIG_BLK_DEV_IDECD=m
CONFIG_BLK_DEV_IDEPCI=y
CONFIG_IDEPCI_SHARE_IRQ=y
CONFIG_BLK_DEV_IDEDMA_PCI=y
CONFIG_IDEDMA_PCI_AUTO=y
CONFIG_BLK_DEV_IDEDMA=y
CONFIG_IDEDMA_PCI_WIP=y
CONFIG_IDEDMA_NEW_DRIVE_LISTINGS=y
CONFIG_BLK_DEV_AMD7409=y
CONFIG_AMD7409_OVERRIDE=y
CONFIG_IDEDMA_AUTO=y
CONFIG_IDEDMA_IVB=y
CONFIG_BLK_DEV_IDE_MODES=y
CONFIG_NETDEVICES=y
CONFIG_NET_ETHERNET=y
CONFIG_NET_PCI=y
CONFIG_EEPRO100=y
CONFIG_VT=y
CONFIG_VT_CONSOLE=y
CONFIG_SERIAL=y
CONFIG_SERIAL_CONSOLE=y
CONFIG_UNIX98_PTYS=y
CONFIG_MOUSE=y
CONFIG_PSMOUSE=y
CONFIG_WATCHDOG=y
CONFIG_SOFT_WATCHDOG=m
CONFIG_NVRAM=m
CONFIG_RTC=m
CONFIG_AGP=y
CONFIG_AGP_AMD=y
CONFIG_QUOTA=y
CONFIG_AUTOFS_FS=m
CONFIG_AUTOFS4_FS=y
CONFIG_ISO9660_FS=m
CONFIG_JOLIET=y
CONFIG_PROC_FS=y
CONFIG_DEVFS_FS=y
CONFIG_EXT2_FS=y
CONFIG_NFS_FS=y
CONFIG_NFS_V3=y
CONFIG_NFSD=y
CONFIG_NFSD_V3=y
CONFIG_SUNRPC=y
CONFIG_LOCKD=y
CONFIG_LOCKD_V4=y
CONFIG_MSDOS_PARTITION=y
CONFIG_NLS=y
CONFIG_NLS_ISO8859_2=m
CONFIG_VGA_CONSOLE=y
CONFIG_VIDEO_SELECT=y
CONFIG_USB=m
CONFIG_USB_DEVICEFS=y
CONFIG_USB_UHCI=m
CONFIG_USB_UHCI_ALT=m
CONFIG_MAGIC_SYSRQ=y

-Yenya

-- 
\ Jan "Yenya" Kasprzak <kas at fi.muni.cz>       http://www.fi.muni.cz/~kas/
\\ PGP: finger kas at aisa.fi.muni.cz   0D99A7FB206605D7 8B35FCDE05B18A5E //
\\\             Czech Linux Homepage:  http://www.linux.cz/              ///
/// Vite jak Microsoft vyrabi nezavirovana CD? ... ... ... Pouziva UNIX! \\\
//     http://support.microsoft.com/support/kb/articles/Q80/5/20.ASP      \\

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/



This archive was generated by hypermail 2b29 : Thu Jun 15 2000 - 21:00:26 EST