2.2.8/2.3.0 disk I/O hang, appears to serialise following processes

David (david+nospam@killerlabs.com)
Fri, 14 May 1999 16:35:17 -0700


This is a multi-part message in MIME format.
--------------1E2BA2040E678E7B5DFD25CA
Content-Type: text/plain; charset=us-ascii
Content-Transfer-Encoding: 7bit

Ok, I've seen another thread on this but haven't seen patches.

This is how it happened to me. I was untarring X 3.3.3 source and
reached 100% disk capacity so i hit ctrl-c in the untar rxvt which
didn't get me anywhere. I went to another rxvt and started an rm of all
I had just untarred. This didn't yield anything. Now with two terms
hung, I went to a third (I couldn't start any more rxvts so I was now
being very cautious about what I ran) and started capturing /proc info.
Here's what I could grab. Sorry, by the end of this capture, all of my
terminals had hung either on disk i/o or a some form of tty i/o (i
assume).

Again, this input is somewhat sparse and lacking because I couldn't read
the disk or write to it.

Some things to note; update was not running as per notes w/ bdflush.
There is another problem with this system. I *must* disable APM, not
just in the kernel, but in CMOS as well. This machine is an AMD
K6-2/200 w/ 128megs, 3.5gig scsi/1.4g ide. It has a USB mouse on it
which I know is also subject to a variety of bad bugs, indeed, switching
out of X to console used to lockup the machine hard with no oops
reports.

To contrast this, I have the exact same kernel running on the machine
next to it, running the same set of programs w/ libs etc. It doesn't
have a USB mouse, *does* have APM compiled/enabled etc, etc. The
differences being the scsi disk/usb mouse and I haven't run it out of
space. It is also a pentium 166 overclocked to 200. It's been very
very stable.

Thankyou,
-d

--
 This is Linux Country. On a quiet night, you can hear Windows NT reboot!
  Do you remember how to -think- ? Do you remember how to experiment? Linux
__ is an operating system that brings back the fun and adventure in computing.
\/  for linux-kernel: please read linux/Documentation/* before posting problems

--------------1E2BA2040E678E7B5DFD25CA Content-Type: text/plain; charset=us-ascii; name="2.3.0-fs-full-err" Content-Transfer-Encoding: 7bit Content-Disposition: inline; filename="2.3.0-fs-full-err"

# df Filesystem 1k-blocks Used Available Use% Mounted on /dev/sd/c0b0t0u0p1 3765477 3561915 0 100% / ...ide/hd/c0b0t1u0p1 1470741 976773 493968 66% /old

$ ps axwl F UID PID PPID PRI NI VSZ RSS WCHAN STAT TTY TIME COMMAND 100 0 1 0 0 0 232 56 pause S ? 0:07 init [3] 040 0 2 1 0 0 0 0 wait_o DW ? 0:01 [kflushd] 040 0 3 1 0 0 0 0 kpiod SW ? 0:00 [kpiod] 040 0 4 1 0 0 0 0 kswapd SW ? 0:01 [kswapd] 040 0 31 1 0 0 1432 300 wait_o D ? 0:00 /usr/sbin/cron -l10 140 0 45 1 0 0 1760 0 do_sys SW ? 0:00 [klogd] 040 0 47 1 0 0 1464 232 do_sel S ? 0:00 /usr/sbin/syslogd 140 0 147 1 0 0 1428 72 do_sel S ? 0:00 /usr/sbin/inetd 140 0 153 1 0 0 1476 0 do_sel SW ? 0:00 [lpd] 040 0 154 1 0 0 0 0 end SW ? 0:00 [root-hub] 040 0 155 1 0 0 0 0 end SW ? 0:00 [khubd] 140 0 168 1 0 0 2304 476 do_sel S ? 0:03 sendmail: rejecting connections on port 25: load average: 14 040 0 261 1 0 0 2464 0 do_sel SW ? 0:00 [sshd] 100 0 273 1 0 0 2388 0 read_c SW tty1 0:00 [bash] 000 0 274 1 0 0 1392 0 read_c SW tty2 0:00 [agetty] 000 0 275 1 0 0 1392 0 read_c SW tty3 0:00 [agetty] 000 0 276 1 0 0 1392 0 read_c SW tty4 0:00 [agetty] 000 0 277 1 0 0 1392 0 read_c SW tty5 0:00 [agetty] 000 0 278 1 0 0 1392 0 read_c SW tty6 0:00 [agetty] 000 0 279 1 0 0 1392 0 read_c SW tty7 0:00 [agetty] 000 0 280 1 0 0 1392 0 read_c SW tty8 0:00 [agetty] 000 0 281 1 0 0 1392 0 read_c SW tty9 0:00 [agetty] 000 0 282 1 0 0 1392 28 read_c S tty10 0:00 /sbin/agetty 38400 tty10 linux 000 0 283 1 0 0 1392 64 read_c S tty11 0:00 /sbin/agetty 38400 tty11 linux 100 1000 284 1 0 0 2372 0 wait4 SW tty12 0:00 [bash] 000 1000 316 284 0 0 828 56 wait4 S tty12 0:00 sh /usr/X11/bin/startx 000 1000 318 316 0 0 2676 0 wait4 SW tty12 0:00 [xinit] 100 0 320 318 9 0 48300 21352 do_sel S tty12 6:55 X :0 000 1000 325 318 0 0 5692 1244 wait_o D tty12 0:06 gnome-session 000 1000 349 1 1 0 5448 2324 do_sel S tty12 0:50 enlightenment -theme /usr/local/enlightenment/themes/BrushedMetal-Tigert -smfile /home/david/.enlighte 000 1000 351 1 0 0 7644 2596 wait_o D tty12 0:11 panel --sm-config-prefix /panel.d/Session-KxpRwG/ --sm-client-id 11cfd50e02000092602140000000004140001 000 1000 382 1 0 0 2876 124 do_pol S ? 0:00 gnome-name-service 000 1000 399 1 1 0 6268 1532 do_pol S ? 0:28 gnomepager_applet --activate-goad-server gnomepager_applet 140 0 668 1 0 0 1392 68 do_sel S ? 0:41 gpm -R -t ps2 -m /dev/USB mouse -M -t ps2 -m /dev/psaux 000 1000 2604 1 0 0 57924 33528 wakeup D tty12 22:45 netscape 000 1000 2605 2604 0 0 17224 628 do_sel S tty12 0:00 (dns helper) 100 1000 2648 349 1 0 2972 868 do_sel S ? 0:04 rxvt 100 1000 2649 2648 9 0 2376 504 wait4 S ttyp0 0:00 -bash 100 0 2696 349 0 0 2664 632 down_f D ? 0:03 rxvt 100 1000 2697 2696 0 0 2368 0 wait4 SW ttyp1 0:00 [bash] 100 0 2709 2697 0 0 2400 716 wait4 S ttyp1 0:00 -su 100 0 3977 349 0 0 2696 608 down_f D ? 0:00 rxvt 100 1000 3978 3977 0 0 2392 476 wait4 S ttyp2 0:00 -bash 000 1000 7990 1 0 0 8492 3512 do_pol S tty12 0:02 gmc --sm-config-prefix /gmc-T7Trog/ --sm-client-id 11cfd50e02000092456800700000022690003 100 0 10934 261 0 0 2052 340 do_sel S ? 0:01 sshd -i -V SSH-1.5-1.2.26? 100 0 10936 10934 0 0 2384 276 wait4 S pts/0 0:00 -bash 100 0 10949 10936 0 0 2028 180 do_sel S pts/0 0:00 ssh -l MausEyes james 100 0 11603 261 0 0 2040 332 do_sel S ? 0:01 sshd -i -V SSH-1.5-1.2.26? 100 0 11605 11603 0 0 2384 276 wait4 S pts/1 0:00 -bash 100 0 11618 11605 0 0 2008 156 do_sel S pts/1 0:00 ssh -l MausEyes james 000 1000 11946 1 0 0 816 408 wait4 S tty12 0:00 sh /tmp/filex74vUp 000 1000 11947 11946 0 0 9620 3548 wait_o D tty12 0:03 gnumeric bills.gnumeric.backup 100 1000 14421 147 0 0 2500 1420 tcp_re S ? 0:00 imapd 100 0 14476 2709 0 0 1628 664 wakeup D ttyp1 0:10 tar xzf X333src-1.tgz 004 0 14477 14476 0 0 0 0 do_exi Z ttyp1 0:10 [gzip <defunct>] 000 1000 14479 1 0 0 6064 2028 down_f D tty12 0:00 session-properties-capplet --init-session-settings --warner --sm-client-id warner0 000 1000 14481 1 0 0 1384 396 wakeup D tty12 0:00 rm -r /home/david/.gnome//panel.d/Session-dTb5hM 100 0 14485 3978 0 0 2404 1088 wait4 S ttyp2 0:00 -su 000 0 14520 14485 0 0 1376 376 wakeup D ttyp2 0:00 rm -rf INSTALL.PS INSTALL.TXT Imakefile LABEL Makefile RELNOTES.PS RELNOTES.TXT bug-report config incl 100 0 14521 349 0 0 2540 1316 wakeup D ? 0:00 rxvt 100 1000 14522 14521 0 0 2372 1036 read_c S ttyp3 0:00 -bash 100 0 14534 349 0 0 2544 1320 down_f D ? 0:00 rxvt 100 1000 14535 14534 0 0 2372 1036 read_c S ttyp4 0:00 -bash 000 1000 14569 2649 4 0 1368 352 wait_o D ttyp0 0:00 sync 000 1000 14573 2649 10 0 2876 1024 - R ttyp0 0:00 ps axwl $ jobs [1]+ Running sync &

$ cat /proc/stat cpu 200968 0 83842 14055115 disk 289882 469 0 0 disk_rio 131146 467 0 0 disk_wio 158736 2 0 0 disk_rblk 266570 934 0 0 disk_wblk 320418 4 0 0 page 1025618 886619 swap 5187 4398 intr 21149413 14339925 41386 0 0 0 0 3 1 0 6651818 72328 0 42963 1 988 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 ctxt 7451730 btime 926573158 processes 14600

$ pwd;cat 0 1 /proc/scsi/sym53c8xx General information: Chip sym53c876, device id 0xf, revision id 0x14 IO port address 0xe400, IRQ number 9 Using memory mapped IO at virtual address 0xc8013000 Synchronous period factor 12, max commands per lun 32 Profiling information: num_fly = 287586 num_trans = 287588 num_disc = 309523 num_disc0 = 264647 num_break = 44877 num_int = 44890 num_kbytes = 1940502 General information: Chip sym53c876, device id 0xf, revision id 0x14 IO port address 0xe800, IRQ number 9 Using memory mapped IO at virtual address 0xc8015000 Synchronous period factor 12, max commands per lun 32 Profiling information: num_fly = 17 num_trans = 24 num_disc = 2 num_disc0 = 2 num_break = 2 num_int = 29 num_kbytes = 0

$ cat /proc/interrupts CPU0 0: 14365201 XT-PIC timer 1: 42389 XT-PIC keyboard 2: 0 XT-PIC cascade 8: 0 XT-PIC rtc 9: 6662499 XT-PIC bttv, sym53c8xx, sym53c8xx, eth0 10: 72734 XT-PIC ohci-usb 12: 42963 XT-PIC PS/2 Mouse 13: 1 XT-PIC fpu 14: 988 XT-PIC ide0 NMI: 0

$ pwd; for i in * ; do echo $i; cat $i; done /proc/sys/fs binfmt_misc cat: binfmt_misc: Invalid argument dentry-state 0 16367 45 0 0 0 dquot-max 1024 dquot-nr 0 0 file-max 4096 file-nr 794 13 4096 inode-max 16376 inode-nr 16380 1748 inode-state 16380 1748 0 0 0 0 0 super-max 256 super-nr 4

# pwd; for i in * ; do echo $i; cat $i; done /proc/sys/vm bdflush 40 500 64 256 5 3000 500 1884 2 buffermem 2 10 60 freepages 255 510 765 kswapd 512 32 32 overcommit_memory 0 page-cluster 4 pagecache 2 15 75 pagetable_cache 25 50

--------------1E2BA2040E678E7B5DFD25CA--

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu Please read the FAQ at http://www.tux.org/lkml/