I'm seeing errors on one side of a GigE link. The link is a
point-to-point, just connecting two computers. As I move a lot of data,
the error count increases.
What can I do to fix/debug something like this? My understanding is that
there's no half-duplex GigE, so that cannot be the problem. The "errors"
that I see are RX errors, dropped, and overruns.
The kernel is 2.6.9-34.ELsmp. The driver for the NIC is the e1000.
ifconfig reports:
eth1 Link encap:Ethernet HWaddr 00:0D:60:83:78:33
inet addr:192.168.6.202 Bcast:192.168.6.203 Mask:255.255.255.252
inet6 addr: fe80::20d:60ff:fe83:7833/64 Scope:Link
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:71585966 errors:82025 dropped:82025 overruns:82025 frame:0
TX packets:47757031 errors:0 dropped:0 overruns:0 carrier:0
collisions:0 txqueuelen:1000
RX bytes:330120282 (314.8 MiB) TX bytes:1106484502 (1.0 GiB)
Base address:0x3400 Memory:d0240000-d0260000
/proc/net/dev contains:
Inter-| Receive | Transmit
face |bytes packets errs drop fifo frame compressed multicast|bytes packets errs drop fifo colls carrier compressed
lo: 18152 103 0 0 0 0 0 0 18152 103 0 0 0 0 0 0
eth0: 754856 8269 0 0 0 0 0 0 1559998 6480 0 0 0 0 0 0
eth1:330120388 71585967 82025 82025 82025 0 0 0 1106484672 47757033 0 0 0 0 0 0
sit0: 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
The two controllers on this machine are:
02:01.0 Ethernet controller: Intel Corporation 82547GI Gigabit Ethernet Controller
04:03.0 Ethernet controller: Intel Corporation 82541GI/PI Gigabit Ethernet Controller
I'm confused about these overruns. The computer is doing nothing but a dd
reading from an iSCSI SAN (and writing to /dev/null). It contains a
3Ghz Pentium 4 (it's an IBM x306). Why would it be unable to
handle the traffic?
- Andrew