All of lore.kernel.org
 help / color / mirror / Atom feed
* System lockup.
@ 2002-10-21  0:02 Bill Leckey
  2002-10-21 11:30 ` Alan Cox
  0 siblings, 1 reply; 11+ messages in thread
From: Bill Leckey @ 2002-10-21  0:02 UTC (permalink / raw)
  To: linux-kernel

I have a terminal server that's supporting up to 240 lines.  It's a 
2.4.17 kernel, and is running squid, and using the reiser file system to 
store log files, squid cache and other data.  About every day or so, the 
machine locks up.  The screen is blank, keyboard doesn't respond, the 
serial console I set up shows no 'dying gasp' and there is nothing in 
any of the system logs.

This doesn't appear to be related to load as it has happened both during 
the busiest times and during the low times.

I'm still servicing interrupts from our serial devices (on IRQ 11), so 
it seems interrupts are still happening.

Beyond this, however, I have no idea where to go from here.  If anyone 
has any hints on what the problem might be, or even a way to gather more 
information, I would be grateful.

-- 
Bill Leckey - Senior Software Design Engineer
TPG Research and Development
Ph: +61 2 62851711
Fax: +61 2 62853939


^ permalink raw reply	[flat|nested] 11+ messages in thread
* system lockup
@ 2004-08-13 22:53 Mike Waychison
  2004-08-13 23:23 ` Ian Pratt
  0 siblings, 1 reply; 11+ messages in thread
From: Mike Waychison @ 2004-08-13 22:53 UTC (permalink / raw)
  To: xen-devel

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi All,


I've recently managed to get a 2.6.7 dom0 to boot without any major
problems.  I had seen the hwclock issues and the nosegfixup issues fly
by and these seem to be all worked out now.


Another issue though is still lingering for me, and I'm a little
clueless as to how to go about debugging it.


It seems that once I log into my gnome session, everything is aok for
the first little while, until eventually I get a segfault pop-up for
wnck-applet.  I've never seen this segfault before, and am not sure if
it is xen or 2.6.7 related (I'm still running a 2.6.1 variant).
However, a few seconds later, whether or not I click the 'ok' in the
segfault dialog, the machine seems to lock hard.


I've tested this w/ & w/o both a) removing the /lib/tls directory and b)
the nosegfixup kernel option.


Has anyone else seen this?  or possibly know of any other cause for this?

I will try updating to vanilla 2.6.7 tonight to see the issue remains.




Also:


A while ago, I tried building the xenolinux-2.6.7-dom0 kernel with a
pentium II cpu target.  The system appeared to boot up properly, however
X couldn't start as there seemed to be a mysterious SIGBUS being sent.
For now, I'm using a PIV build on this PII..


Thanks,

- --
Mike Waychison
Sun Microsystems, Inc.
1 (650) 352-5299 voice
1 (416) 202-8336 voice
http://www.sun.com

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
NOTICE:  The opinions expressed in this email are held by me,
and may not represent the views of Sun Microsystems, Inc.
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.2.4 (GNU/Linux)

iD8DBQFBHUZVdQs4kOxk3/MRAgDsAJ421NUq5IUMdmNKNTvoqsWz1TqeeQCeOU66
A5S+sHksUBrbjtBH2nTlFNk=
=mA/7
-----END PGP SIGNATURE-----


-------------------------------------------------------
SF.Net email is sponsored by Shop4tech.com-Lowest price on Blank Media
100pk Sonic DVD-R 4x for only $29 -100pk Sonic DVD+R for only $33
Save 50% off Retail on Ink & Toner - Free Shipping and Free Gift.
http://www.shop4tech.com/z/Inkjet_Cartridges/9_108_r285

^ permalink raw reply	[flat|nested] 11+ messages in thread
* system lockup
@ 2007-02-27  1:20 Roman Mashak
  0 siblings, 0 replies; 11+ messages in thread
From: Roman Mashak @ 2007-02-27  1:20 UTC (permalink / raw)
  To: netdev

Hello,

for learning device drivers I took 8139too.c to explore it. I
disrupted the code into several logical blocks and now I'm trying to
implement my own simplified version referring to original code from
time to time. I have rtl8139d NIC for experiments. So, by now I've
acomplished the following stages:
1) detected device
2) enable PCI device
3) memory mapped IO initialised
4) initialization of 'net_device' structure

And I'm stuck on the chip reset. Whenever I load driver and try to
enable interface (ifconfig eth1 up) my system just hangs, keyboard
locks up, I can't even use 'SysRq' shortcuts.

I figured out that problem occurs after I initialised chip, i.e. in
this routine called from 'net_device->open' method:


#define CmdTxEnb  (0x04)
...
#define RxOK   (0x01)
#define RxErr   (0x02)
#define TxOK   (0x04)
#define TxErr   (0x08)
#define RxOverFlow  (0x10)
#define RxUnderrun  (0x20)
#define RxFIFOOver  (0x40)
#define CableLen  (0x2000)
#define TimeOut   (0x4000)
#define SysErr   (0x8000)

#define INT_MASK (RxOK | RxErr | TxOK | TxErr | RxOverFlow | \
RxUnderrun | RxFIFOOver | CableLen | TimeOut | SysErr)


static void rtl8139_hw_start(struct net_device *dev)
{
   struct rtl8139_private *tp = dev->priv;
   void *ioaddr = tp->mmio_addr;
   ...
   writeb(CmdTxEnb, ioaddr + REG_COMMAND);
   writel(0x00000600, ioaddr + REG_TX_CONFIG);    /* DMA burst size 1024 */

/* init TX buffer DMA addresses */
for (i = 0; i < NUM_TX_DESC; i++) {
writel(tp->tx_bufs_dma + (tp->tx_buf[i] - tp->tx_bufs), ioaddr +
REG_TX_ADDR0 + (i * 4));
}


   /* enable all known interrupts by setting the interrupt mask */
   writew(INT_MASK, ioaddr + REG_INTR_MASK);


   netif_start_queue(dev);
   return;
}


static int rtl8139_open(struct net_device *dev)
{
   int retval;
   struct rtl8139_private *tp = dev->priv;
   ...
   retval = request_irq(dev->irq, rtl8139_interrupt, 0, dev->name, dev);
   if (retval)
       return retval;

/* Get memory for TX buffers. Memory must be DMA-able */
tp->tx_bufs = pci_alloc_consistent(tp->pci_dev, TOTAL_TX_BUF_SIZE,
&tp->tx_bufs_dma);
...
rtl8139_init_ring(dev);
rtl8139_hw_start(dev);

DPRINTK("init_ring() & hw_start() passed\n");


   return;
}

rtl8139_hw_start() is really invoked and returned, since I'm getting
printk output. Commenting 'rtl8139_hw_start(dev);' out brings the
interface up succesfully, that's why I came to conclusion the problem
is in chip initialization routine.

If anybody has any clue, I'd appreciate to hear it and get advice.
Thanks in advance.

-- 
Roman

^ permalink raw reply	[flat|nested] 11+ messages in thread

end of thread, other threads:[~2007-02-27  1:20 UTC | newest]

Thread overview: 11+ messages (download: mbox.gz follow: Atom feed
-- links below jump to the message on this page --
2002-10-21  0:02 System lockup Bill Leckey
2002-10-21 11:30 ` Alan Cox
2002-10-21 21:44   ` Bill Leckey
2002-10-27 22:26   ` Bill Leckey
  -- strict thread matches above, loose matches on Subject: below --
2004-08-13 22:53 system lockup Mike Waychison
2004-08-13 23:23 ` Ian Pratt
2004-08-13 23:59   ` Mike Waychison
2004-08-14  0:20     ` Ian Pratt
2004-08-14  8:32     ` Keir Fraser
2004-08-16 13:59       ` Mark Williamson
2007-02-27  1:20 Roman Mashak

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.