Re: [PATCH v5 2/5] can: kvaser_usb: Consolidate and unify state change handling

From: Andri Yngvason
Date: Wed Jan 21 2015 - 11:57:19 EST


Quoting Ahmed S. Darwish (2015-01-21 15:36:47)
> On Wed, Jan 21, 2015 at 03:00:15PM +0000, Andri Yngvason wrote:
> > Quoting Ahmed S. Darwish (2015-01-21 14:43:23)
> > > Hi!
>
> ...
>
> > > <-- Unplug the cable -->
> > >
> > > (000.009106) can0 20000080 [8] 00 00 00 00 00 00 08 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{8}{0}}
> > > (000.001872) can0 20000080 [8] 00 00 00 00 00 00 10 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{16}{0}}
> > [...]
> > > error-counter-tx-rx{{80}{0}}
> > > (000.001910) can0 20000080 [8] 00 00 00 00 00 00 58 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{88}{0}}
> > > (000.001753) can0 20000084 [8] 00 08 00 00 00 00 60 00 ERRORFRAME
> > > controller-problem{tx-error-warning}
> > Good.
> > > bus-error
> > > error-counter-tx-rx{{96}{0}}
>
> Nice.
>
> > > (000.001720) can0 20000080 [8] 00 00 00 00 00 00 68 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{104}{0}}
> > > (000.001876) can0 20000080 [8] 00 00 00 00 00 00 70 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{112}{0}}
> > > (000.001749) can0 20000080 [8] 00 00 00 00 00 00 78 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{120}{0}}
> > > (000.001771) can0 20000084 [8] 00 20 00 00 00 00 80 00 ERRORFRAME
> > > controller-problem{tx-error-passive}
> > Also good.
> > > bus-error
> > > error-counter-tx-rx{{128}{0}}
>
> Also nice :-)
>
> > > (000.001868) can0 20000080 [8] 00 00 00 00 00 00 80 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{128}{0}}
> > > (000.001982) can0 20000080 [8] 00 00 00 00 00 00 80 00 ERRORFRAME
> > > bus-error
> > > error-counter-tx-rx{{128}{0}}
> > >
> > > (( Then a continous flood, exactly similar to the above packet, appears.
> > > Unfortunately this flooding is a firmware problem. ))
> > >
> > > <-- Replug the cable, after a good amount of time -->
> > >
> > Where are the reverse state transitions?
> > >
>
> Hmmm...
>
> [ ... ]
> >
> > Reverse state transitions are missing from the logs. See comments above.
> >
>
> When the device is on the _receiving_ end, and I unplug the CAN cable after
> introducing some noise to the level of reaching WARNING or PASSIVE, I
> receive a BUS_ERROR event with the rxerr count reset back to 0 or 1. In
> that case, the driver correctly transitions back the state to ERROR_ACTIVE
> and candump produces something similar to:
>
> (000.000362) can0 2000008C [8] 00 40 40 00 00 00 00 01 ERRORFRAME
> controller-problem{}
> protocol-violation{{back-to-error-active}{}}
> bus-error
> error-counter-tx-rx{{0}{1}}
>
> which is, AFAIK, the correct behaviour from the driver side.
>
> Meanwhile, when the device is on the _sending_ end and I re-plug the CAN
> cable again. Sometimes I receive events with txerr reset to 0 or 1, and
> the driver correctly reverts back to ERROR_ACTIVE in that case. But on
> another times like the quoted case above, I don't receive any events
> resetting txerr back -- only data packets on the bus.
>
> So, What can the driver do given the above?
>
So what you're telling us is that the state does not got back to error-active
unless there is something else transmitting on the bus?
If that's the case, it's almost definitely because state changes aren't
triggering interrupts.

An rx event will give you an interrupt which yields a napi poll which means that
the state will be polled, so in that case you don't need the interrupts.

Look for "Consolidate and unify state change handling" on gmane. There was a lot
of discussion about this kind of issues, especially regarding FlexCAN.

Best regards,
Andri
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/