re-work some FDIR logic #172

Merged
muellerr merged 2 commits from improve-dev-fdir into develop 2024-04-09 10:47:33 +02:00
Owner
No description provided.
muellerr added 1 commit 2024-04-03 12:50:04 +02:00
muellerr requested review from gaisser 2024-04-03 12:50:09 +02:00
muellerr added 1 commit 2024-04-03 12:52:35 +02:00
gaisser requested changes 2024-04-04 20:55:57 +02:00
gaisser left a comment
Member

Please check if it is ok to trigger events during fdirState == NONE.

Please check if it is ok to trigger events during fdirState == NONE.
@ -186,15 +191,6 @@ void DeviceHandlerFailureIsolation::setFdirState(FDIRState state) {
fdirState = state;
}
void DeviceHandlerFailureIsolation::triggerEvent(Event event, uint32_t parameter1,
Member

Is this function not called or why remove it?

If the base function is used instead, events will get triggered during recoveries which might break stuff.

Is this function not called or why remove it? If the base function is used instead, events will get triggered during recoveries which might break stuff.
Author
Owner

Aren't FDIR reactions ignored if the FDIR is active? Also, aren't there cases where I'd at least want to see the events even if the FDIR is active?

Aren't FDIR reactions ignored if the FDIR is active? Also, aren't there cases where I'd at least want to see the events even if the FDIR is active?
Member

I don't know for sure. what will happen if another event is triggered during recoveries. For now it seems ok as the FDIR does not react. Although, higher level FDIRs will react if the system has any.

I don't know for sure. what will happen if another event is triggered during recoveries. For now it seems ok as the FDIR does not react. Although, higher level FDIRs will react if the system has any.
Member

Have you watched any events during testing while in recovery? it might filter event spam.

Have you watched any events during testing while in recovery? it might filter event spam.
Author
Owner

I think this should be fine.. isn't is a configuration error if a device throws tons of unexpected events for reboot handling or when being faulty? I tested this for EIVE and did not receive any unexpected event spam so far.

I think this should be fine.. isn't is a configuration error if a device throws tons of unexpected events for reboot handling or when being faulty? I tested this for EIVE and did not receive any unexpected event spam so far.
Author
Owner

You mean events with severity higher than INFO while the FDIR is IDLE? This was already tested.

You mean events with severity higher than INFO while the FDIR is IDLE? This was already tested.
muellerr merged commit e6e3753324 into develop 2024-04-09 10:47:33 +02:00
muellerr deleted branch improve-dev-fdir 2024-04-09 10:47:34 +02:00
Sign in to join this conversation.
No description provided.