03-02-2011 08:21 AM
HI Experts
I am running SRX3600 on junos 10.4R1.9 in chassis cluster mode.
In this configuration it looks like very unstable.
Most of the time I used to get this error below after which the PICS on the node1 used to become offline.
This is the error
node1.fpc8.pic0 SCHED: Thread 4 (Module Init) ran for 1819 ms without yielding
After above error the PICS in node1 used to get offline randomly.
SRX3600-1# run show chassis fpc pic-status | no-more
node0:
--------------------------------------------------
Slot 0 Online SRX3k SFB 12GE
PIC 0 Online 8x 1GE-TX 4x 1GE-SFP
Slot 2 Online SRX3k 16xGE TX
PIC 0 Online 16x 1GE-TX
Slot 5 Online SRX3k 2x10GE XFP
PIC 0 Online 2x 10GE-XFP
Slot 8 Online SRX3k SPC
PIC 0 Online SPU Cp-Flow
Slot 11 Online SRX3k NPC
PIC 0 Online NPC PIC
node1:
--------------------------------------------------
Slot 0 Offline SRX3k SFB 12GE
Slot 2 Offline SRX3k 16xGE TX
Slot 5 Offline SRX3k 2x10GE XFP
Slot 8 Offline SRX3k SPC
Slot 11 Online SRX3k NPC
PIC 0 Online NPC PIC
After rebooring node 1 all the PICS used to come online for sometime and after that problem happen again.
I wanted to ask if someone has any idea about this error?
Any suggestions to avoid this problem?
Thanks
Regards
08-16-2011 12:41 PM
any solution you got from the TAC or any other recommendations ...
regards
02-07-2012 02:21 PM
I have the same problem with 10.4R8.5.
I have just opened a ticket.
I will post here if I have any feddback.
02-07-2012 08:09 PM
Hi Ahmed,
You get the message node1.fpc8.pic0 SCHED: Thread 4 (Module Init) ran for 1819 ms without yielding when the SPU is booting up, so this is a normal message.
It looks like secondary is going into disabled state and that is the reason why all the cards are offline.
Most probably the fabric link communication is breaking for some reason.
Please post the output of "show chassis cluster information detail"
Regards,
AJ