Routing
Highlighted
Routing

chassisd

a week ago

Hi

What reason could be causing to getting the followings freguently

chassisd[5083]: CHASSISD_SNMP_TRAP6: SNMP trap generated: Over Temperature! (jnxContentsContainerIndex 7, jnxContentsL1Index 2, jnxContentsL2Index 0, jnxContentsL3Index 0, jnxContentsDescr FPC: QFX5210-48S-4C @ 0/*/*, jnxOperatingTemp 39)
chassisd[5083]: CHASSISD_SNMP_TRAP6: SNMP trap generated: Over Temperature! (jnxContentsContainerIndex 7, jnxContentsL1Index 2, jnxContentsL2Index 0, jnxContentsL3Index 0, jnxContentsDescr FPC: QFX5210-48S-4C @ 1/*/*, jnxOperatingTemp 40)

show chassis temperature-thresholds
                                        Fan speed      Yellow alarm      Red alarm      Fire Shutdown
                                       (degrees C)      (degrees C)     (degrees C)      (degrees C)
Item                                  Normal  High   Normal  Bad fan   Normal  Bad fan     Normal
FPC 0 Sensor TopLeft I                    38    64       69       69       75       75         78
FPC 0 Sensor TopLeft E2                   40    61       66       66       72       72         75
FPC 0 Sensor TopCenter E2                 40    61       66       66       72       72         75
FPC 0 Sensor TopLeft E1                   42    63       68       68       74       74         77
FPC 0 Sensor CPULeft I                    38    59       67       67       73       73         76
FPC 0 Sensor CPURight I                   38    59       67       67       73       73         76
FPC 0 Sensor CPU Die Temp                 47    86       86       86       92       92         95
FPC 0 Sensor TopRight OCXO                35    62       67       67       73       73         76
FPC 0 Sensor TopRight E                   43    64       69       69       75       75         78
FPC 0 Sensor BottomLeft E                 40    61       69       69       75       75         78
FPC 0 Sensor TopRight I                   31    54       63       63       69       69         72
FPC 0 Sensor TopCenter E1                 40    61       66       66       72       72         75
FPC 1 Sensor TopLeft I                    40    66       65       65       71       71         74
FPC 1 Sensor TopLeft E2                   42    67       66       66       72       72         75
FPC 1 Sensor TopCenter E2                 58    78       77       77       83       83         86
FPC 1 Sensor TopLeft E1                   39    66       65       65       71       71         74
FPC 1 Sensor CPULeft I                    40    67       66       66       72       72         75
FPC 1 Sensor CPURight I                   41    68       67       67       71       71         74
FPC 1 Sensor CPU Die Temp                 56    93       86       86       92       92         95
FPC 1 Sensor TopRight OCXO                29    56       65       65       71       71         74
FPC 1 Sensor TopRight E                   29    56       65       65       71       71         74
FPC 1 Sensor BottomLeft E                 29    56       65       65       71       71         74
FPC 1 Sensor TopRight I                   29    56       65       65       71       71         74
FPC 1 Sensor TopCenter E1                  0     0        0        0        0        0          0

show chassis fpc
                     Temp  CPU Utilization (%)   CPU Utilization (%)  Memory    Utilization (%)
Slot State            (C)  Total  Interrupt      1min   5min   15min  DRAM (MB) Heap     Buffer
  0  Online            39      6          0        5      5      5    16384      29         49
  1  Online            40      5          0        5      5      5    16384      29         49
  2  Empty

 

11 REPLIES 11
Highlighted
Routing

Re: chassisd

a week ago

Hi Arix,

 

Check for air flow for this device. I suspect there is no proper air flow to cool the FPC which is triggering this SNMP trap.

 

If this solves your problem, please mark this post as "Accepted Solution" so we can help others too

 

Kudos are appreciated too 

 

 

Regards,

Nadeem

 

Highlighted
Routing

Re: chassisd

a week ago

Hello Arix,

 

The message log appears when the temperature of QFX5210 exceeded the yellow alarm threshold.
QFX detected a high temperature and an SNMP trap was generated.

 

Perform the below checks and narrow down the issue:-

>Check "show chassis environment".

> Check the surrounding environment.

> Check the CPU utilization.

> Check the RE memory.

> Check the system process

 

I hope this helps. Please mark this post "Accept as solution" if this answers your query.

 

Kudos are always appreciated! Smiley Happy

 

Best Regards,

Lingabasappa H

Highlighted
Routing

Re: chassisd

a week ago

Hi Arix,

 

The error messages suggest that the FPCs ( FPC0 and FPC1) are seeing a high-temperature alert.  

 

The below output shows the operating temperature of each FPC displayed in the first column-

show chassis fpc
                     Temp  CPU Utilization (%)   CPU Utilization (%)  Memory    Utilization (%)
Slot State            (C)  Total  Interrupt      1min   5min   15min  DRAM (MB) Heap     Buffer
  0  Online            39      6          0        5      5      5    16384      29         49
  1  Online            40      5          0        5      5      5    16384      29         49

Can you please paste output for  >show chassis alarms

                        > show chassis environment

                        > show version

                        > show system core-dumps

 

Verify the LED status for the components. 

Please verify if the proper environmental conditions are met for healthy functioning of the device:

https://www.juniper.net/documentation/en_US/release-independent/junos/topics/reference/specification...

 

Please check if the below troubleshooting document helps you:

https://www.juniper.net/documentation/en_US/release-independent/junos/topics/topic-map/qfx5210-troub...

 

Hope this helps Smiley Happy

 

Please mark "Accepted Solution" if this works for you.

Kudos are always appreciated.!

 

 

Highlighted
Routing

Re: chassisd

a week ago
--------------------------------------------------------------------------
FPC 0 PEM 0 status:
  State                      Online
  Airflow                    Front to Back
  Temp Sensor 0              OK   54 degrees C / 129 degrees F
  Temp Sensor 1              OK   32 degrees C / 89 degrees F
  Temp Sensor 2              OK   27 degrees C / 80 degrees F
  Fan 0                      5520 RPM
  Fan 1                      5340 RPM
  DC Output           Voltage(V) Current(A)  Power(W)  Load(%)
                        12          6             72       11
FPC 0 PEM 1 status:
  State                      Online
  Airflow                    Front to Back
  Temp Sensor 0              OK   54 degrees C / 129 degrees F
  Temp Sensor 1              OK   32 degrees C / 89 degrees F
  Temp Sensor 2              OK   29 degrees C / 84 degrees F
  Fan 0                      6180 RPM
  Fan 1                      5400 RPM
  DC Output           Voltage(V) Current(A)  Power(W)  Load(%)
                        12          6             72       11
FPC 1 PEM 0 status:
  State                      Online
  Airflow                    Front to Back
  Temperature                OK
  DC Output           Voltage(V) Current(A)  Power(W)  Load(%)
                        12          5             60       9
FPC 1 PEM 1 status:
  State                      Online
  Airflow                    Front to Back
  Temperature                OK
  DC Output           Voltage(V) Current(A)  Power(W)  Load(%)
                        12          6             72       11

fpc1:
--------------------------------------------------------------------------
FPC 1 PEM 0 status:
  State                      Online
  Airflow                    Front to Back
  Temp Sensor 0              OK   53 degrees C / 127 degrees F
  Temp Sensor 1              OK   33 degrees C / 91 degrees F
  Temp Sensor 2              OK   28 degrees C / 82 degrees F
  Fan 0                      5220 RPM
  Fan 1                      4500 RPM
  DC Output           Voltage(V) Current(A)  Power(W)  Load(%)
                        12          6             72       11
FPC 1 PEM 1 status:
  State                      Online
  Airflow                    Front to Back
  Temp Sensor 0              OK   53 degrees C / 127 degrees F
  Temp Sensor 1              OK   35 degrees C / 95 degrees F
  Temp Sensor 2              OK   28 degrees C / 82 degrees F
  Fan 0                      4920 RPM
  Fan 1                      4500 RPM
  DC Output           Voltage(V) Current(A)  Power(W)  Load(%)
                        12          6             72       11
Highlighted
Routing

Re: chassisd

a week ago

Arix, 

 

Could you share the output of : 

> show chassis environment fpc 0
> show chassis environment fpc 1

Also , are there any alarms ? 

This one might be of some help though it is said that the KB is meant for SRX but is valid across Junos products.

https://kb.juniper.net/InfoCenter/index?page=content&id=KB33967&cat=SRX_5600_1&actp=LIST

 

Please mark "Accept as solution" if this answers your query. 

 

Kudos are appreciated too! 

 

Regards, 

Sharat Ainapur

Highlighted
Routing

Re: chassisd

a week ago

Hello Arix,

 

Greetings!

These alarms can be raised when there are some temperature changes around the FPC0 and FPC1 sensors and it crosses the threshold set for the normal temperature.

 

FPC 0 Sensor TopRight OCXO --> 35

FPC 1 Sensor TopRight OCXO --> 29

 

This wouldn't necessarily mean that the issue is with the FPC or the FPC has failed. These are informational log messages. However to isolate the issue, please follow the below steps.

 

1. check if there are any alarms on the device

2. Check if there are any temperature sensor issues

3. Check the site temperature where the device is placed. Check if there are any temperature fluctuations

4. Check if there is any blockage to the air flow of the device

5. Check the air filter of the device. See if you can clean it and place it back.


Regards,
Vishaal


Accept as Solution = cool ! (Help fellow community members with similar query be redirected here instead of them reposting again)
Accept as Solution+Kudo = You are a Star !
Highlighted
Routing

Re: chassisd

[ Edited ]
a week ago

I checked temp on the another device next to switch is fine.. There is no any alarm on the switch...And also didn't change any thresholds...

Highlighted
Routing

Re: chassisd

a week ago

Hi Arix,

 

Additionally, please follow the below-mentioned steps :

1) Check the surrounding environment to the chassis, to verify that airflow to and from the unit is not being restricted. Air must be allowed to enter and exit via only the air vents on the chassis. Vent restriction will lower the effectiveness of the cooling for the system. Also check to make sure that air inflow is not next to the exhaust of another system generating heated air, as this will also reduce cooling efficiently.

2) Verify that the air filters on the platform are clean.

3) Verify that empty FPC slots have a blank cover plate installed, to preserve the integrity of the internal airflow inside the chassis. All system components (FPCs, CBs, PICs, REs, etc.) should fit snugly in their slots and not have gaps that allow airflow around the component to the outside environment.

4) Verify that the fans are operating by either visual and/or audio observation. If a fan or fan tray does not seem to be running as expected, or there are log messages reporting a fan failure, try reseating the fan in its slot.

 

5) You may consider cleaning and reseating the air filters.

 

In case you have performed all the above and you are still seeing these logs while no alarms are being reported and no LED status is showing any issue, you may choose to safely ignore this. I will paste a link below that will help you prevent Syslog messages from being written in the log file.

How to prevent certain Syslog messages from being written to the log file:
https://kb.juniper.net/InfoCenter/index?page=content&id=KB9382

 

If you are running one of the versions: Junos 18.4R3, Junos 19.2R2, Junos 19.1R2, you should ideally not get these over-temperature FALSE alarms.

 

Hope this helps Smiley Happy

 

Please mark " Accepted Solution" if this helps you.

Kudos would be much appreciated Smiley Happy

 

 

 

 

Highlighted
Routing

Re: chassisd

a week ago

It is interesting.. Why ? let me explain.

 

There are two same series device(same configuration and same FPC 0 and 1) located different locations. One device hasn't got any snmp trap......here:


show version | match Junos:
Junos: 14.1X53-D35.3
Junos: 14.1X53-D35.3

show chassis temperature-thresholds Fan speed Yellow alarm Red alarm Fire Shutdown (degrees C) (degrees C) (degrees C) (degrees C) Item Normal High Normal Bad fan Normal Bad fan Normal FPC 0 Sensor TopMiddle I 47 67 65 65 71 71 74 FPC 0 Sensor TopRight C 41 65 63 63 69 69 72 FPC 0 Sensor TopLeft C 45 67 64 64 70 70 73 FPC 0 Sensor TopRight I 42 64 62 62 68 68 71 FPC 0 Sensor CPURight C 40 67 65 65 71 71 74 FPC 0 Sensor CPULeft I 44 65 63 63 69 69 72 FPC 0 Sensor CPU Die Temp 62 93 86 86 92 92 95 FPC 1 Sensor TopMiddle I 47 67 65 65 71 71 74 FPC 1 Sensor TopRight C 41 65 63 63 69 69 72 FPC 1 Sensor TopLeft C 45 67 64 64 70 70 73 FPC 1 Sensor TopRight I 42 64 62 62 68 68 71 FPC 1 Sensor CPURight C 40 67 65 65 71 71 74 FPC 1 Sensor CPULeft I 44 65 63 63 69 69 72 FPC 1 Sensor CPU Die Temp 62 93 86 86 92 92 95

But the following device has snmp trap repetatively:

show version | match Junos:
Junos: 18.1R3-S7.1 flex
Junos: 18.1R3-S7.1 flex

show chassis temperature-thresholds
                                        Fan speed      Yellow alarm      Red alarm      Fire Shutdown
                                       (degrees C)      (degrees C)     (degrees C)      (degrees C)
Item                                  Normal  High   Normal  Bad fan   Normal  Bad fan     Normal
FPC 0 Sensor TopLeft I                    38    64       69       69       75       75         78
FPC 0 Sensor TopLeft E2                   40    61       66       66       72       72         75
FPC 0 Sensor TopCenter E2                 40    61       66       66       72       72         75
FPC 0 Sensor TopLeft E1                   42    63       68       68       74       74         77
FPC 0 Sensor CPULeft I                    38    59       67       67       73       73         76
FPC 0 Sensor CPURight I                   38    59       67       67       73       73         76
FPC 0 Sensor CPU Die Temp                 47    86       86       86       92       92         95
FPC 0 Sensor TopRight OCXO                35    62       67       67       73       73         76
FPC 0 Sensor TopRight E                   43    64       69       69       75       75         78
FPC 0 Sensor BottomLeft E                 40    61       69       69       75       75         78
FPC 0 Sensor TopRight I                   31    54       63       63       69       69         72
FPC 0 Sensor TopCenter E1                 40    61       66       66       72       72         75
FPC 1 Sensor TopLeft I                    40    66       65       65       71       71         74
FPC 1 Sensor TopLeft E2                   42    67       66       66       72       72         75
FPC 1 Sensor TopCenter E2                 58    78       77       77       83       83         86
FPC 1 Sensor TopLeft E1                   39    66       65       65       71       71         74
FPC 1 Sensor CPULeft I                    40    67       66       66       72       72         75
FPC 1 Sensor CPURight I                   41    68       67       67       71       71         74
FPC 1 Sensor CPU Die Temp                 56    93       86       86       92       92         95
FPC 1 Sensor TopRight OCXO                29    56       65       65       71       71         74
FPC 1 Sensor TopRight E                   29    56       65       65       71       71         74
FPC 1 Sensor BottomLeft E                 29    56       65       65       71       71         74
FPC 1 Sensor TopRight I                   29    56       65       65       71       71         74
FPC 1 Sensor TopCenter E1                  0     0        0        0        0        0          0

Highlighted
Routing

Re: chassisd

[ Edited ]
a week ago

Hello Arix,

 

As per my understanding, the air filters of both the devices[FPC 0 and 1] are not efficiently working in the series of devices that have an SNMP trap.

 

Are there any other devices in the same environment having the same issue? 

 

For now, can you investigate the room temperature where the device has placed and clean the air filters and check if the traps stop generating?

 

To check about the QFX5210 Fan modules:

https://www.juniper.net/documentation/en_US/release-independent/junos/topics/topic-map/qfx5210-fans-...

For system overview:

https://www.juniper.net/documentation/en_US/release-independent/junos/topics/topic-map/qfx5210-syste...

 

 

I hope this helps. Please mark this post "Accept as solution" if this answers your query.

 

Kudos are always appreciated! Smiley Happy

 

Best Regards,

Lingabasappa H

Highlighted
Routing

Re: chassisd

a week ago

Hi Arix,

 

In that case, you might want to check with the environmental conditions and Air Flow In/Out and check manually if the fans are working as expected or not. From the outputs sent by you, nothing looks suspicious in that direction.

 

In case the environment settings and arrangements look that way it is mentioned in the previous document that I sent, you might have to open a JTAC case for live remote troubleshooting.

 

Hope this helps Smiley Happy

 

Please mark this "Accepted Solution" if this helps you solve your query.

Kudos are always appreciated.