[1] kernel v pruběhu stress-ng crashuje

[2] SEL logs ukazující The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive. těsně před náhlým samovolným restartem
"ID","Severity","Description","Last Update","Count","Category",
"2496","Informational","Browser login: cloud_admin - 10.16.96.4(DNS name not found).","05/16/2023 07:20:25","1","Security, Administration",
"2495","Informational","Browser logout: cloud_admin - 10.16.96.4(DNS name not found).","05/16/2023 07:07:55","1","Security, Administration",
"2494","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/16/2023 06:37:53","1","Security, Administration",
"2493","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/16/2023 06:37:36","1","Security, Administration",
"2492","Informational","Browser login: cloud_admin - 10.16.96.4(DNS name not found).","05/16/2023 06:36:36","1","Security, Administration",
"2491","Informational","Browser logout: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 21:56:22","1","Security, Administration",
"2490","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 21:54:06","1","Hardware, Administration",
"2489","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 21:54:06","1","Hardware, Administration",
"2488","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 21:54:06","1","Hardware, Administration",
"2487","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 21:54:06","1","Hardware, Administration",
"2486","Informational","Host REST logout: System Administrator","05/15/2023 21:48:14","1","Security, Administration",
"2485","Informational","Host REST login: System Administrator","05/15/2023 21:46:46","1","Security, Administration",
"2484","Informational","Server power restored.","05/15/2023 21:45:33","1","Maintenance, Administration",
"2483","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 21:45:23","1","Hardware, Administration",
"2482","Caution","Server reset.","05/15/2023 21:45:20","1","Maintenance, Administration",
"2481","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 21:44:57","1","Hardware, Administration",
"2480","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 21:27:12","1","Hardware, Administration",
"2479","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 21:27:12","1","Hardware, Administration",
"2478","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 21:27:12","1","Hardware, Administration",
"2477","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 21:27:12","1","Hardware, Administration",
"2476","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 21:22:56","1","Security, Administration",
"2475","Informational","Host REST logout: System Administrator","05/15/2023 21:21:56","1","Security, Administration",
"2474","Informational","Host REST login: System Administrator","05/15/2023 21:20:18","1","Security, Administration",
"2473","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 21:20:04","1","Security, Administration",
"2472","Informational","Server power restored.","05/15/2023 21:19:10","1","Maintenance, Administration",
"2471","Caution","Server reset.","05/15/2023 21:18:57","1","Maintenance, Administration",
"2470","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 21:18:43","1","Hardware, Administration",
"2469","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 21:11:32","1","Hardware, Administration",
"2468","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 21:11:32","1","Hardware, Administration",
"2467","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 21:11:32","1","Hardware, Administration",
"2466","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 21:11:32","1","Hardware, Administration",
"2465","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 21:06:20","1","Security, Administration",
"2464","Informational","Host REST logout: System Administrator","05/15/2023 21:05:39","1","Security, Administration",
"2463","Informational","Host REST login: System Administrator","05/15/2023 21:04:11","1","Security, Administration",
"2462","Informational","Server power restored.","05/15/2023 21:03:04","1","Maintenance, Administration",
"2461","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 21:02:56","1","Hardware, Administration",
"2460","Informational","Embedded Flash: Restarted","05/15/2023 21:45:22","3","Firmware",
"2459","Caution","Server reset.","05/15/2023 21:02:51","1","Maintenance, Administration",
"2458","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 21:02:23","1","Hardware, Administration",
"2457","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:58:55","1","Security, Administration",
"2456","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 20:57:48","1","Hardware, Administration",
"2455","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 20:57:48","1","Hardware, Administration",
"2454","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 20:57:48","1","Hardware, Administration",
"2453","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 20:57:48","1","Hardware, Administration",
"2452","Informational","Host REST logout: System Administrator","05/15/2023 20:52:07","1","Security, Administration",
"2451","Informational","Host REST login: System Administrator","05/15/2023 20:50:34","1","Security, Administration",
"2450","Informational","Server power restored.","05/15/2023 20:49:30","1","Maintenance, Administration",
"2449","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 20:49:19","1","Hardware, Administration",
"2448","Caution","Server reset.","05/15/2023 20:49:17","1","Maintenance, Administration",
"2447","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 20:48:41","1","Hardware, Administration",
"2446","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:47:54","1","Security, Administration",
"2445","Informational","Host REST logout: System Administrator","05/15/2023 20:43:31","1","Security, Administration",
"2444","Informational","Host REST login: System Administrator","05/15/2023 20:41:56","1","Security, Administration",
"2443","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:41:00","1","Security, Administration",
"2442","Informational","Server power restored.","05/15/2023 20:40:41","1","Maintenance, Administration",
"2441","Caution","Server reset.","05/15/2023 20:40:28","1","Maintenance, Administration",
"2440","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:40:04","1","Security, Administration",
"2439","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 20:40:00","1","Hardware, Administration",
"2438","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:39:36","1","Security, Administration",
"2437","Informational","Browser logout: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:30:28","1","Security, Administration",
"2436","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:21:55","1","Security, Administration",
"2435","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 20:20:41","1","Hardware, Administration",
"2434","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 20:20:41","1","Hardware, Administration",
"2433","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 20:20:41","1","Hardware, Administration",
"2432","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 20:20:41","1","Hardware, Administration",
"2431","Informational","Host REST logout: System Administrator","05/15/2023 20:15:05","1","Security, Administration",
"2430","Informational","Host REST login: System Administrator","05/15/2023 20:13:45","1","Security, Administration",
"2429","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:13:05","1","Security, Administration",
"2428","Informational","Server power restored.","05/15/2023 20:12:38","1","Maintenance, Administration",
"2427","Informational","Embedded Flash: Restarted","05/15/2023 20:49:20","3","Firmware",
"2426","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 20:12:27","1","Hardware, Administration",
"2425","Caution","Server reset.","05/15/2023 20:12:25","1","Maintenance, Administration",
"2424","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 20:11:50","1","Hardware, Administration",
"2423","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 20:02:19","1","Hardware, Administration",
"2422","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 20:02:19","1","Hardware, Administration",
"2421","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 20:02:19","1","Hardware, Administration",
"2420","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 20:02:19","1","Hardware, Administration",
"2419","Informational","Browser login: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 20:00:46","2","Security, Administration",
"2418","Informational","Host REST logout: System Administrator","05/15/2023 19:56:31","1","Security, Administration",
"2417","Informational","Host REST login: System Administrator","05/15/2023 19:55:01","1","Security, Administration",
"2416","Informational","Server power restored.","05/15/2023 19:53:55","1","Maintenance, Administration",
"2415","Caution","Server reset.","05/15/2023 19:53:42","1","Maintenance, Administration",
"2414","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 19:35:48","1","Hardware, Administration",
"2413","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 19:35:48","1","Hardware, Administration",
"2412","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 19:35:47","1","Hardware, Administration",
"2411","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 19:35:47","1","Hardware, Administration",
"2410","Informational","Host REST logout: System Administrator","05/15/2023 19:29:54","1","Security, Administration",
"2409","Informational","Host REST login: System Administrator","05/15/2023 19:28:29","1","Security, Administration",
"2408","Informational","Server power restored.","05/15/2023 19:27:24","1","Maintenance, Administration",
"2407","Caution","Server reset.","05/15/2023 19:27:11","1","Maintenance, Administration",
"2406","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 19:26:52","1","Hardware, Administration",
"2405","Informational","Host REST logout: System Administrator","05/15/2023 19:24:35","1","Security, Administration",
"2404","Informational","Host REST login: System Administrator","05/15/2023 19:23:09","1","Security, Administration",
"2403","Informational","Server power restored.","05/15/2023 19:22:04","1","Maintenance, Administration",
"2402","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 19:21:53","1","Hardware, Administration",
"2401","Informational","Embedded Flash: Restarted","05/15/2023 19:53:44","3","Firmware",
"2400","Caution","Server reset.","05/15/2023 19:21:51","1","Maintenance, Administration",
"2399","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 19:21:14","1","Hardware, Administration",
"2398","Informational","Browser logout: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 19:03:47","1","Security, Administration",
"2397","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 19:03:47","1","Security, Administration",
"2396","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 18:39:55","1","Hardware, Administration",
"2395","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 18:39:55","1","Hardware, Administration",
"2394","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 18:39:55","1","Hardware, Administration",
"2393","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 18:39:55","1","Hardware, Administration",
"2392","Informational","Host REST logout: System Administrator","05/15/2023 18:34:15","1","Security, Administration",
"2391","Informational","Host REST login: System Administrator","05/15/2023 18:32:42","1","Security, Administration",
"2390","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 18:31:55","1","Security, Administration",
"2389","Informational","Browser login: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 18:31:49","1","Security, Administration",
"2388","Informational","Server power restored.","05/15/2023 18:31:36","1","Maintenance, Administration",
"2387","Informational","Embedded Flash: Restarted","05/15/2023 18:31:26","1","Firmware",
"2386","Caution","Server reset.","05/15/2023 18:31:23","1","Maintenance, Administration",
"2385","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 18:08:41","1","Security, Administration",
"2384","Informational","Browser logout: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 18:08:41","1","Security, Administration",
"2383","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 17:38:44","1","Security, Administration",
"2382","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 17:35:38","1","Hardware, Administration",
"2381","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 17:35:38","1","Hardware, Administration",
"2380","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 17:35:38","1","Hardware, Administration",
"2379","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 17:35:38","1","Hardware, Administration",
"2378","Informational","Remote console session stopped by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 17:33:21","1","Security, Administration",
"2377","Informational","Remote console session started by: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 17:30:47","1","Security, Administration",
"2376","Informational","Browser login: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 17:30:34","1","Security, Administration",
"2375","Informational","Host REST logout: System Administrator","05/15/2023 17:29:56","1","Security, Administration",
"2374","Informational","Host REST login: System Administrator","05/15/2023 17:28:29","1","Security, Administration",
"2373","Informational","Server power restored.","05/15/2023 17:27:21","1","Maintenance, Administration",
"2372","Informational","Embedded Flash: Restarted","05/15/2023 17:27:11","1","Firmware",
"2371","Caution","Server reset.","05/15/2023 17:27:08","1","Maintenance, Administration",
"2370","Caution","The iLO health monitoring status of the device / adapter located in Slot 12 is not responsive.","05/15/2023 17:26:46","1","Hardware, Administration",
"2369","Informational","Browser logout: cloud_admin - 10.16.96.4(DNS name not found).","05/15/2023 15:04:57","1","Security, Administration",
"2368","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 14:55:09","1","Hardware, Administration",
"2367","Caution","The iLO health monitoring status of the device / adapter located in Slot  6 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 14:55:09","1","Hardware, Administration",
"2366","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so power sensor is unknown.","05/15/2023 14:55:09","1","Hardware, Administration",
"2365","Caution","The iLO health monitoring status of the device / adapter located in Slot  3 has OS driver missing or not in persistent mode so read thermal limits is not responsive.","05/15/2023 14:55:09","1","Hardware, Administration",
"2364","Informational","Host REST logout: System Administrator","05/15/2023 14:49:38","1","Security, Administration",
"2363","Informational","Host REST login: System Administrator","05/15/2023 14:48:09","1","Security, Administration",

[3] Porušení operačního systému centos7, fáze I
[root@cerit-hdg-009-ostack ~]# semodule -l
libsemanage.semanage_direct_get_module_info: Unable to read check-docker module lang ext file.
semodule:  Failed on list!

[4] Porušení operačního systému centos7, fáze II
[root@cerit-hdg-009-ostack ~]# LC_ALL=C rpm -q kernel
error: db5 error(11) from dbenv->open: Resource temporarily unavailable
error: cannot open Packages index using db5 - Resource temporarily unavailable (11)
error: cannot open Packages database in /var/lib/rpm
error: db5 error(11) from dbenv->open: Resource temporarily unavailable
error: cannot open Packages database in /var/lib/rpm
package kernel is not installed
# fix https://unix.stackexchange.com/a/198704


[5] server onsazení slotů
 Device Inventory  (  hide empty slots  )
MCTP Discovery:   Enabled

Location    Product Name    Product version firmware version    status
Embedded Device 	Embedded Video Controller 		2.5 	 Enabled
OCP 3.0 Slot 10 	Intel Eth Adptr I350T4 OCPv3 	K53978-004 	1.3310.0 	 Enabled
PCI-E Slot 2 	Empty slot 2 		N/A 	 N/A
PCI-E Slot 3 	NVIDIA PCIe GPU Controller 		90.02.30.00.81 	 Enabled
PCI-E Slot 5 	Empty slot 5 		N/A 	 N/A
PCI-E Slot 6 	NVIDIA PCIe GPU Controller 		90.02.30.00.81 	 Enabled
PCI-E Slot 7 	Marvell FastLinQ 41000 Series - 2P 10GbE 10GBASE-T QL41132HLRJ-HC MD2 Adapter - NIC 		8.50.76 	 Unknown
Storage Slot 12 	HPE Smart Array E208i-a SR Gen10 	B 	5.32 	 Enabled

[6] SMART

[root@cerit-hdg-009-ostack ~]# smartctl -a /dev/sdb; echo $?
smartctl 7.0 2018-12-30 r4883 [x86_64-linux-3.10.0-1160.90.1.el7.x86_64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     VK000480GWSRR
Serial Number:    S4NANA0N632958
LU WWN Device Id: 5 002538 e006731b5
Firmware Version: HPG4
User Capacity:    480 103 981 056 bytes [480 GB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-4, ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue May 16 11:10:10 2023 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

...
0
[root@cerit-hdg-009-ostack ~]# smartctl -a /dev/sda; echo $?
smartctl 7.0 2018-12-30 r4883 [x86_64-linux-3.10.0-1160.90.1.el7.x86_64] (local build)
Copyright (C) 2002-18, Bruce Allen, Christian Franke, www.smartmontools.org

=== START OF INFORMATION SECTION ===
Device Model:     VK000480GWSRR
Serial Number:    S4NANA0N632974
LU WWN Device Id: 5 002538 e006731e3
Firmware Version: HPG4
User Capacity:    480 103 981 056 bytes [480 GB]
Sector Sizes:     512 bytes logical, 4096 bytes physical
Rotation Rate:    Solid State Device
Form Factor:      2.5 inches
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-4, ACS-3 T13/2161-D revision 5
SATA Version is:  SATA 3.2, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Tue May 16 11:10:14 2023 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

...
0