Experiencing Some crashes on my Hypervisor
Experiencing Some crashes on my Hypervisor
Hello everyone .
Im up against a very strange situation on my setup :
(HP ML350 G6 , with dual Xeon X5670 @ 2.93GHz & 48GB of ram )
runing Proxmox on 2 Samsung Evo 870 512GB ssds in a zfs array for reduduncy .
One VM is runing TrueNas core with my HBA ( A LSI card) PCI-passed through that manages my files
and gives out SMB , FTP & NFS endpoints that i use to accses my files from other (physical or virtual) macines on my network &
one other VM that is runing debian , that has NextCloud on top , i connected the FTP from trueNas to the nextcloud as extrenal storage and its working like a treat for over 1 year . (Personal uptime record 5.5 months)
(plus some other vms , pbxs etc)
I have my music on the trunas share , and i listen to it via nextcloud , but during the last month i had 2 VERY weird crashes .
the first time , NC hanged , then my other VMs one by one started going offline , ending with my proxmox crashing with no error to either its VGA out or to the logs , in fact while this situation was unfolding i want able to use my VGA terminal , the system was just not listening to any of my inputs . A hard reboot later and a Through check of SMART & other reports on both truenas & proxmox i has again up for quite a while , doing heavy file transfers , listening to music watching movies out of the server (all IO heavy tasks)
Fast forward to tonight, i was about to finish my nightly music sesion ... and the whole thing happened again! NextCloud , gone , Freenas crashing , proxmox inaccessible... but , this time i had a clue . right before freenas "went" , it sent to my logger
Device /dev/da0p2 is causing slow I/O on pool boot-pool.
And the whole setup crashed again , a reboot later , im up and imidetly checked for Drive status .
On the proxmox side on both SSDs SMART was passed and Wearout was on 14% and 15% on the 2 ssds , on the freenas side , all my Spinning Rust had its SMART passing . and with
glabel status
i couldt find da0p2 , but da0 -propably- is the virtual drive that proxmox created for truenas to boot from ..
Please Help me out with this one , its almost cursed & i have no clues to run of from to troubleshot