1291: Network switch of Astro Coma cluster down
The network switch of the Coma cluster seems to be broken, all attached nodes are separated from the rest of the network. We’ll replace the switch a.s.a.p. and (let) analyze the problem after that.
CPK messages are initially sent to the CPK mailing list, you can (un)subscribe via this link. You can also follow the service interruption messages via RSS using the link in the title under the RSS icon. If the CPK takes more time to resolve, any updates are published on this website.
For RU wide service interruption see meldingen.ru.nl.
The network switch of the Coma cluster seems to be broken, all attached nodes are separated from the rest of the network. We’ll replace the switch a.s.a.p. and (let) analyze the problem after that.
February 14, ILS switched off antique versions of TLS (1.0 and 1.1) for the Eduroam authentication on ILS LDAP servers. From then on, SUSE Linux 15.3 clients can’t authenticate with U- or s-number. They only have TLS1.2 and the ILS servers offer TLS1.3 first, after that an error occurs. By using the Science-account to authenticate, these users succeed in connecting to Eduroam.
Due to human error, the connection between the new datacenter switches and the central router was interrupted.
Vmhost07 was accidentally shut down. Cause: human error. labservanttest neurotech2 printvm msql01 indicoimapp ldap2 eftw jupytervm
As a result of the expansion of the Ceph storage cluster, the cluster had performance and availability issues. The problems were resolved this morning.
Two modules of an important switch in the main C&CZ server room lost power during the preparation of planned maintenance. This disconnected ca. 75% of the servers in the room from the network. Moving the modules to new PDU’s kimited the downtime to ca. 15 minutes.
An error in the management software prevented all license processes from starting correctly at the reboot of the license server. After fixing this error, all licenses were available again.
Course software that had been tested caused an overload of the fileserver when it was used by 100 students. The performance of the fileserver was impaired for all users of network shares of this server.
A broken PDU has offlined a switch, which has caused the VPN server to be unreachable (and several other things, which don’t affect users).
Due to an emergency maintenance, the central microsoft exchange server is unavailable for 4 hours. This may also affect systems that are dependent on exchange. E-mail and calendar functionality is expected to be restored when the maintenance is done around 13:30 Today.