CPK messages are initially sent to the CPK mailing list, you can (un)subscribe via this link. You can also follow the service interruption messages via RSS using the link in the title under the RSS icon. If the CPK takes more time to resolve, any updates are published on this website.

For RU wide service interruption see meldingen.ru.nl.

 

Service Interruptions


1407: NFS problems under investigation

When we moved the gateways of our networks from the old location to the new firewalls, we have received some complaints about NFS filesystems having slowness, longer delays or unavailability. In general NFS should never be a requirement for a clusternode job if you can avoid it, because this I/O is always much slower than local I/O from /scratch. We are investigating how we can optimise the network to resolve this issue, but we are hard pressed to know the exact cause of the problem....

Resolved Reports


1290: Interrupted link to new datacenter switches

Due to human error, the connection between the new datacenter switches and the central router was interrupted.

Updated Oct 20, 2022  ·  Bram Daams · Created Dec 15, 2021 · 

1289: vmhost07 poweroff

Vmhost07 was accidentally shut down. Cause: human error. labservanttest neurotech2 printvm msql01 indicoimapp ldap2 eftw jupytervm

Updated Oct 20, 2022  ·  Bram Daams · Created Dec 2, 2021 · 

1288: Ceph storage expansion caused performance issues

As a result of the expansion of the Ceph storage cluster, the cluster had performance and availability issues. The problems were resolved this morning.

Updated Oct 20, 2022  ·  Bram Daams · Created Nov 16, 2021 · 

1287: Server room network switch powerless

Two modules of an important switch in the main C&CZ server room lost power during the preparation of planned maintenance. This disconnected ca. 75% of the servers in the room from the network. Moving the modules to new PDU’s kimited the downtime to ca. 15 minutes.

Updated Oct 20, 2022  ·  Bram Daams · Created Oct 12, 2021 · 

1286: License server problem

An error in the management software prevented all license processes from starting correctly at the reboot of the license server. After fixing this error, all licenses were available again.

Updated Oct 20, 2022  ·  Bram Daams · Created Oct 11, 2021 · 

1285: Fileserver 'flock' overloaded

Course software that had been tested caused an overload of the fileserver when it was used by 100 students. The performance of the fileserver was impaired for all users of network shares of this server.

Updated Oct 20, 2022  ·  Bram Daams · Created Sep 17, 2021 · 

1284: VPN onbereikbaar

A broken PDU has offlined a switch, which has caused the VPN server to be unreachable (and several other things, which don’t affect users).

Updated Oct 20, 2022  ·  Bram Daams · Created Apr 24, 2021 · 

1283: Central E-mail/Calendar disruption (exchange)

Due to an emergency maintenance, the central microsoft exchange server is unavailable for 4 hours. This may also affect systems that are dependent on exchange. E-mail and calendar functionality is expected to be restored when the maintenance is done around 13:30 Today.

Updated Nov 9, 2022  ·  Miek Gieben · Created Apr 14, 2021 · 

1282: Ceph problem

During a routine upgrade of ceph, a bug in the latest version manifested itself and made the ceph manager unreachable. After aborting the upgrade and with help from the ceph-users mailinglist, everything became available again using a workaround.

Updated Oct 26, 2022  ·  Miek Gieben · Created Mar 24, 2021 · 

1281: Windows 7 computers disabled in B-FAC domain

Because of security issues the last remaining Windows 7 machines wil be disabled, effective 24-03-2021, as member of the Active Directory Domain B-FAC. Please upgrade these computers to a more up-to-date OS.

Updated Jul 7, 2024  ·  Bram Daams · Created Mar 24, 2021 ·