CPK messages are initially sent to the CPK mailing list, you can (un)subscribe via this link. You can also follow the service interruption messages via RSS using the link in the title under the RSS icon. If the CPK takes more time to resolve, any updates are published on this website.

For RU wide service interruption see meldingen.ru.nl.

 

Service Interruptions


 No service interruptions.


Resolved Reports


1359: Ceph filesystem failure

Ceph filesystem failure. Services required for the CephFS filesytem cannot start. Failure of these services wil cause failure to access files on Ceph. We are in contact with our support party 42on to remedy the problem. Duration is currently unknown. Update 2024-01-22 We’re working with 42on on the issue and have a meeting scheduled for 17:00 today. Update 2024-01-23 An initial dentry_recover was successful and according to 42on the Ceph journals are OK....

Updated Apr 15, 2024  ·  Miek Gieben · Created Jan 19, 2024

1358: GitLab TLS Issue

On December 10, 2023 the main GitLab TLS certificate expired. A new certificate was generated, but it lacked a proper certificate chain. Browsers disregard that chain, but other GitLab components do care, which meant that the registry and the runners were unable to connect to GitLab. On Monday the 11th we renewed all certificates using Let’s Encrypt. Let’s Encrypt will also auto-renew these.

Updated Dec 11, 2023  ·  Miek Gieben · Created Dec 11, 2023

1357: jitsi videoconferencing unavailable

Jitsi (or actually the internal prosody service) had an old certificate, causing it to refuse startup after the normal reboot. The expired certificate was replaced and after a restart of the prosody service it works again.

1356: Network problems after maintenance on the RU core router

After maintenance on the central routers, our servers on the 25Gbit network couldn’t send traffic to the internet. The cause or nature of the problem appears to be in the central RU routing. By resetting the interface to our network, RU Connectivity can fix this, when it occurs again. RU Connectivity contacted their maintenance provider to investigate this problem. Final solution: After a few failed attempts, it was possible to define a static route in the central routers to our networks....

1355: Jupyterhub restarted

Some users were unable to login, the server had become unstable, which made a reboot necessary. Unfortunately, the jupyterhub service still needs a manual startup, which didn’t happen until 13:24 hours. The service is now working again. We are working to fix the reboot problem.

1354: DHCP server down due to config error

A typo that could propagate to shutdown the DHCP server had the effect that some network devices did not get an IP address when they were switched on and that others, whose address lease expired in this period, lost their IP address and thus their access to the network.. We’ll improve the process to prevent a typo from bringing down the DHCP server in the future.

1353: Jupyterhub22 refused to start this morning

After a scheduled reboot of the machine running jupyterhub, jupyter failed to start properly. The service started after invoking a manual start command. The measures taken to resolve this recurring problem have not proven to be sufficient.

1352: DNS broken for z.science.ru.nl

Due to a misconfiguration of the DNS in the z.science.ru.nl zone, all shares were not available during the outage. Extra tests are added to prevent a future occurrence.

Updated Oct 25, 2023  ·  Simon Oosthoek · Created Oct 23, 2023

1351: All Science services down 15 minutes Thursday Oct 19 07:00-07:15 due to router reboot

The router for most Science services urgently needs a reboot. This has been scheduled for early morning. In the unlikely case that this reboot fails, ILS Connectivity is on campus to fix it.

1350: Mailman not accepting messages

In preparation of the migration to Microsoft Exchange Online Protection (MS EOP), we added another mail exchange server (mx5) to be addressed directly by MS EOP. However, having had mx4 in production for some time, we forgot to test mx5 in conjunction with our mailman server (zaaivm). This could result in a bounced mail ’not accepting messages’ for FNWI users (employees and students) using @ru.nl addresses. Meanwhile, mx5 has been made known to zaaivm thus resolving this issue....