CPK messages are initially sent to the CPK mailing list, you can (un)subscribe via this link. You can also follow the service interruption messages via RSS using the link in the title under the RSS icon. If the CPK takes more time to resolve, any updates are published on this website.

 

Service Interruptions


1365: restore old ceph shares to new locations

Even though we closed CPK #1359, the user’s shares are not all restored. We have recovered the data from cephfs to temporary storage that is not accessible to users, it will take a bit more time to find new permanent locations for this data. The storage will not be cluster based anymore, but single server ZFS with snapshots.

April 18, 2024 · updated April 18, 2024 ·  Simon

Resolved Reports


1364: Some network shares temporary unavailable

Some of our network shares were temporarily unavailable earlier due to a configuration error. The issue has been resolved and all shares should be accessible now. We want to assure you that the configuration error that occurred did not impact the integrity or security of the data stored on the file server. We apologize for any inconvenience this may have caused. In the meantime, we’re working on making information about network shares readily available on DIY for your reference....

April 18, 2024 · updated April 18, 2024 ·  Bram

1363: Brief disruption to Matlab Licensing this morning

Some users may have experienced difficulty launching MATLAB this morning due to a temporary issue with the license server. We identified the problem and resolved it by manually restarting the server. MATLAB licensing should now be functioning normally. We apologize for any inconvenience this may have caused.

April 15, 2024 · updated April 15, 2024 ·  Bram

1362: Sieve difference(s) current mailbox server

After migrating to a new mailbox server on Sunday, March 31, we noticed different behaviour in the processing of mailbox filters (by the Sieve software). Firstly, the ‘include’ of global spam+virus filtering rules had a side-effect which has been fixed using a newer version of the software. Secondly, the default behaviour of the ‘fileinto’ statement underwent a change: previously, it featured an implicit ‘stop’ statement, but now it requires an explicit ‘stop’ statement after the ‘fileinto’ to achieve the same effect....

April 5, 2024 · updated April 9, 2024 ·  Erik

1361: homeserver home1 nfs problem

home1 had a problem with the nfs server after the reboot. Another reboot fixed it.

April 1, 2024 · updated April 9, 2024 ·  Simon

1360: mailserver will be replaced Easter weekend

In Easter Weekend (March 30th 2024) the new mailserver will replace the old one. The final switch will take a short while, during which mail clients (eg Thunderbird, mail on your phone, etc.) will not be able to connect to the science mailserver. (This is a week later than announced, due to some issues that need to be resolved before final migration) Central mail services (Microsoft @ru.nl) are independent of our mailserver and thus will not be affected at all....

March 22, 2024 · updated April 9, 2024 ·  Simon

1359: Ceph filesystem failure

Ceph filesystem failure. Services required for the CephFS filesytem cannot start. Failure of these services wil cause failure to access files on Ceph. We are in contact with our support party 42on to remedy the problem. Duration is currently unknown. Update 2024-01-22 We’re working with 42on on the issue and have a meeting scheduled for 17:00 today. Update 2024-01-23 An initial dentry_recover was successful and according to 42on the Ceph journals are OK....

January 19, 2024 · updated April 15, 2024

1358: GitLab TLS Issue

On December 10, 2023 the main GitLab TLS certificate expired. A new certificate was generated, but it lacked a proper certificate chain. Browsers disregard that chain, but other GitLab components do care, which meant that the registry and the runners were unable to connect to GitLab. On Monday the 11th we renewed all certificates using Let’s Encrypt. Let’s Encrypt will also auto-renew these.

December 11, 2023 · updated April 9, 2024

1357: jitsi videoconferencing unavailable

Jitsi (or actually the internal prosody service) had an old certificate, causing it to refuse startup after the normal reboot. The expired certificate was replaced and after a restart of the prosody service it works again.

December 6, 2023 · updated April 9, 2024 ·  Simon

1356: Network problems after maintenance on the RU core router

After maintenance on the central routers, our servers on the 25Gbit network couldn’t send traffic to the internet. The cause or nature of the problem appears to be in the central RU routing. By resetting the interface to our network, RU Connectivity can fix this, when it occurs again. RU Connectivity contacted their maintenance provider to investigate this problem. Final solution: After a few failed attempts, it was possible to define a static route in the central routers to our networks....

December 4, 2023 · updated April 9, 2024 ·  Simon