CPK messages are initially sent to the CPK mailing list, you can (un)subscribe via this link. You can also follow the service interruption messages via RSS using the link in the title under the RSS icon. If the CPK takes more time to resolve, any updates are published on this website.

 

Service Interruptions


1365: restore old ceph shares to new locations

Even though we closed CPK #1359, the user’s shares are not all restored. We have recovered the data from cephfs to temporary storage that is not accessible to users, it will take a bit more time to find new permanent locations for this data. The storage will not be cluster based anymore, but single server ZFS with snapshots.

Resolved Reports


1366: license server down

Due to some unexpected reaction to a config change, the licenseserver for several of our licenses was misconfigured for about 40minutes. The change was reverted, but it took long enough for users to run into license problems.

1364: Some network shares temporary unavailable

Some of our network shares were temporarily unavailable earlier due to a configuration error. The issue has been resolved and all shares should be accessible now. We want to assure you that the configuration error that occurred did not impact the integrity or security of the data stored on the file server. We apologize for any inconvenience this may have caused. In the meantime, we’re working on making information about network shares readily available on DIY for your reference....

1363: Brief disruption to Matlab Licensing this morning

Some users may have experienced difficulty launching MATLAB this morning due to a temporary issue with the license server. We identified the problem and resolved it by manually restarting the server. MATLAB licensing should now be functioning normally. We apologize for any inconvenience this may have caused.

1362: Sieve difference(s) current mailbox server

After migrating to a new mailbox server on Sunday, March 31, we noticed different behaviour in the processing of mailbox filters (by the Sieve software). Firstly, the ‘include’ of global spam+virus filtering rules had a side-effect which has been fixed using a newer version of the software. Secondly, the default behaviour of the ‘fileinto’ statement underwent a change: previously, it featured an implicit ‘stop’ statement, but now it requires an explicit ‘stop’ statement after the ‘fileinto’ to achieve the same effect....

Updated Apr 5, 2024  ·  Erik Joost Visser · Created Apr 5, 2024 ·  Erik Visser

1361: homeserver home1 nfs problem

home1 had a problem with the nfs server after the reboot. Another reboot fixed it.

1360: mailserver will be replaced Easter weekend

In Easter Weekend (March 30th 2024) the new mailserver will replace the old one. The final switch will take a short while, during which mail clients (eg Thunderbird, mail on your phone, etc.) will not be able to connect to the science mailserver. (This is a week later than announced, due to some issues that need to be resolved before final migration) Central mail services (Microsoft @ru.nl) are independent of our mailserver and thus will not be affected at all....

1359: Ceph filesystem failure

Ceph filesystem failure. Services required for the CephFS filesytem cannot start. Failure of these services wil cause failure to access files on Ceph. We are in contact with our support party 42on to remedy the problem. Duration is currently unknown. Update 2024-01-22 We’re working with 42on on the issue and have a meeting scheduled for 17:00 today. Update 2024-01-23 An initial dentry_recover was successful and according to 42on the Ceph journals are OK....

Updated Apr 15, 2024  ·  Miek Gieben · Created Jan 19, 2024

1358: GitLab TLS Issue

On December 10, 2023 the main GitLab TLS certificate expired. A new certificate was generated, but it lacked a proper certificate chain. Browsers disregard that chain, but other GitLab components do care, which meant that the registry and the runners were unable to connect to GitLab. On Monday the 11th we renewed all certificates using Let’s Encrypt. Let’s Encrypt will also auto-renew these.

Updated Dec 11, 2023  ·  Miek Gieben · Created Dec 11, 2023

1357: jitsi videoconferencing unavailable

Jitsi (or actually the internal prosody service) had an old certificate, causing it to refuse startup after the normal reboot. The expired certificate was replaced and after a restart of the prosody service it works again.