I’m supporting the system administration of a growing OMERO installation. We run version 5.6.1 with a 3 server setup, i.e., one for OMERO.server, OMERO.web and the database each. We have users from different groups that use OMERO to store data from various instruments and run image analysis tasks using the web client, scripts, etc.
We have had a couple of instances where a user’s tasks has produced quite a lot of load on the OMERO.server for a significant amount of time without coming to an end. The problematic process limited the resources of the app server dramatically for other users. Also,the user did not seem to notice that something was wrong which made it difficult for us trace the issue back. From the top command, we were able to get the PID of the process in question, but we have not been able to find out the actual task/job in question and stop it without restarting the server.
This brings me to my questions
From a system administration point of view, I would like to be able to see which user is running what process at the moment. What is the best approach for this? I would like to be able to do it on OMERO.server itself and not via OMERO.web if possible.
I would also like to be able to stop a rogue process of which I know the PID without having to restart the entire server. Is this possible? And if yes, how? Because of the number of users, finding/contacting the user whose job is running, is not always a solution.
I’m fairly new to administrating OMERO and this is my first forum post. I hope that I have given all relevant information. If not, please let me know. So far, I have not found the information that I’m looking for in the documentation and the forum.