High CPU Usage on DHIS2 2.38.4.3

Alex_Tumwesigye · 9 August 2023 15:32

Is there any experiencing high CPU Usage on DHIS2 2.38.4.3? Any ideas are welcome.

Gassim · 11 August 2023 00:25

Welcome back to the community!

It might help more if you provide more information about your instance’s environment and setup. Additionally, you might find this topic helpful: High CPU use in a Linode server

Thanks!

Stephan_Mestach · 17 October 2023 20:59

I would try to gain a bit of visibility in what the machine is doing

the machine : sudo top -bn1
1. find the process eating the cpu or if the machine is swapping
2. you can also have a look a tools like glances or htop
get a grasp of what the java process is doing : this will produce thread dumps with the call stack of the executing code
for PID in jps | grep -v jps | cut "-d " -f1; do jstack $PID ; done'
(note you can use that site to parse and analyse a bit the dumps)
the db : to get the running queries on the db with psql you can launch
COPY (select * from pg_stat_activity) TO STDOUT WITH CSV HEADER
monitor the access/error logs (is there a robot, or over accessed api/page)
cat /var/log/nginx/access.log | awk '{print $7}' | sort | uniq -c | sort -nr | head -n 20 or
cat /var/log/nginx/access.log | awk '{print $9}' | sort | uniq -c | sort -nr | head -n 20
(adapt the field number according to your log format and http server)
check if a heapdump is not produced on your system (you might see a huge file)
for the java process you might be interested by this jvm-mon to profile/monitor the jvm or async-profiler (or use a commercial java profiler)
for the db you can have a look at pg_activity

Note that it’s possible to automate all this and produce reports
Launching several times or when the phenomenon is observed
This might allow you or a dhis2 developer to see a trend or spot some suspects.

In the usual suspect

analytics or continuous analytics running
the machine is swapping
the java process continuously heap dumping or triggering a lot of garbage collection (check the xmx xms options)
a script/batch using the api and hitting the same endpoint (or not reusing the session)
a bot trying a lot of url to find an exploit