If you are using JIRA and/or Confluence in a corporate environment you should expect to hit scalability issues quite fast.
Here our current list of configuration changes that are needed in order to be able to scale the system for more users:
Example of working setups that are sharing a server with 48GB RAM, 32 cores, and SSD:
Number of cores is not so important, it seems that Atlassian products are not able to really use them effectively, the average load is less than 3, and a 100% usage would translate into 32.
Memory and IO speed are essential, SSD added a speed up of almost 10x when it comes to indexing or service start time.
While I do have full HTTP logs that do include backend response time for each request, I am still looking for a tools that is able to parse these and to extract some meaningful information.
We are not fully pleased about the performance and I want to have some realistic data regarding what is causing the slowdowns.
Feel free to add your own hints, so we can build a better tuning tutorial.
At this moment we are using nginx in front on JIRA to provide the following:
server {
    listen 80;
    server_name jira.example.com old-jira.example.com;
    rewrite     ^ https://jira.example.com$request_uri? permanent;
    access_log  /var/log/nginx/redirect_access.log full;
    error_log  /var/log/nginx/redirect_error.log;
}
 
server {
        listen 443 ssl spdy;
        server_name jira.example.com ;
        root /etc/nginx/www/;
        gzip  on;
        gzip_vary on;
        access_log  /var/log/nginx/jira.access.log full;
        error_log  /var/log/nginx/jira.error.log;
        client_max_body_size 150m;
        error_page 502 503 504  /www/maintenance.html;
        location /www/ {
                root /etc/nginx/;
        }
        location / {
                proxy_redirect          off;
                proxy_next_upstream     error timeout invalid_header http_500;
                proxy_connect_timeout   5;
                proxy_pass   http://localhost:8080;
                proxy_redirect  off;
                #proxy_connect_timeout 120;
                proxy_read_timeout      10800s;
                proxy_set_header        Host            $http_host;
                proxy_set_header        X-Real-IP       $remote_addr;
                proxy_set_header        X-Forwarded-Host $host;
                proxy_set_header        X-Forwarded-Server $host;
                proxy_set_header        X-Forwarded-For $proxy_add_x_forwarded_for;
        }
}
Have you been able to set up SPDY to be used by all the requests? I noticed that only a few (13 in my case) first request are being made using SPDY and then basic HTTS is used in my case: https://www.dropbox.com/s/qlowz0gr7qhlza8/Zrzut%20ekranu%202015-04-13%2010.55.29.png?dl=0
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
I have found that the number of custom fields affects the size of the Lucene index. And really big indexes can be slower to update which leads to other actions waiting longer
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
@Rp Subhub I am sure that using caching in nginx would speedup them considerably but at this moment I do not have a set of nginx caching rules. The major problem is that Atlassian products have the bad habit of not using the HTTP caching/expire properties properly so adding caching is really tricky. If anyone has a set of working caching options I would be more than happy to try them. My only experience with enabling the caching was really awful as I ended up with users being switched: logging in as John and ending up as Marry. Still it seems that Atlassian does keep the information regarding what can be be caches really secret, probably as an selling point for TAM and Enterprise level support.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Great question and insights, Sorin! Thanks a lot! Have you tried setting up caching of static resources by nginx? Does it/will it have an effect on performance/load?
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Have you experimented with JIRA/Confluence Data Center yet? I would assume large installation optimizations would be in that deployment package.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Sorry but considering that the hardware usage is below 15%, looking for the datacenter seems just as an unfounded up-selling proposal.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
I have no relationship with Atlassian other than an user of their products, so I was not trying to upsell anything to you. I was suggesting that maybe (the reason for the word experiment) that changes in software architecture (active-active database) and algorithms should have a greater effect than throwing hardware at the problem.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
 
 
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.