Data Center Cache Replication problems

February 25, 2018

I have been trying to set up a demo Data Center instance. It's on Azure, not that I think that matters. I did NOT use the Azure Data Center Marketplace Template. I wanted to set everything up manually. Everything works other than I am failing the 'HealthCheck: Cluster Cache Replication'.

If I am logged into node1 it tells me that node2 isn't replicating. If I am logged into node2 it tells me that node1 isn't replicating.

The Shared Home is in /mnt/sharedhome

The Shared Home is on a different vm and shared through NFS.

Nodes are CentOS 7.4.

Both nodes can see that mount fine. I've already chown'd that directory to the jira user

root $ chown jira /mnt/sharedhome/
root $ chown -R jira /mnt/sharedhome/

I think I've given the jira user the right permissions.

root $ chmod -R u+rwx /mnt/sharedhome/

The jira user can create and delete files in the directory. If I make a change to the directory on one node the change is almost immediately reflected on the other.

jira $ touch /mnt/sharedhome/test.txt      (on node1)
jira $ rm /mnt/sharedhome/test.txt         (on node2)

My cluster.properties file is (changed to node2 on node2 obviously)

# This ID must be unique across the cluster 
jira.node.id = node1 
# The location of the shared home directory for all JIRA nodes 
jira.shared.home = /mnt/sharedhome

On both I am seeing similar messages in catalina.out

2018-02-25 10:49:51,328 Caesium-1-2 INFO ServiceRunner [c.a.j.c.cache.ehcache.BlockingParallelCacheReplicator] Start replicating cache: com.atlassian.jira.plugins.healthcheck.service.HeartBeatService.heartbeat, operation: put, key: <only-in-debug>, stacktrace: <only-in-trace>
2018-02-25 10:49:51,343 Caesium-1-2 INFO ServiceRunner [c.a.j.c.cache.ehcache.BlockingParallelCacheReplicator] Done replicating cache: com.atlassian.jira.plugins.healthcheck.service.HeartBeatService.heartbeat, operation: put, key: <only-in-debug>, numberOfPeers: 1, numberOfSuccess: 1, timeMillis: 14, stacktrace: <only-in-trace>
2018-02-25 10:50:13,997 HealthCheck:thread-7 WARN taylor-local 627x61x2 c52hux 98.247.96.192 /rest/troubleshooting/1.0/check/process/ [c.a.t.j.healthcheck.cluster.ClusterReplicationHealthCheck] Node node1 does not seem to replicate its cache

Specifically bother are saying that they are

Done replicating cache

while both warning that the other node

does not seem to replicate its cache

EDIT: Based on
https://community.atlassian.com/t5/Jira-questions/JIRA-DC-Node-ehcache-connection-refused/qaq-p/634262

https://jira.atlassian.com/browse/JRASERVER-64974

https://jira.atlassian.com/browse/JRASERVER-66608

https://community.atlassian.com/t5/Jira-questions/What-is-the-random-port-opened-by-Jira-Datacenter-used-for-and/qaq-p/346614

I've added this to my cluster.properties file

ehcache.object.port = 40011

And opened that port for both inbound and outbound, as well as port 40001, did not fix the issue.

Again, it's just the Cluster Cache Replication health check that's failing. The Cluster Index Replication and Shared Home health checks are fine.

Product Q&A

Community resources

Support

Top groups

Community resources

Support

Learn

Community resources

Support

Events

Community resources

Support

Get product advice from experts

Join a community group

Advance your career with learning paths

Earn badges and rewards

Connect and share ideas at events

Data Center Cache Replication problems

3 answers

Suggest an answer

Was this helpful?

Thanks!

TAGS

Atlassian Community Events