Mark the node as FAIL when the node is marked as NOADDR #1191

enjoy-binbin · 2024-10-18T08:55:23Z

Imagine we have a cluster, for example a three-shard cluster,
if shard 1 doing a CLUSTER RESET HARD, it will change the node
name, and then other nodes will mark it as NOADR since the node
name received by PONG has changed.

In the eyes of other nodes, there is one working primary node
left but with no address, and in this case, the address report
in MOVED will be invalid and will confuse the clients. And in
the same time, the replica will not failover since its primary
is not in the FAIL state. And the cluster looks OK to everyone.

This leaves a cluster that appears OK, but with no coverage for
shard 1, obviously we should do something like CLUSTER FORGET
to remove the node and fix the cluster before using it.

But the point in here, we can mark the NOADDR node as FAIL to
advance the cluster state. If a node is NOADDR means it does
not have a valid address, so we won't reconnect it, we won't
send PING, we won't gossip it, it seems reasonable to mark it
as FAIL.

Imagine we have a cluster, for example a three-shard cluster, if shard 1 doing a CLUSTER RESET HARD, it will change the node name, and then other nodes will mark it as NOADR since the node name received by PONG has changed. In the eyes of other nodes, there is one working primary node left but with no address, and in this case, the address report in MOVED will be invalid and will confuse the clients. And in the same time, the replica will not failover since its primary is not in the FAIL state. And the cluster looks OK to everyone. This leaves a cluster that appears OK, but with no coverage for shard 1, obviously we should do something like CLUSTER FORGET to remove the node and fix the cluster before using it. But the point in here, we can mark the NOADDR node as FAIL to advance the cluster state. If a node is NOADDR means it does not have a valid address, so we won't reconnect it, we won't send PING, we won't gossip it, it seems reasonable to mark it as FAIL. Signed-off-by: Binbin <[email protected]>

codecov · 2024-10-18T09:11:26Z

Codecov Report

Attention: Patch coverage is 66.66667% with 1 line in your changes missing coverage. Please review.

Project coverage is 70.68%. Comparing base (a62d1f1) to head (c1bf0e6).

Files with missing lines	Patch %	Lines
src/cluster_legacy.c	66.66%	1 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff              @@
##           unstable    #1191      +/-   ##
============================================
+ Coverage     70.65%   70.68%   +0.02%     
============================================
  Files           114      114              
  Lines         61799    63073    +1274     
============================================
+ Hits          43664    44580     +916     
- Misses        18135    18493     +358

Files with missing lines	Coverage Δ
src/cluster_legacy.c	`86.52% <66.66%> (+0.21%)`	⬆️

... and 90 files with indirect coverage changes

zuiderkwast

It seems to be correct, but I want someone else to take a look. @PingXie?

So when a node is in NOADDR state, it will never be PFAIL and later FAIL? We never get any updates from the old node ID so it should automatically become PFAIL at some point?

Are there any other case where a node can be marked as NOADDR and come back again? Changed IP address of the server but still running?

zuiderkwast · 2024-10-18T10:10:46Z

src/cluster_legacy.c

+                /* We also mark the node as fail because we have disconnected from it,
+                 * and will not reconnect, and obviously we will not gossip NOADDR nodes.
+                 * Marking it as FAIL can help us advance the state, such as the cluster
+                 * state becomes FAIL or the replica can do the failover. Otherwise, the
+                 * NOADDR node will provide an invalid address in redirection and confuse
+                 * the client, and the replica will not initiate a failover since the node
+                 * is not actually in FAIL state. */


Let's combine this comment with the comment above?

} else if (memcmp(link->node->name, hdr->sender, CLUSTER_NAMELEN) != 0) { /* If the reply has a non matching node ID we * disconnect this node and set it as not having an associated - * address. */ + * address. This can happen if the node did CLUSTER RESET and changed + * its node ID. In this case, the old node ID will not come back. + * + * We also mark the node as fail because we have disconnected from it, + * and will not reconnect, and obviously we will not gossip NOADDR nodes. + * Marking it as FAIL can help us advance the state, such as the cluster + * state becomes FAIL or the replica can do the failover. Otherwise, the + * NOADDR node will provide an invalid address in redirection and confuse + * the client, and the replica will not initiate a failover since the node + * is not actually in FAIL state. */

enjoy-binbin requested review from PingXie and zuiderkwast October 18, 2024 08:55

enjoy-binbin force-pushed the NOADDR_nodes branch from e59417b to c1bf0e6 Compare October 18, 2024 08:56

zuiderkwast reviewed Oct 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mark the node as FAIL when the node is marked as NOADDR #1191

Mark the node as FAIL when the node is marked as NOADDR #1191

enjoy-binbin commented Oct 18, 2024

codecov bot commented Oct 18, 2024

zuiderkwast left a comment

zuiderkwast Oct 18, 2024

Mark the node as FAIL when the node is marked as NOADDR #1191

Are you sure you want to change the base?

Mark the node as FAIL when the node is marked as NOADDR #1191

Conversation

enjoy-binbin commented Oct 18, 2024

codecov bot commented Oct 18, 2024

Codecov Report

zuiderkwast left a comment

Choose a reason for hiding this comment

zuiderkwast Oct 18, 2024

Choose a reason for hiding this comment