source ip is not preserved if out of clustercidr #8812

ehsan310 · 2024-05-13T11:05:09Z

We have a cluster up and running , peered with ToR and clusterIP advertised.
We have noticed that if traffic received on ClusterIP then it is get masquerade unless it is received from the range set as kube-proxy clusterCIDR which is matching our calico default IPPool

According to docs :

https://docs.tigera.io/calico/latest/about/kubernetes-training/about-kubernetes-services#externaltrafficpolicylocal

when traffic policy set to Local I was expecting to see source ip down to pod but it seems to be only working if I user NodePort directly sending traffic to Node IP address not ServiceIP, Is this expected ?

I have my cluster pushed via Kubespray and using version 2.24.1

tomastigera · 2024-05-13T18:20:44Z

This seems related to iptables dataplane. Do you see an MASQUERADE rule hit and is it from kube-proxy or calico? It seems like this is kube-proxy's doing.

I wonder whether you would see the same with eBPF dataplane as there we (a) replace kube-proxy and (b) treat any service hitting resolution on the main node interfaces pretty much like a nodeport. Also we always preserve source IP regardless of the ExternalTraffic policy.

caseydavenport · 2024-05-13T22:50:59Z

We have noticed that if traffic received on ClusterIP then it is get masquerade unless it is received from the range set as kube-proxy clusterCIDR which is matching our calico default IPPool

Yeah, the external traffic policy is implemented by kube-proxy in this case.

The CIDR range of the pods in the cluster. (For dual-stack clusters, this can be a comma-separated dual-stack pair of CIDR ranges.). When --detect-local-mode is set to ClusterCIDR, kube-proxy will consider traffic to be local if its source IP is in this range. (Otherwise it is not used.) This parameter is ignored if a config file is specified by --config.

Source: https://kubernetes.io/docs/reference/command-line-tools-reference/kube-proxy/

I suspect the Calico docs are subtly misleading if stating the ClusterIP advertisement is going to preserve the IP in this scenario, since kube-proxy will NAT this traffic if it's coming from outside the cluster CIDR (cluster IP is really intended to be within the cluster)

ehsan310 · 2024-05-15T00:29:50Z

We have noticed that if traffic received on ClusterIP then it is get masquerade unless it is received from the range set as kube-proxy clusterCIDR which is matching our calico default IPPool

Yeah, the external traffic policy is implemented by kube-proxy in this case.

The CIDR range of the pods in the cluster. (For dual-stack clusters, this can be a comma-separated dual-stack pair of CIDR ranges.). When --detect-local-mode is set to ClusterCIDR, kube-proxy will consider traffic to be local if its source IP is in this range. (Otherwise it is not used.) This parameter is ignored if a config file is specified by --config.

Source: https://kubernetes.io/docs/reference/command-line-tools-reference/kube-proxy/

I suspect the Calico docs are subtly misleading if stating the ClusterIP advertisement is going to preserve the IP in this scenario, since kube-proxy will NAT this traffic if it's coming from outside the cluster CIDR (cluster IP is really intended to be within the cluster)

Agree , this requires a bit more clarification.

ehsan310 · 2024-05-15T00:31:06Z

This seems related to iptables dataplane. Do you see an MASQUERADE rule hit and is it from kube-proxy or calico? It seems like this is kube-proxy's doing.

I wonder whether you would see the same with eBPF dataplane as there we (a) replace kube-proxy and (b) treat any service hitting resolution on the main node interfaces pretty much like a nodeport. Also we always preserve source IP regardless of the ExternalTraffic policy.

Tried with eBPF on multiple cluster and manged to fixed it , it's working as expected for now.
tried externaltrafficpolicy set to local as well and got the expected bgp advertisements.

ehsan310 · 2024-05-15T23:06:47Z

@tomastigera Noticed one different behavior when eBPF is enabled, when ippool natOutgoing set to false connection to kubernetes api through serviceIP starts timing out from pods wanting to use it, since there is no iptables rule sending service ip out to correct node.

How does this work in eBPF mode ?

I am forced to enable outgoing nat to solve this issue, is this a bug or expected behavior?

tomastigera added the kind/support label May 13, 2024

tomastigera closed this as completed May 15, 2024

tomastigera mentioned this issue May 15, 2024

When eBPF is enabled, when ippool natOutgoing set to false connection to kubernetes api through serviceIP starts timing out #8829

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

source ip is not preserved if out of clustercidr #8812

source ip is not preserved if out of clustercidr #8812

ehsan310 commented May 13, 2024

tomastigera commented May 13, 2024

caseydavenport commented May 13, 2024

ehsan310 commented May 15, 2024

ehsan310 commented May 15, 2024

ehsan310 commented May 15, 2024

source ip is not preserved if out of clustercidr #8812

source ip is not preserved if out of clustercidr #8812

Comments

ehsan310 commented May 13, 2024

tomastigera commented May 13, 2024

caseydavenport commented May 13, 2024

ehsan310 commented May 15, 2024

ehsan310 commented May 15, 2024

ehsan310 commented May 15, 2024