Commit 68b7107b authored by Edward Allcutt's avatar Edward Allcutt Committed by David S. Miller

ipv4: icmp: Fix pMTU handling for rare case

Some older router implementations still send Fragmentation Needed
errors with the Next-Hop MTU field set to zero. This is explicitly
described as an eventuality that hosts must deal with by the
standard (RFC 1191) since older standards specified that those
bits must be zero.

Linux had a generic (for all of IPv4) implementation of the algorithm
described in the RFC for searching a list of MTU plateaus for a good
value. Commit 46517008 ("ipv4: Kill ip_rt_frag_needed().")
removed this as part of the changes to remove the routing cache.
Subsequently any Fragmentation Needed packet with a zero Next-Hop
MTU has been discarded without being passed to the per-protocol
handlers or notifying userspace for raw sockets.

When there is a router which does not implement RFC 1191 on an
MTU limited path then this results in stalled connections since
large packets are discarded and the local protocols are not
notified so they never attempt to lower the pMTU.

One example I have seen is an OpenBSD router terminating IPSec
tunnels. It's worth pointing out that this case is distinct from
the BSD 4.2 bug which incorrectly calculated the Next-Hop MTU
since the commit in question dismissed that as a valid concern.

All of the per-protocols handlers implement the simple approach from
RFC 1191 of immediately falling back to the minimum value. Although
this is sub-optimal it is vastly preferable to connections hanging
indefinitely.

Remove the Next-Hop MTU != 0 check and allow such packets
to follow the normal path.

Fixes: 46517008 ("ipv4: Kill ip_rt_frag_needed().")
Signed-off-by: default avatarEdward Allcutt <edward.allcutt@openmarket.com>
Signed-off-by: default avatarDavid S. Miller <davem@davemloft.net>
parent a921e2a3
...@@ -739,8 +739,6 @@ static void icmp_unreach(struct sk_buff *skb) ...@@ -739,8 +739,6 @@ static void icmp_unreach(struct sk_buff *skb)
/* fall through */ /* fall through */
case 0: case 0:
info = ntohs(icmph->un.frag.mtu); info = ntohs(icmph->un.frag.mtu);
if (!info)
goto out;
} }
break; break;
case ICMP_SR_FAILED: case ICMP_SR_FAILED:
......
Markdown is supported
0%
or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment