iproute2

Commit Graph

Author	SHA1	Message	Date
David Ahern	09d8ce3db1	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-04 09:24:12 -06:00
Justin Iurman	32f4969d44	New IOAM6 encap type for routes This patch provides a new encap type for routes to insert an IOAM pre-allocated trace: $ ip -6 ro ad fc00::1/128 encap ioam6 trace prealloc type 0x800000 ns 1 size 12 dev eth0 where: - "trace" and "prealloc" may appear as useless but just anticipate for future implementations of other ioam option types. - "type" is a bitfield (=u32) defining the IOAM pre-allocated trace type (see the corresponding uapi). - "ns" is an IOAM namespace ID attached to the pre-allocated trace. - "size" is the trace pre-allocated size in bytes; must be a 4-octet multiple; limited size (see IOAM6_TRACE_DATA_SIZE_MAX). Signed-off-by: Justin Iurman <justin.iurman@uliege.be> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-02 11:33:31 -06:00
Justin Iurman	2909812583	Add, show, link, remove IOAM namespaces and schemas This patch provides support for adding, listing and removing IOAM namespaces and schemas with iproute2. When adding an IOAM namespace, both "data" (=u32) and "wide" (=u64) are optional. Therefore, you can either have none, one of them, or both at the same time. When adding an IOAM schema, there is no restriction on "DATA" except its size (see IOAM6_MAX_SCHEMA_DATA_LEN). By default, an IOAM namespace has no active IOAM schema (meaning an IOAM namespace is not linked to an IOAM schema), and an IOAM schema is not considered as "active" (meaning an IOAM schema is not linked to an IOAM namespace). It is possible to link an IOAM namespace with an IOAM schema, thanks to the last command below (meaning the IOAM schema will be considered as "active" for the specific IOAM namespace). $ ip ioam Usage: ip ioam { COMMAND \| help } ip ioam namespace show ip ioam namespace add ID [ data DATA32 ] [ wide DATA64 ] ip ioam namespace del ID ip ioam schema show ip ioam schema add ID DATA ip ioam schema del ID ip ioam namespace set ID schema { ID \| none } Signed-off-by: Justin Iurman <justin.iurman@uliege.be> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-02 11:33:05 -06:00
Gokul Sivakumar	cf866f0a5a	ipneigh: add support to print brief output of neigh cache in tabular format Make use of the already available brief flag and print the basic details of the IPv4 or IPv6 neighbour cache in a tabular format for better readability when the brief output is expected. $ ip -br neigh 172.16.12.100 bridge0 b0:fc:36:2f:07:43 172.16.12.174 bridge0 8c:16:45:2f:bc:1c 172.16.12.250 bridge0 04:d9:f5:c1:0c:74 fe80::267b:9f70:745e:d54d bridge0 b0:fc:36:2f:07:43 fd16:a115:6a62:0:8744:efa1:9933:2c4c bridge0 8c:16:45:2f:bc:1c fe80::6d9:f5ff:fec1:c74 bridge0 04:d9:f5:c1:0c:74 And add "ip neigh show" to the list of ip sub commands mentioned in the man page that support the brief output in tabular format. Signed-off-by: Gokul Sivakumar <gokulkumar792@gmail.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-08-02 10:14:50 -06:00
Alexander Mikhalitsyn	459ce6e3d7	ip route: ignore ENOENT during save if RT_TABLE_MAIN is being dumped We started to use in-kernel filtering feature which allows to get only needed tables (see iproute_dump_filter()). From the kernel side it's implemented in net/ipv4/fib_frontend.c (inet_dump_fib), net/ipv6/ip6_fib.c (inet6_dump_fib). The problem here is that behaviour of "ip route save" was changed after `c7e6371bc` ("ip route: Add protocol, table id and device to dump request"). If filters are used, then kernel returns ENOENT error if requested table is absent, but in newly created net namespace even RT_TABLE_MAIN table doesn't exist. It is really allocated, for instance, after issuing "ip l set lo up". Reproducer is fairly simple: $ unshare -n ip route save > dump Error: ipv4: FIB table does not exist. Dump terminated Expected result here is to get empty dump file (as it was before this change). v2: reworked, so, now it takes into account NLMSGERR_ATTR_MSG (see nl_dump_ext_ack_done() function). We want to suppress error messages in stderr about absent FIB table from kernel too. v3: reworked to make code clearer. Introduced rtnl_suppressed_errors(), rtnl_suppress_error() helpers. User may suppress up to 3 errors (may be easily extended by changing SUPPRESS_ERRORS_INIT macro). v4: reworked, rtnl_dump_filter_errhndlr() was introduced. Thanks to Stephen Hemminger for comments and suggestions v5: space fixes, commit message reformat, empty initializers Fixes: `c7e6371bc` ("ip route: Add protocol, table id and device to dump request") Cc: David Ahern <dsahern@gmail.com> Cc: Stephen Hemminger <stephen@networkplumber.org> Cc: Andrei Vagin <avagin@gmail.com> Cc: Alexander Mikhalitsyn <alexander@mihalicyn.com> Signed-off-by: Alexander Mikhalitsyn <alexander.mikhalitsyn@virtuozzo.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-07-07 07:32:56 -07:00
Sergey Ryazanov	6acccd52a2	iplink: support for WWAN devices The WWAN subsystem has been extended to generalize the per data channel network interfaces management. This change implements support for WWAN links handling. And actively uses the earlier introduced ip-link capability to specify the parent by its device name. The WWAN interface for a new data channel should be created with a command like this: ip link add dev wwan0-2 parentdev wwan0 type wwan linkid 2 Where: wwan0 is the modem HW device name (should be taken from /sys/class/wwan) and linkid is an identifier of the opened data channel. Signed-off-by: Sergey Ryazanov <ryazanov.s.a@gmail.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-26 04:40:57 +00:00
Sergey Ryazanov	362da458a4	iplink: add support for parent device Add support for specifying a parent device (struct device) by its name during the link creation and printing parent name in the links list. This option will be used to create WWAN links and possibly by other device classes that do not have a "natural parent netdev". Add the parent device bus name printing for links list info completeness. But do not add a corresponding command line argument, as we do not have a use case for this attribute. Signed-off-by: Sergey Ryazanov <ryazanov.s.a@gmail.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-26 04:40:22 +00:00
Paolo Lungaroni	3e26254f31	seg6: add support for SRv6 End.DT46 Behavior We introduce the new "End.DT46" action for supporting the SRv6 End.DT46 Behavior in iproute2. The SRv6 End.DT46 Behavior, defined in RFC 8986 [1] section 4.8, can be used to implement L3 VPNs based on Segment Routing over IPv6 networks in multi-tenants environments and it is capable of handling both IPv4 and IPv6 tenant traffic at the same time. The SRv6 End.DT46 Behavior decapsulates the received packets and it performs the IPv4 or IPv6 routing lookup in the routing table of the tenant. As for the End.DT4 and for the End.DT6 in VRF mode, the SRv6 End.DT46 Behavior leverages a VRF device in order to force the routing lookup into the associated routing table using the "vrftable" attribute. To make the End.DT46 work properly, it must be guaranteed that the routing table used for routing lookup operations is bound to one and only one VRF during the tunnel creation. Such constraint has to be enforced by enabling the VRF strict_mode sysctl parameter, i.e.: $ sysctl -wq net.vrf.strict_mode=1 Note that the same approach is used for the End.DT4 Behavior and for the End.DT6 Behavior in VRF mode. An SRv6 End.DT46 Behavior instance can be created as follows: $ ip -6 route add 2001:db8::1 encap seg6local action End.DT46 vrftable 100 dev vrf100 Standard Output: $ ip -6 route show 2001:db8::1 2001:db8::1 encap seg6local action End.DT46 vrftable 100 dev vrf100 metric 1024 pref medium JSON Output: $ ip -6 -j -p route show 2001:db8::1 [ { "dst": "2001:db8::1", "encap": "seg6local", "action": "End.DT46", "vrftable": 100, "dev": "vrf100", "metric": 1024, "flags": [ ], "pref": "medium" } ] This patch updates the route.8 man page and the ip route help with the information related to End.DT46. Considering that the same information was missing for the SRv6 End.DT4 and the End.DT6 Behaviors, we have also added it. [1] https://www.rfc-editor.org/rfc/rfc8986.html#name-enddt46-decapsulation-and-s Signed-off-by: Andrea Mayer <andrea.mayer@uniroma2.it> Signed-off-by: Paolo Lungaroni <paolo.lungaroni@uniroma2.it> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-06-22 15:36:17 +00:00
Jakub Kicinski	49437375b6	ip: dynamically size columns when printing stats This change makes ip -s -s output size the columns automatically. I often find myself using json output because the normal output is unreadable. Even on a laptop after 2 days of uptime byte and packet counters almost overflow their columns, let alone a busy server. For max readability switch to right align. Before: RX: bytes packets errors dropped missed mcast 8227918473 8617683 0 0 0 0 RX errors: length crc frame fifo overrun 0 0 0 0 0 TX: bytes packets errors dropped carrier collsns 691937917 4727223 0 0 0 0 TX errors: aborted fifo window heartbeat transns 0 0 0 0 10 After: RX: bytes packets errors dropped missed mcast 8228633710 8618408 0 0 0 0 RX errors: length crc frame fifo overrun 0 0 0 0 0 TX: bytes packets errors dropped carrier collsns 692006303 4727740 0 0 0 0 TX errors: aborted fifo window heartbt transns 0 0 0 0 10 More importantly, with large values before: RX: bytes packets errors dropped overrun mcast 126570234447969 15016149200 0 0 0 0 RX errors: length crc frame fifo missed 0 0 0 0 0 TX: bytes packets errors dropped carrier collsns 126570234447969 15016149200 0 0 0 0 TX errors: aborted fifo window heartbeat transns 0 0 0 0 10 Note that in this case we have full shift by a column, e.g. the value under "dropped" is actually for "errors" etc. After: RX: bytes packets errors dropped missed mcast 126570234447969 15016149200 0 0 0 0 RX errors: length crc frame fifo overrun 0 0 0 0 0 TX: bytes packets errors dropped carrier collsns 126570234447969 15016149200 0 0 0 0 TX errors: aborted fifo window heartbt transns 0 0 0 0 10 Signed-off-by: Jakub Kicinski <kuba@kernel.org> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-05-09 22:51:59 +00:00
Paolo Lungaroni	02ca3aabe9	seg6: add counters support for SRv6 Behaviors We introduce the "count" optional attribute for supporting counters in SRv6 Behaviors as defined in [1], section 6. For each SRv6 Behavior instance, counters defined in [1] are: - the total number of packets that have been correctly processed; - the total amount of traffic in bytes of all packets that have been correctly processed; In addition, we introduce a new counter that counts the number of packets that have NOT been properly processed (i.e. errors) by an SRv6 Behavior instance. Each SRv6 Behavior instance can be configured, at the time of its creation, to make use of counters specifing the "count" attribute as follows: $ ip -6 route add 2001:db8::1 encap seg6local action End count dev eth0 per-behavior counters can be shown by adding "-s" to the iproute2 command line, i.e.: $ ip -s -6 route show 2001:db8::1 2001:db8::1 encap seg6local action End packets 0 bytes 0 errors 0 dev eth0 [1] https://www.rfc-editor.org/rfc/rfc8986.html#name-counters v2: - add help and route.8 man page updates Signed-off-by: Andrea Mayer <andrea.mayer@uniroma2.it> Signed-off-by: Paolo Lungaroni <paolo.lungaroni@uniroma2.it> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-05-09 22:20:59 +00:00
Jakub Kicinski	570d2cf0ec	ip: align the name of the 'nohandler' stat Before: RX: bytes packets errors dropped missed mcast 8848233056 8548168 0 0 0 0 RX errors: length crc frame fifo overrun nohandler 0 0 0 0 0 101 TX: bytes packets errors dropped carrier collsns compressed 1142925945 4683483 0 0 0 0 101 TX errors: aborted fifo window heartbeat transns 0 0 0 0 14 After: RX: bytes packets errors dropped missed mcast 8848297833 8548461 0 0 0 0 RX errors: length crc frame fifo overrun nohandler 0 0 0 0 0 101 TX: bytes packets errors dropped carrier collsns compressed 1143049820 4683865 0 0 0 0 101 TX errors: aborted fifo window heartbeat transns 0 0 0 0 14 Signed-off-by: Jakub Kicinski <kuba@kernel.org> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-05-06 14:41:19 +00:00
Jianguo Wu	7f1d58d1a1	mptcp: make sure flag signal is set when add addr with port When add address with port, it is mean to send an ADD_ADDR to remote, so it must have flag signal set. Fixes: `42fbca91cd` ("mptcp: add support for port based endpoint") Signed-off-by: Jianguo Wu <wujianguo@chinatelecom.cn> Acked-by: Matthieu Baerts <matthieu.baerts@tessares.net> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-30 14:30:24 +00:00
David Ahern	e1e089d1f2	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-28 15:48:28 +00:00
Jethro Beekman	d56dcd3549	ip: Add nodst option to macvlan type source The default behavior for source MACVLAN is to duplicate packets to appropriate type source devices, and then do the normal destination MACVLAN flow. This patch adds an option to skip destination MACVLAN processing if any matching source MACVLAN device has the option set. This allows setting up a "catch all" device for source MACVLAN: create one or more devices with type source nodst, and one device with e.g. type vepa, and incoming traffic will be received on exactly one device. Signed-off-by: Jethro Beekman <kernel@jbeekman.nl> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-28 15:45:59 +00:00
Stephen Hemminger	2363bc99f9	Merge git://git.kernel.org/pub/scm/network/iproute2/iproute2-next Required manual fix of devlink/devlink.c Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-04-27 19:39:39 -07:00
Stephen Hemminger	a3fb3fcb7d	remove trailing whitespace Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-04-27 11:55:53 -07:00
Andrea Claudi	38ef5bb7b4	ip: netns: fix missing netns close on some error paths In functions netns_pids() and netns_identify_pid(), the netns file is not closed on some error paths. Fix this using a conditional close and a single return point on both functions. Fixes: `44b563269e` ("ip-nexthop: support flush by id") Signed-off-by: Andrea Claudi <aclaudi@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-04-26 21:04:02 -07:00
Tony Ambardar	e705b19d48	ip: drop 2-char command assumption The 'ip' utility hardcodes the assumption of being a 2-char command, where any follow-on characters are passed as an argument: $ ./ip-full help Object "-full" is unknown, try "ip help". This confusing behaviour isn't seen with 'tc' for example, and was added in a 2005 commit without documentation. It was noticed during testing of 'ip' variants built/packaged with different feature sets (e.g. w/o BPF support). Mitigate the problem by redoing the command without the 2-char assumption if the follow-on characters fail to parse as a valid command. Fixes: `351efcde4e` ("Update header files to 2.6.14") Signed-off-by: Tony Ambardar <Tony.Ambardar@gmail.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-26 02:29:42 +00:00
Andrea Claudi	81bfd01a4c	lib: move get_task_name() from rdma The function get_task_name() is used to get the name of a process from its pid, and its implementation is similar to ip/iptuntap.c:pid_name(). Move it to lib/fs.c to use a single implementation and make it easily reusable. Signed-off-by: Andrea Claudi <aclaudi@redhat.com> Acked-by: Leon Romanovsky <leonro@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-22 05:22:16 +00:00
Florian Westphal	ff619e4fd3	mptcp: add support for event monitoring This adds iproute2 support for mptcp event monitoring, e.g. creation, establishment, address announcements from the peer, subflow establishment and so on. While the kernel-generated events are primarily aimed at mptcpd (e.g. for subflow management), this is also useful for debugging. This adds print support for the existing events. Sample output of 'ip mptcp monitor': [ CREATED] token=83f3a692 remid=0 locid=0 saddr4=10.0.1.2 daddr4=10.0.1.1 sport=58710 dport=10011 [ ESTABLISHED] token=83f3a692 remid=0 locid=0 saddr4=10.0.1.2 daddr4=10.0.1.1 sport=58710 dport=10011 [SF_ESTABLISHED] token=83f3a692 remid=0 locid=1 saddr4=10.0.2.2 daddr4=10.0.1.1 sport=40195 dport=10011 backup=0 [ CLOSED] token=83f3a692 Signed-off-by: Florian Westphal <fw@strlen.de>	2021-04-22 05:10:25 +00:00
Andrea Claudi	6a2c51da99	nexthop: fix memory leak in add_nh_group_attr() grps is dinamically allocated with a calloc, and not freed in a return path in the for cycle. This commit fix it. While at it, make the function use a single return point. Fixes: `63df8e8543` ("Add support for nexthop objects") Signed-off-by: Andrea Claudi <aclaudi@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-04-13 19:16:55 -07:00
Stephen Hemminger	06d0bbf1ee	erspan: fix JSON output The format for erspan/erspan6 output is not valid JSON, as on version 2 a valueless key was presented. The direction should be value and erspan_dir should be the key. Fixes: `2897636267` ("erspan: add erspan version II support") Cc: u9012063@gmail.com Reported-by: Christian Pössinger <christian@poessinger.com> Signed-off-by: Christian Pössinger <christian@poessinger.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-04-10 09:52:48 -07:00
Chunmei Xu	44b563269e	ip-nexthop: support flush by id since id is unique for nexthop, it is heavy to dump all nexthops. use existing delete_nexthop to support flush by id Signed-off-by: Chunmei Xu <xuchunmei@linux.alibaba.com> Reviewed-by: Ido Schimmel <idosch@nvidia.com> Tested-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-04-08 15:38:58 +00:00
Petr Machata	7384c15e0e	ip: Fix batch processing After the comment cited below, batch mode neglects to set the global variable batch_mode to a non-zero value. Netns and VRF commands use this variable, and break in batch mode. Fix by setting the value again. Fixes: `1d9a81b8c9` ("Unify batch processing across tools") Reported-by: Tim Rice <trice@posteo.net> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-03-22 16:30:21 -07:00
David Ahern	76bfc185f2	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-21 17:16:01 +00:00
Sabrina Dubroca	3c75135835	ip: xfrm: add support for tfcpad This patch adds support for setting and displaying the Traffic Flow Confidentiality attribute for an XFRM state, which allows padding ESP packets to a specified length. Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-21 17:15:07 +00:00
Ido Schimmel	2be6d18b30	nexthop: Add support for nexthop buckets Add ability to dump multiple nexthop buckets and get a specific one. Example: # ip nexthop add id 10 group 1/2 type resilient buckets 8 # ip nexthop id 1 via 192.0.2.2 dev dummy10 scope link id 2 via 192.0.2.19 dev dummy20 scope link id 10 group 1/2 type resilient buckets 8 idle_timer 120 unbalanced_timer 0 unbalanced_time 0 # ip nexthop bucket id 10 index 0 idle_time 28.1 nhid 2 id 10 index 1 idle_time 28.1 nhid 2 id 10 index 2 idle_time 28.1 nhid 2 id 10 index 3 idle_time 28.1 nhid 2 id 10 index 4 idle_time 28.1 nhid 1 id 10 index 5 idle_time 28.1 nhid 1 id 10 index 6 idle_time 28.1 nhid 1 id 10 index 7 idle_time 28.1 nhid 1 # ip nexthop bucket show nhid 1 id 10 index 4 idle_time 53.59 nhid 1 id 10 index 5 idle_time 53.59 nhid 1 id 10 index 6 idle_time 53.59 nhid 1 id 10 index 7 idle_time 53.59 nhid 1 # ip nexthop bucket get id 10 index 5 id 10 index 5 idle_time 81 nhid 1 # ip -j -p nexthop bucket get id 10 index 5 [ { "id": 10, "bucket": { "index": 5, "idle_time": 104.89, "nhid": 1 }, "flags": [ ] } ] Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:01:25 +00:00
Ido Schimmel	9167671822	nexthop: Add support for resilient nexthop groups Add ability to configure resilient nexthop groups and show their current configuration. Example: # ip nexthop add id 10 group 1/2 type resilient buckets 8 # ip nexthop show id 10 id 10 group 1/2 type resilient buckets 8 idle_timer 120 unbalanced_timer 0 # ip -j -p nexthop show id 10 [ { "id": 10, "group": [ { "id": 1 },{ "id": 2 } ], "type": "resilient", "resilient_args": { "buckets": 8, "idle_timer": 120, "unbalanced_timer": 0 }, "flags": [ ] } ] Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:01:18 +00:00
Ido Schimmel	b82d6b81fa	nexthop: Add ability to specify group type Next patches are going to add a 'resilient' nexthop group type, so allow users to specify the type using the 'type' argument. Currently, only 'mpath' type is supported. These two commands are equivalent: # ip nexthop add id 10 group 1/2/3 # ip nexthop add id 10 group 1/2/3 type mpath Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:00:49 +00:00
Petr Machata	28fb925d8b	nexthop: Extract a helper to parse a NH ID NH ID extraction is a common operation, and will become more common still with the resilient NH groups support. Add a helper that does what it usually done and returns the parsed NH ID. Signed-off-by: Petr Machata <petrm@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-19 15:00:43 +00:00
Stephen Hemminger	6639fce430	ip: cleanup help message text Wrap help message text at 80 characters, and put list of things in alpha order. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-03-18 11:24:06 -07:00
Sabrina Dubroca	6050055387	ip: xfrm: limit the length of the security context name when printing Security context names are not guaranteed to be NUL-terminated by the kernel, so we can't just print them using %s directly. The length of the string is determined by sctx->ctx_len, so we can use that to limit what fprintf outputs. While at it, factor that out to a separate function, since the exact same code is used to print the security context for both policies and states. Fixes: `b2bb289a57` ("xfrm security context support") Reported-by: Paul Wouters <pwouters@redhat.com> Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-03-16 22:53:28 -07:00
David Ahern	27ca8989c1	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-15 15:08:01 +00:00
Luca Boccassi	6739068fb0	iproute: fix printing resolved localhost format_host_rta_r might return a cached hostname via its return value and not use the input buffer. Before: $ ip -resolve -6 route dev lo proto kernel metric 256 pref medium After: $ ip/ip -resolve -6 route localhost dev lo proto kernel metric 256 pref medium Bug-Debian: https://bugs.debian.org/983591 Reported-by: Axel Scheepers <axel.scheepers76@gmail.com> Signed-off-by: Luca Boccassi <bluca@debian.org> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-03-03 18:54:16 -08:00
Paolo Abeni	42fbca91cd	mptcp: add support for port based endpoint The feature is supported by the kernel since 5.11-net-next, let's allow user-space to use it. Just parse and dump an additional, per endpoint, u16 attribute Signed-off-by: Paolo Abeni <pabeni@redhat.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-03-01 00:15:10 +00:00
Stephen Hemminger	52c5f3f043	Merge git://git.kernel.org/pub/scm/network/iproute2/iproute2-next	2021-02-23 23:03:42 -08:00
Andrea Claudi	e833dbe140	ip: lwtunnel: seg6: bail out if table ids are invalid When table and vrftable are used in SRv6, ip should bail out if table ids are not valid, and return a proper error message to the user. Achieve this simply checking rtnl_rttable_a2n return value, as we already do in the rest of iproute. Fixes: `0486388a87` ("add support for table name in SRv6 End.DT* behaviors") Fixes: `69629b4e43` ("seg6: add support for vrftable attribute in SRv6 End.DT4/DT6 behaviors") Signed-off-by: Andrea Claudi <aclaudi@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-22 18:11:48 -08:00
Amit Cohen	33e2471e8f	ip route: Print "rt_offload_failed" indication The kernel signals when offload fails using the 'RTM_F_OFFLOAD_FAILED' flag. Print it to help users understand the offload state of the route. The "rt_" prefix is used in order to distinguish it from the offload state of nexthops, similar to "rt_offload" and "rt_trap". Signed-off-by: Amit Cohen <amcohen@nvidia.com> Reviewed-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-02-13 17:50:15 -07:00
Luca Boccassi	5a37254b71	iproute: force rtm_dst_len to 32/128 Since NETLINK_GET_STRICT_CHK was enabled, the kernel rejects commands that pass a prefix length, eg: ip route get `1.0.0.0/1 Error: ipv4: Invalid values in header for route get request. ip route get 0.0.0.0/0 Error: ipv4: rtm_src_len and rtm_dst_len must be 32 for IPv4 Since there's no point in setting a rtm_dst_len that we know is going to be rejected, just force it to the right value if it's passed on the command line. Print a warning to stderr to notify users. Bug-Debian: https://bugs.debian.org/944730 Reported-By: Clément 'wxcafé' Hertling <wxcafe@wxcafe.net> Signed-off-by: Luca Boccassi <bluca@debian.org> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:32:47 -08:00
Edwin Peer	9764761888	iplink: print warning for missing VF data The kernel might truncate VF info in IFLA_VFINFO_LIST. Compare the expected number of VFs in IFLA_NUM_VF to how many were found in the list and warn accordingly. Signed-off-by: Edwin Peer <edwin.peer@broadcom.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:18:42 -08:00
Guillaume Nault	86d9660dc1	iplink_bareudp: cleanup help message and man page * Fix PROTO description in help message (mpls isn't a valid argument). * Remove SRCPORTMIN description from help message since it doesn't appear in the syntax string. * Use same keywords in help message and in man page. * Use the "ethertype" option name (.B ethertype) rather than the option value (.I ETHERTYPE) in the man page description of [no]multiproto. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-02-02 14:11:32 -08:00
Oliver Hartkopp	2ce313d1bb	iplink_can: add Classical CAN frame LEN8_DLC support The len8_dlc element is filled by the CAN interface driver and used for CAN frame creation by the CAN driver when the CAN_CTRLMODE_CC_LEN8_DLC flag is supported by the driver and enabled via netlink configuration interface. Add the command line support for cc-len8-dlc for Linux 5.11+ Signed-off-by: Oliver Hartkopp <socketcan@hartkopp.net> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-29 15:49:23 +00:00
Jarod Wilson	7887500008	bond: support xmit_hash_policy=vlan+srcmac There's a new transmit hash policy being added to the bonding driver that is a simple XOR of vlan ID and source MAC, xmit_hash_policy vlan+srcmac. This trivial patch makes it configurable and queryable via iproute2. $ sudo modprobe bonding mode=2 max_bonds=1 xmit_hash_policy=0 $ sudo ip link set bond0 type bond xmit_hash_policy vlan+srcmac $ ip -d link show bond0 11: bond0: <BROADCAST,MULTICAST,MASTER> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 link/ether ce:85:5e:24:ce:90 brd ff:ff:ff:ff:ff:ff promiscuity 0 minmtu 68 maxmtu 65535 bond mode balance-xor miimon 0 updelay 0 downdelay 0 peer_notify_delay 0 use_carrier 1 arp_interval 0 arp_validate none arp_all_targets any primary_reselect always fail_over_mac none xmit_hash_policy vlan+srcmac resend_igmp 1 num_grat_arp 1 all_slaves_active 0 min_links 0 lp_interval 1 packets_per_slave 1 lacp_rate slow ad_select stable tlb_dynamic_lb 1 addrgenmode eui64 numtxqueues 16 numrxqueues 16 gso_max_size 65536 gso_max_segs 65535 $ grep Hash /proc/net/bonding/bond0 Transmit Hash Policy: vlan+srcmac (5) $ sudo ip link add test type bond help Usage: ... bond [ mode BONDMODE ] [ active_slave SLAVE_DEV ] [ clear_active_slave ] [ miimon MIIMON ] [ updelay UPDELAY ] [ downdelay DOWNDELAY ] [ peer_notify_delay DELAY ] [ use_carrier USE_CARRIER ] [ arp_interval ARP_INTERVAL ] [ arp_validate ARP_VALIDATE ] [ arp_all_targets ARP_ALL_TARGETS ] [ arp_ip_target [ ARP_IP_TARGET, ... ] ] [ primary SLAVE_DEV ] [ primary_reselect PRIMARY_RESELECT ] [ fail_over_mac FAIL_OVER_MAC ] [ xmit_hash_policy XMIT_HASH_POLICY ] [ resend_igmp RESEND_IGMP ] [ num_grat_arp\|num_unsol_na NUM_GRAT_ARP\|NUM_UNSOL_NA ] [ all_slaves_active ALL_SLAVES_ACTIVE ] [ min_links MIN_LINKS ] [ lp_interval LP_INTERVAL ] [ packets_per_slave PACKETS_PER_SLAVE ] [ tlb_dynamic_lb TLB_DYNAMIC_LB ] [ lacp_rate LACP_RATE ] [ ad_select AD_SELECT ] [ ad_user_port_key PORTKEY ] [ ad_actor_sys_prio SYSPRIO ] [ ad_actor_system LLADDR ] BONDMODE := balance-rr\|active-backup\|balance-xor\|broadcast\|802.3ad\|balance-tlb\|balance-alb ARP_VALIDATE := none\|active\|backup\|all ARP_ALL_TARGETS := any\|all PRIMARY_RESELECT := always\|better\|failure FAIL_OVER_MAC := none\|active\|follow XMIT_HASH_POLICY := layer2\|layer2+3\|layer3+4\|encap2+3\|encap3+4\|vlan+srcmac LACP_RATE := slow\|fast AD_SELECT := stable\|bandwidth\|count Cc: Stephen Hemminger <stephen@networkplumber.org> Cc: Jay Vosburgh <j.vosburgh@gmail.com> Signed-off-by: Jarod Wilson <jarod@redhat.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-23 18:33:15 +00:00
Luca Boccassi	8dca565b17	vrf: print BPF log buffer if bpf_program_load fails Necessary to understand what is going on when bpf_program_load fails Signed-off-by: Luca Boccassi <bluca@debian.org> Reviewed-by: David Ahern <dsahern@kernel.org> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2021-01-18 12:32:11 -08:00
Ido Schimmel	9bd498bfcd	ipmonitor: Mention "nexthop" object in help and man page Before: # ip monitor help Usage: ip monitor [ all \| LISTofOBJECTS ] [ FILE ] [ label ] [all-nsid] [dev DEVICE] LISTofOBJECTS := link \| address \| route \| mroute \| prefix \| neigh \| netconf \| rule \| nsid FILE := file FILENAME After: # ip monitor help Usage: ip monitor [ all \| LISTofOBJECTS ] [ FILE ] [ label ] [all-nsid] [dev DEVICE] LISTofOBJECTS := link \| address \| route \| mroute \| prefix \| neigh \| netconf \| rule \| nsid \| nexthop FILE := file FILENAME Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-10 17:17:32 +00:00
Ido Schimmel	043e03a369	nexthop: Fix usage output Before: # ip nexthop help Usage: ip nexthop { list \| flush } [ protocol ID ] SELECTOR ip nexthop { add \| replace } id ID NH [ protocol ID ] ip nexthop { get\| del } id ID SELECTOR := [ id ID ] [ dev DEV ] [ vrf NAME ] [ master DEV ] [ groups ] [ fdb ] NH := { blackhole \| [ via ADDRESS ] [ dev DEV ] [ onlink ] [ encap ENCAPTYPE ENCAPHDR ] \| group GROUP ] } GROUP := [ id[,weight]>/<id[,weight]>/... ] ENCAPTYPE := [ mpls ] ENCAPHDR := [ MPLSLABEL ] After: # ip nexthop help Usage: ip nexthop { list \| flush } [ protocol ID ] SELECTOR ip nexthop { add \| replace } id ID NH [ protocol ID ] ip nexthop { get \| del } id ID SELECTOR := [ id ID ] [ dev DEV ] [ vrf NAME ] [ master DEV ] [ groups ] [ fdb ] NH := { blackhole \| [ via ADDRESS ] [ dev DEV ] [ onlink ] [ encap ENCAPTYPE ENCAPHDR ] \| group GROUP [ fdb ] } GROUP := [ <id[,weight]>/<id[,weight]>/... ] ENCAPTYPE := [ mpls ] ENCAPHDR := [ MPLSLABEL ] Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@kernel.org>	2021-01-10 17:14:08 +00:00
Thomas Karlsson	42f5642a40	iplink:macvlan: Added bcqueuelen parameter This patch allows the user to set and retrieve the IFLA_MACVLAN_BC_QUEUE_LEN parameter via the bcqueuelen command line argument This parameter controls the requested size of the queue for broadcast and multicast packages in the macvlan driver. If not specified, the driver default (1000) will be used. Note: The request is per macvlan but the actually used queue length per port is the maximum of any request to any macvlan connected to the same port. For this reason, the used queue length IFLA_MACVLAN_BC_QUEUE_LEN_USED is also retrieved and displayed in order to aid in the understanding of the setting. However, it can of course not be directly set. Signed-off-by: Thomas Karlsson <thomas.karlsson@paneda.se> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-16 04:02:07 +00:00
Petr Machata	cdd9425315	Move the use_iec declaration to the tools The tools "ip" and "tc" use a flag "use_iec", which indicates whether, when formatting rate values, the prefixes "K", "M", etc. should refer to powers of 1024, or powers of 1000. The flag is currently kept as a global variable in "ip" and "tc", but is nonetheless declared in util.h. Instead, move the declaration to tool-specific headers ip/ip_common.h and tc/tc_common.h. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-09 02:28:43 +00:00
Paolo Lungaroni	69629b4e43	seg6: add support for vrftable attribute in SRv6 End.DT4/DT6 behaviors We introduce the "vrftable" attribute for supporting the SRv6 End.DT4 and End.DT6 behaviors in iproute2. The "vrftable" attribute indicates the routing table associated with the VRF device used by SRv6 End.DT4/DT6 for routing IPv4/IPv6 packets. The SRv6 End.DT4/DT6 is used to implement IPv4/IPv6 L3 VPNs based on Segment Routing over IPv6 networks in multi-tenants environments. It decapsulates the received packets and it performs the IPv4/IPv6 routing lookup in the routing table of the tenant. The SRv6 End.DT4/DT6 leverages a VRF device in order to force the routing lookup into the associated routing table using the "vrftable" attribute. Some examples: $ ip -6 route add 2001:db8::1 encap seg6local action End.DT4 vrftable 100 dev eth0 $ ip -6 route add 2001:db8::2 encap seg6local action End.DT6 vrftable 200 dev eth0 Standard Output: $ ip -6 route show 2001:db8::1 2001:db8::1 encap seg6local action End.DT4 vrftable 100 dev eth0 metric 1024 pref medium JSON Output: $ ip -6 -j -p route show 2001:db8::2 [ { "dst": "2001:db8::2", "encap": "seg6local", "action": "End.DT6", "vrftable": 200, "dev": "eth0", "metric": 1024, "flags": [ ], "pref": "medium" } ] v2: - no changes made: resubmit after pulling out this patch from the kernel patchset. v1: - mixing this patch with the kernel patchset confused patckwork. Signed-off-by: Paolo Lungaroni <paolo.lungaroni@cnit.it> Signed-off-by: Andrea Mayer <andrea.mayer@uniroma2.it> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-09 02:27:42 +00:00
David Ahern	8065d28218	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-04 16:25:12 +00:00
Stephen Hemminger	2e80ae89ca	Merge branch 'gcc-10' into main	2020-12-03 08:33:06 -08:00
Luca Boccassi	975c4944e8	ip/netns: use flock when setting up /run/netns If multiple ip processes are ran at the same time to set up separate network namespaces, and it is the first time so /run/netns has to be set up first, and they end up doing it at the same time, the processes might enter a recursive loop creating thousands of mount points, which might crash the system depending on resources available. Try to take a flock on /run/netns before doing the mount() dance, to ensure this cannot happen. But do not try too hard, and if it fails continue after printing a warning, to avoid introducing regressions. First reported on Debian: https://bugs.debian.org/949235 To reproduce (WARNING: run in a VM to avoid system lockups): for i in {0..9} do strace -e trace=mount -e inject=mount:delay_exit=1000000 ip \ netns add "testnetns$i" 2>&1 \| tee "$i.log" & done wait The strace is to ensure the problem always reproduces, to add an artificial synchronization point after the first mount(). Reported-by: Etienne Dechamps <etienne@edechamps.fr> Signed-off-by: Luca Boccassi <bluca@debian.org> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-12-03 08:31:23 -08:00
Sergey Ryazanov	d7190d4ced	ip: add IP_LIB_DIR environment variable Do not hardcode /usr/lib/ip as a path and allow libraries path configuration in run-time. Signed-off-by: Sergey Ryazanov <ryazanov.s.a@gmail.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-12-02 16:37:07 +00:00
Stephen Hemminger	5bdc4e9151	bridge: fix string length warning Gcc-10 complains about possible string length overflow. This can't happen Ethernet address format is always limited to 18 characters or less. Just resize the temp buffer. Fixes: `70dfb0b883` ("iplink: bridge: export bridge_id and designated_root") Cc: nikolay@cumulusnetworks.com Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-11-29 16:20:16 -08:00
Hangbin Liu	dc800a4ed4	lib: make ipvrf able to use libbpf and fix function name conflicts There are directly calls in libbpf for bpf program load/attach. So we could just use two wrapper functions for ipvrf and convert them with libbpf support. Function bpf_prog_load() is removed as it's conflict with libbpf function name. bpf.c is moved to bpf_legacy.c for later main libbpf support in iproute2. Reviewed-by: Toke Høiland-Jørgensen <toke@redhat.com> Signed-off-by: Hangbin Liu <haliu@redhat.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 22:14:04 -07:00
Hangbin Liu	503e9229b0	iproute2: add check_libbpf() and get_libbpf_version() This patch aim to add basic checking functions for later iproute2 libbpf support. First we add check_libbpf() in configure to see if we have bpf library support. By default the system libbpf will be used, but static linking against a custom libbpf version can be achieved by passing libbpf DESTDIR to variable LIBBPF_DIR for configure. Another variable LIBBPF_FORCE is used to control whether to build iproute2 with libbpf. If set to on, then force to build with libbpf and exit if not available. If set to off, then force to not build with libbpf. When dynamically linking against libbpf, we can't be sure that the version we discovered at compile time is actually the one we are using at runtime. This can lead to hard-to-debug errors. So we add a new file lib/bpf_glue.c and a helper function get_libbpf_version() to get correct libbpf version at runtime. Signed-off-by: Hangbin Liu <haliu@redhat.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 22:14:02 -07:00
Petr Machata	ca5ec9a17a	ip: iptuntap: Convert to use print_on_off() Instead of rolling a custom on-off printer, use the one added to utils.c. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 21:43:41 -07:00
Petr Machata	66e574c4c5	ip: ipnetconf: Convert to use print_on_off() Instead of rolling a custom on-off printer, use the one added to utils.c. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 21:43:34 -07:00
Petr Machata	07d82b4a79	ip: iplink_bridge_slave: Convert to use print_on_off() Instead of rolling a custom on-off printer, use the one added to utils.c. Note that _print_onoff() has an extra parameter for a JSON-specific flag name. However that argument is not used, and never was. Therefore when moving over to print_on_off(), drop this argument. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 21:43:30 -07:00
Petr Machata	3e0d2a73ba	ip: iplink_bridge_slave: Port over to parse_on_off() Invoke parse_on_off() from bridge_slave_parse_on_off() instead of hand-rolling one. Exit on failure, because the invarg that was ivoked here before would. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 21:43:27 -07:00
Petr Machata	5f685d064b	ip: iplink: Convert to use parse_on_off() Invoke parse_on_off() instead of rolling a custom function. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-24 21:43:23 -07:00
Ido Schimmel	0788678991	nexthop: Always print nexthop flags Currently, the nexthop flags are only printed when the nexthop has a nexthop device. The offload / trap indication is therefore not printed for nexthop groups. Instead, always print the nexthop flags, regardless if the nexthop has a nexthop device or not. Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-22 12:43:56 -07:00
Ido Schimmel	3de35f41be	ip route: Print "trap" nexthop indication The kernel can now signal that a nexthop is trapping packets instead of forwarding them. Print the flag to help users understand the offload state of each nexthop. Signed-off-by: Ido Schimmel <idosch@nvidia.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-22 12:42:20 -07:00
Petr Machata	66a2d71487	lib: parse_mapping: Recognize a keyword "all" The DCB tool will have to provide an interface to a number of fixed-size arrays. Unlike the egress- and ingress-qos-map, it makes good sense to have an interface to set all members to the same value. For example to set strict priority on all TCs besides select few, or to reset allocated bandwidth to all zeroes, again besides several explicitly-given ones. To support this usage, extend the parse_mapping() with a boolean that determines whether this special use is supported. If "all" is given and recognized, mapping_cb is called with the key of -1. Have iplink_vlan pass false for allow_all. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-13 19:43:15 -07:00
Petr Machata	28e663ee65	lib: Extract from iplink_vlan a helper to parse key:value arrays VLAN netdevices have two similar attributes: ingress-qos-map and egress-qos-map. These attributes can be configured with a series of 802.1-priority-to-skb-priority (and vice versa) mappings. A reusable helper along those lines will be handy for configuration of various priority-to-tc, tc-to-algorithm, and other arrays in DCB. Therefore extract the logic to a function parse_mapping(), move to utils.c, and dispatch to utils.c from iplink_vlan.c. That necessitates extraction of a VLAN-specific parse_qos_mapping(). Do that, and propagate addattr_l() return value up, unlike the original. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-13 19:43:15 -07:00
Petr Machata	82604d2852	lib: Add parse_one_of(), parse_on_off() Take from the macsec code parse_one_of() and adapt so that it passes the primary result as the main return value, and error result through a pointer. That is the simplest way to make the code reusable across data types without introducing extra magic. Also from macsec take the specialization of parse_one_of() for parsing specifically the strings "off" and "on". Convert the macsec code to the new helpers. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-13 19:43:15 -07:00
Petr Machata	1d9a81b8c9	Unify batch processing across tools The code for handling batches is largely the same across iproute2 tools. Extract a helper to handle the batch, and adjust the tools to dispatch to this helper. Sandwitch the invocation between prologue / epilogue code specific for each tool. Signed-off-by: Petr Machata <me@pmachata.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-11-13 19:43:15 -07:00
David Ahern	eb12cc9ae1	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-25 15:08:12 -06:00
Jan Engelhardt	0ca1312c20	ip: add error reporting when RTM_GETNSID failed `ip addr` when run under qemu-user-riscv64, fails. This likely is due to qemu-5.1 not doing translation of RTM_GETNSID calls. Aborting ip completely is not helpful for the user however. This patch reworks the error handling. Before: rtest:/ # ip a 2: host0@if4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000 request send failed: Operation not supported link/ether 46:3f:2d:88:3d:db brd ff:ff:ff:ff:ff:ffrtest:/ # Afterwards: rtest:/ # ip a 2: host0@if4: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000 rtnl_send(RTM_GETNSID): Operation not supported. Continuing anyway. link/ether 46:3f:2d:88:3d:db brd ff:ff:ff:ff:ff:ff link-netnsid 0 inet 192.168.72.147/28 brd 192.168.72.159 scope global host0 valid_lft forever preferred_lft forever inet6 fe80::443f:2dff:fe88:3ddb/64 scope link valid_lft forever preferred_lft forever Signed-off-by: Jan Engelhardt <jengelh@inai.de> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-10-12 08:10:25 -07:00
David Ahern	b5a583fb32	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-11 20:11:09 -06:00
Antony Antony	4322b13c8d	ip xfrm: support setting XFRMA_SET_MARK_MASK attribute in states The XFRMA_SET_MARK_MASK attribute can be set in states (4.19+) It is optional and the kernel default is 0xffffffff It is the mask of XFRMA_SET_MARK(a.k.a. XFRMA_OUTPUT_MARK in 4.18) e.g. ./ip/ip xfrm state add output-mark 0x6 mask 0xab proto esp \ auth digest_null 0 enc cipher_null '' ip xfrm state src 0.0.0.0 dst 0.0.0.0 proto esp spi 0x00000000 reqid 0 mode transport replay-window 0 output-mark 0x6/0xab auth-trunc digest_null 0x30 0 enc ecb(cipher_null) anti-replay context: seq 0x0, oseq 0x0, bitmap 0x00000000 sel src 0.0.0.0/0 dst 0.0.0.0/0 Signed-off-by: Antony Antony <antony@phenome.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-10-07 00:10:47 -06:00
Stephen Hemminger	be1bea8432	addr: Fix noprefixroute and autojoin for IPv4 These were reported as IPv6-only and ignored: # ip address add 192.0.2.2/24 dev dummy5 noprefixroute Warning: noprefixroute option can be set only for IPv6 addresses # ip address add 224.1.1.10/24 dev dummy5 autojoin Warning: autojoin option can be set only for IPv6 addresses This enables them back for IPv4. Fixes: `9d59c86e57` ("iproute2: ip addr: Organize flag properties structurally") Signed-off-by: Adel Belhouane <bugs.a.b@free.fr> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-10-06 15:15:56 -07:00
Eyal Birger	e410c963e3	ipntable: add missing ndts_table_fulls ntable stat Used for tracking neighbour table overflows. Signed-off-by: Eyal Birger <eyal.birger@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-10-06 15:07:10 -07:00
Kamal Heib	10414de9e6	ip: iplink_ipoib.c: Remove extra spaces Remove the extra space between the reported ipoib attrs - use only one space instead of two. Fixes: `de0389935f` ("iplink: Added support for the kernel IPoIB RTNL ops") Signed-off-by: Kamal Heib <kamalheib1@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-09-30 22:29:05 -07:00
Jakub Kicinski	b8663da049	ip: promote missed packets to the -s row missed_packet_errors are much more commonly reported: linux$ git grep -c '[.>]rx_missed_errors ' -- drivers/ \| wc -l 64 linux$ git grep -c '[.>]rx_over_errors ' -- drivers/ \| wc -l 37 Plus those drivers are generally more modern than those using rx_over_errors. Since recently merged kernel documentation makes this preference official, let's make ip -s output more informative and let rx_missed_errors take the place of rx_over_errors. Before: 2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP mode DEFAULT group default qlen 1000 link/ether 00:0a:f7:c1:4d:38 brd ff:ff:ff:ff:ff:ff RX: bytes packets errors dropped overrun mcast 6.04T 4.67G 0 0 0 67.7M RX errors: length crc frame fifo missed 0 0 0 0 7 TX: bytes packets errors dropped carrier collsns 3.13T 2.76G 0 0 0 0 TX errors: aborted fifo window heartbeat transns 0 0 0 0 6 After: 2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc mq state UP mode DEFAULT group default qlen 1000 link/ether 00:0a:f7:c1:4d:38 brd ff:ff:ff:ff:ff:ff RX: bytes packets errors dropped missed mcast 6.04T 4.67G 0 0 7 67.7M RX errors: length crc frame fifo overrun 0 0 0 0 0 TX: bytes packets errors dropped carrier collsns 3.13T 2.76G 0 0 0 0 TX errors: aborted fifo window heartbeat transns 0 0 0 0 6 Signed-off-by: Jakub Kicinski <kuba@kernel.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-09-22 20:23:29 -06:00
Roopa Prabhu	6fd53b2a1c	iplink: add support for protodown reason This patch adds support for recently added link IFLA_PROTO_DOWN_REASON attribute. IFLA_PROTO_DOWN_REASON enumerates reasons for the already existing IFLA_PROTO_DOWN link attribute. $ cat /etc/iproute2/protodown_reasons.d/r.conf 0 mlag 1 evpn 2 vrrp 3 psecurity $ ip link set dev vx10 protodown on protodown_reason vrrp on $ip link show dev vx10 14: vx10: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 link/ether f2:32:28:b8:35:ff brd ff:ff:ff:ff:ff:ff protodown on protodown_reason <vrrp> $ip -p -j link show dev vx10 [ { <snip> "proto_down": true, "proto_down_reason": [ "vrrp" ] } ] $ip link set dev vx10 protodown_reason mlag on $ip link show dev vx10 14: vx10: <BROADCAST,MULTICAST> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 link/ether f2:32:28:b8:35:ff brd ff:ff:ff:ff:ff:ff protodown on protodown_reason <mlag,vrrp> $ip -p -j link show dev vx10 [ { <snip> "proto_down": true, "protodown_reason": [ "mlag","vrrp" ] } ] $ip -p -j link show dev vx10 $ip link set dev vx10 protodown off protodown_reason vrrp off Error: Cannot clear protodown, active reasons. $ip link set dev vx10 protodown off protodown_reason mlag off $ Note: for somereason the json and non-json key for protodown are different (protodown and proto_down). I have kept the same for protodown reason for consistency (protodown_reason and proto_down_reason). Signed-off-by: Roopa Prabhu <roopa@cumulusnetworks.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-09-01 19:52:13 -06:00
Antony Antony	af27494d2e	ip xfrm: support printing XFRMA_SET_MARK_MASK attribute in states The XFRMA_SET_MARK_MASK attribute is set in states (4.19+). It is the mask of XFRMA_SET_MARK(a.k.a. XFRMA_OUTPUT_MARK in 4.18) sample output: note the output-mark mask ip xfrm state src 192.1.2.23 dst 192.1.3.33 proto esp spi 0xSPISPI reqid REQID mode tunnel replay-window 32 flag af-unspec output-mark 0x3/0xffffff aead rfc4106(gcm(aes)) 0xENCAUTHKEY 128 if_id 0x1 Signed-off-by: Antony Antony <antony@phenome.org> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-09-01 19:49:29 -06:00
Phil Sutter	23203b750e	ip link: Fix indenting in help text Indenting of 'ip link set' options below 'link-netns' was wrong, they should be on the same level as the above. While being at it, fix closing brackets in vf-specific options. Also write node/port_guid parameters in upper-case without curly braces: They are supposed to be replaced by values, not put literally. Fixes: `8589eb4efd` ("treewide: refactor help messages") Fixes: `5a3ec4ba64` ("iplink: Update usage in help message") Signed-off-by: Phil Sutter <phil@nwl.cc> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-08-31 12:32:26 -07:00
Murali Karicheri	68f027724b	iplink: hsr: add support for creating PRP device similar to HSR This patch enhances the iplink command to add a proto parameters to create PRP device/interface similar to HSR. Both protocols are quite similar and requires a pair of Ethernet interfaces. So re-use the existing HSR iplink command to create PRP device/interface as well. Use proto parameter to differentiate the two protocols. Signed-off-by: Murali Karicheri <m-karicheri2@ti.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-08-22 21:14:12 -07:00
Sascha Hauer	7e7a1d107b	iproute2: ip maddress: Check multiaddr length ip maddress add\|del takes a MAC address as argument, so insist on getting a length of ETH_ALEN bytes. This makes sure the passed argument is actually a MAC address and especially not an IPv4 address which was previously accepted and silently taken as a MAC address. While at it, do not print *argv in the error path as this has been modified by ll_addr_a2n() and doesn't contain the full string anymore, which can lead to misleading error messages. Also while at it, replace the hardcoded buffer size with the actual buffer size using sizeof(). Signed-off-by: Sascha Hauer <s.hauer@pengutronix.de> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-08-22 21:12:30 -07:00
David Ahern	e572e3af0d	Merge branch 'main' into next Conflicts: bridge/fdb.c man/man8/bridge.8 Signed-off-by: David Ahern <dsahern@kernel.org>	2020-08-06 16:21:35 +00:00
Stephen Hemminger	fbef655568	replace SNAPSHOT with auto-generated version string Replace the iproute2 snapshot with a version string which is autogenerated as part of the build process using git describe. This will also allow seeing if the version of the command is built from the same sources is as upstream. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-08-03 10:02:47 -07:00
Petr Vaněk	a7f1974f6e	ip-xfrm: add support for oseq-may-wrap extra flag This flag allows to create SA where sequence number can cycle in outbound packets if set. Signed-off-by: Petr Vaněk <pv@excello.cz> Signed-off-by: David Ahern <dsahern@kernel.org>	2020-08-03 14:57:25 +00:00
Matthieu Baerts	3a53ff7e58	mptcp: show all endpoints when no ID is specified According to 'ip mptcp help', 'endpoint show' can accept no argument: ip mptcp endpoint show [ id ID ] It makes sense to print all endpoints when no filter is used. So here if the following command is used, all endpoints are printed: ip mptcp endpoint show Same as: ip mptcp endpoint Fixes: `7e0767cd` ("add support for mptcp netlink interface") Signed-off-by: Matthieu Baerts <matthieu.baerts@tessares.net> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-27 16:39:58 -07:00
David Ahern	b78c480532	Merge branch 'main' into next Signed-off-by: David Ahern <dsahern@kernel.org>	2020-07-14 23:52:43 +00:00
Eyal Birger	f33a871b80	ip xfrm: policy: support policies with IF_ID in get/delete/deleteall The XFRMA_IF_ID attribute is set in policies for them to be associated with an XFRM interface (4.19+). Add support for getting/deleting policies with this attribute. For supporting 'deleteall' the XFRMA_IF_ID attribute needs to be explicitly copied. Signed-off-by: Eyal Birger <eyal.birger@gmail.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-13 08:51:37 -07:00
Andrea Claudi	a8d6f51c84	ip address: remove useless include utils.h is included two times in ipaddress.c, there is no need for that. Signed-off-by: Andrea Claudi <aclaudi@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-08 08:47:28 -07:00
Stephen Hemminger	d44bcd2fbf	iplink_bareudp: use common include syntax Follow the precedent of other parts of iproute2 follow the example of: Standard libc headers Linux headers Iproute2 support headers Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-08 08:38:58 -07:00
Guillaume Nault	a6c5c952ab	ip link: initial support for bareudp devices Bareudp devices provide a generic L3 encapsulation for tunnelling different protocols like MPLS, IP, NSH, etc. inside a UDP tunnel. This patch is based on original work from Martin Varghese: https://lore.kernel.org/netdev/1570532361-15163-1-git-send-email-martinvarghesenokia@gmail.com/ Examples: - ip link add dev bareudp0 type bareudp dstport 6635 ethertype mpls_uc This creates a bareudp tunnel device which tunnels L3 traffic with ethertype 0x8847 (unicast MPLS traffic). The destination port of the UDP header will be set to 6635. The device will listen on UDP port 6635 to receive traffic. - ip link add dev bareudp0 type bareudp dstport 6635 ethertype ipv4 multiproto Same as the MPLS example, but for IPv4. The "multiproto" keyword allows the device to also tunnel IPv6 traffic. Signed-off-by: Guillaume Nault <gnault@redhat.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-06 11:11:05 -07:00
Sorah Fukumori	9e5d246877	ip fou: respect preferred_family for IPv6 ip(8) accepts -family ipv6 (-6) option at the toplevel. It is straightforward to support the existing option for modifying listener on IPv6 addresses. Maintain the backward compatibility by leaving ip fou -6 flag implemented, while it's removed from the usage message. Signed-off-by: Sorah Fukumori <her@sorah.jp> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-07-06 11:03:09 -07:00
Roi Dayan	473d18e219	ip address: Fix loop initial declarations are only allowed in C99 On some distros, i.e. rhel 7.6, compilation fails with the following: ipaddress.c: In function ‘lookup_flag_data_by_name’: ipaddress.c:1260:2: error: ‘for’ loop initial declarations are only allowed in C99 mode for (int i = 0; i < ARRAY_SIZE(ifa_flag_data); ++i) { ^ ipaddress.c:1260:2: note: use option -std=c99 or -std=gnu99 to compile your code This commit fixes the single place needed for compilation to pass. Fixes: `9d59c86e57` ("iproute2: ip addr: Organize flag properties structurally") Signed-off-by: Roi Dayan <roid@mellanox.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-06-11 15:05:20 -07:00
Roopa Prabhu	a56d17463c	ipnexthop: support for fdb nexthops This patch adds support to add and delete ecmp nexthops of type fdb. Such nexthops can be linked to vxlan fdb entries. $ip nexthop add id 12 via 172.16.1.2 fdb $ip nexthop add id 13 via 172.16.1.3 fdb $ip nexthop add id 102 group 12/13 fdb $bridge fdb add 02:02:00:00:00:13 dev vx10 nhid 102 self Signed-off-by: Roopa Prabhu <roopa@cumulusnetworks.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-06-11 15:52:29 +00:00
Stephen Hemminger	0a5dbbeddb	Merge git://git.kernel.org/pub/scm/network/iproute2/iproute2-next	2020-06-05 08:33:29 -07:00
Donald Sharp	2c78aba2fb	nexthop: Fix Deletion display Actually display that deletions are happening when monitoring nexthops. Signed-off-by: Donald Sharp <sharpd@cumulusnetworks.com> Acked-by: Roopa Prabhu <roopa@cumulusnetworks.com> Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2020-06-01 08:08:46 -07:00
Ian K. Coolidge	5413a735a6	iproute2: ip addr: Add support for setting 'optimistic' optimistic DAD is controllable via sysctl for an interface or all interfaces on the system. This would affect addresses added by the kernel only. Recent kernels, however, have enabled support for adding optimistic address via userspace. This plumbs that support. Signed-off-by: David Ahern <dsahern@gmail.com>	2020-05-31 23:01:33 +00:00
Ian K. Coolidge	9d59c86e57	iproute2: ip addr: Organize flag properties structurally This creates a nice systematic way to check that the various flags are mutable from userspace and that the address family is valid. Mutability properties are preserved to avoid introducing any behavioral change in this CL. However, previously, immutable flags were ignored and fell through to this confusing error: Error: either "local" is duplicate, or "dadfailed" is a garbage. But now, they just warn more explicitly: Warning: dadfailed option is not mutable from userspace Signed-off-by: David Ahern <dsahern@gmail.com>	2020-05-31 23:01:22 +00:00
Alexander Aring	9f91f1b7b8	lwtunnel: add support for rpl segment routing This patch adds support for rpl segment routing settings. Example: ip -n ns0 -6 route add 2001::3 encap rpl segs \ fe80::c8fe:beef:cafe:cafe,fe80::c8fe:beef:cafe:beef dev lowpan0 Signed-off-by: Alexander Aring <alex.aring@gmail.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-05-27 00:03:17 +00:00
Dmitry Yakunin	d5e6ee0dac	ss: introduce cgroup2 cache and helper functions This patch prepares infrastructure for matching sockets by cgroups. Two helper functions are added for transformation between cgroup v2 ID and pathname. Cgroup v2 cache is implemented as hash table indexed by ID. This cache is needed for faster lookups of socket cgroup. v2: - style fixes (David Ahern) Signed-off-by: Dmitry Yakunin <zeil@yandex-team.ru> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-05-13 14:28:04 +00:00
David Ahern	8c109059b5	Merge branch 'master' into next Signed-off-by: David Ahern <dsahern@gmail.com>	2020-05-05 16:49:38 +00:00
Xin Long	39fa047938	iproute_lwtunnel: add options support for erspan metadata This patch is to add LWTUNNEL_IP_OPTS_ERSPAN's parse and print to implement erspan options support in iproute_lwtunnel. Option is expressed as version:index:dir:hwid, dir and hwid will be parsed when version is 2, while index will be parsed when version is 1. All of these are numbers. erspan doesn't support multiple options. With this patch, users can add and dump erspan options like: # ip netns add a # ip netns add b # ip -n a link add eth0 type veth peer name eth0 netns b # ip -n a link set eth0 up # ip -n b link set eth0 up # ip -n a addr add 10.1.0.1/24 dev eth0 # ip -n b addr add 10.1.0.2/24 dev eth0 # ip -n b link add erspan1 type erspan key 1 seq erspan 123 \ local 10.1.0.2 remote 10.1.0.1 # ip -n b addr add 1.1.1.1/24 dev erspan1 # ip -n b link set erspan1 up # ip -n b route add 2.1.1.0/24 dev erspan1 # ip -n a link add erspan1 type erspan key 1 seq local 10.1.0.1 external # ip -n a addr add 2.1.1.1/24 dev erspan1 # ip -n a link set erspan1 up # ip -n a route add 1.1.1.0/24 encap ip id 1 \ erspan_opts 2:123:1:2 dst 10.1.0.2 dev erspan1 # ip -n a route show # ip netns exec a ping 1.1.1.1 -c 1 1.1.1.0/24 encap ip id 1 src 0.0.0.0 dst 10.1.0.2 ttl 0 tos 0 erspan_opts 2:0:1:2 dev erspan1 scope link PING 1.1.1.1 (1.1.1.1) 56(84) bytes of data. 64 bytes from 1.1.1.1: icmp_seq=1 ttl=64 time=0.124 ms v1->v2: - improve the changelog. - use PRINT_ANY to support dumping with json format. v2->v3: - implement proper JSON object for opts instead of just bunch of strings. v3->v4: - keep the same format between input and output, json and non json. - print version, index, dir and hwid as uint. Signed-off-by: Xin Long <lucien.xin@gmail.com> Signed-off-by: David Ahern <dsahern@gmail.com>	2020-05-01 16:33:09 +00:00

1 2 3 4 5 ...

1695 Commits