iproute2

Commit Graph

Author	SHA1	Message	Date
Daniel Borkmann	c7272ca720	bpf: add initial support for attaching xdp progs Now that we made the BPF loader generic as a library, reuse it for loading XDP programs as well. This basically adds a minimal start of a facility for iproute2 to load XDP programs. There currently only exists the xdp1_user.c sample code in the kernel tree that sets up netlink directly and an iovisor/bcc front-end. Since we have all the necessary infrastructure in place already from tc side, we can just reuse its loader back-end and thus facilitate migration and usability among the two for people familiar with tc/bpf already. Sharing maps, performing tail calls, etc works the same way as with tc. Naturally, once kernel configuration API evolves, we will extend new features for XDP here as well, resp. extend dumping of related netlink attributes. Minimal example: clang -target bpf -O2 -Wall -c prog.c -o prog.o ip [-force] link set dev em1 xdp obj prog.o # attaching ip [-d] link # dumping ip link set dev em1 xdp off # detaching For the dump, intention is that in the first line for each ip link entry, we'll see "xdp" to indicate that this device has an XDP program attached. Once we dump some more useful information via netlink (digest, etc), idea is that 'ip -d link' will then display additional relevant program information below the "link/ ether [...]" output line for such devices, for example. Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Alexei Starovoitov <ast@kernel.org>	2016-12-09 12:44:12 -08:00
Daniel Borkmann	fb24802b9c	bpf: check for owner_prog_type and notify users when differ Kernel commit 21116b7068b9 ("bpf: add owner_prog_type and accounted mem to array map's fdinfo") added support for telling the owner prog type in case of prog arrays. Give a notification to the user when they differ, and the program eventually fails to load. Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Alexei Starovoitov <ast@kernel.org>	2016-12-09 12:44:12 -08:00
Thomas Graf	0f74d0f3a9	bpf: Fix number of retries when growing log buffer The log buffer is automatically grown when the verifier output does not fit into the default buffer size. The number of growing attempts was not sufficient to reach the maximum buffer size so far. Perform 9 iterations to reach max and let the 10th one fail. j:0 i:65536 max:16777215 j:1 i:131072 max:16777215 j:2 i:262144 max:16777215 j:3 i:524288 max:16777215 j:4 i:1048576 max:16777215 j:5 i:2097152 max:16777215 j:6 i:4194304 max:16777215 j:7 i:8388608 max:16777215 j:8 i:16777216 max:16777215 Signed-off-by: Thomas Graf <tgraf@suug.ch> Acked-by: Daniel Borkmann <daniel@iogearbox.net>	2016-12-09 12:42:11 -08:00
Cyrill Gorcunov	9f66764e30	libnetlink: Add test for error code returned from netlink reply In case if some diag module is not present in the system, say the kernel is not modern enough, we simply skip the error code reported. Instead we should check for data length in NLMSG_DONE and process unsupported case. Signed-off-by: Cyrill Gorcunov <gorcunov@gmail.com>	2016-12-01 10:55:56 -08:00
Stephen Hemminger	328374dcfe	Merge branch 'master' into net-next	2016-12-01 10:29:12 -08:00
Stephen Hemminger	2c500a4dc2	libnetlink: style cleanups Follow kernel style related cleanups: * break long lines * remove unnecessary void * cast	2016-11-29 13:15:08 -08:00
Zhang Shengju	1b109a30bf	libnetlink: reduce size of message sent to kernel Fixes commit 246f57c4086d99fa ("ip link: Add support for kernel side filtering"). This patch reduce the size of message sent to kernel space. Before this patch, for command: 'ip link show', we will sent 1056 bytes. With this patch, we only need to send 40 bytes. Signed-off-by: Zhang Shengju <zhangshengju@cmss.chinamobile.com>	2016-11-29 13:03:00 -08:00
Zhang Shengju	2d98dd4821	iproute2: fix the link group name getting error In the situation where more than one entry live in the same hash bucket, loop to get the correct one. Before: $ cat /etc/iproute2/group 0 default 256 test $ sudo ip link set group test dummy1 $ ip link show type dummy 11: dummy0: <BROADCAST,NOARP> mtu 1500 qdisc noop state DOWN mode DEFAULT group 0 qlen 1000 link/ether 4e:3b:d3:6c:f0:e6 brd ff:ff:ff:ff:ff:ff 12: dummy1: <BROADCAST,NOARP> mtu 1500 qdisc noop state DOWN mode DEFAULT group test qlen 1000 link/ether d6:9c:a4:1f:e7:e5 brd ff:ff:ff:ff:ff:ff After: $ ip link show type dummy 11: dummy0: <BROADCAST,NOARP> mtu 1500 qdisc noop state DOWN mode DEFAULT group default qlen 1000 link/ether 4e:3b:d3:6c:f0:e6 brd ff:ff:ff:ff:ff:ff 12: dummy1: <BROADCAST,NOARP> mtu 1500 qdisc noop state DOWN mode DEFAULT group test qlen 1000 link/ether d6:9c:a4:1f:e7:e5 brd ff:ff:ff:ff:ff:ff Signed-off-by: Zhang Shengju <zhangshengju@cmss.chinamobile.com>	2016-11-29 12:48:07 -08:00
Daniel Borkmann	e42256699c	bpf: make tc's bpf loader generic and move into lib This work moves the bpf loader into the iproute2 library and reworks the tc specific parts into generic code. It's useful as we can then more easily support new program types by just having the same ELF loader backend. Joint work with Thomas Graf. I hacked a rough start of a test suite to make sure nothing breaks [1] and looks all good. [1] https://github.com/borkmann/clsact/blob/master/test_bpf.sh Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Signed-off-by: Thomas Graf <tgraf@suug.ch>	2016-11-29 12:35:32 -08:00
stefan@datenfreihafen.org	8ae2c5382b	ip: update link types to show 6lowpan and ieee802.15.4 monitor Both types have been missing here and thus ip always showed only the numbers. Based on a suggestion from Alexander Aring. Signed-off-by: Stefan Schmidt <stefan@datenfreihafen.org>	2016-11-12 10:14:03 +03:00
Stephen Hemminger	d54e3ab985	Merge branch 'master' into net-next	2016-10-09 18:53:52 -07:00
Igor Ryzhov	6cf2609ddb	fix netlink message length checks Signed-off-by: Igor Ryzhov <iryzhov@nfware.com>	2016-10-09 18:48:30 -07:00
Stephen Hemminger	88ba11bc08	Merge branch 'master' into net-next	2016-09-01 09:11:10 -07:00
Nikolay Aleksandrov	56e3eb4c34	ip: route: fix multicast route dumps If we have multicast routes and do ip route show table all we'll get the following output: ... multicast ???/32 from ???/32 table default proto static iif eth0 The "???" are because the rtm_family is set to RTNL_FAMILY_IPMR instead (or RTNL_FAMILY_IP6MR for ipv6). Add a simple workaround that returns the real family based on the rtm_type (always RTN_MULTICAST for ipmr routes) and the rtm_family. Similar workaround is already used in ipmroute, and we can use this helper there as well. After the patch the output is: multicast 239.10.10.10/32 from 0.0.0.0/32 table default proto static iif eth0 Also fix a minor whitespace error and switch to tabs. Reported-by: Satish Ashok <sashok@cumulusnetworks.com> Signed-off-by: Nikolay Aleksandrov <nikolay@cumulusnetworks.com>	2016-09-01 08:41:37 -07:00
Nikolay Aleksandrov	7abf5de677	bridge: vlan: add support to display per-vlan statistics This patch adds support for the stats argument to the bridge vlan command which will display the per-vlan statistics and the device each vlan belongs to with its flags. The supported command filtering options are dev and vid. Also the man page is updated to explain the new option. The patch uses the new RTM_GETSTATS interface with a filter_mask to dump all bridges and ports vlans. Later we can add support for using the per-device dump and filter it in the kernel instead. Example: $ bridge -s vlan show port vlan id br0 1 Egress Untagged RX: 2536 bytes 20 packets TX: 2536 bytes 20 packets 101 RX: 43158 bytes 50 packets TX: 43158 bytes 50 packets eth1 1 Egress Untagged RX: 2536 bytes 20 packets TX: 2536 bytes 20 packets 100 RX: 0 bytes 0 packets TX: 0 bytes 0 packets 101 RX: 43158 bytes 50 packets TX: 43158 bytes 50 packets 102 RX: 16897 bytes 93 packets TX: 0 bytes 0 packets The format is the same as bridge vlan show but with stats, even though under the hood the calls done to the kernel are different. Signed-off-by: Nikolay Aleksandrov <nikolay@cumulusnetworks.com>	2016-08-29 10:58:40 -07:00
Sabrina Dubroca	2b68cb77cd	libgenl: introduce genl_init_handle All users of genl have the same code to open a genl socket and resolve the family for their specific protocol. Introduce a helper to initialize the handle, and use it in all the genl code. Signed-off-by: Sabrina Dubroca <sd@queasysnail.net>	2016-08-17 13:59:21 -07:00
Phil Sutter	f89bb0210f	Replace malloc && memset by calloc This only replaces occurrences where the newly allocated memory is cleared completely afterwards, as in other cases it is a theoretical performance hit although code would be cleaner this way. Signed-off-by: Phil Sutter <phil@nwl.cc> Acked-by: David Ahern <dsa@cumulusnetworks.com>	2016-07-20 12:05:24 -07:00
Phil Sutter	d17b136f7d	Use C99 style initializers everywhere This big patch was compiled by vimgrepping for memset calls and changing to C99 initializer if applicable. One notable exception is the initialization of union bpf_attr in tc/tc_bpf.c: changing it would break for older gcc versions (at least <=3.4.6). Calls to memset for struct rtattr pointer fields for parse_rtattr*() were just dropped since they are not needed. The changes here allowed the compiler to discover some unused variables, so get rid of them, too. Signed-off-by: Phil Sutter <phil@nwl.cc> Acked-by: David Ahern <dsa@cumulusnetworks.com>	2016-07-20 12:05:24 -07:00
Anuradha Karuppiah	d721a14590	json_writer: Removed automatic json-object type from the constructor Top level can be any json type and can be created using jsonw_start_object/jsonw_end_object etc. Signed-off-by: Anuradha Karuppiah <anuradhak@cumulusnetworks.com>	2016-07-20 12:02:02 -07:00
Eli Cohen	d91fb3f4c7	Add support for configuring Infiniband GUIDs Add two NLA's that allow configuration of Infiniband node or port GUIDs by referencing the IPoIB net device set over the physical function. The format to be used is as follows: ip link set dev ib0 vf 0 node_guid 00:02:c9:03:00:21:6e:70 ip link set dev ib0 vf 0 port_guid 00:02:c9:03:00:21:6e:78 Signed-off-by: Eli Cohen <eli@mellanox.com>	2016-07-15 11:25:36 -07:00
Beniamino Galvani	9ba4126dc4	utils: fix hex digits parsing in hexstring_a2n() strtoul() only modifies errno on overflow, so if errno is not zero before calling the function its value is preserved and makes the function fail for valid inputs; initialize it. Signed-off-by: Beniamino Galvani <bgalvani@redhat.com>	2016-06-14 14:25:05 -07:00
Sabrina Dubroca	609640f5f0	utils: provide get_hex to read a hex digit from a char Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Acked-by: Phil Sutter <phil@nwl.cc>	2016-06-08 09:30:41 -07:00
Sabrina Dubroca	9f7401fa49	utils: add get_be{16, 32, 64}, use them where possible Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Acked-by: Phil Sutter <phil@nwl.cc>	2016-06-08 09:30:37 -07:00
Sabrina Dubroca	89ae502056	utils: make hexstring_a2n provide the number of hex digits parsed Signed-off-by: Sabrina Dubroca <sd@queasysnail.net> Acked-by: Phil Sutter <phil@nwl.cc>	2016-06-08 09:30:31 -07:00
David Ahern	57bdf8b764	Make builds default to quiet mode Similar to the Linux kernel and perf add infrastructure to reduce the amount of output tossed to a user during a build. Full build output can be obtained with 'make V=1' Builds go from: make[1]: Leaving directory `/home/dsa/iproute2.git/lib' make[1]: Entering directory `/home/dsa/iproute2.git/ip' gcc -Wall -Wstrict-prototypes -Wmissing-prototypes -Wmissing-declarations -Wold-style-definition -Wformat=2 -O2 -I../include -DRESOLVE_HOSTNAMES -DLIBDIR=\"/usr/lib\" -DCONFDIR=\"/etc/iproute2\" -D_GNU_SOURCE -D_FILE_OFFSET_BITS=64 -D_LARGEFILE_SOURCE -D_LARGEFILE64_SOURCE -c -o ip.o ip.c gcc -Wall -Wstrict-prototypes -Wmissing-prototypes -Wmissing-declarations -Wold-style-definition -Wformat=2 -O2 -I../include -DRESOLVE_HOSTNAMES -DLIBDIR=\"/usr/lib\" -DCONFDIR=\"/etc/iproute2\" -D_GNU_SOURCE -D_FILE_OFFSET_BITS=64 -D_LARGEFILE_SOURCE -D_LARGEFILE64_SOURCE -c -o ipaddress.o ipaddress.c to: ... AR libutil.a ip CC ip.o CC ipaddress.o ... Signed-off-by: David Ahern <dsa@cumulusnetworks.com>	2016-05-31 12:13:07 -07:00
David Ahern	b0a4ce620e	ip link: Add support for kernel side filtering Kernel gained support for filtering link dumps with commit dc599f76c22b ("net: Add support for filtering link dump by master device and kind"). Add support to ip link command. If a user passes master device or kind to ip link command they are added to the link dump request message. Signed-off-by: David Ahern <dsa@cumulusnetworks.com>	2016-05-18 11:52:14 -07:00
Jiri Pirko	4952b45946	include: add linked list implementation from kernel Rename hlist.h to list.h while adding it to be aligned with kernel Signed-off-by: Jiri Pirko <jiri@mellanox.com>	2016-03-27 10:56:11 -07:00
Stephen Hemminger	e9e9365b56	scrub out whitespace issues Run script that removes trailing whitespace everywhere.	2016-03-27 10:50:14 -07:00
Marco Varlese	334af76143	fix get_addr() and get_prefix() error messages An attempt to add invalid address to interface would print "???" string instead of the address family name. For example: $ ip address add 256.10.166.1/24 dev ens8 Error: ??? prefix is expected rather than "256.10.166.1/24". $ ip neighbor add proxy 2001:db8::g dev ens8 Error: ??? address is expected rather than "2001:db8::g". With this patch the output will look like: $ ip address add 256.10.166.1/24 dev ens8 Error: inet prefix is expected rather than "256.10.166.1/24". $ ip neighbor add proxy 2001:db8::g dev ens8 Error: inet6 address is expected rather than "2001:db8::g". Signed-off-by: Przemyslaw Szczerbik <przemyslawx.szczerbik@intel.com> Signed-off-by: Marco Varlese <marco.varlese@intel.com>	2016-03-27 10:47:02 -07:00
Phil Sutter	f63ed3e629	lib/ll_addr: improve ll_addr_n2a() a bit Apart from making the code a bit more compact and efficient, this also prevents a potential buffer overflow if the passed buffer is really too small: Although correctly decrementing the size parameter passed to snprintf, it could become negative which would then wrap since snprintf uses (unsigned) size_t for the parameter. Signed-off-by: Phil Sutter <phil@nwl.cc>	2016-03-27 10:37:35 -07:00
Phil Sutter	2e96d2ccd0	utils: make rt_addr_n2a() non-reentrant by default There is only a single user who needs it to be reentrant (not really, but it's safer like this), add rt_addr_n2a_r() for it to use. Signed-off-by: Phil Sutter <phil@nwl.cc>	2016-03-27 10:37:34 -07:00
Phil Sutter	a418e45164	make format_host non-reentrant by default There are only three users which require it to be reentrant, the rest is fine without. Instead, provide a reentrant format_host_r() for users which need it. Signed-off-by: Phil Sutter <phil@nwl.cc>	2016-03-27 10:37:34 -07:00
Phil Sutter	a1121aa1f5	color: introduce color helpers and COLOR_CLEAR This adds two helper functions which map a given data field to a color, so color_fprintf() statements don't have to be duplicated with only a different color value depending on that data field's value. In order for this to work in a generic way, COLOR_CLEAR has been added to serve as a fallback default of uncolored output. Signed-off-by: Phil Sutter <phil@nwl.cc>	2016-03-27 10:37:34 -07:00
Phil Sutter	72b365e8e0	libnetlink: Double the dump buffer size There have been reports about 'ip addr' printing "Message truncated" on systems with large numbers of VFs. Although I haven't been able to get my hands on hardware suitable to reproduce this, increasing the dump buffer has been reported to resolve the issue. For want of a better idea, just double the buffer size to 32k. Feels like this opportunistic buffer size selection is rather workarounding a design flaw in libnetlink or maybe even the netlink protocol itself. Signed-off-by: Phil Sutter <phil@nwl.cc>	2016-03-06 12:51:18 -08:00
Gustavo Zacarias	4a36b4c2ec	iproute2: fix building with musl We need limits.h for PATH_MAX, fixes: rt_names.c:364:13: error: ‘PATH_MAX’ undeclared (first use in this function) Signed-off-by: Gustavo Zacarias <gustavo@zacarias.com.ar>	2016-02-02 15:58:33 +11:00
Lorenzo Colitti	57fdf2d4d9	libnetlink: don't print NETLINK_SOCK_DIAG errors in rtnl_talk This change is a no-op, as currently no code uses rtnl_talk on NETLINK_SOCK_DIAG_BY_FAMILY sockets. It is needed to suppress spurious errors when using SOCK_DESTROY via rtnl_talk. Signed-off-by: Lorenzo Colitti <lorenzo@google.com>	2016-01-18 11:47:03 -08:00
Stephen Hemminger	c13b6b097a	add coverity model file Track any coverity overrides for this project. Signed-off-by: Stephen Hemminger <stephen@networkplumber.org>	2015-12-30 18:06:12 -08:00
Stephen Hemminger	00a2a1748b	Merge branch 'master' into net-next	2015-12-17 17:21:15 -08:00
Tom Herbert	5866bddd9a	ila: Add support for ILA lwtunnels This patch: - Adds a utility function for parsing a 64 bit address - Adds a utility function for converting a 64 bit address to ASCII - Adds and ILA encap type in lwt tunnels Signed-off-by: Tom Herbert <tom@herbertland.com>	2015-12-17 17:07:07 -08:00
Stephen Hemminger	6ad355ca9e	Merge branch 'master' into net-next	2015-12-10 08:56:18 -08:00
Nicolas Dichtel	ed108cfc02	libnetlink: don't confuse variables in rtnl_talk() There is two variables named 'len' in rtnl_talk. In fact, commit `c079e121a7` didn't work. For example, it was possible to trigger a seg fault with this command: $ ip link set gre2 type ip6gre hoplimit 32 Let's rename the argument len to maxlen. Fixes: `c079e121a7` ("libnetlink: add size argument to rtnl_talk") Reported-by: Thomas Faivre <thomas.faivre@6wind.com> Signed-off-by: Nicolas Dichtel <nicolas.dichtel@6wind.com>	2015-12-10 08:45:21 -08:00
Daniel Borkmann	f6793eec46	{f, m}_bpf: allow for user-defined object pinnings The recently introduced object pinning can be further extended in order to allow sharing maps beyond tc namespace. F.e. maps that are being pinned from tracing side, can be accessed through this facility as well. Signed-off-by: Daniel Borkmann <daniel@iogearbox.net> Acked-by: Alexei Starovoitov <ast@kernel.org>	2015-11-29 11:55:16 -08:00
Phil Sutter	8e72880f6b	libnetlink: introduce nc_flags Allow for a filter to ignore certain nlmsg_flags. Signed-off-by: Phil Sutter <phil@nwl.cc>	2015-11-29 11:47:29 -08:00
Stephen Hemminger	68ef507249	rt_names: style cleanup Cleanup all checkpatch complaints about whitespace in rt_names.	2015-11-29 11:41:23 -08:00
David Ahern	13ada95da4	Add support for rt_tables.d Add support for reading table id/name mappings from rt_tables.d directory. Suggested-by: Roopa Prabhu <roopa@cumulusnetworks.com> Signed-off-by: David Ahern <dsa@cumulusnetworks.com>	2015-11-29 11:29:31 -08:00
Stephen Hemminger	1e5aa99024	Merge branch 'master' into net-next	2015-11-03 16:31:57 -08:00
Phil Sutter	b5bb1820e8	lib/utils: improve error messages of get_addr() and get_prefix() Instead of statically complaining about illegal inet address, use get_family() to get the address family right. Based on a patch by Hangbin Liu to print "inet6" for AF_INET6 made more generic by me. Signed-off-by: Phil Sutter <phil@nwl.cc>	2015-11-03 16:28:36 -08:00
Stephen Hemminger	c6646c1ea5	Merge branch 'master' into net-next	2015-10-16 16:03:32 -07:00
Roopa Prabhu	303cc9cbee	libnetlink: introduce rta_nest and u8, u16, u64 helpers for nesting within rtattr This patch introduces two new api's rta_nest and rta_nest_end to nest attributes inside a rta attribute represented by 'struct rtattr' as required to construct a nexthop. Also adds rta_addattr* variants for u8, u16 and u64 as needed to support encapsulation. Signed-off-by: Roopa Prabhu <roopa@cumulusnetworks.com> Signed-off-by: Thomas Graf <tgraf@suug.ch> Acked-by: Jiri Benc <jbenc@redhat.com>	2015-10-16 16:00:47 -07:00
David Ahern	0d238ca2b8	ip neigh: Add support for filtering dumps by master device Add support for filtering neighbor dumps by master device. Kernel side support provided by commit 21fdd092acc7. Since the feature is not available in older kernels the user is given a warning message if the kernel does not support the request. Signed-off-by: David Ahern <dsa@cumulusnetworks.com>	2015-10-12 09:39:37 -07:00

1 2 3 4 5

206 Commits