The conntrack-tools user manual Pablo Neira Ayuso
pablo@netfilter.org
2008-2011 Pablo Neira Ayuso Permission is granted to copy, distribute and/or modify this document under the terms of the GNU Free Documentation License, Version 1.2 or any later version published by the Free Software Foundation; with no Invariant Sections, no Front-Cover Texts, and no Back-Cover Texts. A copy of the license is included in the section entitled "GNU Free Documentation License". This document details how to install and configure the conntrack-tools >= 0.9.15. This software is under development, for that reason, it is likely that this document will evolve in the future to cover new features and changes.
Introduction This document should be a kick-off point to install and configure the conntrack-tools. If you find any error or imprecision in this document, please send an email to the author, it will be appreciated. In this document, the author assumes that the reader is familiar with firewalling concepts and iptables in general. If this is not your case, I suggest you to read the iptables documentation before going ahead. Moreover, the reader must also understand the difference between stateful and stateless firewalls. If this is not your case, I strongly suggest you to read the article Netfilter's Connection Tracking System published in :login; the USENIX magazine. That document contains a general description that should help to clarify the concepts. If you do not fulfill the previous requirements, this documentation is likely to be a source of frustration. Probably, you wonder why I'm insisting on these prerequisites too much, the fact is that if your iptables rule-set is stateless, it is very likely that the conntrack-tools will not be of any help for you. You have been warned! What are the conntrack-tools? The conntrack-tools are a set of free software tools for GNU/Linux that allow system administrators interact, from user-space, with the in-kernel Connection Tracking System, which is the module that enables stateful packet inspection for iptables. Probably, you did not hear about this module so far. However, if any of the rules of your rule-set use the state or ctstate iptables matches, you are indeed using it. The conntrack-tools package contains two programs: conntrack is command line interface conntrack provides a more flexible interface to the connnection tracking system than /proc/net/ip_conntrack. With conntrack, you can show, delete and update the existing state entries; and you can also listen to flow events. conntrackd is the user-space connection tracking daemon. This daemon can be used to deploy fault-tolerant GNU/Linux firewalls but you can also use it to collect flow-based statistics of the firewall use. Although the name of both tools is very similar - and you can blame me for that, I'm not a marketing guy - they are used for very different tasks. Requirements You have to install the following software in order to get the conntrack-tools working. Make sure that you have installed them correctly before going ahead: Linux kernel version >= 2.6.18 that, at least, has support for: Connection Tracking System. CONFIG_NF_CONNTRACK=m CONFIG_NF_CONNTRACK_IPV4=m CONFIG_NF_CONNTRACK_IPV6=m (if your setup supports IPv6) nfnetlink: the generic messaging interface for Netfilter. CONFIG_NETFILTER_NETLINK=m nf_conntrack_netlink: the messaging interface for the Connection Tracking System. CONFIG_NF_CT_NETLINK=m connection tracking event notification API: the flow-based event notification interface. CONFIG_NF_CONNTRACK_EVENTS=y Verifying kernel support Make sure you have loaded nf_conntrack, nf_conntrack_ipv4 (if your setup also supports IPv6, nf_conntrack_ipv6) and nf_conntrack_netlink. libnfnetlink: the netfilter netlink library use the official release available in netfilter.org libnetfilter_conntrack: the netfilter netlink library use the official release available in netfilter.org Installation To compile and install the conntrack-tools run the following commands: (non-root)$ tar xvjf conntrack-tools-x.x.x.tar.bz2 (non-root)$ cd conntrack-tools-x.x.x (non-root)$ ./configure --prefix=/usr (non-root)$ make (root) # make install Fedora Users If you are installing the libraries in /usr/local/, do not forget to do the following things: PKG_CONFIG_PATH=/usr/local/lib/pkgconfig; export PKG_CONFIG_PATH Add `/usr/local/lib' to your /etc/ld.so.conf file and run `ldconfig' Check `ldd' for trouble-shooting, read this for more information on how libraries work. Verifying kernel support To check that the modules are enabled in the kernel, run `conntrack -E' and generate traffic, you should see flow events reporting new connections and updates. Using conntrack: the command line interface The /proc/net/ip_conntrack interface is very limited as it only allows you to display the existing flows, their state and other information: # cat /proc/net/ip_conntrack tcp 6 431982 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34846 dport=993 packets=169 bytes=14322 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34846 packets=113 bytes=34787 [ASSURED] mark=0 secmark=0 use=1 tcp 6 431698 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34849 dport=993 packets=244 bytes=18723 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34849 packets=203 bytes=144731 [ASSURED] mark=0 secmark=0 use=1 The command line tool conntrack can be used to display the same information: # conntrack -L tcp 6 431982 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34846 dport=993 packets=169 bytes=14322 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34846 packets=113 bytes=34787 [ASSURED] mark=0 secmark=0 use=1 tcp 6 431698 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34849 dport=993 packets=244 bytes=18723 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34849 packets=203 bytes=144731 [ASSURED] mark=0 secmark=0 use=1 conntrack v0.9.7 (conntrack-tools): 2 flow entries have been shown. You can natively filter the output without using grep: # conntrack -L -p tcp --dport 34856 tcp 6 431982 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34846 dport=993 packets=169 bytes=14322 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34846 packets=113 bytes=34787 [ASSURED] mark=0 secmark=0 use=1 conntrack v0.9.7 (conntrack-tools): 1 flow entries have been shown. Update the mark based on a selection, this allows you to change the mark of an entry without using the CONNMARK target: # conntrack -U -p tcp --dport 3486 --mark 10 tcp 6 431982 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34846 dport=993 packets=169 bytes=14322 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34846 packets=113 bytes=34787 [ASSURED] mark=1 secmark=0 use=1 conntrack v0.9.7 (conntrack-tools): 1 flow entries has been updated. Delete one entry, this can be used to block traffic if: You have a stateful rule-set that blocks traffic in INVALID state. You have set /proc/sys/net/ipv4/netfilter/ip_conntrack_tcp_loose or /proc/sys/net/netfilter/nf_conntrack_tcp_loose, depending on your kernel version, to zero. # conntrack -D -p tcp --dport 3486 tcp 6 431982 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=34846 dport=993 packets=169 bytes=14322 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=34846 packets=113 bytes=34787 [ASSURED] mark=1 secmark=0 use=1 conntrack v0.9.7 (conntrack-tools): 1 flow entries has been deleted. Display the connection tracking events: # conntrack -E [NEW] udp 17 30 src=192.168.2.100 dst=192.168.2.1 sport=57767 dport=53 [UNREPLIED] src=192.168.2.1 dst=192.168.2.100 sport=53 dport=57767 [UPDATE] udp 17 29 src=192.168.2.100 dst=192.168.2.1 sport=57767 dport=53 src=192.168.2.1 dst=192.168.2.100 sport=53 dport=57767 [NEW] tcp 6 120 SYN_SENT src=192.168.2.100 dst=66.102.9.104 sport=33379 dport=80 [UNREPLIED] src=66.102.9.104 dst=192.168.2.100 sport=80 dport=33379 [UPDATE] tcp 6 60 SYN_RECV src=192.168.2.100 dst=66.102.9.104 sport=33379 dport=80 src=66.102.9.104 dst=192.168.2.100 sport=80 dport=33379 [UPDATE] tcp 6 432000 ESTABLISHED src=192.168.2.100 dst=66.102.9.104 sport=33379 dport=80 src=66.102.9.104 dst=192.168.2.100 sport=80 dport=33379 [ASSURED] You can also display the existing flows in XML format, filter the output based on the NAT handling applied, etc. Setting up conntrackd: the daemon The daemon conntrackd supports two working modes: State table synchronization: the daemon can be used to synchronize the connection tracking state table between several firewall replicas. This can be used to deploy fault-tolerant stateful firewalls. This is the main feature of the daemon. Flow-based statistics collection: the daemon can be used to collect flow-based statistics. This feature is similar to what ulogd-2.x provides. State table synchronization Requirements In order to get conntrackd working in synchronization mode, you have to fulfill the following requirements: A high availability manager like keepalived that manages the virtual IPs of the firewall cluster, detects errors, and decide when to migrate the virtual IPs from one firewall replica to another. Without it, conntrackd will not work appropriately. The state synchronization setup requires a working installation of keepalived, preferibly a recent version. Check if your distribution comes with a recent packaged version. Otherwise, you may compile it from the sources. There is a very simple example file in the conntrackd sources to setup a simple HA cluster with keepalived (see the file keepalived.conf under the doc/sync/ directory). This file can be used to set up a simple VRRP cluster composed of two machines that hold the virtual IPs 192.168.0.100 on eth0 and 192.168.1.100 on eth1. If you are not familiar with keepalived, please read the official documentation available at the keepalived website (http://www.keepalived.org). If you use a different high availability manager, make sure it works correctly before going ahead. A dedicated link. The dedicated link between the firewalls is used to transmit and receive the state information. The use of a dedicated link is mandatory for security reasons as someone may pick the state information that is transfered between the firewalls. A well-formed stateful rule-set. Otherwise you are likely to experience problems during the fail-over. An example of a well-formed stateful iptables rule-set is available in the conntrack-tools website. If your Linux kernel is < 2.6.22, you have to disable TCP window tracking: # echo 1 > /proc/sys/net/ipv4/netfilter/ip_conntrack_tcp_be_liberal Configuring the daemon The daemon conntrackd in synchronization mode supports up to three replication approaches: notrack: this approach is the most simple as it is based on a best effort replication protocol, ie. unreliable protocol. This protocol sends and receives the state information without performing any specific checking. ft-fw: this approach is based on a reliable protocol that performs message tracking. Thus, the protocol can recover from message loss, re-ordering and corruption. alarm: this approach is spamming. It is based on a alarm-based protocol that periodically re-sends the flow state to the backup firewall replicas. This protocol consumes a lot of bandwidth but it resolves synchronization problems fast. The three existing approaches are soft real-time asynchronous replication protocols that are aimed to have negligible impact in terms of latency and bandwidth throughput in the stateful firewall filtering. To configure conntrackd in any of the existing synchronization modes, you have to copy the example configuration file to the directory /etc/conntrackd/ on every firewall replica. Note that _type_ is the synchronization type selected. (conntrack-tools-x.x.x)# cp doc/_type_/conntrackd.conf /etc/conntrackd/conntrackd.conf Do not forget to edit the files before going ahead. There are several parameters that you have to tune to adapt the example configuration file to your setup. Configuration file location If you don't want to put the config file under /etc/conntrackd/, just tell conntrackd where to find it passing the option -C. Active-Backup setup Stateful firewall architectures A good reading to extend the information about firewall architectures is Demystifying cluster-based fault-tolerant firewalls published in IEEE Internet Computing magazine. In the Active-Backup setup, one of the stateful firewall replicas filters traffic and the other acts as backup. If you use this approach, you have to copy the script primary-backup.sh to: (conntrack-tools-x.x.x)# cp doc/sync/primary-backup.sh /etc/conntrackd/ The HA manager invokes this script when a transition happens, ie. If a stateful firewall replica: becomes active to recover the filtering. becomes backup. hits failure (this is available if the HA manager has a failure state, which is true for keepalived. The script is simple, and it contains the different actions that conntrackd performs to recover the filtering or purge obsolete entries from the state table, among others. The script is commented, you can have a look at it if you need further information. Active-Active setup The Active-Active setup consists of having more than one stateful firewall replicas actively filtering traffic. Thus, we reduce the resource waste that implies to have a backup firewall which does nothing. We can classify the type of Active-Active setups in several families: Symmetric path routing: The stateful firewall replicas share the workload in terms of flows, ie. the packets that are part of a flow are always filtered by the same firewall. Asymmetric multi-path routing: The packets that are part of a flow can be filtered by whatever stateful firewall in the cluster. Thus, every flow-states have to be propagated to all the firewalls in the cluster as we do not know which one would be the next to filter a packet. This setup goes against the design of stateful firewalls as we define the filtering policy based on flows, not in packets anymore. As for 0.9.8, the design of conntrackd allows you to deploy an symmetric Active-Active setup based on a static approach. For example, assume that you have two virtual IPs, vIP1 and vIP2, and two firewall replicas, FW1 and FW2. You can give the virtual vIP1 to the firewall FW1 and the vIP2 to the FW2. Unfortunately, you will have to wait for the support for the Active-Active setup based on dynamic approach, ie. a workload sharing setup without directors that allow the stateful firewall share the filtering. On the other hand, the asymmetric scenario may work if your setup fulfills several strong assumptions. However, in the opinion of the author of this work, the asymmetric setup goes against the design of stateful firewalls and conntrackd. Therefore, you have two choices here: you can deploy an Active-Backup setup or go back to your old stateless rule-set (in that case, the conntrack-tools will not be of any help anymore, of course). Launching conntrackd Once you have configured conntrackd, you can run in console mode which is an interactive mode, in that case type 'conntrackd' as root. (root)# conntrackd If you want to run conntrackd in daemon mode, then type: (root)# conntrackd -d You can verify that conntrackd is running by checking the log messages via ps. Moreover, if conntrackd is running fine, you can dump the current status of the daemon: # conntrackd -s cache internal: current active connections: 4 connections created: 4 failed: 0 connections updated: 0 failed: 0 connections destroyed: 0 failed: 0 cache external: current active connections: 0 connections created: 0 failed: 0 connections updated: 0 failed: 0 connections destroyed: 0 failed: 0 traffic processed: 0 Bytes 0 Pckts multicast traffic: 352 Bytes sent 0 Bytes recv 22 Pckts sent 0 Pckts recv 0 Error send 0 Error recv multicast sequence tracking: 0 Pckts mfrm 0 Pckts lost This command displays the number of entries in the internal and external cache: The internal cache contains the states that this firewall replica is filtering, ie. this is a cache of the kernel state table. The external cache contains the states that the other firewall replica is filtering. You can dump the internal cache with the following command: # conntrackd -i tcp 6 ESTABLISHED src=192.168.2.100 dst=139.174.175.20 sport=58491 dport=993 src=139.174.175.20 dst=192.168.2.100 sport=993 dport=58491 [ASSURED] mark=0 secmark=0 [active since 536s] tcp 6 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=38211 dport=993 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=38211 [ASSURED] mark=0 secmark=0 [active since 536s] tcp 6 ESTABLISHED src=192.168.2.100 dst=123.59.27.117 sport=38209 dport=993 src=123.59.27.117 dst=192.168.2.100 sport=993 dport=38209 [ASSURED] mark=0 secmark=0 [active since 536s] tcp 6 TIME_WAIT src=192.168.2.100 dst=74.125.45.166 sport=42593 dport=80 src=74.125.45.166 dst=192.168.2.100 sport=80 dport=42593 [ASSURED] [active since 165s] tcp 6 ESTABLISHED src=192.168.2.100 dst=139.174.175.20 sport=37962 dport=993 src=139.174.175.20 dst=192.168.2.100 sport=993 dport=37962 [ASSURED] mark=0 secmark=0 [active since 536s] You can dump the external cache with the following command: # conntrackd -e If the replication works fine, conntrackd -s displays the active's internal cache should display the same number of entries than the backup's external cache and vice-versa. To verify that the recovery works fine, if you trigger a fail-over, the log files should display the following information: [Thu Sep 18 18:03:02 2008] (pid=9759) [notice] committing external cache [Thu Sep 18 18:03:02 2008] (pid=9759) [notice] Committed 1545 new entries This means that the state entries have been injected into the kernel correctly. Other configuration options The daemon allows several configuration options that you may want to enable. This section contains some information about them. Disabling external cache It is possible to disable the external cache. Thus, conntrackd directly injects the flow-states into the in-kernel Connection Tracking System of the backup firewall. You can do it by enabling the DisableExternalCache option in the conntrackd.conf configuration file: Sync { Mode FTFW { [...] DisableExternalCache Off } } You can also use this option with the NOTRACK and ALARM modes. This increases CPU consumption in the backup firewall but now you do not need to commit the flow-states during the master failures since they are already in the in-kernel Connection Tracking table. Moreover, you save memory in the backup firewall since you do not need to store the foreign flow-states anymore. Disabling internal cache You can also disable the internal cache by means of the DisableInternalCache option in the conntrackd.conf configuration file: Sync { Mode NOTRACK { [...] DisableInternalCache Off } } However, this option is only available for the NOTRACK mode. This mode provides unreliable flow-state synchronization between firewalls. Thus, if flow-states are lost during the synchronization, the protocol provides no way to recover them. Using UDP, TCP or multicast for flow-state synchronization You can use up to three different transport layer protocols to synchronize flow-state changes between the firewalls: UDP, TCP and Multicast. UDP and multicast are unreliable but together with the FT-FW mode provide partial reliable flow-state synchronization. The preferred choice is FT-FW over UDP, or multicast alternatively. TCP introduces latency in the flow-state synchronization due to the congestion control. Under flow-state message are lost, the FIFO delivery becomes also a problem since the backup firewall quickly gets out of sync. For that reason, its use is discouraged. Note that using TCP only makes sense with the NOTRACK mode. Redundant dedicated links You can set redundant dedicated links without using bonding, you have to configure as many redundant links as you want in the configuration file. In case of failure of the master dedicated link, conntrackd failovers to one of the backups. An example of this configuration is the following: Sync { Mode FTFW { [...] } # default master dedicated link UDP Default { IPv4_address 192.168.2.1 IPv4_Destination_Address 192.168.2.2 Port 3780 Interface eth3 SndSocketBuffer 24985600 RcvSocketBuffer 24985600 Checksum on } # backup dedicated link UDP { IPv4_address 192.168.1.3 IPv4_Destination_Address 192.168.1.4 Port 3780 Interface eth2 SndSocketBuffer 24985600 RcvSocketBuffer 24985600 Checksum on } [...] } Troubleshooting Problems with conntrackd? The following list of questions should help for troubleshooting: I see packets lost in conntrackd -s You can rise the value of McastRcvSocketBuffer and McastRcvSocketBuffer, if the problem is due to buffer overruns in the multicast sender or the receiver, the problem should disapear. The log messages report that the maximum netlink socket buffer has been reached. You can increase the values of SocketBufferSize and SocketBufferSizeMaxGrown. I see can't open multicast server in the log messages Make sure that the IPv4_interface clause has the IP of the dedicated link. Can I use wackamole, heartattack or any other HA manager? Absolutely, you can. But before reporting issues, make sure that your HA manager is not the source of the problems. Does conntrackd support TCP flow-recovery with window tracking enabled? Yes, but you require a Linux kernel >= 2.6.36 and the conntrack-tools >= 0.9.15. To enable it, check the TCPWindowTracking clause in the example configuration files. Does conntrackd support the H.323 and SIP connection tracking helpers? No. This is not implemented yet, sorry. If you are interested in sponsoring this support, please contact me. Is there any way to set up a more verbose mode in the log message for debugging? No, but conntrackd provides lots of information that you can look up in runtime via -s option. You can check network statistics to find anomalies: # conntrackd -s network network statistics: recv: Malformed messages: 0 Wrong protocol version: 0 Malformed header: 0 Malformed payload: 0 Bad message type: 0 Truncated message: 0 Bad message size: 0 send: Malformed messages: 0 sequence tracking statistics: recv: Packets lost: 42726 Packets before: 0 UDP traffic (active device=eth3): 564232 Bytes sent 1979844 Bytes recv 2844 Pckts sent 8029 Pckts recv 0 Error send 0 Error recv You can check cache statistics: cache:internal active objects: 0 active/total entries: 0/ 0 creation OK/failed: 11068/ 0 no memory available: 0 no space left in cache: 0 update OK/failed: 4128/ 0 entry not found: 0 deletion created/failed: 11068/ 0 entry not found: 0 cache:external active objects: 0 active/total entries: 0/ 0 creation OK/failed: 10521/ 0 no memory available: 0 no space left in cache: 0 update OK/failed: 8832/ 0 entry not found: 0 deletion created/failed: 10521/ 0 entry not found: 0 You can check runtime miscelaneous statistics: daemon uptime: 14 min netlink stats: events received: 24736 events filtered: 0 events unknown type: 0 catch event failed: 0 dump unknown type: 0 netlink overrun: 0 flush kernel table: 1 resync with kernel table: 0 current buffer size (in bytes): 8000000 runtime stats: child process failed: 0 child process segfault: 0 child process termsig: 0 select failed: 0 wait failed: 0 local read failed: 0 local unknown request: 0 You can check dedicated link statistics: UDP traffic device=eth3 status=RUNNING role=ACTIVE: 566848 Bytes sent 1982612 Bytes recv 3018 Pckts sent 8203 Pckts recv 0 Error send 0 Error recv You can check network queue statistics: allocated queue nodes: 1 queue txqueue: current elements: 0 maximum elements: 2147483647 not enough space errors: 0 queue errorq: current elements: 0 maximum elements: 128 not enough space errors: 0 queue rsqueue: current elements: 1 maximum elements: 131072 not enough space errors: 0