Network reconfiguration tutorial: Difference between revisions

From Grid5000
Jump to navigation Jump to search
No edit summary
 
(74 intermediate revisions by 8 users not shown)
Line 11: Line 11:
= Introduction =
= Introduction =


The goal of this tutorial is to present a example of usage of Grid'5000, which benefits from the capability of '''reconfiguration of the network topology''' of the platform, using '''[[KaVLAN]]'''.
This tutorial presents a example of usage of Grid'5000 which benefits from the capability of '''reconfiguration of the network topology''' of the platform, using '''[[KaVLAN]]'''.


'''[[KaVLAN]]''' is a tool of Grid'5000, which allows a user to manage [https://en.wikipedia.org/wiki/Virtual_LAN VLANs] in the platform. Behind the scenes, [[KaVLAN]] actually changes the configuration of the network switchs, to set the VLAN membership (VLAN ID) for the ports which are cabled to the network interfaces of one or more nodes. That mechanism can provide a layer 2 network isolation for experiments.
'''[[KaVLAN]]''' is a Grid'5000 tool, which allows a user to manage [https://en.wikipedia.org/wiki/Virtual_LAN VLANs] in the platform. '''That mechanism can provide a layer 2 network isolation for experiments.'''


Three kinds of KaVLAN VLANs are available on Grid'5000. You can find more information in the [[KaVLAN]] page. In this tutorial, we will only use '''''global'' and ''local VLANs''''' (but no ''routed VLAN'').
This tutorial will only use 3 nodes, but could be adapted for a usage at scale, with hundreds of nodes.
 
Three kinds of KaVLAN VLANs are available in Grid'5000. You can find more information in the [[KaVLAN]] page. In this tutorial, we will only use '''''global'' and ''local VLANs''''' (but no ''routed VLAN'').


Before going further with the practical setup proposed in this page, it is '''strongly advised to read the [[KaVLAN]] page'''.
Before going further with the practical setup proposed in this page, it is '''strongly advised to read the [[KaVLAN]] page'''.
Line 24: Line 26:
[[File:TP-Kavlan-School2016.png]]
[[File:TP-Kavlan-School2016.png]]


Nodes nancy-1 and rennes-2 are placed in a same "global" (inter-site) vlan, and rennes-2 is also placed in a same vlan as rennes-1 ("local" vlan).
== Reservations of the resources==
== Reservations of the resources==


Line 40: Line 43:


=== In Rennes (Rennes terminal) ===
=== In Rennes (Rennes terminal) ===
In Rennes, we have to reserve 1 ''local VLAN'', and 2 nodes for instance of the ''paravance'' cluster, because that cluster is composed of nodes with 2 network interfaces (in order to find other Grid'5000 clusters providing such a characteristic, you can go to [https://api.grid5000.fr/sid/ui/quick-start.html this API page]).
In Rennes, we have to reserve 1 ''local VLAN'', and 2 nodes for instance of the ''paravance'' cluster, because that cluster is composed of nodes with 2 network interfaces (in order to find other Grid'5000 clusters providing such a characteristic, you can go to [[Hardware#Networking|the Hardware page]]).


We also try to reserve the ''global VLAN'' in Rennes as well.
We also try to reserve the ''global VLAN'' in Rennes as well.
Line 50: Line 53:


Once logged to Rennes' frontend in our first terminal, we execute:
Once logged to Rennes' frontend in our first terminal, we execute:
{{Term|location=rennes:frontend|cmd=<code class="command">oarsub</code>  -t deploy -l {"type='kavlan-global'"}/vlan=1+{"type='kavlan-local'"}/vlan=1+{"cluster='paravance'"}/nodes=2,walltime=<code class="replace">WALLTIME</code> -r "<code class="replace">STARTDATE</code>"}}
{{Term|location=rennes:frontend|cmd=<code class="command">oarsub</code>  -t deploy -l {"type='kavlan-global'"}/vlan=1+{"type='kavlan-local'"}/vlan=1+{paravance}/nodes=2,walltime=<code class="replace">WALLTIME</code> -r "<code class="replace">STARTDATE</code>"}}


{{Note|text=To use KaVLAN, a job of type '''deploy''' (kadeploy) is mandatory}}
{{Note|text=To use KaVLAN, a job of type '''deploy''' (kadeploy) is mandatory}}
Line 62: Line 65:
; If you face issue #1:
; If you face issue #1:


Just replace Rennes by another site which has nodes with several interfaces (see [https://api.grid5000.fr/sid/ui/quick-start.html the API page] to find such a site).
Just replace Rennes by another site which has nodes with several interfaces (see [[Hardware#Networking|the Hardware page]] to find such a site).


; If you face issue #2:
; If you face issue #2:
Line 69: Line 72:


So, '''if the previous command fails''' because Rennes' global VLAN is not available, only execute:
So, '''if the previous command fails''' because Rennes' global VLAN is not available, only execute:
{{Term|location=rennes:frontend|cmd=<code class="command">oarsub</code> -t deploy -l {"type='kavlan-local'"}/vlan=1+{"cluster='paravance'"}/nodes=2,walltime=<code class="replace">WALLTIME</code> -r "<code class="replace">STARTDATE</code>"}}
{{Term|location=rennes:frontend|cmd=<code class="command">oarsub</code> -t deploy -l {"type='kavlan-local'"}/vlan=1+{paravance}/nodes=2,walltime=<code class="replace">WALLTIME</code> -r "<code class="replace">STARTDATE</code>"}}
and do not forget to reserve a global VLAN elsewhere.
(and do not forget to reserve a ''global VLAN'' in the other site, as you will be proposed in the next section about the reservation in Nancy).


'''Either way, a successful <code class="command">oarsub</code> command should have returned a <code class="replace">JOBID</code>.'''
'''Either way, a successful <code class="command">oarsub</code> command should have returned a <code class="replace">JOBID</code>.'''


Let's call our 2 nodes in Rennes <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> (e.g. you might have <code class="replace">rennes-1</code> = paravance-23).
Let's call our 2 nodes in Rennes <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> (e.g. you might have <code class="replace">rennes-1</code> = paravance-23).


Now, let's retrieve the ID of our VLANs (local + global, or just the local one if the global VLAN is reserved elsewhere):
Now, let's retrieve the ID of our VLANs (local + global, or just the local one if the global VLAN is reserved elsewhere):
Line 97: Line 101:


You can now look at which VLANs you get, as above.
You can now look at which VLANs you get, as above.
{{Term|location=rennes:frontend|cmd=<code class="command">kavlan</code> -V -j <code class="replace">JOBID</code>}}
{{Term|location=nancy:frontend|cmd=<code class="command">kavlan</code> -V -j <code class="replace">JOBID</code>}}


== Deployment of our operating system for the experiment ==
== Deployment of our operating system for the experiment ==
Line 105: Line 109:
(<code class="replace">JOBID</code> is the one of the job of Rennes of course)
(<code class="replace">JOBID</code> is the one of the job of Rennes of course)


Then we launch the deployment of the Debian Jessie minimal environment to the nodes:
Then we launch the deployment of the Debian minimal environment to the nodes:
{{Term|location=rennes:frontend|cmd=<code class="command">kadeploy3</code> -e jessie-x64-min -k -f $OAR_NODEFILE}}
{{Term|location=rennes:frontend|cmd=<code class="command">kadeploy3</code> -e debian11-x64-min -k -f $OAR_NODEFILE}}
{{Note|text=We choose to not use the ''--vlan <code class="replace">VLAN-ID</code>'' option of <code class="command">kadeploy3</code> here, because we want to tweak VLANs ourselves.}}
{{Note|text=We choose to not use the ''--vlan <code class="replace">VLAN-ID</code>'' option of <code class="command">kadeploy3</code> here, because we want to tweak VLANs ourselves.}}
Line 116: Line 120:


{{Note|text=When the access to a node is broken due to an error in a network setup for instance, the [[Kaconsole]] tool becomes very handy: it allows to connect to the serial console of a node, quite the same way you would connect to a virtual console of a GNU/Linux workstation (CTRL+ALT+F1), to fix the network configuration for instance.
{{Note|text=When the access to a node is broken due to an error in a network setup for instance, the [[Kaconsole]] tool becomes very handy: it allows to connect to the serial console of a node, quite the same way you would connect to a virtual console of a GNU/Linux workstation (CTRL+ALT+F1), to fix the network configuration for instance.
{{Term|location=frontend|cmd=<code class="command">kaconsole3</code> -m <code class="replace">rennes-1</code>}}
{{Term|location=rennes:frontend|cmd=<code class="command">kaconsole3</code> -m <code class="replace">rennes-1</code>}}
Credentials for login on the console with our deployed environment is "root":"grid5000".
Credentials for login on the console with our deployed environment is "root":"grid5000".
}}
}}
Line 122: Line 126:
=== In Nancy (Nancy terminal) ===
=== In Nancy (Nancy terminal) ===
We run from Nancy's frontend the same commands as in Rennes, but use the <code class="replace">JOBID</code> of Nancy of course.
We run from Nancy's frontend the same commands as in Rennes, but use the <code class="replace">JOBID</code> of Nancy of course.


== Installation of useful extra packages for networking ==
== Installation of useful extra packages for networking ==
   
   
In this tutorial we are going to need '''some extra packages''' which are not installed in jessie-min environment, such as <code class="command">tcpdump</code>. Since we are going to modify the network configurations making the '''Internet unreachable''' from the nodes, we must '''install them beforehand'''.
In this tutorial we are going to need '''some extra packages''' which are not installed in debian11-min environment, such as <code class="command">tcpdump</code>. Since we are going to modify the network configurations making the '''Internet unreachable''' from the nodes, we must '''install them beforehand'''.
   
   
Besides, you may wish to play with (or debug) your network configuration with other tools like <code class="command">ethtool</code>, <code class="command">dig</code>, or <code class="command">traceroute</code> as well. If you want to use them later you need to install them now too.
Besides, you may wish to play with (or debug) your network configuration with other tools like <code class="command">ethtool</code>, <code class="command">dig</code>, or <code class="command">traceroute</code> as well. If you want to use them later you need to install them now too.
Line 137: Line 139:
=== In Rennes (Rennes terminal) ===
=== In Rennes (Rennes terminal) ===
   
   
In Rennes, since we reserved several nodes (hey, 2 is more than 1! :-) ), we will be using [[TakTuk]] for speeding up the process:
In Rennes, since we reserved several nodes (hey, 2 is more than 1! :-) ), we will be using [[Using TakTuk|TakTuk]] for speeding up the process:


{{Term|location=rennes:frontend|cmd=<code class="command">taktuk</code> -s -l root -f $OAR_FILE_NODES broadcast exec [ "apt-get update; apt-get --yes install at tcpdump dnsutils traceroute ethtool" ]}}
{{Term|location=rennes:frontend|cmd=<code class="command">taktuk</code> -s -l root -f $OAR_FILE_NODES broadcast exec [ "apt-get update; apt-get --yes install at tcpdump dnsutils traceroute ethtool ipcalc" ]}}
   
   
In addition, we will need OpenVSwitch on <code class="replace">rennes-2</code> in the second part of the tutorial, so we install it too:
In addition, we will need OpenVSwitch on <code class="replace">rennes-2</code> in the second part of the tutorial, so we install it too:
Line 152: Line 154:
* While it can be interesting to manage by ourselves the IP addressing of the nodes in our VLAN, Grid'5000 provides predefined IP addresses for the nodes within any VLAN, which can be very handy (see the [[KaVLAN]] page), especially because they are served by a DHCP service within each VLAN. So both options are possible, and one has to weight the pros and the cons, depending on the actual need.
* While it can be interesting to manage by ourselves the IP addressing of the nodes in our VLAN, Grid'5000 provides predefined IP addresses for the nodes within any VLAN, which can be very handy (see the [[KaVLAN]] page), especially because they are served by a DHCP service within each VLAN. So both options are possible, and one has to weight the pros and the cons, depending on the actual need.
* Setting up our own IP addressing mechanism can actually be seen as an ''"expert mode"'', to which the <code class="command">kavlan</code> command provides some support as well, allowing to disable the DHCP service in a VLANs if required. Please really '''mind''' what you do when in that ''expert mode''.
* Setting up our own IP addressing mechanism can actually be seen as an ''"expert mode"'', to which the <code class="command">kavlan</code> command provides some support as well, allowing to disable the DHCP service in a VLANs if required. Please really '''mind''' what you do when in that ''expert mode''.
* If not using the IP addresses which are provided by Grid'5000, routing to ''Global VLANs'' and ''Routed VLANs'' will not work, and neither will SSH gateways to ''local VLANs''. However, if all or some of those feature as not useful for your experiment, it is not harmful.
* If not using the IP addresses which are provided by Grid'5000, routing to ''Global VLANs'' and ''Routed VLANs'' will not work, and neither will SSH gateways to ''local VLANs''. However, if all or some of those feature are not useful for your experiment, it is not harmful.


That said, for this tutorial and up until the ''openvswitch'' section, we will use the predefined IP addresses provided by Grid'5000. This will be very handy thanks to the DHCP services which are provided in the VLANs.
That said, for this tutorial and up until the ''openvswitch'' section, we will use the predefined IP addresses provided by Grid'5000. This will be very handy thanks to the DHCP services which are provided in the VLANs.
Line 161: Line 163:


=== In Nancy (Nancy terminal) ===
=== In Nancy (Nancy terminal) ===
The <code class="replace">nancy-1</code> node is in the ''global VLAN''. Using DHCP, it will thus receive the IP address which corresponds to <code class="replace">nancy-1</code>-kavlan-<code class="replace">G_ID</code>. See the table in the [[KaVLAN]] page to find out the corresponding IP address, or run:
<code class="replace">nancy-1</code> will be put in the ''global VLAN''. Once in that VLAN, it will be able to ask for an IP address to the DHCP server of the ''global VLAN'', which will answer the IP corresponding to <code class="replace">nancy-1</code>-kavlan-<code class="replace">G_ID</code>. See the table in the [[KaVLAN]] page to find out the corresponding IP address, or run:


{{Term|location=nancy:frontend|cmd=<code class="command">host</code> <code class="replace">nancy-1</code>-kavlan-<code class="replace">G_ID</code>}}
{{Term|location=nancy:frontend|cmd=<code class="command">host</code> <code class="replace">nancy-1</code>-kavlan-<code class="replace">G_ID</code>}}


Since the IP configuration of the network Interface uses DHCP in the environment we deployed, the mechanism we have to trigger is the following:
The mechanism we have to trigger is the following:
# Change the VLAN of the node
# Change the VLAN of the node
# Restart the network service
# Restart the network service


Since our node won't be reachable with its old IP address (the one for the default VLAN) once put in the ''global VLAN'', we have to create before action #1, a ''trigger'' for running action #2 asynchronously. That can be done using the <code class="command">at</code> command, for instance. Let's do that in one command line:
Since our node won't be reachable with its old IP address (the one for the default VLAN) once put in the ''global VLAN'', we won't be able to run action #2 after running action #1 as-is. We will use a trick to create a ''trigger'' before action #1, in order to run action #2 asynchronously. That can be done using the <code class="command">at</code> command, for instance. Let's do that in one command line:


{{Term|location=nancy:frontend|cmd=<code class="command">ssh</code> root@<code class="replace">nancy-1</code> "echo 'service networking restart' &#124; <code class="command">at</code> now + 1 minute" && kavlan -s -i <code class="replace">G_ID</code> -m <code class="replace">nancy-1</code>.nancy.grid5000.fr --verbose}}
{{Term|location=nancy:frontend|cmd=<code class="command">ssh</code> root@<code class="replace">nancy-1</code> "echo 'service networking restart' &#124; <code class="command">at</code> now + 1 minute" && kavlan -s -i <code class="replace">G_ID</code> -m <code class="replace">nancy-1</code>.nancy.grid5000.fr --verbose}}
In this way, the restart of the networking service will be scheduled ''before'' changing the VLAN, and actually done ''one minute after''.


Another option would be to use [[Kaconsole]], to access the node via its serial console to run action #2, but that is definitely not scalable to many nodes, so it is to be kept as a last resort option.
Another option would be to use [[Kaconsole]], to access the node via its serial console to run action #2, but that is definitely not scalable to many nodes, so it is to be kept as a last resort option.
Line 182: Line 186:
=== In Rennes (Rennes terminal) ===
=== In Rennes (Rennes terminal) ===
In Rennes, we want:
In Rennes, we want:
# to put <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> default network interfaces (eth0) in the ''local VLAN'', and have their IP addresses changed accordingly.
# to put <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> default network interfaces (eth0/eno1 on paravance) in the ''local VLAN'', and have their IP addresses changed accordingly.
# to put <code class="replace">rennes-2</code> secondary network interface (eth1) in the ''global VLAN'' and have the IP address changed accordingly.
# to put <code class="replace">rennes-2</code> secondary network interface (eth1/eno2 on paravance) in the ''global VLAN'' and have the IP address changed accordingly.


That way, <code class="replace">rennes-2</code> will be in-between the ''local'' and the ''global'' VLANs, which is the heart of our setup.
That way, <code class="replace">rennes-2</code> will be in-between the ''local'' and the ''global'' VLANs, which is the heart of our setup.
Line 195: Line 199:




Notice that <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> are still accessible in SSH via the VLAN's gateway, named "'''kavlan-<code class="replace">L_ID</code>'''":
Notice that <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> are still accessible via SSH on the VLAN's gateway, named "'''kavlan-<code class="replace">L_ID</code>'''":
{{Term|location=rennes:frontend|cmd=<code class="command">ssh</code> kavlan-<code class="replace">L_ID</code>}}
{{Term|location=rennes:frontend|cmd=<code class="command">ssh</code> kavlan-<code class="replace">L_ID</code>}}
{{Term|location=rennes:kavlan-L_ID|cmd=<code class="command">ssh</code> root@<code class="replace">rennes-1</code>-kavlan-<code class="replace">L_ID</code>}}
{{Term|location=rennes:kavlan-L_ID|cmd=<code class="command">ssh</code> root@<code class="replace">rennes-1</code>-kavlan-<code class="replace">L_ID</code>}}
Line 202: Line 206:
We now have to put <code class="replace">rennes-2</code>'s secondary network interface in the ''global VLAN''. That secondary interface can be referenced with the following hostname: <code class="replace">rennes-2</code>-eth1.
We now have to put <code class="replace">rennes-2</code>'s secondary network interface in the ''global VLAN''. That secondary interface can be referenced with the following hostname: <code class="replace">rennes-2</code>-eth1.


{{Note|text=According to the information of the Grid'5000 API, for paravance nodes, eth1/eno2 is indeed cabled to the switch: see eno2 in https://api.grid5000.fr/sid/sites/rennes/clusters/paravance/nodes/paravance-1.json?pretty.}}


{{Note|text=According to the information of the Grid'5000 API, for paravance nodes, eth1 is indeed cabled to the switch: see eth1 in https://api.grid5000.fr/sid/sites/rennes/clusters/paravance/nodes/paravance-1.json?pretty, or look at the [[Special Features]] page.}}


So, to do that, run the <code class="command">kavlan</code> command:


Since the node is not reachable anymore from the default VLAN, we will use [[Kaconsole]] to restart the network service in that case, after having run the <code class="command">kavlan</code> command:
{{Term|location=rennes:frontend|cmd=<code class="command">kavlan</code> -s -i <code class="replace">G_ID</code> -m <code class="replace">rennes-2</code>-eth1.rennes.grid5000.fr --verbose}}


{{Term|location=rennes:frontend|cmd=<code class="command">kavlan</code> -s -i <code class="replace">G_ID</code> -m <code class="replace">rennes-2</code>-eth1.rennes.grid5000.fr --verbose}}
Since the node is not reachable anymore from the default VLAN, we will have to go through the local VLAN to restart the network service. So let's bounce on the gateway to reach <code class="replace">rennes-2</code>, like this:
{{Term|location=rennes:frontend|cmd=<code class="command">ssh</code> kavlan-<code class="replace">L_ID</code>}}
{{Term|location=kavlan-L_ID|cmd=<code class="command">ssh</code> root@<code class="replace">rennes-2</code>-kavlan-<code class="replace">L_ID</code>}}


Now run:
And finally, configure the interface and program a network restart in one minute:
{{Term|location=rennes:frontend|cmd=<code class="command">kaconsole3</code> -m <code class="replace">rennes-2</code>}}
{{Term|location=rennes-2|cmd=<code class="command">echo</code> -e "\nauto eno2\n iface eno2 inet dhcp" >> /etc/network/interfaces ; ifup eno2 }}
and execute:
Then exit the terminal before the networking service restarts, otherwise the SSH session will be lost and the terminal will freeze.
{{Term|location=console@rennes-2|cmd=<code class="command">echo</code> -e "\nauto eth1\n iface eth1 inet dhcp" >> /etc/network/interfaces ; <code class="command">service</code> networking restart}}




{{Note|text=As eth1 is not configured by default in /etc/network/interfaces, we have to do it before restarting the network, otherwise this interface won't ask for a DHCP lease.}}
{{Note|text=As eno2 is not configured by default in /etc/network/interfaces, we have to do it before restarting the network, otherwise this interface won't ask for a DHCP lease.}}




Line 245: Line 251:
# Try to ping <code class="replace">nancy-1</code>'s IP in the ''global VLAN'' from <code class="replace">rennes-1</code>
# Try to ping <code class="replace">nancy-1</code>'s IP in the ''global VLAN'' from <code class="replace">rennes-1</code>


For that purpose, you will have to find out the hostnames and IP addresses of the machines in the VLANs ! See [[KaVLAN]].
For that purpose, you will have to find out the hostnames and IP addresses of the machines in the VLANs ! See [[KaVLAN]] and the table above.
 
For instance, <code class="replace">rennes-2</code> should be reachable with the hostname <code class="replace">rennes-2</code>-eth1-kavlan-<code class="replace">G_ID</code>.


The last 3 pings between <code class="replace">nancy-1</code> and Rennes' local VLAN (either way) should fail.
The last 3 pings between <code class="replace">nancy-1</code> and Rennes' local VLAN (either way) should fail at this point.


== Setup of the IP routing between the 2 VLANs ==
== Setup of the IP routing between the 2 VLANs ==
Line 256: Line 264:
To enable that, we will activate the IP routing feature of Linux on <code class="replace">rennes-2</code>. This in fact only consist in activating a '''sysctl''', as follows:
To enable that, we will activate the IP routing feature of Linux on <code class="replace">rennes-2</code>. This in fact only consist in activating a '''sysctl''', as follows:


{{Term|location=rennes:node2|cmd=<code class="command">sysctl</code> -w net.ipv4.ip_forward=1}}
{{Term|location=rennes-2|cmd=<code class="command">sysctl</code> -w net.ipv4.ip_forward=1}}


But we also need to add a route on both <code class="replace">rennes-1</code> and <code class="replace">nancy-1</code>, to make them know how to contact each other.
But we also need to add a route on both <code class="replace">rennes-1</code> and <code class="replace">nancy-1</code>, to make them know how to contact each other.


To find out the IP network of the ''local VLAN'' and of the ''global VLAN'', we can either look at the tables in the [[KaVLAN]] page, or examine the IP configuration of the interfaces of <code class="replace">rennes-1</code>, with <code class="command">ifconfig</code> or <code class="command">ip addr</code>.
To find out the IP network of the ''local VLAN'' and of the ''global VLAN'', we can either look at the tables in the [[KaVLAN]] page, or examine the IP configuration of the interfaces of <code class="replace">rennes-1</code>, with <code class="command">ip addr show</code> and <code class="command">ip route show</code>.


Let's have:
Let's have:
* the network for the ''global VLAN'' written <code class="replace">G_NETWORK/G_NETMASK</code>  
* the network for the ''global VLAN'' written <code class="replace">G_NETWORK/G_NETMASK</code>  
* the network for the ''local VLAN'' written <code class="replace">L_NETWORK/L_NETMASK</code>
* the network for the ''local VLAN'' written <code class="replace">L_NETWORK/L_NETMASK</code>
* the IP address of rennes-2 on the ''global VLAN'' written <code class="replace">G_IP_rennes-2</code>
* the IP address of rennes-2 in the ''global VLAN'' written <code class="replace">G_IP_rennes-2</code>
* the IP address of rennes-2 on the ''local VLAN'' written <code class="replace">L_IP_rennes-2</code>
* the IP address of rennes-2 in the ''local VLAN'' written <code class="replace">L_IP_rennes-2</code>


We can add on <code class="replace">rennes-1</code> a route to the ''global VLAN'' as follows:
We can add on <code class="replace">rennes-1</code> a route to the ''global VLAN'' as follows:
Line 274: Line 282:
And on <code class="replace">nancy-1</code> a route to the ''local VLAN'' as follows:
And on <code class="replace">nancy-1</code> a route to the ''local VLAN'' as follows:


{{Term|location=rennes-1|cmd=<code class="command">ip</code> route add <code class="replace">L_NETWORK/L_NETMASK</code> via <code class="replace">G_IP_rennes-2</code>}}
{{Term|location=nancy-1|cmd=<code class="command">ip</code> route add <code class="replace">L_NETWORK/L_NETMASK</code> via <code class="replace">G_IP_rennes-2</code>}}


Routing should now be operational!
Routing should now be operational!
Line 282: Line 290:
== New tests with IP routing enabled==
== New tests with IP routing enabled==


We first check if packets are able to find their way via our ''router''.
First, you can replay the previous tests and check that they all succeed now:
# Try to ssh to <code class="replace">nancy-1</code> in the ''global VLAN'' from the frontend of any Grid'5000 site, then to <code class="replace">rennes-2</code> and finally to <code class="replace">rennes-1</code>
# Try to ssh to the SSH gateway of the ''local VLAN'', then to <code class="replace">rennes-1</code>, then to <code class="replace">rennes-2</code> and finally to <code class="replace">nancy-1</code>
# Try to ping <code class="replace">rennes-2</code>'s IP in the ''global VLAN'' from <code class="replace">nancy-1</code>
# Try to ping <code class="replace">rennes-2</code>'s IP in the ''local VLAN'' from <code class="replace">nancy-1</code>
# Try to ping <code class="replace">rennes-1</code>'s IP in the ''local VLAN'' from <code class="replace">nancy-1</code>
# Try to ping <code class="replace">nancy-1</code>'s IP in the ''global VLAN'' from <code class="replace">rennes-1</code>
 
 
And to go further, we can observe packets with <code class="command">tcpdump</code>.
For instance let's glance at the packets on our ''router''.


We run on node1 and node2 (in Rennes) the following commands:
We run on <code class="replace">rennes-1</code> and <code class="replace">rennes-2</code> the following commands:


{{Term|location=rennes:node2|cmd=<code class="command">tcpdump</code> -i eth0 icmp}}
{{Term|location=rennes-2|cmd=<code class="command">tcpdump</code> -i eno1 icmp}}
(tcpdump is a network traffic sniffer, here filtering ICMP traffic on eth0)
(tcpdump is a network traffic sniffer, here filtering ICMP traffic on eno1)


{{Term|location=rennes:node1|cmd=<code class="command">ping</code> 192.168.1.1}}
{{Term|location=rennes-1|cmd=<code class="command">ping</code> <code class="replace">IP_nancy-1</code>}}


On the first terminal you should see that the ICMP packets are forwarded on node2 :
On the first terminal you should see that the ICMP requests and replies are forwarded on <code class="replace">rennes-2</code> :


<pre>
<pre>
IP 192.168.200.7 > 192.168.1.1: ICMP echo request, id 4270, seq 1, length 64
listening on eno1, link-type EN10MB (Ethernet), capture size 262144 bytes
IP 192.168.1.1 > 192.168.200.7: ICMP echo reply, id 4270, seq 1, length 64
17:08:55.145241 IP 10.27.200.183 > 192.168.200.68: ICMP echo request, id 3498, seq 1, length 64
IP 192.168.200.7 > 192.168.1.1: ICMP echo request, id 4270, seq 2, length 64
17:08:55.145356 IP 192.168.200.68 > 10.27.200.183: ICMP echo reply, id 3498, seq 1, length 64
IP 192.168.1.1 > 192.168.200.7: ICMP echo reply, id 4270, seq 2, length 64
17:08:56.146970 IP 10.27.200.183 > 192.168.200.68: ICMP echo request, id 3498, seq 2, length 64
IP 192.168.200.7 > 192.168.1.1: ICMP echo request, id 4270, seq 3, length 64
17:08:56.147075 IP 192.168.200.68 > 10.27.200.183: ICMP echo reply, id 3498, seq 2, length 64
IP 192.168.1.1 > 192.168.200.7: ICMP echo reply, id 4270, seq 3, length 64
17:08:57.148675 IP 10.27.200.183 > 192.168.200.68: ICMP echo request, id 3498, seq 3, length 64
17:08:57.148782 IP 192.168.200.68 > 10.27.200.183: ICMP echo reply, id 3498, seq 3, length 64
</pre>
</pre>


If ip forwarding was disabled on node2 (see above the sysctl command to disable it), nothing will happen in tcpdump and ping should report errors.
'''If ip forwarding was disabled''' on <code class="replace">rennes-2</code> (see above the <code class="command">sysctl</code> command to disable it with a '0' instead of a '1'), tcpdump will only see ping requests (not ping replies because <code class="replace">rennes-2</code> doesn't transmit anything to <code class="replace">nancy-1</code>), and ping should fail.


We also check the packet route with traceroute from node1:
We also check the packet route with <code class="command">traceroute</code> from <code class="replace">rennes-1</code>:


{{Term|location=rennes:node1|cmd=<code class="command">traceroute</code> 192.168.1.1}}
{{Term|location=rennes-1|cmd=<code class="command">traceroute</code> <code class="replace">IP_nancy-1</code>}}


We should see two hops: the intermediary router and the target:
We should see two hops: the intermediary router and the target:


<pre>
<pre>
traceroute to 192.168.1.1 (192.168.1.1), 30 hops max, 60 byte packets
traceroute to grisou-9-kavlan-16.nancy (10.27.209.9), 30 hops max, 60 byte packets
  1  192.168.200.8 (192.168.200.8)  0.136 ms  0.122 ms  0.115 ms
  1  paranoia-8-eth0-kavlan-1.rennes.grid5000.fr (192.168.194.8)  0.161 ms  0.167 ms  0.148 ms
  2  192.168.1.1 (192.168.1.125.612 ms  25.617 ms  25.611 ms
  2  grisou-9-eth0-kavlan-16.nancy.grid5000.fr (10.27.209.924.852 ms  24.834 ms  24.816 ms
</pre>
</pre>


The first hop is from node1 to node2, and the second one is from node2 to node_nancy.
The first hop is <code class="replace">rennes-2</code>, and the second one is <code class="replace">nancy-1</code>.


With tcpdump we can check the complete isolation of nodes from the production VLAN (and from any other one) :
With <code class="command">tcpdump</code> we can check the complete isolation of nodes from the production VLAN (and from any other one) :


{{Term|location=rennes:node1|cmd=<code class="command">tcpdump</code> port not 22}}
{{Term|location=rennes-1|cmd=<code class="command">tcpdump</code> port not 22}}


On node1 (in a 12 seconds time):
On <code class="replace">rennes-1</code> (in a 12 seconds time):


<pre>
<pre>
14:26:15.107927 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
listening on eno1, link-type EN10MB (Ethernet), capture size 262144 bytes
14:26:17.109436 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
17:26:28.053495 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
14:26:19.108669 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
17:26:30.053550 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
14:26:21.108675 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
17:26:32.053496 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
14:26:23.108669 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
17:26:34.053499 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
14:26:25.108654 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
17:26:36.053565 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
14:26:27.108674 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6c:bc.808e, length 43
17:26:38.053505 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:40.053525 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
</pre>
</pre>


The only frames received are spanning tree frames, from the switch.
The only frames received are ethernet-level management frames sent by the switches (STP frames here, but we could also see LLDP frames, or other ones)


In comparison, this is what can be captured on a node in the production VLAN, in only a one second time:
In comparison, this is what can be captured on a node in the production VLAN, in only a one second time:
Line 352: Line 372:


Here we see ARP requests, DNS messages, multicast reports…
Here we see ARP requests, DNS messages, multicast reports…
{{Note|text=As you can see, in these <code class="command">tcpdump</code> commands we '''filtered SSH packets''', because we were watching the results via SSH. This is '''something you always want to be cautious with''', because every output line generates an SSH packet to bring it back to you, this packet will appear in a new output line, etc. If the traffic you are watching is dense and you run <code class="command">tcpdump</code> remotely without filtering SSH packets, the network can rapidly be saturated, making your session freeze.}}


= Bridging VLANs with OpenVSwitch =
= Bridging VLANs with OpenVSwitch =
[[File:TP-Kavlan-School2016-bridge.png|thumb|right|300px|Topology with a software switch]]
[[File:TP-Kavlan-School2016-bridge.png|thumb|right|300px|Topology with a software switch]]
== Setup ==
== Setup ==
Our goal is now to allow <code class="replace">nancy-1</code> and <code class="replace">rennes-1</code> to communicate directly at the Ethernet layer, with no IP routing in between. Only 1 IP network must be used, so lets choose the one of the ''global VLAN'', and forget about the one of the ''local VLAN''.
Our goal is now to allow <code class="replace">nancy-1</code> and <code class="replace">rennes-1</code> to communicate directly at the Ethernet layer, with no IP routing in between. In this configuration, only one IP subnet (range) must be used, so let's chose the ''global VLAN'' one, and forget about the ''local VLAN'' one.


First, we need to unconfigure what we did in the IP routing setup paragraph:
First, we need to unset what we configured when we were doing IP routing:
# we remove the extra IP route to <code class="replace">L_NETWORK</code> on <code class="replace">nancy-1</code>, replacing '''add''' by '''del''' in the <code class="command">ip</code> command.
# remove the extra IP route to <code class="replace">L_NETWORK</code> on <code class="replace">nancy-1</code>
# we deactivate the IP routing functionality on <code class="replace">rennes-2</code> (see the <code class="command">sysctl</code> command: set to ''ip_forward'' to '''0''' instead of '''1'''),
# deactivate the IP routing functionality on <code class="replace">rennes-2</code>
# stop the DHCP service in the ''local VLAN'' (so that it does not confict with the one of the ''global VLAN'')
# stop the DHCP service in the ''local VLAN'' (so that it does not conflict with the ''global VLAN'' one)
# unconfigure both network interfaces of <code class="replace">rennes-2</code>, and setup OpenVSwitch within a [[Kaconsole]] session.
# unconfigure both network interfaces of <code class="replace">rennes-2</code>, and setup OpenVSwitch within a [[Kaconsole]] session.
# restart the networking service of <code class="replace">rennes-1</code> !
# restart the networking service of <code class="replace">rennes-1</code> !


{{Note|text=Bridging could be achieved using Linux bridge driver and the <code class="command">brctl</code> companion command, but here we prefer OpenVSwitch which provides enhanced functionalities}}
Let's detail a little…
 
; action #1:
 
Just replace '''add''' by '''del''' in the <code class="command">ip route</code> command.
 
; action #2:
 
in the <code class="command">sysctl</code> command, set ''ip_forward'' to '''0''' instead of '''1'''


; Let's detail action #3:
; action #3:
{{Term|location=rennes:frontend|cmd=<code class="command">kavlan</code> -d -i <code class="replace">L_ID</code>}}
{{Term|location=rennes:frontend|cmd=<code class="command">kavlan</code> -d -i <code class="replace">L_ID</code> --verbose}}


; then action #4:  
; action #4:  
Run [[Kaconsole]]
Run [[Kaconsole]]
{{Term|location=rennes:frontend|cmd=<code class="command">kaconsole3</code> -m  <code class="replace">rennes-2</code>}}
{{Term|location=rennes:frontend|cmd=<code class="command">kaconsole3</code> -m  <code class="replace">rennes-2</code>}}
Login with root / grid5000.
Unconfigure the IP addresses of the network interfaces:
Unconfigure the IP addresses of the network interfaces:
{{Term|location=console@rennes-2|cmd=<code class="command">ifconfig</code> eth0 0 ; <code class="command">ifconfig</code> eth1 0}}
{{Term|location=console@rennes-2|cmd=<code class="command">ip</code> addr del <code class="replace">L_NETWORK</code>/<code class="replace">L_NETMASK</code> dev eno1}}
{{Term|location=console@rennes-2|cmd=<code class="command">ip</code> addr del <code class="replace">G_NETWORK</code>/<code class="replace">G_NETMASK</code> dev eno2}}
Create the bridge:
Create the bridge:
{{Term|location=console@rennes-2|cmd=<code class="command">ovs-vsctl</code> add-br OVSbr}}
{{Term|location=console@rennes-2|cmd=<code class="command">ovs-vsctl</code> add-br OVSbr}}
Add eth0 and eth1 to the bridge:
Add eno1 and eno2 to the bridge:
{{Term|location=console@rennes-2|cmd=<code class="command">ovs-vsctl</code> add-port OVSbr eth0}}
{{Term|location=console@rennes-2|cmd=<code class="command">ovs-vsctl</code> add-port OVSbr eno1}}
{{Term|location=console@rennes-2|cmd=<code class="command">ovs-vsctl</code> add-port OVSbr eth1}}
{{Term|location=console@rennes-2|cmd=<code class="command">ovs-vsctl</code> add-port OVSbr eno2}}
 
You can list all the ports in the bridge with:
{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> show OVSbr}}


; and finally action #5:
; and finally action #5:
Line 393: Line 430:


== Tests ==
== Tests ==
Now that it is done:
Now that everything is configured:
* You should be able to ping <code class="replace">rennes-1</code> with its new IP from <code class="replace">nancy-1</code>.  
* You should be able to ping <code class="replace">rennes-1</code> with its new IP from <code class="replace">nancy-1</code>.  
* Also if you run traceroute, you should notice that there is only one hop between the 2 nodes now.
* Also if you run <code class="command">traceroute</code>, you should notice that there is only one hop between the 2 nodes now.
 
{{Note|text=Bridging could be achieved using Linux bridge driver and the <code class="command">brctl</code> companion command, but here we prefer to use OpenVSwitch which provides enhanced functionalities}}


== Flow control ==
== Flow control ==
You can use openVSwitch to manage flows, for example you can DROP all packets from an IP with this command:
You can use openVSwitch to manage flows, for example you can DROP all pings to a specific IP from a specific interface:
 
{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> add-flow OVSbr "in_port=1,ip,nw_src=172.20.0.1,actions=drop"}}
 
"in_port=1" refers to eth0 in our case. You can list all the ports in the bridge with:


{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ifctl</code> show OVSbr}}
{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> add-flow OVSbr "in_port=1,icmp,nw_src=<code class="replace">G_IP_rennes-1</code>,nw_proto=1,actions=drop"}}
As it can be checked with the "<code class="command">ovs-ofctl</code> show" command, "in_port=1" refers to eno1 in our case.
Now, from <code class="replace">rennes-1</code>, try to ping <code class="replace">nancy-1</code> and check that it doesn't work, but make sure you can still access it via SSH.


You can also DROP all pings to a specific IP from a specific port:
You can also DROP all packets from an IP with this command:


{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> add-flow OVSbr "in_port=2,ip,nw_dst=192.168.1.1,nw_proto=1,actions=drop"}}
{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> add-flow OVSbr "in_port=1,ip,nw_src=<code class="replace">G_IP_rennes-1</code>,actions=drop"}}
Now make sure you can't SSH from <code class="replace">rennes-1</code> to <code class="replace">nancy-1</code> anymore


And the following command will display all your flow rules :  
Additionally, the following command will display all your flow rules :  


{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> dump-flows OVSbr}}
{{Term|location=rennes:rennes-2|cmd=<code class="command">ovs-ofctl</code> dump-flows OVSbr}}
Line 421: Line 459:


If you finished the tutorial, please have a look at the [[TopoMaker]] page, and beta test !
If you finished the tutorial, please have a look at the [[TopoMaker]] page, and beta test !
= Conclusions =
At this stage, you are now ready to perform SDN (Software Defined Networking) and NFV (Network Function Virtualization) research on Grid'5000. You could:
* configure virtual network overlays using VXLAN on your OpenVSwitch instances
* set up an OpenFlow controller such as POX or NOX to control your OpenVSwitch switches
* set up network functions on reserved nodes (firewalls, load balancers)
Also note that Nancy has nodes with four 10Gbps network interfaces (grisou), which might be useful for more complex setups.

Latest revision as of 12:29, 31 March 2022

Note.png Note

This page is actively maintained by the Grid'5000 team. If you encounter problems, please report them (see the Support page). Additionally, as it is a wiki page, you are free to make minor corrections yourself if needed. If you would like to suggest a more fundamental change, please contact the Grid'5000 team.

Introduction

This tutorial presents a example of usage of Grid'5000 which benefits from the capability of reconfiguration of the network topology of the platform, using KaVLAN.

KaVLAN is a Grid'5000 tool, which allows a user to manage VLANs in the platform. That mechanism can provide a layer 2 network isolation for experiments.

This tutorial will only use 3 nodes, but could be adapted for a usage at scale, with hundreds of nodes.

Three kinds of KaVLAN VLANs are available in Grid'5000. You can find more information in the KaVLAN page. In this tutorial, we will only use global and local VLANs (but no routed VLAN).

Before going further with the practical setup proposed in this page, it is strongly advised to read the KaVLAN page.

Topology setup

In this first part of the tutorial, we setup a topology, as presented in the following picture.

TP-Kavlan-School2016.png

Nodes nancy-1 and rennes-2 are placed in a same "global" (inter-site) vlan, and rennes-2 is also placed in a same vlan as rennes-1 ("local" vlan).

Reservations of the resources

First of all we will create jobs to reserve:

  • 2 nodes providing at least 2 network interfaces each, on one site
  • 1 local VLAN on the same site
  • 1 global VLAN
  • 1 node on another site

In the text below, we use the sites of Rennes and Nancy (but other sites could be used, see below):

  • Rennes for the 2 nodes with several network interfaces, and the local VLAN.
  • Nancy for the other node

(we will discuss later about the global VLAN).

Consequently, practically speaking, you are advised to open 2 terminal windows (2 separate shells) for the tutorial, one for Rennes and one for Nancy.

In Rennes (Rennes terminal)

In Rennes, we have to reserve 1 local VLAN, and 2 nodes for instance of the paravance cluster, because that cluster is composed of nodes with 2 network interfaces (in order to find other Grid'5000 clusters providing such a characteristic, you can go to the Hardware page).

We also try to reserve the global VLAN in Rennes as well.

Note.png Note

Please mind the fact that a global VLAN spreads all over Grid'5000, so we only need to reserve it on one site.


Since we have to work on 2 different sites, we will use OAR advance reservations in order to have 2 jobs that start and end at the same dates. Let's define STARTDATE as the start date (e.g. "2016-01-21 14:00:00", don't worry if it is in the past for a few minutes), and WALLTIME as the duration of the job (e.g. the duration for the tutorial).

Once logged to Rennes' frontend in our first terminal, we execute:

Terminal.png rennes:frontend:
oarsub -t deploy -l {"type='kavlan-global'"}/vlan=1+{"type='kavlan-local'"}/vlan=1+{paravance}/nodes=2,walltime=WALLTIME -r "STARTDATE"
Note.png Note

To use KaVLAN, a job of type deploy (kadeploy) is mandatory


Unfortunately, that command may fail (return KO):

  1. either because there are no more paravance nodes available for the time of the tutorial, or no more local VLAN: that should not be too likely to happen
  2. or because the global VLAN of Rennes is already taken: that is very likely to happen during a common tutorial session with many students !

In either case, one will have to try to use another site than Rennes, or try again later.

If you face issue #1

Just replace Rennes by another site which has nodes with several interfaces (see the Hardware page to find such a site).

If you face issue #2

Only reserve 1 local VLAN, and 2 nodes of the paravance cluster, but keep the global VLAN reservation for the other site (to simplify the writing of this tutorial, we assume that the global VLAN of Nancy is available, but any other site with a free global VLAN could do).

So, if the previous command fails because Rennes' global VLAN is not available, only execute:

Terminal.png rennes:frontend:
oarsub -t deploy -l {"type='kavlan-local'"}/vlan=1+{paravance}/nodes=2,walltime=WALLTIME -r "STARTDATE"

(and do not forget to reserve a global VLAN in the other site, as you will be proposed in the next section about the reservation in Nancy).

Either way, a successful oarsub command should have returned a JOBID.

Let's call our 2 nodes in Rennes rennes-1 and rennes-2 (e.g. you might have rennes-1 = paravance-23).


Now, let's retrieve the ID of our VLANs (local + global, or just the local one if the global VLAN is reserved elsewhere):

Terminal.png rennes:frontend:
kavlan -V -j JOBID

You can know which VLAN ID is global and which is local, using the following table:

KaVLAN name in OAR type first id last id
kavlan-local local 1 3
kavlan routed 4 9
kavlan-global global 10 21

(see KaVLAN for more details)

In Nancy (Nancy terminal)

If we managed to reserve the global VLAN in Rennes, we just need to reserve a single node in Nancy. In our second terminal, we log to Nancy's frontend, and execute:

Terminal.png nancy:frontend:
oarsub -t deploy -l nodes=1,walltime=WALLTIME -r "STARTDATE"

If not, we also reserve Nancy's global VLAN. We execute:

Terminal.png nancy:frontend:
oarsub -t deploy -l {"type='kavlan-global'"}/vlan=1+nodes=1,walltime=WALLTIME -r "STARTDATE"

Again, a successful oarsub command should have returned a JOBID (which of course may not be the same as the job ID in Rennes).

If the oarsub fails (returns "KO"), that means that you probably should try to use yet another site than Nancy, where the global VLAN would be available.

Let's call our node in Nancy nancy-1 (e.g. nancy-1 = graphite-3).

You can now look at which VLANs you get, as above.

Terminal.png nancy:frontend:
kavlan -V -j JOBID

Deployment of our operating system for the experiment

In Rennes (Rennes terminal)

Back on the Rennes terminal, we now enter our job in order to benefit from OAR environment variable $OAR_NODEFILE.

Terminal.png rennes:frontend:
oarsub -C JOBID

(JOBID is the one of the job of Rennes of course)

Then we launch the deployment of the Debian minimal environment to the nodes:

Terminal.png rennes:frontend:
kadeploy3 -e debian11-x64-min -k -f $OAR_NODEFILE
Note.png Note

We choose to not use the --vlan VLAN-ID option of kadeploy3 here, because we want to tweak VLANs ourselves.

In order to be capable of doing ssh from one node to the other, we push our SSH private key to the nodes.

Terminal.png rennes:frontend:
for node in $(uniq $OAR_NODEFILE); do scp ~/.ssh/id_rsa root@$node:.ssh/; done


Note.png Note

When the access to a node is broken due to an error in a network setup for instance, the Kaconsole tool becomes very handy: it allows to connect to the serial console of a node, quite the same way you would connect to a virtual console of a GNU/Linux workstation (CTRL+ALT+F1), to fix the network configuration for instance.

Terminal.png rennes:frontend:
kaconsole3 -m rennes-1
Credentials for login on the console with our deployed environment is "root":"grid5000".

In Nancy (Nancy terminal)

We run from Nancy's frontend the same commands as in Rennes, but use the JOBID of Nancy of course.

Installation of useful extra packages for networking

In this tutorial we are going to need some extra packages which are not installed in debian11-min environment, such as tcpdump. Since we are going to modify the network configurations making the Internet unreachable from the nodes, we must install them beforehand.

Besides, you may wish to play with (or debug) your network configuration with other tools like ethtool, dig, or traceroute as well. If you want to use them later you need to install them now too.

In Nancy (Nancy terminal)

Terminal.png nancy:frontend:
ssh root@nancy-1 "apt-get update; apt-get install --yes at tcpdump dnsutils traceroute ethtool ipcalc"

In Rennes (Rennes terminal)

In Rennes, since we reserved several nodes (hey, 2 is more than 1! :-) ), we will be using TakTuk for speeding up the process:

Terminal.png rennes:frontend:
taktuk -s -l root -f $OAR_FILE_NODES broadcast exec [ "apt-get update; apt-get --yes install at tcpdump dnsutils traceroute ethtool ipcalc" ]

In addition, we will need OpenVSwitch on rennes-2 in the second part of the tutorial, so we install it too:

Terminal.png rennes:frontend:
ssh root@rennes-2 "apt-get install --yes openvswitch-switch"

Setup of the network topology configuration

Thanks to KaVLAN, we will have a layer 2 network isolation for our nodes. That means that whatever IP configuration we set in our VLAN, there will be no risk to break the network configuration in the default VLAN or to disturb other users (for instance by misusing reserved IP addresses). We can even setup network services or protocols based on IP broadcast for instance (e.g. DHCP) with no risk of polluting the default VLAN (or conflicting with the platform's default VLAN DHCP service).

However, we must keep in mind that:

  • We must be very careful to what VLAN the node is in at any time. Indeed, a mistake could quickly lead to some changes to the nodes network configuration while the node is actually still in the default VLAN, which could impact other users/experiments.
  • While it can be interesting to manage by ourselves the IP addressing of the nodes in our VLAN, Grid'5000 provides predefined IP addresses for the nodes within any VLAN, which can be very handy (see the KaVLAN page), especially because they are served by a DHCP service within each VLAN. So both options are possible, and one has to weight the pros and the cons, depending on the actual need.
  • Setting up our own IP addressing mechanism can actually be seen as an "expert mode", to which the kavlan command provides some support as well, allowing to disable the DHCP service in a VLANs if required. Please really mind what you do when in that expert mode.
  • If not using the IP addresses which are provided by Grid'5000, routing to Global VLANs and Routed VLANs will not work, and neither will SSH gateways to local VLANs. However, if all or some of those feature are not useful for your experiment, it is not harmful.

That said, for this tutorial and up until the openvswitch section, we will use the predefined IP addresses provided by Grid'5000. This will be very handy thanks to the DHCP services which are provided in the VLANs.

Furthermore, for the ease of reading this document, let be:

  • G_ID = the ID of the global VLAN we reserved
  • L_ID = the ID of the local VLAN we reserved

In Nancy (Nancy terminal)

nancy-1 will be put in the global VLAN. Once in that VLAN, it will be able to ask for an IP address to the DHCP server of the global VLAN, which will answer the IP corresponding to nancy-1-kavlan-G_ID. See the table in the KaVLAN page to find out the corresponding IP address, or run:

Terminal.png nancy:frontend:
host nancy-1-kavlan-G_ID

The mechanism we have to trigger is the following:

  1. Change the VLAN of the node
  2. Restart the network service

Since our node won't be reachable with its old IP address (the one for the default VLAN) once put in the global VLAN, we won't be able to run action #2 after running action #1 as-is. We will use a trick to create a trigger before action #1, in order to run action #2 asynchronously. That can be done using the at command, for instance. Let's do that in one command line:

Terminal.png nancy:frontend:
ssh root@nancy-1 "echo 'service networking restart' | at now + 1 minute" && kavlan -s -i G_ID -m nancy-1.nancy.grid5000.fr --verbose

In this way, the restart of the networking service will be scheduled before changing the VLAN, and actually done one minute after.

Another option would be to use Kaconsole, to access the node via its serial console to run action #2, but that is definitely not scalable to many nodes, so it is to be kept as a last resort option.


Note.png Note

Feel free to make different tests, with at or with Kaconsole. To start over, you can put the node back in the default VLAN with the following command:

Terminal.png nancy:frontend:
kavlan -s -i DEFAULT -m nancy-1.nancy.grid5000.fr --verbose

In Rennes (Rennes terminal)

In Rennes, we want:

  1. to put rennes-1 and rennes-2 default network interfaces (eth0/eno1 on paravance) in the local VLAN, and have their IP addresses changed accordingly.
  2. to put rennes-2 secondary network interface (eth1/eno2 on paravance) in the global VLAN and have the IP address changed accordingly.

That way, rennes-2 will be in-between the local and the global VLANs, which is the heart of our setup.

For the first action, we do the same as for Nancy's node, but with the local VLAN instead of the global VLAN, and using taktuk to benefit from a parallel execution on the 2 nodes:

Terminal.png rennes:frontend:
taktuk -s -l root -f $OAR_NODEFILE broadcast exec [ "echo 'service networking restart' | at now + 1 minute" ] && kavlan -s -i L_ID -m rennes-1.rennes.grid5000.fr -m rennes-2.rennes.grid5000.fr --verbose


Note.png Note

unlike kadeploy, kavlan requires a full hostname for the -m parameter.


Notice that rennes-1 and rennes-2 are still accessible via SSH on the VLAN's gateway, named "kavlan-L_ID":

Terminal.png rennes:frontend:
ssh kavlan-L_ID
Terminal.png rennes:kavlan-L_ID:
ssh root@rennes-1-kavlan-L_ID


We now have to put rennes-2's secondary network interface in the global VLAN. That secondary interface can be referenced with the following hostname: rennes-2-eth1.

Note.png Note

According to the information of the Grid'5000 API, for paravance nodes, eth1/eno2 is indeed cabled to the switch: see eno2 in https://api.grid5000.fr/sid/sites/rennes/clusters/paravance/nodes/paravance-1.json?pretty.


So, to do that, run the kavlan command:

Terminal.png rennes:frontend:
kavlan -s -i G_ID -m rennes-2-eth1.rennes.grid5000.fr --verbose

Since the node is not reachable anymore from the default VLAN, we will have to go through the local VLAN to restart the network service. So let's bounce on the gateway to reach rennes-2, like this:

Terminal.png rennes:frontend:
ssh kavlan-L_ID
Terminal.png kavlan-L_ID:
ssh root@rennes-2-kavlan-L_ID

And finally, configure the interface and program a network restart in one minute:

Terminal.png rennes-2:
echo -e "\nauto eno2\n iface eno2 inet dhcp" >> /etc/network/interfaces ; ifup eno2

Then exit the terminal before the networking service restarts, otherwise the SSH session will be lost and the terminal will freeze.


Note.png Note

As eno2 is not configured by default in /etc/network/interfaces, we have to do it before restarting the network, otherwise this interface won't ask for a DHCP lease.


We are now done with the VLAN setup.

TP-Kavlan-School2016-lineaire.png


The following table summarizes the hostnames of the machines and the way to access them, depending on the VLAN they are in:

nancy-1 rennes-2 rennes-1 access from default VLAN
default VLAN nancy-1 rennes-2 rennes-1
local VLAN rennes-2-kavlan-L_ID rennes-1-kavlan-L_ID via SSH gateway: kavlan-L_ID in the local site
global VLAN nancy-1-kavlan-G_ID rennes-2-eth1-kavlan-G_ID routed from any site

First tests of the topology

We can now run some basic tests:

  1. Try to ssh to nancy-1 in the global VLAN from the frontend of any Grid'5000 site, then to rennes-2 and finally to rennes-1
  2. Try to ssh to the SSH gateway of the local VLAN, then to rennes-1, then to rennes-2 and finally to nancy-1
  3. Try to ping rennes-2's IP in the global VLAN from nancy-1
  4. Try to ping rennes-2's IP in the local VLAN from nancy-1
  5. Try to ping rennes-1's IP in the local VLAN from nancy-1
  6. Try to ping nancy-1's IP in the global VLAN from rennes-1

For that purpose, you will have to find out the hostnames and IP addresses of the machines in the VLANs ! See KaVLAN and the table above.

For instance, rennes-2 should be reachable with the hostname rennes-2-eth1-kavlan-G_ID.

The last 3 pings between nancy-1 and Rennes' local VLAN (either way) should fail at this point.

Setup of the IP routing between the 2 VLANs

Topology with a software router

rennes-1 and nancy-1 are in separate VLANs, so no IP networking is possible directly from rennes-1 to nancy-1 (or the way around).

To enable that, we will activate the IP routing feature of Linux on rennes-2. This in fact only consist in activating a sysctl, as follows:

Terminal.png rennes-2:
sysctl -w net.ipv4.ip_forward=1

But we also need to add a route on both rennes-1 and nancy-1, to make them know how to contact each other.

To find out the IP network of the local VLAN and of the global VLAN, we can either look at the tables in the KaVLAN page, or examine the IP configuration of the interfaces of rennes-1, with ip addr show and ip route show.

Let's have:

  • the network for the global VLAN written G_NETWORK/G_NETMASK
  • the network for the local VLAN written L_NETWORK/L_NETMASK
  • the IP address of rennes-2 in the global VLAN written G_IP_rennes-2
  • the IP address of rennes-2 in the local VLAN written L_IP_rennes-2

We can add on rennes-1 a route to the global VLAN as follows:

Terminal.png rennes-1:
ip route add G_NETWORK/G_NETMASK via L_IP_rennes-2

And on nancy-1 a route to the local VLAN as follows:

Terminal.png nancy-1:
ip route add L_NETWORK/L_NETMASK via G_IP_rennes-2

Routing should now be operational!

TP-Kavlan-School2016-lineaire-router.png

New tests with IP routing enabled

First, you can replay the previous tests and check that they all succeed now:

  1. Try to ssh to nancy-1 in the global VLAN from the frontend of any Grid'5000 site, then to rennes-2 and finally to rennes-1
  2. Try to ssh to the SSH gateway of the local VLAN, then to rennes-1, then to rennes-2 and finally to nancy-1
  3. Try to ping rennes-2's IP in the global VLAN from nancy-1
  4. Try to ping rennes-2's IP in the local VLAN from nancy-1
  5. Try to ping rennes-1's IP in the local VLAN from nancy-1
  6. Try to ping nancy-1's IP in the global VLAN from rennes-1


And to go further, we can observe packets with tcpdump. For instance let's glance at the packets on our router.

We run on rennes-1 and rennes-2 the following commands:

Terminal.png rennes-2:
tcpdump -i eno1 icmp

(tcpdump is a network traffic sniffer, here filtering ICMP traffic on eno1)

Terminal.png rennes-1:
ping IP_nancy-1

On the first terminal you should see that the ICMP requests and replies are forwarded on rennes-2 :

listening on eno1, link-type EN10MB (Ethernet), capture size 262144 bytes
17:08:55.145241 IP 10.27.200.183 > 192.168.200.68: ICMP echo request, id 3498, seq 1, length 64
17:08:55.145356 IP 192.168.200.68 > 10.27.200.183: ICMP echo reply, id 3498, seq 1, length 64
17:08:56.146970 IP 10.27.200.183 > 192.168.200.68: ICMP echo request, id 3498, seq 2, length 64
17:08:56.147075 IP 192.168.200.68 > 10.27.200.183: ICMP echo reply, id 3498, seq 2, length 64
17:08:57.148675 IP 10.27.200.183 > 192.168.200.68: ICMP echo request, id 3498, seq 3, length 64
17:08:57.148782 IP 192.168.200.68 > 10.27.200.183: ICMP echo reply, id 3498, seq 3, length 64

If ip forwarding was disabled on rennes-2 (see above the sysctl command to disable it with a '0' instead of a '1'), tcpdump will only see ping requests (not ping replies because rennes-2 doesn't transmit anything to nancy-1), and ping should fail.

We also check the packet route with traceroute from rennes-1:

Terminal.png rennes-1:
traceroute IP_nancy-1

We should see two hops: the intermediary router and the target:

traceroute to grisou-9-kavlan-16.nancy (10.27.209.9), 30 hops max, 60 byte packets
 1  paranoia-8-eth0-kavlan-1.rennes.grid5000.fr (192.168.194.8)  0.161 ms  0.167 ms  0.148 ms
 2  grisou-9-eth0-kavlan-16.nancy.grid5000.fr (10.27.209.9)  24.852 ms  24.834 ms  24.816 ms

The first hop is rennes-2, and the second one is nancy-1.

With tcpdump we can check the complete isolation of nodes from the production VLAN (and from any other one) :

Terminal.png rennes-1:
tcpdump port not 22

On rennes-1 (in a 12 seconds time):

listening on eno1, link-type EN10MB (Ethernet), capture size 262144 bytes
17:26:28.053495 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:30.053550 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:32.053496 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:34.053499 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:36.053565 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:38.053505 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43
17:26:40.053525 STP 802.1w, Rapid STP, Flags [Learn, Forward], bridge-id 82bd.8c:60:4f:47:6d:7c.8110, length 43

The only frames received are ethernet-level management frames sent by the switches (STP frames here, but we could also see LLDP frames, or other ones)

In comparison, this is what can be captured on a node in the production VLAN, in only a one second time:

14:27:43.920934 IP paravance-60.rennes.grid5000.fr.38784 > dns.rennes.grid5000.fr.domain: 65121+ PTR? 5.98.16.172.in-addr.arpa. (42)
14:27:43.921384 IP dns.rennes.grid5000.fr.domain > paravance-60.rennes.grid5000.fr.38784: 65121* 1/1/0 PTR parapide-5.rennes.grid5000.fr. (103)
14:27:43.921510 IP paravance-60.rennes.grid5000.fr.49250 > dns.rennes.grid5000.fr.domain: 48890+ PTR? 111.111.16.172.in-addr.arpa. (45)
14:27:43.921816 IP dns.rennes.grid5000.fr.domain > paravance-60.rennes.grid5000.fr.49250: 48890* 1/1/0 PTR kadeploy.rennes.grid5000.fr. (104)
14:27:44.017208 ARP, Request who-has parapide-5.rennes.grid5000.fr tell dns.rennes.grid5000.fr, length 46
14:27:44.201278 IP6 fe80::214:4fff:feca:9470 > ff02::16: HBH ICMP6, multicast listener report v2, 1 group record(s), length 28
14:27:44.201416 IP paravance-60.rennes.grid5000.fr.34416 > dns.rennes.grid5000.fr.domain: 7912+ PTR? 6.1.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.2.0.f.f.ip6.arpa. (90)
14:27:44.284641 ARP, Request who-has parapide-9.rennes.grid5000.fr tell kadeploy.rennes.grid5000.fr, length 46
14:27:44.307171 ARP, Request who-has parapide-5.rennes.grid5000.fr tell metroflux.rennes.grid5000.fr, length 46
14:27:44.398978 IP dns.rennes.grid5000.fr.domain > paravance-60.rennes.grid5000.fr.34416: 7912 NXDomain 0/1/0 (160)

Here we see ARP requests, DNS messages, multicast reports…

Note.png Note

As you can see, in these tcpdump commands we filtered SSH packets, because we were watching the results via SSH. This is something you always want to be cautious with, because every output line generates an SSH packet to bring it back to you, this packet will appear in a new output line, etc. If the traffic you are watching is dense and you run tcpdump remotely without filtering SSH packets, the network can rapidly be saturated, making your session freeze.

Bridging VLANs with OpenVSwitch

Topology with a software switch

Setup

Our goal is now to allow nancy-1 and rennes-1 to communicate directly at the Ethernet layer, with no IP routing in between. In this configuration, only one IP subnet (range) must be used, so let's chose the global VLAN one, and forget about the local VLAN one.

First, we need to unset what we configured when we were doing IP routing:

  1. remove the extra IP route to L_NETWORK on nancy-1
  2. deactivate the IP routing functionality on rennes-2
  3. stop the DHCP service in the local VLAN (so that it does not conflict with the global VLAN one)
  4. unconfigure both network interfaces of rennes-2, and setup OpenVSwitch within a Kaconsole session.
  5. restart the networking service of rennes-1 !

Let's detail a little…

action #1

Just replace add by del in the ip route command.

action #2

in the sysctl command, set ip_forward to 0 instead of 1

action #3
Terminal.png rennes:frontend:
kavlan -d -i L_ID --verbose
action #4

Run Kaconsole

Terminal.png rennes:frontend:
kaconsole3 -m rennes-2

Login with root / grid5000.

Unconfigure the IP addresses of the network interfaces:

Terminal.png console@rennes-2:
ip addr del L_NETWORK/L_NETMASK dev eno1
Terminal.png console@rennes-2:
ip addr del G_NETWORK/G_NETMASK dev eno2

Create the bridge:

Terminal.png console@rennes-2:
ovs-vsctl add-br OVSbr

Add eno1 and eno2 to the bridge:

Terminal.png console@rennes-2:
ovs-vsctl add-port OVSbr eno1
Terminal.png console@rennes-2:
ovs-vsctl add-port OVSbr eno2

You can list all the ports in the bridge with:

Terminal.png rennes:rennes-2:
ovs-ofctl show OVSbr
and finally action #5

From kaconsole on rennes-1:

Terminal.png console@rennes-1:
service networking restart
We are done

rennes-1 should now have an IP of the global VLAN, while still in the local VLAN, with the associated DNS hostname: rennes-1-kavlan-G_ID !

TP-Kavlan-School2016-lineaire-bridge.png

Note.png Note

Please note that it would also be possible to use the IP subnet of Local VLAN, but in that case we would have to set the IP address of nancy-1 statically because the DHCP service of the Local VLAN would not provide an IP to it.

Tests

Now that everything is configured:

  • You should be able to ping rennes-1 with its new IP from nancy-1.
  • Also if you run traceroute, you should notice that there is only one hop between the 2 nodes now.
Note.png Note

Bridging could be achieved using Linux bridge driver and the brctl companion command, but here we prefer to use OpenVSwitch which provides enhanced functionalities

Flow control

You can use openVSwitch to manage flows, for example you can DROP all pings to a specific IP from a specific interface:

Terminal.png rennes:rennes-2:
ovs-ofctl add-flow OVSbr "in_port=1,icmp,nw_src=G_IP_rennes-1,nw_proto=1,actions=drop"

As it can be checked with the "ovs-ofctl show" command, "in_port=1" refers to eno1 in our case. Now, from rennes-1, try to ping nancy-1 and check that it doesn't work, but make sure you can still access it via SSH.

You can also DROP all packets from an IP with this command:

Terminal.png rennes:rennes-2:
ovs-ofctl add-flow OVSbr "in_port=1,ip,nw_src=G_IP_rennes-1,actions=drop"

Now make sure you can't SSH from rennes-1 to nancy-1 anymore

Additionally, the following command will display all your flow rules :

Terminal.png rennes:rennes-2:
ovs-ofctl dump-flows OVSbr
Note.png Note

If you want to know more about flow syntax go to this man page and look for the "Flow Syntax" paragraph

Bonus: beta testing the TopoMaker tool!

The development of a new tool was recently started to automate most of the previous reconfiguration steps.

If you finished the tutorial, please have a look at the TopoMaker page, and beta test !

Conclusions

At this stage, you are now ready to perform SDN (Software Defined Networking) and NFV (Network Function Virtualization) research on Grid'5000. You could:

  • configure virtual network overlays using VXLAN on your OpenVSwitch instances
  • set up an OpenFlow controller such as POX or NOX to control your OpenVSwitch switches
  • set up network functions on reserved nodes (firewalls, load balancers)

Also note that Nancy has nodes with four 10Gbps network interfaces (grisou), which might be useful for more complex setups.