Grid5000:School2011 Program: Difference between revisions

From Grid5000
Jump to navigation Jump to search
(New page: {{Author|David Margery}} {{Maintainer|David Margery}} {{Status|Draft}} {{See also|Spring School 2011 main page | [[Grid5000:School2011 Program|Spring School 20...)
 
 
(70 intermediate revisions by 9 users not shown)
Line 1: Line 1:
{{Author|David Margery}}
{{Author|David Margery}}
{{Maintainer|David Margery}}
{{Maintainer|David Margery}}
{{Status|Draft}}
{{Status|In Production}}
{{See also|[[Grid5000:School2011|Spring School 2011 main page]] | [[Grid5000:School2011 Program|Spring School 2011 program]] | [[Grid5000:School2011 TPs|Spring School 2011 practical sessions]] |  [https://helpdesk.grid5000.fr/g5kschool11 registrations to practical sessions]}}
{{Portal|User}}
[[Image:Grid5000-Reims.jpg|center|505px|Une image de Reim]]
{{See also|[[Grid5000:School2011|Spring School 2011 main page]] | [[Grid5000:School2011 Program|Spring School 2011 program]] | [[Grid5000:School2011 TPs|Spring School 2011 practical sessions]] |  [https://helpdesk.grid5000.fr/g5kschool registrations to practical sessions]}}
[[Image:Grid5000-Reims.jpg|center|640px|Reims Spring School 2011]]
== Overview, with authors ==
== Overview, with authors ==
This program is the first draft of the program. Practical session will be detailed on a [https://helpdesk.grid5000.fr/g5kschool11 registration site] used to plan capacity. Initial list with [[Grid5000:School2011 TPs|more details]] is available.
This program is the final version.  
Practical session will be detailed on a [https://helpdesk.grid5000.fr/g5kschool registration site] used to plan capacity. Initial list with [[Grid5000:School2011 TPs|more details]] is available.
 
To ease management of the event, please plan your arrival either
* between 13:30 and 14:00 on the Monday,
* between 10:30 and 11:00 or between 13:30 and 14:00 on the Tuesday.
 
 
* The green practical sessions are the sessions recommended to new users
* The green practical sessions are the sessions recommended to new users
* The orange practical sessions are the sessions recommended to current users
* The orange practical sessions are contributed by users
* The first slot on the program each morning is for lessons.
* The first slot on the program each morning is for lessons.


Line 20: Line 28:


  <tr>
  <tr>
   <td >9h-9h30</td>
   <td >9h-10h</td>
   <td rowspan="8"></td>
   <td rowspan="7"></td>
   <td colspan=4 rowspan=2>Kadeploy3 usage on Grid'5000</td>
   <td colspan=2 rowspan=2>Executive Committee meeting: April 6th review debriefing</td>
   <td colspan=4 rowspan="2" style="background-color: #F966F9;">Advanced tools for Grid'5000 (taktuk)</td>
   <td colspan=2 rowspan=2 style="padding: 0px 0px; background-color: #99ff99;">Practical session: efficient customization of Grid'5000 : using Kadeploy3 on Grid'5000</td>
   <td colspan=4 rowspan="2" style="background-color: #F966F9;"></td>
   <td colspan=4 rowspan="1">Advanced tools for Grid'5000: taktuk <em>Pierre Neyron</em></td>
</tr>
   <td colspan=4 rowspan="1">[[Media:Grid5000-2011-Future.pdf|The future of Grid'5000]] <em>David Margery, Technical director</em></td>
 
<tr>
  <td>9h30-10h</td>
   <td colspan=4>Scientific Challenges using Grid'5000<br><em>Christian Perez</em></td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>10h-10h30</td>
   <td>10h-10h30</td>
  <!--- td>Hemera: a Large-scale Initiative Actions (Christian Perez)</td> --->
   <td colspan=4>[[Media:Grid5000-2011-Paper3.pdf|Automating the measure of the NAS MG cache behavior on many architectures using CControl and Grid'5000]] ''Swann Perarnau and Pierre Neyron.''</td>
   <td colspan=4>Poster session</td>
   <td colspan=4>Focus on networking: tools prepared by the technical staff</td>
  <td colspan=4 style="background-color: #F966F9;">Data analysis with R. <em>Louis-Claude Canon</em></td>
   <td colspan=4>Challenge winner's presentation</td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>10h30-11h00</td>
   <td>10h30-11h00</td>
   <td colspan=12>coffee break</td>
   <td colspan=2>Registration/coffee break</td>
  <td colspan=10>coffee break</td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>11h-11h30</td>
   <td>11h-11h30</td>
   <td colspan=4>Experimenting OAR in a virtual cluster environment for btch schedulers comparative evaluation. <em>Joseph Emeras and Yiannis Geiorgiou</em></td>
   <td colspan=2 rowspan="3">Hemera working group: Methodology</td>
   <td colspan=4>Shrinker: Efficient Wide Area Live Virtual Machine Migration using Distributed Content-Based Addressing. <em>Pierre Riteau, Christine Morin and Thierry Priol</em></td>
  <td colspan=2 rowspan="3" style="padding: 0px 0px; background-color: #99ff99;" >Practical session: Grid'5000 is programmable (API Basics)</td>
   <td colspan=4>Sparse linear algebra over GF(2): The rsa768 matrix on Grid'5000.<br><em>Emmanuel Thomé</em></td>
   <td colspan=4 rowspan="1">[https://www.grid5000.fr/school2011/4_Djamai.pdf Experimental Study of a P2P B&B approach on top of Grid’5000] ''Mathieu Djamaï, [http://www.lifl.fr/~derbel/ Bilel Derbel] and [http://www.lifl.fr/~melab/ Nouredine Melab]''</td>
   <td colspan=4 rowspan="2">Awards and Challenge winner's presentation</td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>11h30-12h</td>
   <td>11h30-12h</td>
   <td colspan=4>Virtual resources allocation on Aladdin/Grid'5000 infrastructure through the HIPerNet Middleware. <em>Tram Truong Huu and Johan Montagnat</em></td>
   <td colspan=4>[[Media:Grid5000-2011-Paper5.pdf|Deploying a fault-tolerant computing middleware over Grid'5000: performance analysis of CONFIIT and its integration with a quantum molecular docking application]] ''[http://cosy.univ-reims.fr/~lsteffenel Luiz Angelo Steffenel], [http://cosy.univ-reims.fr/~jcboisson Jean-Charles Boisson], Jaillet Christophe, [http://cosy.univ-reims.fr/flauzac Olivier Flauzac], Michael Krajecki, Chantal Barberot, Eric Henon and Stéphane Gérard.''</td>
  <td colspan=4>Modelization and Performance Evaluation of the DIET Middleware. <em>Eddy Caron, Benjamin Depardon and Frédéric Desprez</em></td>
  <td colspan=4>Evaluating JACEP2P-V2 and some parallel asynchronous iterative methods over Grid'5000. <em>Jean-Claude Charr</em></td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>12h00-12h30</td>
   <td>12h00-12h30</td>
   <td colspan=4>Efficient Grid Resource Selection for a CEM Application. <em>Eddy Caron, Cristian Klein and Christian Perez</em></td>
   <td colspan=8 style="border-bottom: 0px;"></td>
  <td colspan=4>How to enhance the grid accessibility for non-expert users using autonomic computing. <em>Rémi Sharrock, Patricia Stolf, Thierry Monteil and Mihai Alexandru</em></td>
  <td colspan=4>Distributed Branch-and-Bound Algorithm: A Pure Peer-to-Peer Approach. <em>Mathieu Djamaï, Bilel Derbel and Nouredine Melab</em></td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>12h30-13h</td>
   <td>12h30-13h30</td>
   <td colspan=4>Invited Talk : <em>Kate Keahey</em> (Argonne National Laboratory/FutureGrid) </td>
   <td colspan=12 rowspan=1 style="border-left: 0px; border-bottom: 0px;">Lunch break</td>
  <td colspan=4>Rethinking Internet fairness in swarm based content distribution.<br><em>Massimo Gallo</em></td>
  <td colspan=4>Parallel Hybrid Optimization for Permutation Problems on the Computational Grid. <em>Malika Mehdi, Nouredine Melab, EL-Ghazali Talbi and Pascal Boubry</em></td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>13h-14h</td>
   <td>13h30-14h</td>
   <td>as from 13:30 : registration and coffee</td>
   <td style="border-top: 1px solid #000000;">as from 13:30 : registration and coffee</td>
   <td colspan=12 rowspan=2>Lunch</td>
   <td colspan=4 style="border-top: 0px;"></td>
  <td colspan=4 rowspan=2 style="border-top: 1px solid #000000; border-bottom: 1px">Challenge: [https://www.grid5000.fr/school2011/Grid5000-2011-Challenge1.pdf Leveraging BlobSeer to Boost up the Deployment and Execution of Hadoop Applications in Nimbus Cloud Environments on Grid'5000]</td>
  <td colspan=6 style="border-top: 1px solid #000000; border-top: 0px;"></td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>14h-14h30</td>
   <td>14h-14h30</td>
   <th style="border-top: 2px solid; border-left: 2px solid; border-right: 2px solid;">BoF : interconnecting experimental infrastructures</th>.
   <td rowspan=2>Introductory lessons</td>
  <td colspan=4 rowspan=2>[http://graal.ens-lyon.fr/~cperez/110419-Hemera-G5K-School.pdf Hemera Scientific Challenges on Grid'5000]</td>
  <td rowspan=13 style='background:#ffffff;border-right:0px #ffffff solid;border-bottom:0;'></td>
  </tr>  
  </tr>  


  <tr>
  <tr>
    <td>14h30-16h00</td>
  <td>14h30-15h00</td>
   
  <td colspan=4 rowspan=2 style="border-top: 1px solid #000000; border-bottom: 1px">Challenge: [[Media:Grid5000-2011-Challenge2.pdf|Grid-in-a-grid: Deployment of a gLite Grid inside Grid'5000]]</td>  
    <!--- Tuesday --->
</tr>
    <td style="padding: 0px 0px; border-left: 2px solid; border-right: 2px solid; border-bottom: 2px solid;">
      <table style="padding:0;border-collapse:collapse;background-color: #f9f9f9;">
      <tr><td>14:00</td><td style="padding: 0px 0px; border-right: 0px">Grid'5000 presentation and highlights on the network connection to other Grids. <em>David Margery</em> </td></tr>
      <tr><td>14:30</td><td style="padding: 0px 0px; border-right: 0px">OneLab <em>Timur Friedman</em></td></tr>
      <tr><td>15:00</td><td style="padding: 0px 0px; border-right: 0px">IDHAL (connecting G5K and Onelab) Experience around using Grid'5000 and Onelab in the same experiment session <em>Jean-Noël Quintin</em></td></tr>
      <tr><td style="padding: 0px 0px; border-left: 0px; border-bottom: 0px;">15:30</td><td style="padding: 0px 0px; border-right: 0px; border-bottom: 0px;">Round table</td></tr>
      </table>
    </td>


    <!--- Wednesday --->
<tr>
    <td rowspan="2" style="padding: 0px 0px; border-left: 0px solid; border-right: 0px solid;">
  <td>15h00-15h30</td>
      <table style="padding:0;border-collapse:collapse;background-color: #f9f9f9;">
  <td rowspan=2 style="padding: 0px 0px; background-color: #99ff99;">Practical session: from ssh access to initial submissions</td>
      <tr><td style="padding: 0px 0px; border-right: 0px; background-color: #F966F9;">Lesson: using Grid'5000<br><em>David Margery</em></td></tr>
  <td colspan=4>[https://www.grid5000.fr/school2011/1_DianaMoise.pdf A Cost-Evaluation of MapReduce Applications in the Cloud] ''Diana Moise, Alexandra Carpen-Amarie, [http://www.irisa.fr/kerdata/doku.php?id=people:gabriel.antoniu Gabriel Antoniu] and Luc Bouge.''</td>
      <tr><td style="padding: 0px 0px; border-right: 0px; border-bottom: 0px solid; background-color: #99ff99;">First steps on Grid'5000</td></tr>
</tr>
      </table>
    </td>
    <td rowspan="2" style="padding: 0px 0px; background-color: #ffB999;">Customizing Grid'5000 with Kadeploy</td>
    <td rowspan="2" style="padding: 0px 0px; ">Metroflux and Kavlan : mastering Grid'5000's network</td>
    <td rowspan="2" style="padding: 0px 0px; ">Running Hadoop on Grid'5000 <em>Vinicius V. Cogo, Marcelo Pasin and Andrea S. Charão</em> (User contributed)</td>


    <!--- Thursday --->
<tr>
    <td rowspan="2" style="padding: 0px 0px; background-color: #99ff99;">Customizing Grid'5000 with Kadeploy</td>
  <td>15h30-16h00</td>
    <td rowspan="2" style="padding: 0px 0px; background-color: #ffB999;">Metrology API Howto</td>
  <td colspan=4>[[Media:Grid5000-2011-Paper2.pdf|Hybrid Distributed Computing Infrastructure Experiments in Grid5000: Supporting QoS in Desktop Grids with Cloud Resources]] ''Simon Delamare and Gilles Fedak.''</td>
    <td rowspan="2" style="padding: 0px 0px; ">Tools to analyze and present experiment data</td>
  <td colspan=4 rowspan=1 style="border-top: 1px solid #000000; border-bottom: 1px;border-left: 1px">Challenge: [[Media:Grid5000-2011-Challenge3.pdf|Deploy a Highly-dynamic Virtual Cluster Based on OpenNebula and Xen in Grid'5000]]</td>  
    <td rowspan="2" style="padding: 0px 0px; border-right: 0px;">Put some Green in your Grid5000 experiments ! <em>Marcos Dias de Assuncao, Laurent Lefevre, Olivier Mornard and Anne-Cécile Orgerie.</em> (User contributed)</td>
</tr>


    <!--- Friday --->
<tr>
    <td rowspan="2" style="padding: 0px 0px; background-color: #99ff99;">Discovering Grid'5000's API</td>
  <td>16h-16h30</td>
    <td rowspan="2" style="padding: 0px 0px; background-color: #ffB999;">From .iso to Kadeploy3 image</td>
  <td colspan="9">Coffee Break</td>
    <td rowspan="2" style="padding: 0px 0px; ">Running MPI on Grid'5000</td>
    <td rowspan="2" style="padding: 0px 0px; ">HIPerNet: virtual infrastructure manipulation<em><br>Romaric Guillier, Fabienne Anhalt, Guilherme Koslovski and Pascale Vicat-Blanc Primet.</em><br>(User contributed)</td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>16h-16h15</td>
   <td>16h30-17h</td>
   <td rowspan="2">Coffee Break</td>
  <td colspan="1" rowspan="4" style="padding: 0px 0px; background-color: #99ff99;">Practical session: from ssh access to initial submissions</td>
  <td rowspan="4" style="padding: 0px 0px; background-color: #ff9966;"> [[Put Some Green In Your Experiments | Put some Green in your Grid’5000 experiments !]], <em>Marcos Dias de Assuncao, Mohammed El Mehdi Diouri, Laurent Lefèvre, Olivier Mornard, Anne-Cécile Orgerie, Ghislain Landry, Tsafack Chetsa</em></td>
  <td rowspan="4" style="padding: 0px 0px; background-color: #ff9966;">[[Deploying and Using IaaS Clouds on Grid'5000]], <em>Alexandra Carpen-Amarie, Houssem-Eddine Chihoub, Pierre Riteau</em></td>
  <td rowspan="4">[[Deploy a virtualized environment|Deploy a virtualized environment]]</td>
  <td rowspan="4">[[Run_MPI_On_Grid%275000|Run MPI on Grid'5000]]</td>
  <td colspan="1" rowspan="4" style="padding: 0px 0px; background-color: #ff9966;">[[Creating Customized Grid'5000 Environments with Chef]], <em>Pierre Riteau</em></td>
  <td colspan="1" rowspan="4" style="padding: 0px 0px; background-color: #ff9966;">[[Large-Scale Trace Visualization Triva|Large-Scale Trace Visualization Analysis with Triva and Pajé -- the G5K case study --]], <em>Lucas Mello Schnorr, Arnaud Legrand</em></td>
  <td rowspan="4">[[KaVLAN|Network isolation on Grid'5000]]</td>
   <td rowspan="4">[[API_Metrology_Practical|Use the API to gather metrics]]</td>
  </tr>
  </tr>


  <tr>
  <tr>
   <td>16h15-16h30</td>
   <td>17h-17h30</td>
  <td colspan="8" rowspan="2">Coffee break</td>
  <td colspan="4" rowspan="5" style="border-bottom: 0px;border-right: 0px;background-color:#FFFFFF;"></td>
  </tr>
  </tr>


  <tr>
  <tr>
  <td>16h30-16h45</td>
  <td>17h30-18h</td>
  <th style="border-top: 2px solid; border-left: 2px solid; border-right: 2px solid;">BOF : Managing large scale experimental infrastructures</th>.
</tr>
<tr>
  <td>18h-18h30</td>
  </tr>
  </tr>


  <tr>
  <tr>
  <td>16h45-18h30</td>
  <td>18h30-18h45</td>
  <td style="padding: 0px 0px; border-left: 2px solid; border-right: 2px solid; border-bottom: 2px solid; border-top: 2px solid;">
  <td colspan="9" style="border-bottom: 0px;"></td>
  <table style="padding:0;border-collapse:collapse;background-color: #f9f9f9;">
</tr>
    <tr><td>16:30</td><td>Lessons learned while trying to administer Grid'5000 and current tools and architecture for Grid'5000 system admin <em>Pascal Morillon, Rémi Palancher and David Margery*</em></td></tr>
    <tr><td>17:15</td> <td>Future Grid<em>Kate Keahey</em> </td></tr>
    <tr><td>18:00</td><td>Discussion </td></tr>
  </table>
  </td>


<tr>
  <td>18h45-19h</td>
  <td colspan="1" style="border-bottom: 0px;"></td>
  <td colspan="4" rowspan='2' style="border-top:1px solid #000000;">Champagne cellar visit and tasting</td>
  <td colspan="4" style="border-bottom:1px solid #000000;"></td>
</tr>


    <!--- Wednesday --->
<tr>
 
  <td>19h00-20h</td>
    <td style="padding: 0px 0px; background-color: #99ff99;">Resource allocation with OAR and OARGRID</td>
  <td colspan="1" style="border-bottom: 0px;"></td>
    <td style="padding: 0px 0px; background-color: #ffB999;">Discovering Grid'5000's API</td>
   <td colspan="4" style="border:1px solid #000000;">Official cocktail at City Hall</td>
    <td style="padding: 0px 0px; ">Deploying virtual machines with Kadeploy</td>
    <td style="padding: 0px 0px; border-right: 0px;">Programming with MPI</td>
 
    <!--- Thursday --->
    <td rowspan="1" colspan=2 height="16,6%" style="padding: 0px 0px; ">Production Grids session (cancelled due to strikes)</td>
    <td rowspan="1" colspan=2 height="16,7%" style="padding: 0px 0px; ">Grid'5000 challenge final runs</td>
</tr>
<tr>
  <td>18h30-20h</td>
   <td colspan="9" style="border-bottom: 0px;"></td>
  </tr>
  </tr>


Line 170: Line 156:
  </tr>
  </tr>
</table>
</table>
== List of user presentations ==
* '''Experimental Study of a P2P B&amp;B approach on top of Grid’5000.''' ''Mathieu Djamaï, [http://www.lifl.fr/~derbel/ Bilel Derbel] and [http://www.lifl.fr/~melab/ Nouredine Melab]'' [https://www.grid5000.fr/school2011/4_Djamai.pdf PDF]
** '''Abstract:''': The state-of-the-art large scale approach for solving NP-hard permutation-like problems using parallel Branch-and-Bound (B&B) techniques are based on a Master-Slave model which is known to be limited in terms of scalability. To get beyond this limitation, we designed a Pure P2P approach which provides fully distributed mechanisms dealing with the B\&amp;B algorithm.<br/>In this paper, we propose an experimental study of our approach to analyze its performances in both small-scale and large-scale environments. We compare our approach to a state-of-the-art Master-Slave approach and we study its intrinsic performances (Parallel Speed-up). We also study the impact of the network topology on the performances of our approach. The evaluation was carried out on the Aladdin/Grid’5000 testbed. Experimental results show an improved scalability, a reduced communication overhead and a satisfactory speed-up for our approach.
* '''A Cost-Evaluation of MapReduce Applications in the Cloud''' ''Diana Moise, Alexandra Carpen-Amarie, [http://www.irisa.fr/kerdata/doku.php?id=people:gabriel.antoniu Gabriel Antoniu] and Luc Bouge.'' [https://www.grid5000.fr/school2011/1_DianaMoise.pdf PDF]
** '''Abstract:''': MapReduce is a powerful paradigm that enables rapid implementation of a wide range of distributed data-intensive applications. Its open source implementation, the Hadoop project, has been recently adopted by the Cloud computing community. This paper focuses on evaluating the cost of moving MapReduce applications to the Cloud, in order to find a proper trade-off between cost and performance for this class of applications. We provide a cost evaluation of running MapReduce applications in the Cloud, by looking into two aspects: the overhead implied by executing the job on the Cloud, compared to executing it on a Grid, and the actual costs of renting cloud resources. We compared the runtime of two MapReduce applications when they are executed with the Hadoop framework, in two similar environments: clusters belonging to the Grid&apos;5000 platform and in a Nimbus Cloud deployed on top of Grid'5000 nodes.
* '''Hybrid Distributed Computing Infrastructure Experiments in Grid'5000: Supporting QoS in Desktop Grids with Cloud Resources''' ''Simon Delamare and Gilles Fedak.'' [[Media:Grid5000-2011-Paper2.pdf|PDF]]
** '''Abstract:''': Hybrid Distributed Computing Infrastructures (DCIs) allow users to combine Grids, Desktop Grids, Clouds, etc. to obtain for their users even larger computing capabilities. In this paper, we present an experimental study of the SpeQuloS framework which aims at providing QoS to Desktop Grid by provisioning on-demand Cloud resources. We describe the experimental platform which relies on Grid5000 to mimic both a Desktop Grid system and a Cloud system. Preliminary results are presented which shows the potential of the SpeQuloS approach.
* '''Deploying a fault-tolerant computing middleware over Grid'5000: performance analysis of CONFIIT and its integration with a quantum molecular docking application''' ''[http://cosy.univ-reims.fr/~lsteffenel Luiz Angelo Steffenel], [http://cosy.univ-reims.fr/~jcboisson Jean-Charles Boisson], Jaillet Christophe, [http://cosy.univ-reims.fr/flauzac Olivier Flauzac], Michael Krajecki, Chantal Barberot, Eric Henon and Stéphane Gérard.'' [[Media:Grid5000-2011-Paper5.pdf|PDF]]
** '''Abstract:''': P2P computing middlewares are interesting options for grid computing applications that require scalability and resiliency. Nevertheless, most P2P computation systems rely on partially centralized or hybrid decentralized architectures to distribute tasks and collect the results, raising fault tolerance and bottleneck issues. CONFIIT (Computation Over Network with Finite number of Independent and Irregular Tasks) is a purely decentralized middleware for grid computing, relying on a virtual ring for topology management and for task scheduling. Firstly this paper evaluates the impact of node placement and task granularity on the performance of CONFIIT while solving the well-known Langford permutation problem. Secondly, an application of CONFIIT on a complex real life problem is presented.
* '''Automating the measure of the NAS MG cache behavior on many architectures using CControl and Grid'5000''' ''Swann Perarnau and Pierre Neyron.'' [[Media:Grid5000-2011-Paper3.pdf|PDF]]
** '''Abstract:''' The NAS Parallel Benchmarks are a well recognized set of parallel benchmarks, implementing classical HPC algorithms. Understanding the memory behavior of such algorithms enables the development of better architectures and better parallel runtimes. In this paper we discuss the use of Grid'5000 for the various system architectures it features, to measure and understand the memory behavior of the NAS benchmarks. We achieve that thanks to a tool we recently developed: CControl, which allows to control the cache available to an application. Using this tool we present several measurements of the working sets of the NAS MG program, showing how its performance evolves when the cache conguration varies. While we would have liked to gather a wide range of results using the different architectures featured by the platform for comparison, we discovered that unfortunately such measurements cannot be fully automated on Grid'5000 yet. We explain why.

Latest revision as of 15:31, 29 April 2011

Reims Spring School 2011

Overview, with authors

This program is the final version. Practical session will be detailed on a registration site used to plan capacity. Initial list with more details is available.

To ease management of the event, please plan your arrival either

  • between 13:30 and 14:00 on the Monday,
  • between 10:30 and 11:00 or between 13:30 and 14:00 on the Tuesday.


  • The green practical sessions are the sessions recommended to new users
  • The orange practical sessions are contributed by users
  • The first slot on the program each morning is for lessons.
April 18th (Monday) April 19th (Tuesday) April 20th (Wednesday) April 21st (Thursday)
9h-10h Executive Committee meeting: April 6th review debriefing Practical session: efficient customization of Grid'5000 : using Kadeploy3 on Grid'5000 Advanced tools for Grid'5000: taktuk Pierre Neyron The future of Grid'5000 David Margery, Technical director
10h-10h30 Automating the measure of the NAS MG cache behavior on many architectures using CControl and Grid'5000 Swann Perarnau and Pierre Neyron. Focus on networking: tools prepared by the technical staff
10h30-11h00 Registration/coffee break coffee break
11h-11h30 Hemera working group: Methodology Practical session: Grid'5000 is programmable (API Basics) Experimental Study of a P2P B&B approach on top of Grid’5000 Mathieu Djamaï, Bilel Derbel and Nouredine Melab Awards and Challenge winner's presentation
11h30-12h Deploying a fault-tolerant computing middleware over Grid'5000: performance analysis of CONFIIT and its integration with a quantum molecular docking application Luiz Angelo Steffenel, Jean-Charles Boisson, Jaillet Christophe, Olivier Flauzac, Michael Krajecki, Chantal Barberot, Eric Henon and Stéphane Gérard.
12h00-12h30
12h30-13h30 Lunch break
13h30-14h as from 13:30 : registration and coffee Challenge: Leveraging BlobSeer to Boost up the Deployment and Execution of Hadoop Applications in Nimbus Cloud Environments on Grid'5000
14h-14h30 Introductory lessons Hemera Scientific Challenges on Grid'5000
14h30-15h00 Challenge: Grid-in-a-grid: Deployment of a gLite Grid inside Grid'5000
15h00-15h30 Practical session: from ssh access to initial submissions A Cost-Evaluation of MapReduce Applications in the Cloud Diana Moise, Alexandra Carpen-Amarie, Gabriel Antoniu and Luc Bouge.
15h30-16h00 Hybrid Distributed Computing Infrastructure Experiments in Grid5000: Supporting QoS in Desktop Grids with Cloud Resources Simon Delamare and Gilles Fedak. Challenge: Deploy a Highly-dynamic Virtual Cluster Based on OpenNebula and Xen in Grid'5000
16h-16h30 Coffee Break
16h30-17h Practical session: from ssh access to initial submissions Put some Green in your Grid’5000 experiments !, Marcos Dias de Assuncao, Mohammed El Mehdi Diouri, Laurent Lefèvre, Olivier Mornard, Anne-Cécile Orgerie, Ghislain Landry, Tsafack Chetsa Deploying and Using IaaS Clouds on Grid'5000, Alexandra Carpen-Amarie, Houssem-Eddine Chihoub, Pierre Riteau Deploy a virtualized environment Run MPI on Grid'5000 Creating Customized Grid'5000 Environments with Chef, Pierre Riteau Large-Scale Trace Visualization Analysis with Triva and Pajé -- the G5K case study --, Lucas Mello Schnorr, Arnaud Legrand Network isolation on Grid'5000 Use the API to gather metrics
17h-17h30
17h30-18h
18h-18h30
18h30-18h45
18h45-19h Champagne cellar visit and tasting
19h00-20h Official cocktail at City Hall
20h-... Gala dinner

List of user presentations

  • Experimental Study of a P2P B&B approach on top of Grid’5000. Mathieu Djamaï, Bilel Derbel and Nouredine Melab PDF
    • Abstract:: The state-of-the-art large scale approach for solving NP-hard permutation-like problems using parallel Branch-and-Bound (B&B) techniques are based on a Master-Slave model which is known to be limited in terms of scalability. To get beyond this limitation, we designed a Pure P2P approach which provides fully distributed mechanisms dealing with the B\&B algorithm.
      In this paper, we propose an experimental study of our approach to analyze its performances in both small-scale and large-scale environments. We compare our approach to a state-of-the-art Master-Slave approach and we study its intrinsic performances (Parallel Speed-up). We also study the impact of the network topology on the performances of our approach. The evaluation was carried out on the Aladdin/Grid’5000 testbed. Experimental results show an improved scalability, a reduced communication overhead and a satisfactory speed-up for our approach.
  • A Cost-Evaluation of MapReduce Applications in the Cloud Diana Moise, Alexandra Carpen-Amarie, Gabriel Antoniu and Luc Bouge. PDF
    • Abstract:: MapReduce is a powerful paradigm that enables rapid implementation of a wide range of distributed data-intensive applications. Its open source implementation, the Hadoop project, has been recently adopted by the Cloud computing community. This paper focuses on evaluating the cost of moving MapReduce applications to the Cloud, in order to find a proper trade-off between cost and performance for this class of applications. We provide a cost evaluation of running MapReduce applications in the Cloud, by looking into two aspects: the overhead implied by executing the job on the Cloud, compared to executing it on a Grid, and the actual costs of renting cloud resources. We compared the runtime of two MapReduce applications when they are executed with the Hadoop framework, in two similar environments: clusters belonging to the Grid'5000 platform and in a Nimbus Cloud deployed on top of Grid'5000 nodes.
  • Hybrid Distributed Computing Infrastructure Experiments in Grid'5000: Supporting QoS in Desktop Grids with Cloud Resources Simon Delamare and Gilles Fedak. PDF
    • Abstract:: Hybrid Distributed Computing Infrastructures (DCIs) allow users to combine Grids, Desktop Grids, Clouds, etc. to obtain for their users even larger computing capabilities. In this paper, we present an experimental study of the SpeQuloS framework which aims at providing QoS to Desktop Grid by provisioning on-demand Cloud resources. We describe the experimental platform which relies on Grid5000 to mimic both a Desktop Grid system and a Cloud system. Preliminary results are presented which shows the potential of the SpeQuloS approach.
  • Deploying a fault-tolerant computing middleware over Grid'5000: performance analysis of CONFIIT and its integration with a quantum molecular docking application Luiz Angelo Steffenel, Jean-Charles Boisson, Jaillet Christophe, Olivier Flauzac, Michael Krajecki, Chantal Barberot, Eric Henon and Stéphane Gérard. PDF
    • Abstract:: P2P computing middlewares are interesting options for grid computing applications that require scalability and resiliency. Nevertheless, most P2P computation systems rely on partially centralized or hybrid decentralized architectures to distribute tasks and collect the results, raising fault tolerance and bottleneck issues. CONFIIT (Computation Over Network with Finite number of Independent and Irregular Tasks) is a purely decentralized middleware for grid computing, relying on a virtual ring for topology management and for task scheduling. Firstly this paper evaluates the impact of node placement and task granularity on the performance of CONFIIT while solving the well-known Langford permutation problem. Secondly, an application of CONFIIT on a complex real life problem is presented.
  • Automating the measure of the NAS MG cache behavior on many architectures using CControl and Grid'5000 Swann Perarnau and Pierre Neyron. PDF
    • Abstract: The NAS Parallel Benchmarks are a well recognized set of parallel benchmarks, implementing classical HPC algorithms. Understanding the memory behavior of such algorithms enables the development of better architectures and better parallel runtimes. In this paper we discuss the use of Grid'5000 for the various system architectures it features, to measure and understand the memory behavior of the NAS benchmarks. We achieve that thanks to a tool we recently developed: CControl, which allows to control the cache available to an application. Using this tool we present several measurements of the working sets of the NAS MG program, showing how its performance evolves when the cache conguration varies. While we would have liked to gather a wide range of results using the different architectures featured by the platform for comparison, we discovered that unfortunately such measurements cannot be fully automated on Grid'5000 yet. We explain why.