Grid5000:Software: Difference between revisions
| m (kavlan availability) | No edit summary | ||
| Line 10: | Line 10: | ||
| ===Grid5000 Team=== | ===Grid5000 Team=== | ||
| {{Template:Software| | {{Template:Software| | ||
| |title= | |title=OAR2 | ||
| |purpose=OAR is a resource manager (or batch scheduler) for large clusters. It allows cluster users to submit or reserve nodes either in an interactive or a batch mode. | |purpose=OAR is a resource manager (or batch scheduler) for large clusters. It allows cluster users to submit or reserve nodes either in an interactive or a batch mode. | ||
| |logo=[[Image:Logo_oar.png|150px|right]] | |logo=[[Image:Logo_oar.png|150px|right]] | ||
Revision as of 16:39, 22 June 2013
Software mainly developed in Grid'5000 and available for its users.
System Software
Grid5000 Team
OAR2
OAR is a resource manager (or batch scheduler) for large clusters. It allows cluster users to submit or reserve nodes either in an interactive or a batch mode.
- Status: Production/Stable
- Contacts: Olivier Richard
- Homepage: http://oar.imag.fr
- Availability: OAR 2 is installed on all Grid'5000 clusters
Kadeploy 3
Kadeploy is a fast and scalable deployment system towards cluster and grid computing. It provides a set of tools, for cloning, configuring (post installation) and managing a set of nodes. Currently it deploys successfully linux, *BSD, Windows, Solaris on x86 and 64 bits computers.
- Status: Production/Stable
- Contacts: Emmanuel Jeanvoine
- Homepage: https://gforge.inria.fr/projects/kadeploy3/
- Availability: Kadeploy is installed on all Grid'5000 clusters
KaVLAN
VLAN manipulation tool for network isolation of experiment
- Status: Production
- Contacts: Nicolas Niclausse
- Homepage: http://kavlan.gforge.inria.fr/
- Availability: See status table on KaVLAN page
Grid5000 Community
TakTuk 3
TakTuk is a tool for deploying parallel remote executions of commands to a potentially large set of remote nodes. It spreads itself using an adaptive algorithm and sets up an interconnection network to transport commands and perform I/Os multiplexing/demultiplexing. The TakTuk mechanics dynamically adapts to environment (machine performance and current load, network contention) by using a reactive work-stealing algorithm that mix local parallelization and work distribution.
- Status: Production/Stable
- Contacts: Guillaume Huard
- Homepage: http://taktuk.gforge.inria.fr/
- Availability: -
Katapult 3
Katapult is a small, well-tested script to automatically start experiments using deployments. Most experiments start by deploying the nodes, re-deploying the nodes if too many of them failed, copying the user's SSH key to the node, etc. Katapult automates all those tasks. This tool is available on most clusters under the name katapult3 and is compliant with kadeploy3.
- Status: Production/Stable
- Contacts: Lucas Nussbaum
- Homepage: http://www.loria.fr/~lnussbau/katapult.html
- Availability: tested on most clusters under the name katapult3.
GRUDU 1.1.0
GRUDU for Grid5000 Reservation Utility for Deployment Usage is a tool for managing Grid5000 resources, reservations and deployments. It provides :
- the access to the Grid5000 grid environment through a GUI interface allowing the user to get detailed informations on the grid, the sites, and the present jobs.
- the reservation of resources on all queues/types thanks to an user-friendly interface
- the deployment of images on the nodes reserved with the deploy type (through Kadeploy)
- the Ganglia monitoring for all nodes of a sites and the history of the Ganglia metrics on jobs' nodes
- file transfert and synnchrinisation capability with the JFTP module
- the acces to a terminal on each level of the grid (the connection frontale, a cluster frontale or the master node of a reservation)
As GRUDU is a JAVA-based and client-side software it is cross-platform and you only need a JVM 1.5.0 or later on your computer to run it.
- Status: Production/Stable
- Contacts: David Loureiro
- Homepage: http://graal.ens-lyon.fr/DIET/grudu.html
- Availability: Portable (written in Java)
Adage
ADAGE is an automatic deployment tool of applications in a grid environnement. It targets dynamic applications by providing a large set of services to deal with resources like information services, resources reservation/allocation, file transfert and job launching anfd monitoring.
ADAGE is internally based on a generic application description model (GADe) so as to support any kind of programming model and/or to support multi-programming model applications
- Status: Production/Stable
- Contacts: Christian Perez
- Homepage: http://adage.gforge.inria.fr/
- Availability: should work on all clusters.has been tested on various Debian versions, on RH/Fedora-likes, and on *BSD
Experiment Tools
Wrekavoc
The goal of Wrekavoc is to define and control the heterogeneity of a given platform by degrading CPU, network or memory capabilities of each node composing this platform. The degradation is done remotely, without restarting the hardware. The control is fine, reproducible and independent (one may degrade CPU without modifying the network bandwidth).
- Status: Production/Stable
- Contacts: Emmanuel Jeannot
- Homepage: http://wrekavoc.gforge.inria.fr
- Availability: Wrekavoc has been succesfully tested on several linux clusters
CoRDAGe
Co-deployment and Re-deployment of Generic Applications.
- Status: Beta
- Contacts: Loïc Cudennec
- Homepage: http://cordage.gforge.inria.fr
- Availability: Grid'5000
NXE
Network eXperiment Engine is a tool written in Python to automate networking experiments in real testbeds environments. It allows to simply script experiments involving hundreds of nodes. The scenarios are described through XML files that provides a simple and hierarchical description of the topology, the general configuration and the interactions between the end-hosts.
- Status: Stable
- Contacts: Romaric Guillier
- Homepage: http://ens-lyon.fr/LIP/RESO/Software/NXE
- Availability: requires Paramiko SSH library (available on compil.lyon.grid5000.fr)
Execo
Execo is an experiment toolkit. It offers a Python API for local or remote, standalone or parallel, processes execution. It is especially well suited for quickly scripting workflows of parallel/distributed operations on local or remote hosts: conducting experiments, performing automated tests, etc. It includes an API for dealing with oar, oargrid, kadeploy, and grid5000 API.
- Status: Version 1.2. Stable
- Contacts: matthieu dot imbert at inria dot fr
- Homepage: http://execo.gforge.inria.fr
- Availability: Should work on all clusters, needs to be installed.
Development Environments / Middleware
Kaapi
KAAPI means Kernel for Adaptative, Asynchronous Parallel and Interactive programming. It is a C++ library that allows to execute multithreaded computation with data flow synchronization between threads. The library is able to schedule fine/medium size grain program on distributed machine. The data flow graph is dynamic (unfold at runtime).Target architectures are clusters of SMP machines.
- Status: Release versions are stable
- Contacts: kaapi-dev-info@lists.gforge.inria.fr, Thierry Gautier
- Homepage: http://kaapi.gforge.inria.fr
- Availability: Should work on all clusters
DIET
[1] DIET means Distributed Interactive Engineering Toolbox. It is a C/C++ grid middleware, based on the GridRPC paradigm. DIET provides lots of mechanisms for simplifying the usage of a grid: adaptable job scheduling, performance prediction, data management (replication, persistency...), workflow management (either dataflow and workflows with conditional structures and loops), transparent job submission to batch schedulers (OAR, PBS, SGE, LoadLeveler...), transparent submission to cloud systems (Eucalyptus, Amazon EC2...).
- Status: Release versions are stable. Current version: 2.6.1
- Contacts: diet-dev@ens-lyon.fr, Eddy Caron
- Homepage: http://graal.ens-lyon.fr/DIET
- Availability: Works on all clusters
Marcel
Marcel is a POSIX-compliant thread library featuring a programmable scheduler designed for hierarchical multiprocessor architectures.
- Status: Production
- Contacts: Samuel Thibault
- Homepage: http://runtime.futurs.inria.fr/marcel/
- Availability: Bordeaux
Mad-MPI
Mad-MPI is an efficient implementation of MPI for fast networks.
- Status: Production
- Contacts: Nathalie Furmento
- Homepage: http://runtime.futurs.inria.fr/MadMPI/
- Availability: Bordeaux
MPICH-Madeleine
MPICH-Madeleine is an MPI implementation for clusters and clusters of clusters with heterogenous networks.
- Status: Production
- Contacts: Nathalie Furmento, Guillaume Mercier
- Homepage: http://runtime.futurs.inria.fr/mpi/
- Availability: Bordeaux
NewMadeleine
the NewMadeleine communication library provides extended capabilities for dynamic communication optimization on top of high performance networks.
- Status: Production
- Contacts: Olivier Aumage, Elisabeth Brunet
- Homepage: http://runtime.futurs.inria.fr/newmadeleine/
- Availability: Bordeaux
|   | Grid'5000 users, please send an email to web-staff if you would like to see a new software appear on this page. | 





