Grid5000:Home: Difference between revisions

From Grid5000
Jump to navigation Jump to search
No edit summary
No edit summary
 
(134 intermediate revisions by 12 users not shown)
Line 1: Line 1:
__NOTOC__ __NOEDITSECTION__
__NOTOC__ __NOEDITSECTION__
{|width="95%"
{|width="95%"
|-
|- valign="top"
| width="20%" |
|bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[[Image:Logo_Aladdin.png|250px]]
[[Image:g5k-backbone.png|thumbnail|260px|right|Grid'5000]]
|
'''Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.'''
= ALADDIN-G5K : ensuring the development of '''Grid'5000''' =
 
= for the 2008-2012 period =
Key features:
* provides '''access to a large amount of resources''': 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path
* '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer
* '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments
* '''designed to support Open Science and reproducible research''', with full traceability of infrastructure and software changes on the testbed
* '''a vibrant community''' of 500+ users supported by a solid technical team
 
<br>
Read more about our [[Team|teams]], our [[Publications|publications]], and the [[Grid5000:UsagePolicy|usage policy]] of the testbed. Then [[Grid5000:Get_an_account|get an account]], and learn how to use the testbed with our [[Getting_Started|Getting Started tutorial]] and the rest of our [[:Category:Portal:User|Users portal]].


''An infrastructure distributed in 9 sites around France, for research in large-scale parallel and distributed systems''
<b>Grid'5000 is merging with [https://fit-equipex.fr FIT] to build the [http://www.silecs.net/ SILECS Infrastructure for Large-scale Experimental Computer Science]. Read [http://www.silecs.net/wp-content/uploads/2018/04/Desprez-SILECS.pdf an Introduction to SILECS] (April 2018)</b>


Engineers ensuring the development and day to day support of the infrastructure are mostly provided by INRIA, under the ''ADT ALADDIN-G5K''  initiative.
<br>
Recently published documents and presentations:
* [[Media:Grid5000.pdf|Presentation of Grid'5000]] (April 2019)
* [https://www.grid5000.fr/mediawiki/images/Grid5000_science-advisory-board_report_2018.pdf Report from the Grid'5000 Science Advisory Board (2018)]


Older documents:
* [https://www.grid5000.fr/slides/2014-09-24-Cluster2014-KeynoteFD-v2.pdf Slides from Frederic Desprez's keynote at IEEE CLUSTER 2014]
* [https://www.grid5000.fr/ScientificCommittee/SAB%20report%20final%20short.pdf Report from the Grid'5000 Science Advisory Board (2014)]
<br>
Grid'5000 is supported by a scientific interest group (GIS) hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations. Inria has been supporting Grid'5000 through ADT ALADDIN-G5K (2007-2013), ADT LAPLACE (2014-2016), and IPL [[Hemera|HEMERA]] (2010-2014).
|}
|}


{|width="95%"
<br>
|- valign="top"
{{#status:0|0|0|http://bugzilla.grid5000.fr/status/upcoming.json}}
|bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<br>
==Latest news==
[[Image:G5KSS10-Carre.png|left|120px|]]
=== [[Grid5000:School2010|Grid'5000 Spring School 2010 announced]] ([http://registration.net-resa.com/cgi-bin/WebObjects/gnetresa.woa/wa/newParticipant?idevt=444&profil=905 Registration] now opened) ===
Organized in Lille, from April 6th to April 9th 2010, this [[Grid5000:School2010|spring school]] will bring together, but is not limited to, Grid'5000's users, technical team and
executive committee for 4 days of tutorials and talks focusing on
best-practices and results. Presentations and practical sessions will
cover both basic usage of the platform, for new users, or potential users of Grid'5000 and advanced and new usage of the platform, for current users.


A '''challenge''' to showcase tools and environments demonstrating the deployment of distributed or Grid middleware on Grid'5000 will be held this year for the first time.
== Random pick of publications ==
{{#publications:}}


(<small>image ©-maxime-dufour-photographies</small>)
==Latest news==
----
<rss max=4 item-max-length="2000">https://www.grid5000.fr/rss/G5KNews.php</rss>
=== Latest updated experiment descriptions ===
{{#experiments:3}}
----
=== Latest updated publications ===
{{#publications:3}}
----
=== Research in Grids - Production Grids workshop held October 13th, 2009 ===
L’[http://www.idgrilles.fr Institut des Grilles (IdG)] et l’action Aladdin INRIA lancent un [[Appel Interfaces Recherche en grilles/Grilles de production 2009|appel commun à propositions]] pour dynamiser les recherches à l’interface entre la recherche sur les grilles, les grilles de recherche, et les grilles de production.
 
Le colloque présentera les questions ouvertes aux interfaces et les projets proposés dans le cadre de cet appel, afin de favoriser la création de nouveaux réseaux. Le but de la journée est de faire émerger des synergies. La présentation de travaux en cours est donc vivement encouragée. [http://graal.ens-lyon.fr/~desprez/FILES/ProdRech.html Plus d’informations].
---
=== Supercomputing and grid computing days at Lille ===
The first edition of the supercomputing and grid computing days at Lille will be held at Université de Lille1 and INRIA Lille - Nord Europe on December 2, 3 and 7, 2009. Half of the event is dedicated to the Grid5000 nation-wide grid infrastructure. A series of presentations including feedbacks on the use of Grid5000 is scheduled together with a one day practical training on it. The detailed program is available at: http://www2.lifl.fr/~melab/pmwiki/index.php?n=Main.Journ%e9esCIGIL
----
----
[[News|Read more news]]


[[Grid5000:News|read more news]]
=== Grid'5000 sites===
|}
{|width="100%" cellspacing="3"  
 
<br>
==Grid'5000 at a glance==
[[Image:site_map.png|thumbnail|128px|right|Grid'5000 sites]]
* '''Grid'5000''' is a scientific instrument for the study of large scale parallel and distributed systems. It aims at providing a '''highly reconfigurable, controlable and monitorable experimental platform''' to its users. The initial aim (circa 2003) was to reach 5000 processors in the platform. It has been reframed at 5000 cores, and was reached during winter 2008-2009.
* The infrastructure of Grid'5000 is geographically distributed on different sites hosting the instrument, initialy 9 in France. Porto Alegre, Brazil is now officially becoming the 10th site.
===Sites:===
{|width="75%" cellspacing="3"  
|- valign="top"
|- valign="top"
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [[Bordeaux:Home|Bordeaux]]
* [[Grenoble:Home|Grenoble]]
* [[Grenoble:Home|Grenoble]]
* [[Lille:Home|Lille]]
* [[Lille:Home|Lille]]
* [[Luxembourg:Home|Luxembourg]]
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [[Lyon:Home|Lyon]]
* [[Lyon:Home|Lyon]]
* [[Nancy:Home|Nancy]]
* [[Nancy:Home|Nancy]]
* [[Orsay:Home|Orsay]]
* [[Nantes:Home|Nantes]]
* [[PortoAlegre:Home|Porto Alegre]]
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [[Rennes:Home|Rennes]]
* [[Rennes:Home|Rennes]]
Line 70: Line 60:
|-
|-
|}
|}
[[Image:Software layers.png|thumbnail|271px|left|Grid'5000 allows experiments in all these software layers]]
* '''Grid'5000''' is a research effort developping a '''large scale nation wide infrastructure for large scale parallel and distributed computing research'''.
* '''17 [[Grid5000:Laboratories|laboratories]]''' are involved in France with the objective of providing the community a testbed allowing experiments in all the software layers between the network protocols up to the applications.
The current plans are to extend from the 9 initial sites each with 100 to a thousand PCs, connected by the [http://www.renater.fr RENATER] Education and Research Network to a bigger platform including a few sites outside France not necessarily connected through a dedicated network connection. Sites in Brazil and Luxembourg should join shortly.
All sites in France are connected to [http://www.renater.fr RENATER] with a 10Gb/s link.
This high collaborative research effort is funded by INRIA, CNRS, the Universities of all sites and some regional councils.
== Initial Rationale==
'''The foundations of Grid'5000''' have emerged from a thorough analysis and numerous discussions about methodologies used for scientific research in the Grid domain. A report presents the [http://www-sop.inria.fr/aci/grid/public/Library/rapport-grid5000-V3.pdf rationale for Grid'5000].
In addition to theory, simulators and emulators, there is a strong need for '''large scale testbeds''' where real life experimental conditions hold. '''The size of Grid'5000''', in terms of number of sites and number of processors per site, was established according to the scale of the experiments and the number of researchers involved in the project.


== Current funding ==
== Current funding ==
As from June 2008, INRIA is the main contributor to [[Grid5000:Funding|Grid'5000 funding]].  
As from June 2008, Inria is the main contributor to [[Grid5000:Funding|Grid'5000 funding]].  
{|width="100%" cellspacing="3"
{|width="100%" cellspacing="3"
|-
|-
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===INRIA===
===INRIA===
[[Image:Logo-inria.png]]
[[Image:Logo_INRIA.gif|300px]]
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===CNRS===
===CNRS===
[[Image:Logo-cnrs.png]]
[[Image:CNRS-filaire-Quadri.png|125px]]
|-
|-
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===Universities===
===Universities===
University Joseph Fourier, Grenoble<br/>
IMT Atlantique<br/>
University of Rennes 1, Rennes<br/>
Université Grenoble Alpes, Grenoble INP<br/>
Université Rennes 1, Rennes<br/>
Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse<br/>
Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse<br/>
University Bordeaux 1, Bordeaux<br/>
Université Bordeaux 1, Bordeaux<br/>
University Lille 1, Lille<br/>
Université Lille 1, Lille<br/>
Ecole Normale Supérieure, Lyon<br/>
École Normale Supérieure, Lyon<br/>
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===Regional councils===
===Regional councils===
Aquitaine<br/>
Aquitaine<br/>
Auvergne-Rhône-Alpes<br/>
Bretagne<br/>
Bretagne<br/>
Champagne-Ardenne<br/>
Provence Alpes Côte d'Azur<br/>
Provence Alpes Côte d'Azur<br/>
Nord Pas de Calais<br/>
Hauts de France<br/>
Lorraine<br/>
Lorraine<br/>
|}
|}

Latest revision as of 10:29, 26 October 2023

Grid'5000

Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.

Key features:

  • provides access to a large amount of resources: 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path
  • highly reconfigurable and controllable: researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer
  • advanced monitoring and measurement features for traces collection of networking and power consumption, providing a deep understanding of experiments
  • designed to support Open Science and reproducible research, with full traceability of infrastructure and software changes on the testbed
  • a vibrant community of 500+ users supported by a solid technical team


Read more about our teams, our publications, and the usage policy of the testbed. Then get an account, and learn how to use the testbed with our Getting Started tutorial and the rest of our Users portal.

Grid'5000 is merging with FIT to build the SILECS Infrastructure for Large-scale Experimental Computer Science. Read an Introduction to SILECS (April 2018)


Recently published documents and presentations:

Older documents:


Grid'5000 is supported by a scientific interest group (GIS) hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations. Inria has been supporting Grid'5000 through ADT ALADDIN-G5K (2007-2013), ADT LAPLACE (2014-2016), and IPL HEMERA (2010-2014).


Current status (at 2024-07-28 20:18): 2 current events, 1 planned (details)


Random pick of publications

Five random publications that benefited from Grid'5000 (at least 2515 overall):

  • Tulika Bose, Nikolaos Aletras, Irina Illina, Dominique Fohr. Domain Classification-based Source-specific Term Penalization for Domain Adaptation in Hate-speech Detection. COLING 2022 - Proceedings of the 29th International Conference on Computational Linguistics, Oct 2022, Gyeongju, South Korea. hal-03815708 view on HAL pdf
  • John Baena, Pierre Briaud, Daniel Cabarcas, Ray Perlner, Daniel Smith-Tone, et al.. Improving Support-Minors rank attacks: applications to GeMSS and Rainbow. CRYPTO 2022 - 42nd Annual International Cryptology Conference, Aug 2022, Santa Barbara (CA), United States. pp.376--405, 10.1007/978-3-031-15982-4_13. hal-03533455v2 view on HAL pdf
  • Safa Alsaidi, Miguel Couceiro, Esteban Marquer, Sophie Quennelle, Anita Burgun, et al.. An analogy based framework for patient-stay identification in healthcare. ATA@ICCBR 2022 - Workshop Analogies: from Theory to Applications, Sep 2022, Nancy, France. hal-03763772 view on HAL pdf
  • Shakeel Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni. Robust Stuttering Detection via Multi-task and Adversarial Learning. EUSIPCO 2022 - 30th European Signal Processing Conference, Aug 2022, Belgrade, Serbia. hal-03629785 view on HAL pdf
  • Shakeel Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni. Machine Learning for Stuttering Identification: Review, Challenges & Future Directions. Neurocomputing, 2022, 514 (2022), pp.17. 10.1016/j.neucom.2022.10.015. hal-03634072v2 view on HAL pdf


Latest news

Rss.svgNew Access Rules for Production Queue

We are introducing new access rules for clusters in the production queue. Clusters in this queue are now accessed based on priority levels that reflect their funding sources. Jobs submitted at higher priority levels are scheduled before those at lower levels and may also have longer maximum durations.

For more detailed information, please visit: https://www.grid5000.fr/w/Production#Using_production_resources . However, an important thing to note is that job submission commands that worked previously will still work after that change (but you might get higher or lower priority depending on your resources selection).

You can check your priority level for each cluster using https://api.grid5000.fr/explorer . Currently, this tool only displays basic information; however, we plan to add more features soon.

Please be aware that these changes apply only to the production clusters, which are currently available only in Nancy and Rennes. There are no changes to the "default" queue.

If you encounter any issues or have feedback regarding this new feature, or if you believe your priority level on specific resources is not adequate, please contact us at <support-staff@lists.grid5000.fr>.

-- Grid'5000 Team 9:25, June 4th 2024 (CEST)

Rss.svgCluster "estats" is now in the default queue in Toulouse

We are pleased to announce that the estats cluster of Toulouse (the name refers to Pica d'Estats) is now available in the default queue.

As a reminder, estats is composed of 12 edge-class nodes powered by Nvidia AGX Xavier SoCs. Each node features:

  • 1 ARM64 CPU (Nvidia Carmel micro-arch) with 8 cores
  • 1 Nvidia GPU (Nvidia Volta micro-arch)
  • 32 GB RAM shared between CPU and GPU
  • 1 NVMe of 2TB
  • 1 Gbps NIC
  • Since it is not a cluster of server-class machines (unlike all current other Grid'5000 nodes), estats runs a different default system environment, but other common functionalities are the same (kadeploy etc., except kavlan which is not supported yet).

    For the experimentations, it is recommended to deploy Ubuntu L4T.

    More information in the Jetson page.

    The cluster was funded by a CNRS grant.

    -- Grid'5000 Team 9:51, March 6th 2024 (CEST)

    Rss.svgThe big variant of Debian 12 "Bookworm" environments is ready for deployments

    We are pleased to inform you that the big variant of Debian 12 (Bookworm) environments is now supported for deployments in Grid'5000. Check `kaenv3 -l debian12%` for detailed information.

    Notably, the NVIDIA driver has been updated to version 535.129.03, and CUDA has been upgraded to version 12.2.2_535.104.05_linux for the amd64 architecture.

    The default environment available on nodes will continue to be debian11-std for the foreseeable future.

    Please refer to the updated wiki documentation¹ for guidance on Debian 12-min|nfs|big usage.

    ¹: https://www.grid5000.fr/w/Getting_Started#On_Grid.275000_reference_environments

    -- Grid'5000 Team 14:21, Jan 22nd 2024 (CEST)

    Rss.svgCluster "montcalm" is now in the default queue in Toulouse

    We have the pleasure to announce that the "montaclm" cluster is now available in the default queue of the Toulouse site, which makes the site full-fledged again!

    This cluster consists of 10 HPE Proliant DL360 Gen10+ nodes with 2 CPUs Intel Xeon Silver 4314 (16 cores per CPUs), 256 GB of DDR4 RAM, and 894GB SSD.

    Jobs submitted on the Toulouse site will run by default on this cluster.

    Beside the "montcalm" cluster, the "edge-class" cluster "estats" is still available in the testing queue for now.

    In order to support the SLICES-FR project, the site infrastructure has been funded by CNRS/INS2I and the "montcalm" cluster has been funded by University Paul Sabatier (UT3).

    -- Grid'5000 Team 10:30, 18 Jan 2024 (CET)


    Read more news

    Grid'5000 sites

    Current funding

    As from June 2008, Inria is the main contributor to Grid'5000 funding.

    INRIA

    Logo INRIA.gif

    CNRS

    CNRS-filaire-Quadri.png

    Universities

    IMT Atlantique
    Université Grenoble Alpes, Grenoble INP
    Université Rennes 1, Rennes
    Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse
    Université Bordeaux 1, Bordeaux
    Université Lille 1, Lille
    École Normale Supérieure, Lyon

    Regional councils

    Aquitaine
    Auvergne-Rhône-Alpes
    Bretagne
    Champagne-Ardenne
    Provence Alpes Côte d'Azur
    Hauts de France
    Lorraine