Grid5000:Home: Difference between revisions
No edit summary |
Lpouilloux (talk | contribs) No edit summary |
||
(47 intermediate revisions by 9 users not shown) | |||
Line 1: | Line 1: | ||
__NOTOC__ __NOEDITSECTION__ | __NOTOC__ __NOEDITSECTION__ | ||
{|width="95%" | {|width="95%" | ||
|- valign="top" | |- valign="top" | ||
|bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
[[Image: | [[Image:g5k-backbone.png|thumbnail|260px|right|Grid'5000]] | ||
'''Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.''' | |||
'''Grid'5000 is a large-scale and | |||
Key features: | Key features: | ||
* provides '''access to a large amount of resources''': | * provides '''access to a large amount of resources''': 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path | ||
* '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer | * '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer | ||
* '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments | * '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments | ||
Line 16: | Line 14: | ||
<br> | <br> | ||
Read more about our [[ | Read more about our [[Team|teams]], our [[Publications|publications]], and the [[Grid5000:UsagePolicy|usage policy]] of the testbed. Then [[Grid5000:Get_an_account|get an account]], and learn how to use the testbed with our [[Getting_Started|Getting Started tutorial]] and the rest of our [[:Category:Portal:User|Users portal]]. | ||
<b>Grid'5000 is merging with [https://fit-equipex.fr FIT] to build the [http://www.silecs.net/ SILECS Infrastructure for Large-scale Experimental Computer Science]. Read [http://www.silecs.net/wp-content/uploads/2018/04/Desprez-SILECS.pdf an Introduction to SILECS] (April 2018)</b> | |||
<br> | <br> | ||
Recently published documents: | Recently published documents and presentations: | ||
* [[Media:Grid5000.pdf|Presentation of Grid'5000]] (April 2019) | |||
* [https://www.grid5000.fr/mediawiki/images/Grid5000_science-advisory-board_report_2018.pdf Report from the Grid'5000 Science Advisory Board (2018)] | |||
Older documents: | |||
* [https://www.grid5000.fr/slides/2014-09-24-Cluster2014-KeynoteFD-v2.pdf Slides from Frederic Desprez's keynote at IEEE CLUSTER 2014] | * [https://www.grid5000.fr/slides/2014-09-24-Cluster2014-KeynoteFD-v2.pdf Slides from Frederic Desprez's keynote at IEEE CLUSTER 2014] | ||
* [https://www.grid5000.fr/ScientificCommittee/SAB%20report%20final%20short.pdf Report from the Grid'5000 Science Advisory Board | * [https://www.grid5000.fr/ScientificCommittee/SAB%20report%20final%20short.pdf Report from the Grid'5000 Science Advisory Board (2014)] | ||
<br> | <br> | ||
Line 27: | Line 31: | ||
|} | |} | ||
<br> | <br> | ||
{{#status:0|0|0|http://bugzilla.grid5000.fr/status/upcoming.json}} | {{#status:0|0|0|http://bugzilla.grid5000.fr/status/upcoming.json}} | ||
<br> | |||
== Random pick of publications == | |||
{{#publications:}} | |||
==Latest news== | |||
<rss max=4 item-max-length="2000">https://www.grid5000.fr/rss/G5KNews.php</rss> | |||
< | |||
---- | ---- | ||
[[ | [[News|Read more news]] | ||
=== Grid'5000 sites=== | === Grid'5000 sites=== | ||
{|width=" | {|width="100%" cellspacing="3" | ||
|- valign="top" | |- valign="top" | ||
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
Line 101: | Line 50: | ||
* [[Lille:Home|Lille]] | * [[Lille:Home|Lille]] | ||
* [[Luxembourg:Home|Luxembourg]] | * [[Luxembourg:Home|Luxembourg]] | ||
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
* [[Lyon:Home|Lyon]] | * [[Lyon:Home|Lyon]] | ||
* [[Nancy:Home|Nancy]] | * [[Nancy:Home|Nancy]] | ||
* [[Nantes:Home|Nantes]] | * [[Nantes:Home|Nantes]] | ||
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
* [[Rennes:Home|Rennes]] | * [[Rennes:Home|Rennes]] | ||
* [[Sophia:Home|Sophia-Antipolis]] | * [[Sophia:Home|Sophia-Antipolis]] | ||
* [[Toulouse:Home|Toulouse]] | * [[Toulouse:Home|Toulouse]] | ||
Line 114: | Line 62: | ||
== Current funding == | == Current funding == | ||
As from June 2008, | As from June 2008, Inria is the main contributor to [[Grid5000:Funding|Grid'5000 funding]]. | ||
{|width="100%" cellspacing="3" | {|width="100%" cellspacing="3" | ||
|- | |- | ||
Line 122: | Line 70: | ||
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | | width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
===CNRS=== | ===CNRS=== | ||
[[Image:CNRS-filaire- | [[Image:CNRS-filaire-Quadri.png|125px]] | ||
|- | |- | ||
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | | width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
===Universities=== | ===Universities=== | ||
IMT Atlantique<br/> | |||
Université Grenoble Alpes, Grenoble INP<br/> | |||
Université Rennes 1, Rennes<br/> | |||
Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse<br/> | Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse<br/> | ||
Université Bordeaux 1, Bordeaux<br/> | |||
Université Lille 1, Lille<br/> | |||
École Normale Supérieure, Lyon<br/> | |||
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | | width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
===Regional councils=== | ===Regional councils=== | ||
Aquitaine<br/> | Aquitaine<br/> | ||
Auvergne-Rhône-Alpes<br/> | |||
Bretagne<br/> | Bretagne<br/> | ||
Champagne-Ardenne<br/> | Champagne-Ardenne<br/> | ||
Provence Alpes Côte d'Azur<br/> | Provence Alpes Côte d'Azur<br/> | ||
Hauts de France<br/> | |||
Lorraine<br/> | Lorraine<br/> | ||
|} | |} |
Latest revision as of 10:29, 26 October 2023
Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI. Key features:
Grid'5000 is merging with FIT to build the SILECS Infrastructure for Large-scale Experimental Computer Science. Read an Introduction to SILECS (April 2018)
Older documents:
|
Random pick of publications
Five random publications that benefited from Grid'5000 (at least 2517 overall):
- Mathilde Jay, Vladimir Ostapenco, Laurent Lefèvre, Denis Trystram, Anne-Cécile Orgerie, et al.. An experimental comparison of software-based power meters: focus on CPU and GPU. CCGrid 2023 - 23rd IEEE/ACM international symposium on cluster, cloud and internet computing, May 2023, Bangalore, India. pp.1-13, 10.1109/CCGrid57682.2023.00020. hal-04030223v2 view on HAL pdf
- Shakeel Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni. Robust Stuttering Detection via Multi-task and Adversarial Learning. EUSIPCO 2022 - 30th European Signal Processing Conference, Aug 2022, Belgrade, Serbia. hal-03629785 view on HAL pdf
- Manuel Alejandro Diaz-Zapata, Özgür Erkent, Christian Laugier, Jilles Dibangoye, David Sierra González. LAPTNet: LiDAR-Aided Perspective Transform Network. ICARCV 2022 - 17th International Conference on Control, Automation, Robotics and Vision, Dec 2022, Singapore, Singapore. hal-03851513 view on HAL pdf
- Didier El Baz. On Parallel or Distributed Asynchronous Iterations with Unbounded Delays and Possible Out of Order Messages or Flexible Communication for Convex Optimization Problems and Machine Learning. 2022 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), May 2022, Lyon, France. pp.807-813, 10.1109/IPDPSW55747.2022.00135. hal-03793141 view on HAL pdf
- Clément Barthélemy, Francieli Zanon Boito, Emmanuel Jeannot, Guillaume Pallez, Luan Teylo. Implementation of an unbalanced I/O Bandwidth Management system in a Parallel File System. RR-9537, Inria. 2024. hal-04417412 view on HAL pdf
Latest news
Cluster "estats" is now in the default queue in Toulouse
We are pleased to announce that the estats cluster of Toulouse (the name refers to Pica d'Estats) is now available in the default queue.
As a reminder, estats is composed of 12 edge-class nodes powered by Nvidia AGX Xavier SoCs. Each node features:
Since it is not a cluster of server-class machines (unlike all current other Grid'5000 nodes), estats runs a different default system environment, but other common functionalities are the same (kadeploy etc., except kavlan which is not supported yet).
For the experimentations, it is recommended to deploy Ubuntu L4T.
More information in the Jetson page.
The cluster was funded by a CNRS grant.
-- Grid'5000 Team 9:51, March 6th 2024 (CEST)
The big variant of Debian 12 "Bookworm" environments is ready for deployments
We are pleased to inform you that the big variant of Debian 12 (Bookworm) environments is now supported for deployments in Grid'5000. Check `kaenv3 -l debian12%` for detailed information.
Notably, the NVIDIA driver has been updated to version 535.129.03, and CUDA has been upgraded to version 12.2.2_535.104.05_linux for the amd64 architecture.
The default environment available on nodes will continue to be debian11-std for the foreseeable future.
Please refer to the updated wiki documentation¹ for guidance on Debian 12-min|nfs|big usage.
¹: https://www.grid5000.fr/w/Getting_Started#On_Grid.275000_reference_environments
-- Grid'5000 Team 14:21, Jan 22nd 2024 (CEST)
Cluster "montcalm" is now in the default queue in Toulouse
We have the pleasure to announce that the "montaclm" cluster is now available in the default queue of the Toulouse site, which makes the site full-fledged again!
This cluster consists of 10 HPE Proliant DL360 Gen10+ nodes with 2 CPUs Intel Xeon Silver 4314 (16 cores per CPUs), 256 GB of DDR4 RAM, and 894GB SSD.
Jobs submitted on the Toulouse site will run by default on this cluster.
Beside the "montcalm" cluster, the "edge-class" cluster "estats" is still available in the testing queue for now.
In order to support the SLICES-FR project, the site infrastructure has been funded by CNRS/INS2I and the "montcalm" cluster has been funded by University Paul Sabatier (UT3).
-- Grid'5000 Team 10:30, 18 Jan 2024 (CET)
New "edge-computing"-class nodes in Toulouse's testing queue: cluster Estats with 12 Nvidia AGX Xavier SoCs
A new cluster named "estats" is available in the testing queue of the Toulouse site, composed of 12 "Edge computing"-class nodes.
Estats is composed of 12 Nvidia AGX Xavier SoCs¹. Each SoC features:
The 12 modules are packaged in a chassis manufactured by Connecttech⁴.
Since it is not a cluster of server-class machines (unlike all current other Grid'5000 nodes), estats runs a different default system environment. This environment includes Nvidia's Linux for Tegra²³ overlay on top of the Grid'5000 standard environment. This means:
This default environment does not include the required Tegra-specific version of Cuda.
To benefit from the whole Nvidia stack with e.g. the specific Cuda version and DL accelerators support for Nvidia Tegra, it is advised to deploy on the node the Nvidia-supported Ubuntu 20.04 OS with the full L4T support, using kadeploy
. You can use the ubuntul4t200435-big
environment. E.g.:
ftoulouse$
oarsub
-q testing -t exotic -p estats -t deploy
-l nodes=1 -I
ftoulouse$
kadeploy3
ubuntul4t200435-big
This tutorial page explains how this ubuntul4t200435-big
environment is built and how to...
Grid'5000 sites
Current funding
As from June 2008, Inria is the main contributor to Grid'5000 funding.
INRIA |
CNRS |
UniversitiesIMT Atlantique |
Regional councilsAquitaine |