Toulouse:Hardware: Difference between revisions

From Grid5000
Jump to navigation Jump to search
No edit summary
No edit summary
Line 35: Line 35:
|[[#estats|estats]]||<b>testing</b>&nbsp;queue,<br/><b>[[Getting_Started#Selecting_specific_resources|exotic]]</b>&nbsp;job&nbsp;type||2023-06-13||12||1||Nvidia Carmel||8&nbsp;cores/CPU||aarch64||32&nbsp;GiB||data-sort-value="1863"|<b>2.0&nbsp;TB&nbsp;SSD</b>||data-sort-value="1000"|1&nbsp;Gbps&nbsp;||Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)
|[[#estats|estats]]||<b>testing</b>&nbsp;queue,<br/><b>[[Getting_Started#Selecting_specific_resources|exotic]]</b>&nbsp;job&nbsp;type||2023-06-13||12||1||Nvidia Carmel||8&nbsp;cores/CPU||aarch64||32&nbsp;GiB||data-sort-value="1863"|<b>2.0&nbsp;TB&nbsp;SSD</b>||data-sort-value="1000"|1&nbsp;Gbps&nbsp;||Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)
|-
|-
|[[#montcalm|montcalm]]||<b>testing</b>&nbsp;queue||2022-12-01||10||2||Intel Xeon Silver 4314||16&nbsp;cores/CPU||x86_64||256&nbsp;GiB||data-sort-value="894"|<b>960&nbsp;GB&nbsp;SSD</b>||data-sort-value="10000"|10&nbsp;Gbps&nbsp;||
|[[#montcalm|montcalm]]||||2022-12-01||10||2||Intel Xeon Silver 4314||16&nbsp;cores/CPU||x86_64||256&nbsp;GiB||data-sort-value="894"|<b>960&nbsp;GB&nbsp;SSD</b>||data-sort-value="10000"|10&nbsp;Gbps&nbsp;||
|-
|-
|}
|}
''**: crossed GPUs are not supported by Grid'5000 default environments''
''**: crossed GPUs are not supported by Grid'5000 default environments''
= Clusters in the testing queue =
= Clusters in the [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/ default queue] =


== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=estats%20only estats] ==
== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=montcalm%20only montcalm] ==


'''12 nodes, 12 cpus, 96 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/estats/nodes.json?pretty=1 json])
'''10 nodes, 20 cpus, 320 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/montcalm/nodes.json?pretty=1 json])


'''Reservation example:'''
'''Reservation example:'''


{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="replace">-q testing</code> <code class="replace">-t exotic</code> <code class="env">-p estats</code> <code>-I</code>}}
{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="env">-p montcalm</code> <code>-I</code>}}


{|
{|
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Access condition:'''
| testing queue, exotic job type<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| Connecttech/Nvidia Jetson AGX Xavier<br/>
| HPE Proliant DL360 Gen10+<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| 2023-06-13<br/>
| 2022-12-01<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| Nvidia Carmel (Carmel), aarch64, 1&nbsp;CPU/node, 8&nbsp;cores/CPU<br/>
| Intel Xeon Silver 4314 (Ice Lake), x86_64, 2.40GHz, 2&nbsp;CPUs/node, 16&nbsp;cores/CPU<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| 32&nbsp;GiB<br/>
| 256&nbsp;GiB<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
|  disk0, 2.0&nbsp;TB SSD NVME Samsung Samsung SSD 970 EVO Plus 2TB (dev: <code class="file">/dev/disk0</code>, by-path: <code class="file">/dev/disk/by-path/platform-14180000.pcie-pci-0000:01:00.0-nvme-1</code>)  (primary disk)<br/>
|  disk0, 960&nbsp;GB SSD SATA HP VK000960GXAWL (dev: <code class="file">/dev/disk0</code>, by-path: <code class="file">/dev/disk/by-path/pci-0000:00:17.0-ata-1</code>)  (primary disk)<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
|  
|  
* eth0, Ethernet, configured rate: 1&nbsp;Gbps, model: N/A, driver: nvethernet - no KaVLAN<br/>
* eth0/ens10f0np0, Ethernet, configured rate: 10&nbsp;Gbps, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en<br />
|-
* <span style="color:grey">eth1/ens10f1np1, Ethernet, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en - unavailable for experiment</span><br/>
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''GPU:'''
| Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)<br>Compute&nbsp;capability:&nbsp;7.2<br/>
|-
|-
|}
|}


== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=montcalm%20only montcalm] ==
= Clusters in the testing queue =
 
== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=estats%20only estats] ==


'''10 nodes, 20 cpus, 320 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/montcalm/nodes.json?pretty=1 json])
'''12 nodes, 12 cpus, 96 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/estats/nodes.json?pretty=1 json])


'''Reservation example:'''
'''Reservation example:'''


{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="replace">-q testing</code> <code class="env">-p montcalm</code> <code>-I</code>}}
{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="replace">-q testing</code> <code class="replace">-t exotic</code> <code class="env">-p estats</code> <code>-I</code>}}


{|
{|
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Access condition:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Access condition:'''
| testing queue<br/>
| testing queue, exotic job type<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| HPE Proliant DL360 Gen10+<br/>
| Connecttech/Nvidia Jetson AGX Xavier<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| 2022-12-01<br/>
| 2023-06-13<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| Intel Xeon Silver 4314 (Ice Lake), x86_64, 2.40GHz, 2&nbsp;CPUs/node, 16&nbsp;cores/CPU<br/>
| Nvidia Carmel (Carmel), aarch64, 1&nbsp;CPU/node, 8&nbsp;cores/CPU<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| 256&nbsp;GiB<br/>
| 32&nbsp;GiB<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
|  disk0, 960&nbsp;GB SSD SATA HP VK000960GXAWL (dev: <code class="file">/dev/disk0</code>, by-path: <code class="file">/dev/disk/by-path/pci-0000:00:17.0-ata-1</code>)  (primary disk)<br/>
|  disk0, 2.0&nbsp;TB SSD NVME Samsung Samsung SSD 970 EVO Plus 2TB (dev: <code class="file">/dev/disk0</code>, by-path: <code class="file">/dev/disk/by-path/platform-14180000.pcie-pci-0000:01:00.0-nvme-1</code>)  (primary disk)<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
|  
|  
* eth0/ens10f0np0, Ethernet, configured rate: 10&nbsp;Gbps, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en<br />
* eth0, Ethernet, configured rate: 1&nbsp;Gbps, model: N/A, driver: nvethernet - no KaVLAN<br/>
* <span style="color:grey">eth1/ens10f1np1, Ethernet, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en - unavailable for experiment</span><br/>
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''GPU:'''
| Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)<br>Compute&nbsp;capability:&nbsp;7.2<br/>
|-
|-
|}''<small>Last generated from the Grid'5000 Reference API on 2023-12-21 ([https://gitlab.inria.fr/grid5000/reference-repository/commit/7cb5ee1926 commit 7cb5ee1926])</small>''
|}''<small>Last generated from the Grid'5000 Reference API on 2024-01-18 ([https://gitlab.inria.fr/grid5000/reference-repository/commit/b6ecd12701 commit b6ecd12701])</small>''

Revision as of 10:14, 18 January 2024

See also: Network topology for Toulouse

Summary

  • 2 clusters
  • 22 nodes
  • 416 CPU cores
  • 12 GPUs
  • 6144 GPUs cores
  • 2.88 TiB RAM
  • 22 SSDs and 0 HDDs on nodes (total: 33.61 TB)
  • 36.6 TFLOPS (excluding GPUs)

Clusters

Cluster Access Condition Date of arrival Nodes CPU Memory Storage Network Accelerators
# Name Cores Architecture
estats testing queue,
exotic job type
2023-06-13 12 1 Nvidia Carmel 8 cores/CPU aarch64 32 GiB 2.0 TB SSD 1 Gbps  Nvidia AGX Xavier (30 GiB)
montcalm 2022-12-01 10 2 Intel Xeon Silver 4314 16 cores/CPU x86_64 256 GiB 960 GB SSD 10 Gbps 

**: crossed GPUs are not supported by Grid'5000 default environments

Clusters in the default queue

montcalm

10 nodes, 20 cpus, 320 cores (json)

Reservation example:

Terminal.png ftoulouse:
oarsub -p montcalm -I
Model: HPE Proliant DL360 Gen10+
Date of arrival: 2022-12-01
CPU: Intel Xeon Silver 4314 (Ice Lake), x86_64, 2.40GHz, 2 CPUs/node, 16 cores/CPU
Memory: 256 GiB
Storage: disk0, 960 GB SSD SATA HP VK000960GXAWL (dev: /dev/disk0, by-path: /dev/disk/by-path/pci-0000:00:17.0-ata-1) (primary disk)
Network:
  • eth0/ens10f0np0, Ethernet, configured rate: 10 Gbps, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en
  • eth1/ens10f1np1, Ethernet, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en - unavailable for experiment

Clusters in the testing queue

estats

12 nodes, 12 cpus, 96 cores (json)

Reservation example:

Terminal.png ftoulouse:
oarsub -q testing -t exotic -p estats -I
Access condition: testing queue, exotic job type
Model: Connecttech/Nvidia Jetson AGX Xavier
Date of arrival: 2023-06-13
CPU: Nvidia Carmel (Carmel), aarch64, 1 CPU/node, 8 cores/CPU
Memory: 32 GiB
Storage: disk0, 2.0 TB SSD NVME Samsung Samsung SSD 970 EVO Plus 2TB (dev: /dev/disk0, by-path: /dev/disk/by-path/platform-14180000.pcie-pci-0000:01:00.0-nvme-1) (primary disk)
Network:
  • eth0, Ethernet, configured rate: 1 Gbps, model: N/A, driver: nvethernet - no KaVLAN
GPU: Nvidia AGX Xavier (30 GiB)
Compute capability: 7.2

Last generated from the Grid'5000 Reference API on 2024-01-18 (commit b6ecd12701)