Toulouse:Hardware: Difference between revisions

From Grid5000
Jump to navigation Jump to search
No edit summary
No edit summary
 
(8 intermediate revisions by 5 users not shown)
Line 1: Line 1:
__NOTOC__
__NOEDITSECTION__
__NOEDITSECTION__
{{Portal|User}}
{{Portal|User}}
<div class="sitelink">Hardware: [[Hardware|Global]] | [[Grenoble:Hardware|Grenoble]] | [[Lille:Hardware|Lille]] | [[Luxembourg:Hardware|Luxembourg]] | [[Lyon:Hardware|Lyon]] | [[Nancy:Hardware|Nancy]] | [[Nantes:Hardware|Nantes]] | [[Rennes:Hardware|Rennes]] | [[Sophia:Hardware|Sophia]] | [[Toulouse:Hardware|Toulouse]]</div>
<div class="sitelink">Hardware: [[Hardware|Global]] | [[Grenoble:Hardware|Grenoble]] | [[Lille:Hardware|Lille]] | [[Luxembourg:Hardware|Luxembourg]] | [[Lyon:Hardware|Lyon]] | [[Nancy:Hardware|Nancy]] | [[Nantes:Hardware|Nantes]] | [[Rennes:Hardware|Rennes]] | [[Sophia:Hardware|Sophia]] | [[Strasbourg:Hardware|Strasbourg]] | [[Toulouse:Hardware|Toulouse]]</div>
'''See also:''' [[Toulouse:Network|Network topology for Toulouse]]
'''See also:''' [[Toulouse:Network|Network topology for Toulouse]]
= Summary =
= Summary =
Line 14: Line 13:
* 36.6 TFLOPS (excluding GPUs)
* 36.6 TFLOPS (excluding GPUs)


= Clusters =
= Clusters summary =
{|class="wikitable sortable"
{|class="wikitable sortable"
|-
|-
Line 34: Line 33:
|-
|-


|[[#estats|estats]]||<b>testing</b>&nbsp;queue,<br/><b>[[Exotic#Toulouse:_estats|exotic]]</b>&nbsp;job&nbsp;type||2023-06-13||2022-12-01||12||1||Nvidia Carmel||8&nbsp;cores/CPU||aarch64||32&nbsp;GiB||data-sort-value="1863"|<b>2.0&nbsp;TB&nbsp;SSD</b>||data-sort-value="1000"|1&nbsp;Gbps&nbsp;||Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)
|[[#estats|estats]]||<b>[[Exotic#Toulouse:_estats|exotic]]</b>&nbsp;job&nbsp;type||2023-06-13||2022-12-01||12||1||Nvidia Carmel||8&nbsp;cores/CPU||aarch64||data-sort-value="34359738368"|32&nbsp;GiB||data-sort-value="1863"|<b>2.0&nbsp;TB&nbsp;SSD</b>||data-sort-value="1000"|1&nbsp;Gbps&nbsp;||Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)
|-
|-
|[[#montcalm|montcalm]]||||2022-12-01||2022-11-06||10||2||Intel Xeon Silver 4314||16&nbsp;cores/CPU||x86_64||256&nbsp;GiB||data-sort-value="894"|<b>960&nbsp;GB&nbsp;SSD</b>||data-sort-value="10000"|10&nbsp;Gbps&nbsp;||
|[[#montcalm|montcalm]]||||2022-12-01||2022-11-06||10||2||Intel Xeon Silver 4314||16&nbsp;cores/CPU||x86_64||data-sort-value="274877906944"|256&nbsp;GiB||data-sort-value="894"|<b>960&nbsp;GB&nbsp;SSD</b>||data-sort-value="10000"|10&nbsp;Gbps&nbsp;||
|-
|-
|}
|}
Line 42: Line 41:
= Clusters in the [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/ default queue] =
= Clusters in the [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/ default queue] =


== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=montcalm%20only montcalm] ==
== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=estats%20only estats] ==


'''10 nodes, 20 cpus, 320 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/montcalm/nodes.json?pretty=1 json])
'''12 nodes, 12 cpus, 96 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/estats/nodes.json?pretty=1 json])


'''Reservation example:'''
'''Reservation example:'''


{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="env">-p montcalm</code> <code>-I</code>}}
{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="replace">-t exotic</code> <code class="env">-p estats</code> <code>-I</code>}}


{|
{|
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Access condition:'''
| [[Exotic#Toulouse:_estats|exotic]] job type<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| HPE Proliant DL360 Gen10+<br/>
| Connecttech/Nvidia Jetson AGX Xavier<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Manufacturing date:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Manufacturing date:'''
| 2022-11-06<br/>
| 2022-12-01<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| 2022-12-01<br/>
| 2023-06-13<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| Intel Xeon Silver 4314 (Ice Lake), x86_64, 2.40GHz, 2&nbsp;CPUs/node, 16&nbsp;cores/CPU<br/>
| Nvidia Carmel (Carmel), aarch64, 1&nbsp;CPU/node, 8&nbsp;cores/CPU<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| 256&nbsp;GiB<br/>
| 32&nbsp;GiB<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
|  disk0, 960&nbsp;GB SSD SATA HP VK000960GXAWL (dev: <code class="file">/dev/disk0</code>, by-path: <code class="file">/dev/disk/by-path/pci-0000:00:17.0-ata-1</code>)  (primary disk)<br/>
|  disk0, 2.0&nbsp;TB SSD NVME Samsung Samsung SSD 970 EVO Plus 2TB (dev: <code class="file">/dev/disk0</code>)  (primary disk)<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
|  
|  
* eth0/ens10f0np0, Ethernet, configured rate: 10&nbsp;Gbps, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en<br />
* eth0, Ethernet, configured rate: 1&nbsp;Gbps, model: N/A, driver: nvethernet - no KaVLAN<br/>
* <span style="color:grey">eth1/ens10f1np1, Ethernet, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en - unavailable for experiment</span><br/>
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''GPU:'''
| Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)<br>Compute&nbsp;capability:&nbsp;7.2<br/>
|-
|-
|}
|}


This cluster is defined as exotic: please read the '''[[Exotic#Toulouse:_montcalm|exotic]]''' page for more information.
'''Note:''' This cluster is defined as exotic. Please read the '''[[Exotic#Toulouse:_estats|exotic]]''' page for more information.<br/>


= Clusters in the testing queue =
== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=montcalm%20only montcalm] ==


== [https://intranet.grid5000.fr/oar/Toulouse/drawgantt-svg/?filter=estats%20only estats] ==
'''10 nodes, 20 cpus, 320 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/montcalm/nodes.json?pretty=1 json])
 
'''12 nodes, 12 cpus, 96 cores''' ([https://public-api.grid5000.fr/stable/sites/toulouse/clusters/estats/nodes.json?pretty=1 json])


'''Reservation example:'''
'''Reservation example:'''


{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="replace">-q testing</code> <code class="replace">-t exotic</code> <code class="env">-p estats</code> <code>-I</code>}}
{{Term|location=ftoulouse|cmd=<code class="command">oarsub</code> <code class="env">-p montcalm</code> <code>-I</code>}}


{|
{|
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Access condition:'''
| testing queue, exotic job type<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Model:'''
| Connecttech/Nvidia Jetson AGX Xavier<br/>
| HPE Proliant DL360 Gen10+<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Manufacturing date:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Manufacturing date:'''
| 2022-12-01<br/>
| 2022-11-06<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Date of arrival:'''
| 2023-06-13<br/>
| 2022-12-01<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''CPU:'''
| Nvidia Carmel (Carmel), aarch64, 1&nbsp;CPU/node, 8&nbsp;cores/CPU<br/>
| Intel Xeon Silver 4314 (Ice Lake-SP), x86_64, 2.40GHz, 2&nbsp;CPUs/node, 16&nbsp;cores/CPU<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Memory:'''
| 32&nbsp;GiB<br/>
| 256&nbsp;GiB<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Storage:'''
|  disk0, 2.0&nbsp;TB SSD NVME Samsung Samsung SSD 970 EVO Plus 2TB (dev: <code class="file">/dev/disk0</code>, by-path: <code class="file">/dev/disk/by-path/platform-14180000.pcie-pci-0000:01:00.0-nvme-1</code>)  (primary disk)<br/>
|  disk0, 960&nbsp;GB SSD SATA HP VK000960GXAWL (dev: <code class="file">/dev/disk0</code>)  (primary disk)<br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''Network:'''
|  
|  
* eth0, Ethernet, configured rate: 1&nbsp;Gbps, model: N/A, driver: nvethernet - no KaVLAN<br/>
* eth0/ens10f0np0, Ethernet, configured rate: 10&nbsp;Gbps, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en<br />
* <span style="color:grey">eth1/ens10f1np1, Ethernet, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en - unavailable for experiment</span><br/>
|-
|-
| valign="top" style="background-color: #f9f9f9; padding: 0px 10px 0px 3px;" |'''GPU:'''
|}''<small>Last generated from the Grid'5000 Reference API on 2024-09-11 ([https://gitlab.inria.fr/grid5000/reference-repository/commit/bfd60eb378 commit bfd60eb378])</small>''
| Nvidia AGX&nbsp;Xavier&nbsp;(30&nbsp;GiB)<br>Compute&nbsp;capability:&nbsp;7.2<br/>
|-
|}
 
This cluster is defined as exotic: please read the '''[[Exotic#Toulouse:_estats|exotic]]''' page for more information.''<small>Last generated from the Grid'5000 Reference API on 2024-02-27 ([https://gitlab.inria.fr/grid5000/reference-repository/commit/c32e945514 commit c32e945514])</small>''

Latest revision as of 15:27, 11 September 2024

See also: Network topology for Toulouse

Summary

  • 2 clusters
  • 22 nodes
  • 416 CPU cores
  • 12 GPUs
  • 6144 GPUs cores
  • 2.88 TiB RAM
  • 22 SSDs and 0 HDDs on nodes (total: 33.61 TB)
  • 36.6 TFLOPS (excluding GPUs)

Clusters summary

Cluster Access Condition Date of arrival Manufacturing date Nodes CPU Memory Storage Network Accelerators
# Name Cores Architecture
estats exotic job type 2023-06-13 2022-12-01 12 1 Nvidia Carmel 8 cores/CPU aarch64 32 GiB 2.0 TB SSD 1 Gbps  Nvidia AGX Xavier (30 GiB)
montcalm 2022-12-01 2022-11-06 10 2 Intel Xeon Silver 4314 16 cores/CPU x86_64 256 GiB 960 GB SSD 10 Gbps 

**: crossed GPUs are not supported by Grid'5000 default environments

Clusters in the default queue

estats

12 nodes, 12 cpus, 96 cores (json)

Reservation example:

Terminal.png ftoulouse:
oarsub -t exotic -p estats -I
Access condition: exotic job type
Model: Connecttech/Nvidia Jetson AGX Xavier
Manufacturing date: 2022-12-01
Date of arrival: 2023-06-13
CPU: Nvidia Carmel (Carmel), aarch64, 1 CPU/node, 8 cores/CPU
Memory: 32 GiB
Storage: disk0, 2.0 TB SSD NVME Samsung Samsung SSD 970 EVO Plus 2TB (dev: /dev/disk0) (primary disk)
Network:
  • eth0, Ethernet, configured rate: 1 Gbps, model: N/A, driver: nvethernet - no KaVLAN
GPU: Nvidia AGX Xavier (30 GiB)
Compute capability: 7.2

Note: This cluster is defined as exotic. Please read the exotic page for more information.

montcalm

10 nodes, 20 cpus, 320 cores (json)

Reservation example:

Terminal.png ftoulouse:
oarsub -p montcalm -I
Model: HPE Proliant DL360 Gen10+
Manufacturing date: 2022-11-06
Date of arrival: 2022-12-01
CPU: Intel Xeon Silver 4314 (Ice Lake-SP), x86_64, 2.40GHz, 2 CPUs/node, 16 cores/CPU
Memory: 256 GiB
Storage: disk0, 960 GB SSD SATA HP VK000960GXAWL (dev: /dev/disk0) (primary disk)
Network:
  • eth0/ens10f0np0, Ethernet, configured rate: 10 Gbps, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en
  • eth1/ens10f1np1, Ethernet, model: Broadcom Inc. and subsidiaries BCM57414 NetXtreme-E 10Gb/25Gb RDMA Ethernet Controller, driver: bnxt_en - unavailable for experiment

Last generated from the Grid'5000 Reference API on 2024-09-11 (commit bfd60eb378)