Production: Difference between revisions
No edit summary |
|||
Line 1: | Line 1: | ||
{{Portal|User}} | {{Portal|User}} | ||
{{Note|text=2025-01-30 - '''A specific documentation Web site for Abaca will go live shortly. In the meantime, specific pages for “Production” use are hosted in the Grid'5000 documentation.'''}} | |||
= Introduction = | = Introduction = | ||
{{Note|text='''Abaca is the name of Inria's national computing infrastructure dedicated to production applications.''' | |||
Abaca clusters are hosted on Inria sites alongside clusters dedicated to the Grid'5000 platform. Abaca and Grid'5000 use the same technical management tools, and the Abaca and Grid'5000 support teams work together to administer both platforms.<br> | |||
'''In the remainder of this document, “Production” refers to the use of the Abaca platform.'''}} | |||
The Abaca usage rules differ from the rest of Grid'5000. | |||
= Using | = Using Production resources = | ||
== Getting an account == | == Getting an account == | ||
Users from the ''' | Users from the '''Inria''' [https://www.inria.fr/en/inria-research-centres research centres] that want to access for a production usage must use that '''[[Special:G5KRequestAccountUMS|request form]]''' to open an account, like regular Grid'5000 users. | ||
* The following fields must be filled as follows: | * The following fields must be filled as follows: | ||
** ''Group Granting Access'' (GGA): either the group '''named after the research team''' | ** ''Group Granting Access'' (GGA): either the group '''named after the research team''' | ||
** ''Laboratory'': LORIA or IRISA | ** ''Laboratory'': the name of your Inria research center or LORIA or IRISA | ||
** ''Team'': | ** ''Team'': the name of your research team. | ||
Other users from Nancy (not belonging to the Loria laboratory) can ask to join using the '''<code>nancy-misc</code>''' Group Granting Access while other users from Rennes (not belonging to the Irisa laboratory) can ask to join using the '''<code>rennes-misc</code>''' Group Granting Access. | Other users from Nancy (not belonging to the Loria laboratory) can ask to join using the '''<code>nancy-misc</code>''' Group Granting Access while other users from Rennes (not belonging to the Irisa laboratory) can ask to join using the '''<code>rennes-misc</code>''' Group Granting Access. | ||
Line 22: | Line 25: | ||
* Users are automatically subscribed to the Grid'5000 users mailing lists: [mailto:users@lists.grid5000.fr users@lists.grid5000.fr]. This list is the user-to-user or user-to-admin communication mean to address help/support requests for Grid'5000. The technical team can be reached on [mailto:support-staff@lists.grid5000.fr support-staff@lists.grid5000.fr]. | * Users are automatically subscribed to the Grid'5000 users mailing lists: [mailto:users@lists.grid5000.fr users@lists.grid5000.fr]. This list is the user-to-user or user-to-admin communication mean to address help/support requests for Grid'5000. The technical team can be reached on [mailto:support-staff@lists.grid5000.fr support-staff@lists.grid5000.fr]. | ||
== | == Visualizing resources == | ||
== Using | {{Note|text='''At that date (2025-02-01), only the Nancy, Rennes, Grenoble and Sophia sites host clusters Production use (Abaca)'''. }} | ||
See [https://api.grid5000.fr/explorer/hardware/ Hardware] to learn about the site's resources and your priority access to resources. | |||
== Using resources == | |||
The Production usage rules differ from the rest of Grid'5000: | |||
* Advance reservations (<code>oarsub -r</code>) are not allowed (to avoid fragmentation). Only submissions (and reservations that start immediately) are allowed. | |||
* All Grid'5000 users can use Production nodes (provided they meet the conditions stated in [[Grid5000:UsagePolicy]]), but it is expected that users will use their local Production resources in priority, and mostly use those resources for tasks that require Grid'5000 features. | |||
To access production resources, you need to submit jobs to the production queue using the command <code>-q production</code>. Job submissions in the production queue are prioritized based on who funded the material. There are four levels of priority, each with a maximum job duration: | To access production resources, you need to submit jobs to the production queue using the command <code>-q production</code>. Job submissions in the production queue are prioritized based on who funded the material. There are four levels of priority, each with a maximum job duration: | ||
* '''p1''' -- 168h (one week) | * '''p1''' -- 168h (one week) | ||
Line 37: | Line 44: | ||
* You may also have access to the clusters on [[Production#Can_I_use_besteffort_jobs_in_production_?|besteffort]]. | * You may also have access to the clusters on [[Production#Can_I_use_besteffort_jobs_in_production_?|besteffort]]. | ||
'''You can check your priority level for any cluster using''' https://api.grid5000.fr/explorer. | |||
<br> | |||
{{Note|text=Moreover, with '''p1''' priority, user can submit advanced reservation. More information about that in the [[Advanced_OAR#Batch_jobs_vs._advance_reservation_jobs|Advanced OAR Page]]. For example, to reserve one week from now: {{Term|location=fnancy|cmd=<code class="command">oarsub</code> <code>-q p1</code> <code>-r "$(date +'%F %T' --date='+1 week')"</code>}} | {{Note|text=Moreover, with '''p1''' priority, user can submit advanced reservation. More information about that in the [[Advanced_OAR#Batch_jobs_vs._advance_reservation_jobs|Advanced OAR Page]]. For example, to reserve one week from now: {{Term|location=fnancy|cmd=<code class="command">oarsub</code> <code>-q p1</code> <code>-r "$(date +'%F %T' --date='+1 week')"</code>}} | ||
Line 42: | Line 51: | ||
}} | }} | ||
{{Warning|text=These limits '''DO NOT''' replace the [[Production#I_submitted_a_job,_there_are_free_resources,_but_my_job_doesn't_start_as_expected!|maximum walltime per node]] which are still in effects.}} | {{Warning|text=These limits '''DO NOT''' replace the [[Production:FAQ#I_submitted_a_job,_there_are_free_resources,_but_my_job_doesn't_start_as_expected!|maximum walltime per node]] which are still in effects.}} | ||
{{Note|text=As of today, the resources explorer only shows basic information. Additional information will be added in the near future.}} | {{Note|text=As of today, the resources explorer only shows basic information. Additional information will be added in the near future.}} | ||
Line 51: | Line 58: | ||
{{Term|location=fnancy|cmd=<code class="command">oarsub</code> <code>-q production</code> <code>-I</code>}} | {{Term|location=fnancy|cmd=<code class="command">oarsub</code> <code>-q production</code> <code>-I</code>}} | ||
''Using the command above will generally place your job at the lowest priority to allow usage of all clusters, even those where your priority is '''p4'''.'' | ''Using the command above will generally place your job at the lowest priority to allow usage of all clusters, even those where your priority is '''p4'''.'' | ||
When you specify a cluster, your job will be set to your highest priority level for that cluster: | When you specify a cluster, your job will be set to your highest priority level for that cluster: | ||
{{Term|location=fnancy|cmd=<code class="command">oarsub</code> <code>-q production</code> <code class="replace">-p grele</code> <code>-I</code>}} | {{Term|location=fnancy|cmd=<code class="command">oarsub</code> <code>-q production</code> <code class="replace">-p grele</code> <code>-I</code>}} | ||
You can also limit a job submission to a cluster at a specific priority level using <code>-q</code><code class="replace">PRIORITY LEVEL</code>: | You can also limit a job submission to a cluster at a specific priority level using <code>-q</code><code class="replace">PRIORITY LEVEL</code>: | ||
Line 62: | Line 67: | ||
== Dashboards and status pages == | == Dashboards and status pages == | ||
* [https://www.grid5000.fr/status/ planned and ongoing maintenances, events and issues on | * [https://www.grid5000.fr/status/ planned and ongoing maintenances, events and issues on Abaca or Grid'5000 ] | ||
== Resources reservations (OAR) status == | |||
== | {| | ||
|bgcolor="#aaaaaa" colspan="10"| | |||
'''Drawgantt''' ''(past, current and future OAR jobs scheduling)'' | |||
|- | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-prod/ '''Grenoble nodes (production)''']<br> | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/ '''Nancy nodes (production)''']<br> | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-prod/ '''Rennes nodes (production)''']<br> | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-prod/ '''Sophia nodes (production)''']<br> | |||
|- | |||
|bgcolor="#aaaaaa" colspan="10"| | |||
'''Monika''' ''(current placement and queued jobs status)'' | |||
|- | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Grenoble/monika-prod.cgi '''Grenoble (production)'''] | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Nancy/monika-prod.cgi '''Nancy (production)'''] | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Rennes/monika-prod.cgi '''Rennes (production)'''] | |||
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |||
[https://intranet.grid5000.fr/oar/Sophia/monika-prod.cgi '''Sophia (production)'''] | |||
|} | |||
== Learning to use Production == | |||
Refer to the [[Production:Getting Started]] Production tutorial (derived from [[Getting Started]] Grid'5000 tutorial). | |||
== | = Information and support = | ||
{{Note|text=For the time being, access to support is common to both Abaca and Grid'5000....}} | |||
{{ | |||
Before asking for support, you're advised to verify your issue is not documented somewhere on the Grid'5000 website. In particular, you should check: | |||
* the [[Production:Getting_Started]], for general usage | |||
* the [[Production:FAQ]] | |||
* the [https://www.grid5000.fr/status/ events status page], for ongoing maintenances or incidents | |||
You may contact the Support staff directly by sending an e-mail to support-staff@lists.grid5000.fr. | |||
Latest revision as of 18:29, 5 February 2025
Note | |
---|---|
2025-01-30 - A specific documentation Web site for Abaca will go live shortly. In the meantime, specific pages for “Production” use are hosted in the Grid'5000 documentation. |
Introduction
The Abaca usage rules differ from the rest of Grid'5000.
Using Production resources
Getting an account
Users from the Inria research centres that want to access for a production usage must use that request form to open an account, like regular Grid'5000 users.
- The following fields must be filled as follows:
- Group Granting Access (GGA): either the group named after the research team
- Laboratory: the name of your Inria research center or LORIA or IRISA
- Team: the name of your research team.
Other users from Nancy (not belonging to the Loria laboratory) can ask to join using the nancy-misc
Group Granting Access while other users from Rennes (not belonging to the Irisa laboratory) can ask to join using the rennes-misc
Group Granting Access.
- Users are automatically subscribed to the Grid'5000 users mailing lists: users@lists.grid5000.fr. This list is the user-to-user or user-to-admin communication mean to address help/support requests for Grid'5000. The technical team can be reached on support-staff@lists.grid5000.fr.
Visualizing resources
Note | |
---|---|
At that date (2025-02-01), only the Nancy, Rennes, Grenoble and Sophia sites host clusters Production use (Abaca). |
See Hardware to learn about the site's resources and your priority access to resources.
Using resources
The Production usage rules differ from the rest of Grid'5000:
- Advance reservations (
oarsub -r
) are not allowed (to avoid fragmentation). Only submissions (and reservations that start immediately) are allowed. - All Grid'5000 users can use Production nodes (provided they meet the conditions stated in Grid5000:UsagePolicy), but it is expected that users will use their local Production resources in priority, and mostly use those resources for tasks that require Grid'5000 features.
To access production resources, you need to submit jobs to the production queue using the command -q production
. Job submissions in the production queue are prioritized based on who funded the material. There are four levels of priority, each with a maximum job duration:
- p1 -- 168h (one week)
- p2 -- 96h (four days)
- p3 -- 48h (two days)
- p4 -- 24h (one day)
- You may also have access to the clusters on besteffort.
You can check your priority level for any cluster using https://api.grid5000.fr/explorer.
Note | |
---|---|
Moreover, with p1 priority, user can submit advanced reservation. More information about that in the Advanced OAR Page. For example, to reserve one week from now: p1 priority level also allow to extend the duration of a job. The extension is only apply 24h before the end of the job and cannot be longer than 168h. More information about this feature can be found also on the Advance Oar Page. |
Warning | |
---|---|
These limits DO NOT replace the maximum walltime per node which are still in effects. |
Note | |
---|---|
As of today, the resources explorer only shows basic information. Additional information will be added in the near future. |
When submitting a job, by default, you will be placed at the highest priority level that allows you to maximize resources:
Using the command above will generally place your job at the lowest priority to allow usage of all clusters, even those where your priority is p4.
When you specify a cluster, your job will be set to your highest priority level for that cluster:
You can also limit a job submission to a cluster at a specific priority level using -q
PRIORITY LEVEL
:
Dashboards and status pages
Resources reservations (OAR) status
Drawgantt (past, current and future OAR jobs scheduling) | |||||||||
Monika (current placement and queued jobs status) | |||||||||
Learning to use Production
Refer to the Production:Getting Started Production tutorial (derived from Getting Started Grid'5000 tutorial).
Information and support
Before asking for support, you're advised to verify your issue is not documented somewhere on the Grid'5000 website. In particular, you should check:
- the Production:Getting_Started, for general usage
- the Production:FAQ
- the events status page, for ongoing maintenances or incidents
You may contact the Support staff directly by sending an e-mail to support-staff@lists.grid5000.fr.