ACCESS Allocated Production Compute

Infrastructure Integration Roadmap Description

Infrastructure Type(s)

This roadmap integrates a high-performance computing “HPC” cluster (https://en.wikipedia.org/wiki/High-performance_computing). Researchers typically use HPC clusters by logging in to front-end nodes where they install application software and run jobs under a batch scheduler. Science gateways, workflow engines and other software tools may perform these actions on the researchers’ behalf. Jobs may run entirely in the background reading input data and generating output data, may use interactive or steered input, and may generate live or streamed visual output.

Summary

The ACCESS program (https://access-ci.org) enables researchers and educators to gain access to advanced computing, visualization, and data resources to accomplish their research or classroom objectives using an allocations process described at https://allocations.access-ci.org/.

This roadmap details the tasks that a resource provider organization must perform to make a specific HPC cluster ACCESS allocated in production. These tasks cover all aspects of integration and operations in production, including coordination, technical integration tasks, and ongoing support activities.

Operator Information

Operators planning to integrate an HPC resource into ACCESS with this roadmap should contact ACCESS Operations to start the integration process (see details in the first task below). ACCESS Operations will assign a Concierge Integration Expert to provide personalized hands-on assistance with the integration process, schedule an online meeting between resource provider staff and ACCESS staff to launch the integration process, and provide instructions on how to track integration progress.

Operators must perform the Required Tasks below and may perform the Optional Tasks below.

See related Roadmap Task Timeline.

Support Information

Submit a Request: Resource providers needing assistance with integration tasks can submit an ACCESS Integration and Operation Support Request using instructions on this page. Pick an “ACCESS Operational Support Issues” that best matches your request, or the catch-all issue type “ACCESS-wide: Provider Integration - Infrastructure Integration and Roadmaps“.

Slack Discussion: ACCESS has a Slack channel for infrastructure operators and ACCESS to discuss integration topics. For access submit a request to be invited to the RP-ACCESS #integration-roadmaps Slack channel.

By Weekly Meetings: ACCESS holds a Zoom meeting every second Friday from 11:00 AM to Noon Central for infrastructure operators and ACCESS to discuss integration topics. To obtain Zoom coordinates submit a request for the RP and ACCESS Roadmaps Zoom meeting coordinates.

Integration Consultants: ACCESS allocated resource providers and some other types of infrastructure operators have a Concierge Integration Expert consultant available for one-on-one integration support.

Documentation:

Planning phase

  1. Coordination: ACCESS Allocated Resource Integration Coordination v1

  2. Coordination: Infrastructure Description v2

  3. Technology: Cybersecurity Requirements for RPs v1

  4. Technology: Data and Network Integration v1

Integration phase

  1. Coordination: ACCESS Allocation Policies v1

  2. Coordination: Knowledge Base v1

  3. Coordination: RP Forum Participation v1

  4. Coordination: Cybersecurity Governance Council Participation v1

  5. Technology: Resource Metrics Data Availability Assessment v1

  6. (Optional) Technology: ACCESS DNS Entries v1 (NEW)

  7. (Optional) Technology: Local Service ACCESS IAM Integration v1

  8. (Optional) Technology: ACCESS OnDemand Portal Integration v1

Operations phase

  1. Technology: Deploy Globus Endpoint v1

  2. Technology: Publish Dynamic Resource Information v2

  3. Support: Incident Response and Coordination v1

  4. Support: Ticket Handling v2

  5. Support: Operational Status Communications v1

  6. (Optional) Support: Request RP or Site Staff Allocation v1

  7. Technology: AMIE and Usage Reporting v1

  8. Technology: Performance Data reporting v1

Document Management

Status: Official

Official date: 04/24/2023

Coordinators: Jess Haney, ACCESS Operations

Last revised date: 08/22/2023

Retired date: