HPC AI Systems Administrator
USD 111K-211K Senior-level Full Time
Tasks
- Administer virtualized lab infrastructure
- Configure root slots for OS images
- Coordinate with remote administrators vendors and partners
- Design and operate HPC AI lab environments
- Design fault tolerant highly available environments
- Design lab network layouts and operational policies
- Ensure lab stability availability and resource utilization
- Escalate troubleshoot complex HPC AI lab issues
- Image configure and upgrade Linux servers
- Install configure and manage high performance storage
- Install configure and support job scheduling tools
- Mentor junior system administrators and lab staff
- Oversee lab transitions and facility moves
- Perform capacity planning and expansion recommendations
- Prioritize and coordinate lab work requests
- Provide advanced hardware diagnostics and troubleshoot power CPU GPU issues
Perks/Benefits
- N/A
Skills/Tech-stack
CPU troubleshooting | Capacity Planning | Cluster administration | Cybersecurity | Cybersecurity Standards | Fault Tolerance | Firmware Updates | GPU troubleshooting | HPC cluster | HPC cluster administration | Hardware Diagnostics | High Availability | High Performance | High-Performance Computing | Job Scheduling | Linux | Networking | Performance Computing | Power troubleshooting | Resource Management | Server Administration | Storage management | Switch configuration | Virtual Server | Virtual Server Administration | Virtualization
Education
Bachelor of Engineering | Bachelor of Science | Master of Science
Regions
Countries
States
Cities
Related jobs
-
Linux System Administrator USD 86K-130KACAS | ATO | AWS GovCloud | Ansible | BashAfter-hours supportMid-level Full TimeMCB Quantico, VA, United States1h ago
-
Systems Administrator Level 2 USD 98K-135KClient Help Desk | Client-Server | Database Administration | Dispatch systems | Help deskSenior-level Full TimeAnnapolis Junction, United States6h ago
-
Senior-level Full TimeMegaCenter, MD6h ago
-
Mid-level Full TimeMegaCenter, MD6h ago
-
API Management | AWS | Azure | Azure DevOps | Batch jobsOnsite work requirementMid-level Full TimeTallahassee, FL7h ago
-
Archer GRC Engineer I USD 40K-58KBash | DHCP | DNS | Database systems | Incident ManagementFlexible onsite schedule | Hybrid work model | Stable compliant remote workstation requirementsEntry-level Full Time399 Revolution Drive Somerville (Assembly Row …19h ago
-
IT Manager & Salesforce System Administrator USD 80K-100KAccess Management | Amazon Web Services | Cloud Computing | Cloud platform | Container Technologies401k company match | Discounted Employee Services | Discounted products | Medical/Dental/Vision insurance | Paid time offMid-level Full TimeBurlingame, CA R1d ago
-
Senior Linux System Administrator USD 128K-214KAnsible | Capacity Planning | Configuration Management | Hardware Troubleshooting | LinuxHealth insurance | Holiday pay | Learning and development | Life insurance | Long-term disabilitySenior-level Full TimeUSA-MD-Laurel2d ago
-
Systems Administrator, Junior USD 70K-90KACAS | Change Management | DISA STIG | Desktop infrastructure | EMASSEntry-level Full TimeSan Diego, CA, US2d ago
-
Systems Administrator IV USD 118K-188KCapacity Planning | Citrix | NAS | Patching | Performance optimizationHybrid workMid-level Full TimeBoston2d ago
-
Systems Administrator - Infrastructure USD 56K-74KAnsible | Backup and Recovery | Bash | Disaster Recovery | High AvailabilityAfter-hours support | On-call support | Onsite work scheduleMid-level Full TimeBeachwood, OH, United States3d ago
-
Mid-level Full TimeMegaCenter, MD3d ago
-
Mid-level Full TimeUDC, UT3d ago
-
Mid-level Full TimeOPS 2A, MD3d ago
-
Mid-level Full TimeOPS 2A, MD3d ago
-
Mid-level Full TimeOPS 2A, MD3d ago
-
Mid-level Full TimeOPS 2A, MD3d ago
-
Senior-level Full TimeMegaCenter, MD3d ago
-
Senior-level Full TimeMegaCenter, MD3d ago
-
System Administrator II USD 90K-115KAnsible | Bash | CentOS | Confluence | GitActive security clearance | Full Scope PolygraphMid-level Full TimeMegaCenter, MD3d ago
-
Mid-level Full TimeMegaCenter, MD3d ago
-
Mid-level Full TimeMegaCenter, MD3d ago
-
Mid-level Full TimeMegaCenter, MD3d ago
-
Mid-level Full TimeMegaCenter, MD3d ago
-
Mid-level Full TimeMegaCenter, MD3d ago