HPC AI Systems Administrator
San Jose, California, United States of America
USD 111K-211K Senior-level Full Time
Tasks
- Administer Linux servers
- Administer virtual lab infrastructure
- Collaborate with vendors and partners for specialized support
- Communicate lab status risks and failures to management
- Configure firmware and switch settings
- Coordinate power CPU and GPU issue resolution
- Design fault tolerant virtual environments
- Design lab networks and operational policies
- Diagnose hardware issues
- Ensure cybersecurity and asset protection compliance
- Install configure and manage high performance storage
- Install configure and support job scheduling tools
- Manage Lustre storage performance
- Manage root slots for OS images
- Mentor junior system administrators and lab staff
- Perform capacity planning and resource recommendations
- Prioritize and coordinate lab work requests
- Provision and validate HPC clusters
- Support facility moves and infrastructure refreshes
- Troubleshoot lab escalations
Perks/Benefits
- N/A
Skills/Tech-stack
AI benchmarks | Capacity Planning | Cybersecurity | Fault Tolerance | Firmware Updates | HPC | Hardware Diagnostics | Job Scheduling | Linux | Lustre | Networking | Resource Management | Server Administration | Storage Administration | Switch configuration | Virtual Server | Virtual Server Administration | Virtualization
Related jobs
-
Systems Administrator I USD 90K-118KAWS | Active Directory | Backup and Recovery | Bash | BicepHybrid work environment | Occasional travel | On-call rotation | Scheduled maintenance windowsSenior-level Full TimeVancouver, WA, United States6h ago
-
Systems Administrator Senior - Information Assurance USD 133K-180KCISSP | Cisco | Cloud Computing | Compliance | DHCPSenior-level Full TimeDayton, OH7h ago
-
COOP Systems Administrator - Journeyman USD 104K-143KApache HTTP | Apache HTTP Server | Apache Tomcat | Backup Job Monitoring | Backup OperationsSenior-level Full TimeFAIRFAX, VA, United States8h ago
-
Cloud Administrator - Journeyman USD 108K-177KAccess Control | Alerting | Cloud Computing | Cloud Security | ComputeSenior-level Full TimeFAIRFAX, VA, United States8h ago
-
Endpoint Management Technician [Linux] - Journeyman USD 90K-138KCompliance Management | Hotfixes | Linux | MECM | Microsoft IntuneSenior-level Full TimeFAIRFAX, VA, United States8h ago
-
Network Administrator - Journeyman USD 85K-180KConfiguration Management | Cybersecurity compliance | DODIN | Firmware Updates | Network Device HealthSenior-level Full TimeFAIRFAX, VA, United States8h ago
-
Systems Administrator USD 98K-163KAnsible | Backup and Recovery | Diagnostics | IT automation | LinuxMid-level Full TimeUSA-FL-Doral11h ago
-
System Administrator (TS SCI Clearance Required) USD 86K-138KAgile | Amazon Web Services | Bitbucket | CI Polygraph | ConfluenceBenefits | Flexible work-life balance | Long term projectsMid-level Full TimeChantilly, Virginia, United States21h ago
-
Senior System Administrator USD 113K-149K800-53 | Air-gapped | Air-gapped systems | Ansible | AutomationOn-call schedule | On-site supportSenior-level Full TimeAshville, Ohio, United States22h ago
-
System Administrator USD 87K-116K800-53 | Air-gapped | Ansible | DISA STIG | Information AssuranceOn-call scheduleMid-level Full TimeAshville, Ohio, United States22h ago
-
Senior Systems Administrator USD 146K-194K800-53 | Air-gapped | Air-gapped systems | Ansible | DISA STIGSenior-level Full TimeCosta Mesa, California, United States22h ago
-
Network and Systems Engineer USD 78K-102KActive Directory | Asset Management | Backup and Recovery | Capacity Planning | Cisco CatalystCross functional project collaboration | On-site work dailyMid-level Full TimeSpringfield, TN, USA1d ago
-
Systems Administrator USD 95K-110KActive Directory | Configuration Management | Cybersecurity | Linux | Network InfrastructureClassified cleared environment | On-site workMid-level Full TimeWright Patterson Air Force Base, OH1d ago
-
Enterprise Workday Administrator USD 116K-116KAPI | Access Control | Access Management | Business Process | Business Process Configuration401-k match | Employee assistance program | Employee referral program | FSA | HSASenior-level Full TimeAustin, TX, United States1d ago
-
Enterprise Workday Administrator USD 116K-116KAccess Control | Access Management | Business Process | Business Process Configuration | Calculated Fields401k match | Employee assistance program | Employee referral program | Flexible spending account | Health savings accountSenior-level Full TimeDallas, TX, United States1d ago
-
Network & Systems Administrator USD 70K-95KAccess Points | Active Directory | DHCP | DNS | Disaster RecoveryPeriodic travelMid-level Full TimeLincoln, NE1d ago
-
System Administrator (Clearance Required) USD 100K-150K800-53 | Ansible | At Rest Encryption | Auditd | Bash401k matching | Dental insurance | Health insurance | Life insurance | Long-term disabilityMid-level Full TimeWashington, DC, United States1d ago
-
System Administrator Junior Level USD 110K-145KCapacity Planning | Certification and accreditation | Hypervisor | IA Metrics | Information Assurance401k match | Career development | Dental insurance | Federal Holidays | Health savings accountEntry-level Full TimeLaurel, MD, US1d ago
-
Systems Administrator USD 78K-100KAHV | Active Directory | Bash | Certificate Services | DHCP401k matching | Company HSA contribution | Company-Paid Holidays | Dental insurance | Employee assistance programMid-level Full TimeCoppell, Texas1d ago
-
Systems Administrator USD 90K-117KAccess Control | Active Directory | Cisco | Cybersecurity compliance | DISA STIG401k matching | Continuing education assistance | Dental insurance | Employee assistance program | Health insuranceMid-level Full TimeFort Dix, NJ1d ago
-
Senior Systems Administrator (5331) USD 105K-175KAccess Control | Account Management | Active Directory | Agile | BackupsHealth insurance | Paid leave | Retirement planSenior-level Full TimePatuxent River, MD1d ago
-
Systems Administrator USD 91K-135KActive Directory | Ansible | Backup and Recovery | Centralized Logging | Cisco IOS401k matching | Buy your own device reimbursement | Cell phone and internet reimbursement | Maternity & paternity leave | Paid time offMid-level Full TimeLexington, MA, United States1d ago
-
Azure Systems Administrator USD 75K-110KActive Directory | Azure | Azure Virtual | Azure Virtual Desktop | DNS401k matching | Career development opportunities | Flexible spending accounts | Insurance benefits | Paid HolidaysMid-level Full TimeMilan 49051d ago
-
SYSTEM ADMINISTRATOR - Enterprise Infrastructure - 5+ yrs of Experience - TS/SCI w/Poly clearance is required - ES A USD 150K-155KArchitecture Design | Architecture development | Capacity Analysis | Certification and accreditation | Information Assurance401k match | Dental insurance | Federal Holidays | Floating holidays | Life insuranceMid-level Full TimeLaurel, United States1d ago
-
Access Control | Account Management | Backup and Restore | Database Administration | Email Account ManagementSenior-level Full TimeDecatur, GA, United States1d ago