HPC AI Systems Administrator
San Jose, California, United States of America
USD 111K-211K Senior-level Full Time
Tasks
- Administer virtualized lab infrastructure
- Communicate lab risks failures and issues to management
- Configure and manage root slots for OS images
- Coordinate lab stability availability and resource utilization
- Design highly available fault tolerant lab environments
- Design lab layouts networks and operational policies
- Escalate troubleshoot complex HPC and AI lab issues
- Image configure and upgrade servers with Linux
- Install configure and manage high performance storage
- Install configure and support job scheduling and resource management tools
- Mentor junior system administrators
- Oversee lab transitions facility moves and infrastructure refresh
- Perform hardware and software installation and configuration
- Prioritize and coordinate lab work activities
- Recommend lab resource usage and capacity planning
- Run advanced hardware diagnostics
Perks/Benefits
- Career advancement programs
- Flexible work arrangements
- Health and wellbeing benefits
- Inclusion programs
- Professional development programs
Skills/Tech-stack
AI | Capacity Planning | Cybersecurity | Fault-tolerant | Fault-tolerant systems | Firmware Updates | HPC | Hardware Diagnostics | High Performance | High-Performance Storage | Job Scheduling | Linux | Lustre | Network Administration | Resource Management | Server Administration | Storage Administration | Switch configuration | Virtual Server | Virtual Server Administration | Virtualization
Related jobs
-
System Administrator (TS SCI Clearance Required) USD 86K-138KAgile | Amazon Web Services | Bitbucket | CI Polygraph | ConfluenceBenefits | Flexible work-life balance | Long term projectsMid-level Full TimeChantilly, Virginia, United States11h ago
-
Senior System Administrator USD 113K-149K800-53 | Air-gapped | Air-gapped systems | Ansible | AutomationOn-call schedule | On-site supportSenior-level Full TimeAshville, Ohio, United States12h ago
-
System Administrator USD 87K-116K800-53 | Air-gapped | Ansible | DISA STIG | Information AssuranceOn-call scheduleMid-level Full TimeAshville, Ohio, United States12h ago
-
Senior Systems Administrator USD 146K-194K800-53 | Air-gapped | Air-gapped systems | Ansible | DISA STIGSenior-level Full TimeCosta Mesa, California, United States12h ago
-
Enterprise Workday Administrator USD 116K-116KAPI | Access Control | Access Management | Business Process | Business Process Configuration401-k match | Employee assistance program | Employee referral program | FSA | HSASenior-level Full TimeAustin, TX, United States16h ago
-
Enterprise Workday Administrator USD 116K-116KAccess Control | Access Management | Business Process | Business Process Configuration | Calculated Fields401k match | Employee assistance program | Employee referral program | Flexible spending account | Health savings accountSenior-level Full TimeDallas, TX, United States16h ago
-
Network & Systems Administrator USD 70K-95KAccess Points | Active Directory | DHCP | DNS | Disaster RecoveryPeriodic travelMid-level Full TimeLincoln, NE17h ago
-
System Administrator (Clearance Required) USD 100K-150K800-53 | Ansible | At Rest Encryption | Auditd | Bash401k matching | Dental insurance | Health insurance | Life insurance | Long-term disabilityMid-level Full TimeWashington, DC, United States18h ago
-
Systems Administrator USD 90K-117KAccess Control | Active Directory | Cisco | Cybersecurity compliance | DISA STIG401k matching | Continuing education assistance | Dental insurance | Employee assistance program | Health insuranceMid-level Full TimeFort Dix, NJ21h ago
-
Senior Systems Administrator (5331) USD 105K-175KAccess Control | Account Management | Active Directory | Agile | BackupsHealth insurance | Paid leave | Retirement planSenior-level Full TimePatuxent River, MD22h ago
-
Systems Administrator USD 91K-135KActive Directory | Ansible | Backup and Recovery | Centralized Logging | Cisco IOS401k matching | Buy your own device reimbursement | Cell phone and internet reimbursement | Maternity & paternity leave | Paid time offMid-level Full TimeLexington, MA, United States22h ago
-
Azure Systems Administrator USD 75K-110KActive Directory | Azure | Azure Virtual | Azure Virtual Desktop | DNS401k matching | Career development opportunities | Flexible spending accounts | Insurance benefits | Paid HolidaysMid-level Full TimeMilan 49051d ago
-
SYSTEM ADMINISTRATOR - Enterprise Infrastructure - 5+ yrs of Experience - TS/SCI w/Poly clearance is required - ES A USD 150K-155KArchitecture Design | Architecture development | Capacity Analysis | Certification and accreditation | Information Assurance401k match | Dental insurance | Federal Holidays | Floating holidays | Life insuranceMid-level Full TimeLaurel, United States1d ago
-
Senior Linux System Administrator USD 140K-165KAPT | Ansible | AppArmor | Authentication | Backup and RecoverySenior-level Full TimeAlbuquerque, NM, United States1d ago
-
Network and Computer Systems Administrator-Linux (2943) USD 100K-138KBackup Management | LAN | Linux | Log Management | Microsoft OfficeMid-level Full TimePatuxent River, Maryland, United States1d ago
-
Senior Network Engineer Ii- Bdmtc 1656 USD 104K-138KActive Directory | Ansible | Bash | DNS | FirewallsSenior-level Full TimeButlerville, IN1d ago
-
Cloud System Administrator USD 135K-216KAWS | Amazon EC2 | Ansible | Infrastructure as Code | KubernetesBonus plan eligibility | Health insurance subsidy | Paid time offMid-level Full TimeAnnapolis Junction, MD, United States1d ago
-
Splunk Administrator USD 69K-158KAccess Control | Automation Scripting | Bash | CentOS | Disaster RecoveryMid-level Full TimeUSA, MD, Indian Head (3767 Strauss …1d ago
-
Onsite Administrator Print USD 44K-60KAgile methodology | Amazon Web Services | Auditing | Automation | Change ManagementDental insurance | Employee assistance program | Flexible spending account | Health insurance | Life insuranceMid-level Full TimeTW2PA - Teleworker/Offsite-USA-PA, United States R1d ago
-
Data Center Administrator USD 53K-89KCooling systems | Data center | Dell Server | Dell Server Hardware | HP ServerEntry-level Full TimeRemote - OH, United States R1d ago
-
Apache | Freshservice | Google Workspace | Google Workspace for Education | LinuxFlexible spending accounts | Flexible work options | Health insurance | Paid time off | Professional developmentEntry-level Full Time3401 Walnut Street B/C Wing - …1d ago
-
Staff Cyber Systems Administrator USD 161K-241KAWS | Agile | Ansible | Bash | Configuration Management9/80 work schedule | Education assistance | Paid time off | Relocation assistance | Training and developmentSenior-level Full TimeVACH06, United States1d ago
-
Systems Administrator Level 2/3 - Top Secret USD 75K-141KAccess Control | Account administration | Active Directory | Audit Logging | Backup and RecoveryCompany-Paid Holidays | Paid time off | Relocation assistanceMid-level Full TimeWAOH10, United States1d ago
-
Senior System Administrator USD 168K-420KAWS CloudFormation | AWS EC2 | AWS IAM | AWS Lambda | Active Directory401k match | Dental coverage | Flexible work-life balance | Gym reimbursement | Health savings accountSenior-level Full TimeLaurel, MD1d ago
-
Database Administrator Sr Principal USD 129K-155KAWS Cloud | AWS Cloud Computing | Ansible | Bash | Cloud Computing401k company match | Dental insurance | Flexible work schedule | Health insurance | Paid HolidaysSenior-level Full TimeUSA NC Home Office (NCHOME), United …1d ago