HPC AI Systems Administrator
San Jose, California, United States of America
USD 111K-211K Senior-level Full Time
Tasks
- Administer virtualized lab infrastructure
- Configure manage root slots for cluster provisioning
- Coordinate with remote administrators vendors and partners
- Design and operate HPC AI lab environments
- Design fault tolerant highly available environments
- Design lab network and security policies
- Image configure upgrade Linux servers
- Install configure and manage high performance storage
- Install configure and support job scheduling and resource management
- Mentor junior system administrators
- Oversee lab transitions and facility moves
- Perform hardware diagnostics and resolve power CPU GPU issues
- Prioritize coordinate lab requests
- Recommend lab resource usage and capacity planning
- Troubleshoot lab issues and escalate failures
Perks/Benefits
Skills/Tech-stack
Capacity Planning | Cybersecurity | Fault-tolerant | Fault-tolerant systems | Firmware | HPC cluster | Hardware Diagnostics | High Availability | High Performance | High-Performance Storage | Job Scheduling | Linux | Lustre | Networking | Resource Management | Server Administration | Virtual Server | Virtual Server Administration | Virtualization
Education
Related jobs
-
System Administrator, Lab Operation Support USD 120K-140KAWS | Access Control | Azure | Backup and Recovery | FirewallOn-call support rotation | Travel to other sitesMid-level Full TimeSouth San Francisco, California, United States12h ago
-
Senior Systems Administrator USD 146K-194KAccess Management | Automation | Configuration Management | Confluence | FirewallSenior-level Full TimeCosta Mesa, California, United States12h ago
-
IT Systems Administrator, Launch USD 95K-115KActive Directory | Ansible | Automation | Bash | Configuration ManagementExtended hours | Night shift | On-call rotation | Weekend workMid-level Full TimeCape Canaveral, FL13h ago
-
Senior-level Full TimeVienna, VA15h ago
-
Systems Administrator (5249) USD 73K-121KActive Directory | Containers | Continuous Monitoring | DOD 8140 | Disaster RecoveryHealth insurance | Paid leave | RetirementMid-level Full TimePatuxent River, MD16h ago
-
Network Administrator (4962) USD 86K-143KAccess Control | Cisco ISE | Cisco Prime | Cisco Routers | Cisco SDAHealth insurance | Paid leave | RetirementMid-level Full TimePatuxent River, MD17h ago
-
Cloud Infrastructure Administrator USD 32K-52KActive Directory | Azure | Azure Active Directory | Azure Functions | Azure Storage401k match | Dental insurance | Direct Deposit | Disability insurance | Employee stock ownership planMid-level Full TimeAbingdon, VA, US20h ago
-
Data Guard | Data encryption | Database Scalability | Effort Estimation | Enterprise Linux24x7 on-call rotationSenior-level ContractCLEMSON, United States22h ago
-
Mid Linux Systems Administrator USD 114KAnsible | Ansible Playbook | Bash | Bash Scripting | Capacity Planning401k match | Dental insurance | Flexible spending account | HSA or FSA or DFSA | Health insuranceMid-level Full TimeSilver Spring, Maryland, United States23h ago
-
Oracle EBS Apps OCI Database Administrator USD 92K-199KADOP | Adpatch | Ansible | Application Server | AutomationDental insurance | Health insurance | Paid Holidays | Paid life insurance | Paid time offSenior-level Full TimeRemote (United States) R1d ago
-
Windows System Administrator (TS/SCI) USD 150K-170KActive Directory | ESXi | Group Policy | Group Policy Objects | Linux system401k match | Dental insurance | Flexible spending accounts | Health insurance | Paid HolidaysMid-level Full TimeFort Belvoir, VA, US1d ago
-
IT Systems Administrator USD 73K-125KAzure AD | Backup and Recovery | Compliance | Conditional Access | Data RetentionSenior-level Full TimeWakefield, MA, United States1d ago
-
Sharepoint System Administrator USD 85K-100KBI Report Server | Firewall | Microsoft SharePoint | Networking | Power BIActive security clearance supportMid-level Full TimeAlbuquerque, NM, United States1d ago
-
Mid-level Full TimeUnited States1d ago
-
IT Procurement & Operations Administrator USD 72K-88KAsset Management | Budget Management | Contract Management | IT Asset Management | Inventory ManagementMid-level Full TimeBrooklyn, NY, United States1d ago
-
Mid-level Full TimeBrooklyn, NY, United States1d ago
-
IT Security Administrator USD 38K-56K800-53 | Application Firewall | Authentication | Authorization | COBITMedical plans | Tuition supportMid-level Full TimeBOISE, ID, United States1d ago
-
ProjectWise Administrator USD 120K-170KActive Directory | Database Administration | Design integration | Document Properties | File ServersSenior-level Full TimePhoenix, AZ, United States1d ago
-
ProjectWise Administrator USD 120K-170KActive Directory | Autodesk applications | BI Automation | Cache Server Administration | Cache serverSenior-level Full TimeGreenwood Village, CO, United States1d ago
-
IT Procurement & Operations Administrator USD 72K-88KAsset Management | Compliance Auditing | Contract Management | Information security | Inventory ManagementCarbon neutral initiativesMid-level Full TimeBrooklyn, NY, United States1d ago
-
ProjectWise Administrator USD 120K-170KActive Directory | CADD | Database Administration | Document Management | Electronic document managementSenior-level Full TimeAtlanta, GA, United States1d ago
-
Mid-level Full TimeBrooklyn, NY, United States1d ago
-
Systems Administrator USD 82K-173KAWS CLI | AWS CloudFormation | AWS Console | Agile | Amazon Web ServicesEqual opportunity workplace | On-call rotationSenior-level Full TimeChantilly, VA, United States1d ago
-
Senior Data Ops Administrator USD 120K-140KAmazon Web Services | Database Administration | Database monitoring | Database replication | Database securitySenior-level Full TimeSchrafft City Center, United States1d ago
-
Mid-level Full TimeWashington, DC (DC Metro Area), United …1d ago