engineering
Posted Mar 20Data Center Operations Systems Engineer (Kansas City, MO)
at Lambdalabs
Kansas City, United StatesOn-site
Responsibilities
- - Troubleshoot hardware and software issues in some of the world’s most advanced GPU and Networking systems.
Requirements
- Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers.
- Our customers range from AI researchers to enterprises and hyperscalers.
- If you'd like to build the world's best AI cloud, join us. *Note: This position requires presence in our Kansas City, MO Data Center 5 days per week, 7/24 shift coverage
- - Work with RMA team to ensure faulty parts are returned and replacements are ordered - Follow installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers You - Have strong past experiences with critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management - Be familiar with carrier
- experience with critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management -
- Experience with/or knowledge of network topology and configurations and 400gb Infiniband architectures. -
- Experience with/or knowledge of DDP or SCM cluster storage systems. - Have 3+ years working with and reporting from a ticketing systems like JIRA and Zendesk - Advanced
- experience with Linux administration -
- About Lambda - Founded in 2012, with 500+ employees, and growing fast - Our investors notably include TWG Global, US Innovative Technology Fund (USIT), Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, Gradient Ventures, Mercato Partners, SVB, 1517, and Crescent Cove - We have research papers accepted at top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG - Our
Experience
- - Are action-oriented and willingness to train junior staff on best practices - Are willing to travel for bring up of new data center locations as needed Nice to Have - Have 2+ years