jobloom

JobLoom finds jobs directly from company career sites before many job boards, then routes you into detailed role pages like this one.

infrastructure

Posted Jan 29

ML Infrastructure Engineer

at Echothat

San Francisco, United StatesRemote

Requirements

  • COMPANY OVERVIEW Echo Neurotechnologies is an exciting new startup in the Brain-Computer Interface (BCI) space, driving innovation through advanced hardware engineering and AI solutions.
  • JOB SUMMARY We are seeking a Senior Machine Learning Infrastructure Engineer to join our team.
  • The person who fills this role will design, build, and scale infrastructure to power massive-scale data, modeling, and analysis platforms, playing a critical role in shaping a high-performance, production-grade ML ecosystem to support rapid experimentation with diverse datasets spanning neural signals, behavior, and more.
  • This person will have significant ownership over the ML R&D platform, working closely with domain experts to architect new cloud infrastructure, data pipelines, and modeling flows.
  • RESPONSIBILITIES - Create flexible and performant ML infrastructure - Design and build systems ML cloud infrastructure to enable massive-scale modeling and analytics - Support diverse model exploration, hyperparameter optimization, pretraining, fine-tuning, and evaluation processes - Design and optimize scalable distributed training pipelines, with support for features such model sharding, cross-GPU communication, and real-time training monitoring - Create, operate, and maintain robust ML platforms and
  • - Support ML R&D operations while preparing for eventual incorporation into product pipelines REQUIRED
  • QUALIFICATIONS - Bachelor's degree in Computer Science, Electrical Engineering, or a related technical discipline - 5+ years of industry
  • experience in software engineering, large-scale data infrastructure, or systems ML - Extensive proficiency in Python - Familiarity with PyTorch -
  • Experience working with distributed-training frameworks (e.g. FSDP, DeepSpeed, Megatron-LM, Ray, etc.) -
  • Experience building or optimizing ML training pipelines for transformers or other large neural-network models - Demonstrated ability to partner closely with research and modeling teams to productionize workflows - Excellent communication and collaboration skills to work effectively on cross-functional and interdisciplinary teams -
  • Experience having technical ownership over at least one successfully implemented collaborative project PREFERRED
  • QUALIFICATIONS - Advanced degree (MS or PhD) in Computer Science, Electrical Engineering, or a related technical discipline - Proficiency in C++, Go, CUDA, Rust, and/or Java -
  • Experience in data engineering and systems ML for time-series data - Deep understanding of the fundamentals of distributed systems, including scalability, fault tolerance, monitoring, observability, scheduling, performance tuning, and resource management -
  • Experience with cloud-native environments and orchestration (Kubernetes, Docker, etc.) -

Benefits

  • Experience scaling foundation-model training infrastructure or multi-cluster computing environments WHAT WE OFFER - An opportunity to work on exciting, cutting-edge projects to transform patients’ lives in a highly collaborative work environment. - Competitive compensation, including stock options. - Comprehensive
  • benefits package. - 401(k) program with matching contributions.

Additional details

  • Our mission is to deliver cutting-edge technologies that restore autonomy to people living with disabilities and improve their quality of life.
  • TEAM CULTURE Join a small, dedicated team of knowledgeable and motivated professionals.
  • Our early-stage environment offers the opportunity to take ownership of broad decisions with significant and long-lasting impact.
  • We emphasize continuous learning and growth, fostering cross-functional collaboration where your contributions are vital to our success.
  • The work will ultimately enable the development of cutting-edge models for neuroscientific discovery and neural decoding, empowering brain-computer interface technology to improve the lives of patients living with severe neurological conditions. KEY
  • Experience designing, building, and maintaining high-throughput data pipelines for large and diverse datasets -
  • We celebrate diversity and are committed to creating an inclusive environment for all employees.
  • CONFIDENTIALITY All applications will be treated confidentially.
  • Applicants may be asked to sign an NDA after the initial stages of the interview process.

Find more real-time jobs on JobLoom.