Thanks for your interest in the HPC System Engineer position.
Unfortunately this position has been closed but you can search our 629 open jobs by
.MINIMUM REQUIREMENTS:
Education and Experience
Bachelor's degree and
eight years of related increasingly technical work experience or a combination
of education and relevant experience. Strong, demonstrated knowledge of Linux
and demonstrated experience managing multiuser compute clusters and
associated storage environments are required as well.
Knowledge, Skills and
Abilities
Advanced knowledge of
Linux and HPC cluster management and operation are required; experience
managing, using, supporting and consulting on research computing
cyberinfrastructure in an academic or research environment is strongly
preferred. Proven ability to deliver outstanding system and service
administration and end-user support in a thorough and timely manner is
needed. This position requires that you be able to juggle multiple
competing priorities, work quickly and accurately, and demonstrate initiative
in conceptualizing and moving technical projects successfully to
completion. The position must be able to do independent analysis,
troubleshooting and problem resolution, but also must work collaboratively with
other team members and across organizational group boundaries. An
essential component of the job is keeping up with and mastering current and
emerging technologies to facilitate researchers’ computing work and also
that streamline and automate system administration tasks; that requires a
demonstrated passion for and curiosity about the breadth of HPC technologies
and tools and also of technology trends in general.
This position requires
hands-on experience building and supporting multi-tenant Linux servers/clusters
and their associated networks, file systems and storage devices in production
research environments. Specifically, this technical knowledge needed to be
successful in this position includes:
- Expert demonstrated knowledge
of Linux and managing Linux-based environments, including securing
systems, and day-to-day troubleshooting, monitoring, support, software
packaging, and working within industry-wide best practices
- Experience administering,
configuring, and supporting systems with accelerators, and shared file
systems and large-scale storage platforms. This includes hardware
installation, configuration, upgrades and repairs
- Knowledge of and experience
utilizing data and system security techniques, practices and standards as
they relate to multi-user systems, storage and networks
- Hands-on experience installing,
configuring and supporting job schedulers and resource managers (e.g.,
SLURM, OGE, LSF, Torque, Maui, etc.) is desirable.
- Familiarity with deploying
virtualization technologies and basic knowledge of container technologies
- Exceptional written and verbal
communication skills
- Experience using shells scripts
(bash), programming languages (Python), and programming automated system
management tools (e.g. Puppet)
- Familiarity with TCP/IP,
Internet Routing Protocols, private and public networks, VLANs, Firewalls,
Load Balancers, addressing schemes, subnet creation and subnet masking.
Proven ability to troubleshoot basic network issues and communicate and
work with a team of network engineers to solve possible network design
issues
- Familiarity with the
intersection of storage and networking disciplines: transport media,
speeds of media, storage networks, IP based storage delivery, other
storage delivery technologies
- Experience with some the
following applications: Git, Apache, Kerberos, LDAP
- Software installation and
maintenance experience supporting research codes and clients
- Exceptional client service and
communication, focusing on proactive system administrator actions and
interactions to reduce or remove barriers to clients’ efficient use of
resources to advance research
PHYSICAL REQUIREMENTS:
This position requires
the ability to lift and manipulate storage and compute servers, rack and unrack
equipment up to 40 pounds, and occasionally climb ladders.
WORKING CONDITIONS
This position requires
the ability to lift and manipulate storage and compute servers up to 40 pounds,
rack and unrack equipment, and occasionally climb ladders. The position
will support equipment in off-campus locations, so having a valid driver’s
license is necessary. The position is expected to respond to critical system problems
off-hours and also must also be available for routine on-site system
maintenance and patching, typically scheduled for evenings and weekends so to
minimize the disruption of research work. The position is expected to
rotate on-call duties during winter break and other closures.
WORK STANDARDS
- Interpersonal Skills:
Demonstrates the ability to work well with Stanford colleagues and clients
and with external organizations.
- Promote Culture of Safety:
Demonstrates commitment to personal responsibility and value for safety;
communicates safety concerns; uses and promotes safe behaviors based on
training and lessons learned.
- Subject to and expected to
comply with all applicable University policies and procedures, including
but not limited to the personnel policies and other policies found in the
University’s Administrative Guide, http://adminguide.stanford.edu/.
Why Stanford is for You:
- Stanford University has
revolutionized the way we live and enrich the world. Supporting this
mission is our diverse and dedicated 17,000 staff. We seek talent driven
to impact the future of our legacy. Our culture and unique perks empower
you with:
- Freedom to grow. We offer
career development programs, tuition reimbursement, or audit a course.
Join a TedTalk, film screening, or listen to a renowned author or global
leader speak.
- A caring culture. We
provide superb retirement plans, generous time-off, and family care
resources.
- A healthier you. Climb our
rock wall, or choose from hundreds of health or fitness classes at our
world-class exercise facilities. We also provide excellent health care
benefits.
- Discovery and fun. Stroll
through historic sculptures, trails, and museums.
- Enviable resources. Enjoy
free commuter programs, ridesharing incentives, discounts and more.
Stanford is an equal
employment opportunity and affirmative action employer. All qualified
applicants will receive consideration for employment without regard to race,
color, religion, sex, sexual orientation, gender identity, national origin,
disability, protected veteran status, or any other characteristic protected by
law. Stanford welcomes applications from all who would bring additional
dimensions to the University’s research, teaching and clinical missions.
Consistent with its
obligations under the law, the University will provide reasonable accommodation
to any employee with a disability who requires accommodation to perform the
essential functions of the job.