System Administrator – HPC

Job Posting Number: 22831

Information Technology
Lyndhurst, NJ
May 31, 2018
Company Overview:

At Memorial Sloan Kettering (MSK), we’re not only changing the way we treat cancer, but also the way the world thinks about it. By working together and pushing forward with innovation and discovery, we’re driving excellence and improving outcomes.

For the 28th year, MSK has been named a top hospital for cancer by U.S. News & World Report. We are proud to be on Becker’s Healthcare list as one of the 150 Great Places to Work in Healthcare in 2018, as well as one of Glassdoor’s Employees’ Choice Best Place to Work for 2018. We’re treating cancer, one patient at a time. Join us and make a difference every day.

Job Details:

System Administrator for High Performance Computing Cluster, Montvale, NJ

The role:

Our data center houses AIRI, the most advanced architecture ever built for scale-out AI. Specifically designed for deep learning by NVIDIA and Pure, AIRI has a GPU performance of more than 10 petaFLOPS.

AIRI is an essential component. AI has exclusive access for the purposes of computational pathology to MSKCC’s huge archive of pathology slides, which will be used to train an AI.

Candidates should have excellent communications skills and seek to work with people from a range of disciplines to help prioritize and manage the development of a fast, scalable, and secure application that will operate in both cloud and on-premises environments.

Responsibilities include:

Computer systems administration and troubleshooting:

Performs system administration functions including monitoring system and network performance, troubleshooting system problems, maintaining user accounts and various tracking systems, and liaising with other institutional information system and information technology groups.

Performance tuning and optimization:

Plans, configures and maintains a heterogeneous collection of high performance computing systems consisting of single and clustered Linux servers, network switches and clustered file systems. Optimizes performance of hardware systems, network systems and operating systems.

Software installation and maintenance:

Installs and updates software including Linux operating systems, web servers (e.g., Apache, Tomcat), email servers, computer programming languages (e.g., Java, Python, Ruby), relational database servers (e.g., MySQL, postgreSQL) and statistical packages (e.g., R), as well as specialized software packages focused particularly on high-throughput data processing and analysis (e.g., microarray and sequence data).

MSK is an equal opportunity and affirmative action employer committed to diversity and inclusion in all aspects of recruiting and employment. All qualified individuals are encouraged to apply and will receive consideration without regard to race, color, gender, gender identity or expression, sexual orientation, national origin, age, religion, creed, disability, veteran status or any other factor which cannot lawfully be used as a basis for an employment decision.

Federal law requires employers to provide reasonable accommodation to qualified individuals with disabilities. Please tell us if you require a reasonable accommodation to apply for a job or to perform your job. Examples of reasonable accommodation include making a change to the application process or work procedures, providing documents in an alternate format, using a sign language interpreter, or using specialized equipment.