We are looking for a Software and Data Engineer to help implement the MSK MIND data platform that will integrate multi-dimensional datasets and enable advanced analytics including machine learning and artificial intelligence. You will be working closely with other software and data engineers, data scientists, clinicians, and molecular biologists.
Enthusiastic about solving problems in cancer research and their clinical applications.
A software and data engineer who has experience in developing software and data infrastructure.
Eager to learn and apply new technologies and ideas to benefit the entire organization.
A person who enjoys working in a team and coach other team members.
Work with data architect to develop software and data architecture of MSK MIND, including data pipelines, data storage, and data API.
Develop data pipelines to extract, transform and load ( ETL ) a variety of datasets from existing sources, including imaging, genomics, and clinical data.
Develop cloud and on-premise data lake and data warehouse .
Develop data governance/stewardship policies into the platform.
Develop the API for accessing the MSK MIND datasets.
Develop web-based interfaces for data visualization.
Work with machine learning experts to develop and scale machine learning applications for multi-model cancer research.
Bachelors degree in Computer Science or related field with 7+ years experience OR Masters degree with 5+ years experience.
Hands-on experience in software development.
Strong skills in a programming language (e.g. Python, Java, Scala, Go).
Experienced in relational and nonrelational databases (MySQL, PostgreSQL, MongoDb, Cassandra, Redis)
Experience with Agile software development and participating in a Scrum team.
Good verbal, writing, and interpersonal skills.
Nice to have:
Ph.D. degree in Computer Science or related field.
Knowledge in fundamental algorithms in machine learning.
Prior involvement in health informatics or bioinformatics domain.
Experienced in using open source frameworks and non-traditional data stores (Hadoop, Spark, Flink, Elasticsearch, etc).
Experienced in container technologies (Docker, Kubernetes).
Experienced in cloud computing, storage, and deployment.
Understanding the Findable, Accessible, Interoperable and Reusable (FAIR) data principles.
Understanding medical imaging tech stacks (DICOM, PACS, VNA).
Understanding Fast Healthcare Interoperability Resources (FHIR).
Knowledge in healthcare data models.
Experience in developing interface against Electronic Medical Record (EMR) systems.
Experience in web development.
Internal Number: 2019-38836
About Memorial Sloan-Kettering Cancer Center
As one of the world's premier cancer centers, Memorial Sloan-Kettering Cancer Center is committed to exceptional patient care, leading-edge research, and superb educational programs. The close collaboration between our physicians and scientists is one of our unique strengths, enabling us to provide patients with the best care available today as we work to discover more effective strategies to prevent, control, and ultimately cure cancer in the future. Our education programs train future physicians and scientists, and the knowledge and experience they gain at Memorial Sloan-Kettering has an impact on cancer treatment and the biomedical research agenda around the world.