City of Hope

REMOTE - Data Engineer (Machine Learning)

Job description

Job Ref:

Irwindale, CA

Information Technology

Job Type:
Full-time, Regular


About City of Hope
City of Hope is an independent biomedical research and treatment organization for cancer, diabetes and other life-threatening diseases.
Founded in 1913, City of Hope is a leader in bone marrow transplantation and immunotherapy such as CAR T cell therapy. City of Hope’s translational research and personalized treatment protocols advance care throughout the world. Human synthetic insulin, monoclonal antibodies and numerous breakthrough cancer drugs are based on technology developed at the institution. AccessHope™, a subsidiary launched in 2019 serves employers and their health care partners by providing access to City of Hope’s specialized cancer expertise.
A National Cancer Institute-designated comprehensive cancer center and a founding member of the National Comprehensive Cancer Network, City of Hope is ranked among the nation’s “Best Hospitals” in cancer by U.S. News & World Report and received Magnet Recognition from the American Nurses Credentialing Center. Its main campus is located near Los Angeles, with additional locations throughout Southern California and in Arizona.

City of Hope’s commitment to Diversity, Equity and Inclusion
We believe diversity, equity and inclusion is key in serving our mission to provide compassionate patient care, drive innovative discovery, and advance vital education focused on eliminating cancer and diabetes in all of our communities. Our commitment to Diversity, Equity and Inclusion ensures we bring the full range of skills, perspectives, cultural backgrounds and experiences to our work - and that our teams align with the people we serve in order to build trust and understanding. We are dedicated to fostering a community that embraces diversity - in ideas, backgrounds and perspectives; this is reflected in our work and represented in our people.

** This is a Fully Remote Opportunity. You may sit at any of the 48 States **

Position Summary
The Data Engineer will work first-hand with the Applied AI & Data Science team to take data pipelines for model deployment from concept to reality, all while improving the existing pipeline functionality and performance in an evolving cancer care delivery environment. This will be a hands-on technical role that will be focused on designing, developing, and monitoring real-time and data warehouse pipelines for machine learning applications, as well as designing and managing ETL processes including system notifications, reporting measures, and scheduled jobs.

Key Responsibilities include:

  • Collaborate on technical designs with Data Warehouse teams and IT specialists
  • Development of queries and API connections to various data sources
  • Define technical requirements and necessary resources for pipeline development
  • Design and build pilot and production-level data pipelines for predictive models
  • Maintain and support data pipelines in production
  • Unit/functional testing, integration testing, user acceptance testing, performance testing
  • Data models and ETL development
  • Test script automation development
  • Produce technical documentation, visualizations, and diagrams of data systems
  • Identify technical dependencies and coordinate resources across departments
  • On call as needed for technical support
  • Other Duties as assigned


Basic education, experience and skills required for consideration:

  • Master’s degree in a math, computer science, information technology or related technical or quantitative field with 1+ years of experience; or Bachelor degree with 3+ years of experience as a Data Engineer, Data Architect, or similar role.
  • 3+ years of experience with Python
  • Experience with at least one other programming language ( Ex: Java, R),
  • 3+ years of experience with relational database design and interaction (MSSQL, PostgreSQL, or similar).
  • 1+ years of experience with development and deployment of Docker images in a Kubernetes environment.
  • 1+ years of experience with Kafka, Confluent, or other real-time streaming message services.
  • 1+ years of experience with end-to-end deployments on cloud services platforms (Amazon AWS, Microsoft Azure, Google Cloud Platform, etc.).
  • Familiarity with machine learning packages (sklearn, Keras, TensorFlow, etc.).
  • 1+ years of Experience with technical testing/validation procedures (unit/functional testing, integration testing, user acceptance testing, performance testing, etc.).
  • 3+ years of experience with data pipelines, data models, and ETL (SQL, DBT, etc.)
  • 1+ years of experience with test script automation development.
  • 3+ years of experience using git or similar code versioning software.

Preferred education experience and skills:

  • Master’s Degree in a math, programming, or related technical or quantitative field

Additional Information:

  • As a condition of employment, City of Hope requires staff to comply with all state and federal vaccination mandates.

City of Hope is committed to creating a diverse environment and is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, or status as a qualified individual with disability.

Please let the company know that you found this position on this Job Board as a way to support us, so we can keep posting cool jobs.