Data Engineering Intern

2026-01-27
RefinedScience
Data Engineering Intern
At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease.   
WHAT WE ARE LOOKING FOR
We are seeking a motivated Data Engineering Intern to join our team. This internship is open to undergraduate and graduate students who are interested in building data infrastructure that supports advanced analytics, data science, and AI-driven insights in healthcare and life sciences.
You will work closely with data scientists, bioinformaticians, and engineers to help design, build, and improve data pipelines and platforms that power RefinedScience's research and analytics initiatives.
KEY ACTIVITIES

Assist in building and maintaining data pipelines for ingesting, transforming, and validating clinical, biological, and real-world data
Support integration of data from multiple sources (e.g., clinical data, analytics outputs, external datasets)
Help develop and optimize ETL/ELT workflows to ensure data quality and reliability
Collaborate with data science and bioinformatics teams to support analytics and machine learning workflows
Contribute to data modeling, documentation, and best practices for data infrastructure
Participate in code reviews, testing, and performance improvements
Participate in Quality Reviews and Troubleshooting
Communicate progress and findings to cross-functional teams

MUST HAVES

Currently enrolled in a Bachelor's, Master's, or Ph.D. program in Data Engineering, Computer Science, Data Science, Software Engineering, or a related field
Experience with Python and/or SQL through coursework, projects, or internships
Basic understanding of data pipelines, databases, and data transformation concepts
Familiarity with version control (e.g., Git)
Strong analytical thinking and problem-solving skills
Ability to learn quickly and work collaboratively in a team envirPlease mention the word **LOGICAL** and tag RODguMTk4Ljk5LjE0Mw== when applying to show you read the job post completely (#RODguMTk4Ljk5LjE0Mw==). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.