Logo of Huzzle

Data Science Engineer - Summer Intern

Applications are closed

  • Internship
    Full-time
    Summer Internship
  • Data
  • New York

Requirements

  • Pursuing a Master's degree in computer science/statistics field.
  • Proficient in writing complex SQL queries.
  • Proficient in Python data-related libraries (pandas, NumPy).
  • Experience with Git version control.
  • Familiarity with AWS data engineering services.
  • Experience using Docker for containerization and orchestration.
  • Familiarity with data warehousing concepts.
  • Ability to work independently, in a fast-paced and agile environment.

Responsibilities

  • Collaborate with the Data Science engineering team to build ETL data pipelines.
  • Develop and optimize complex Snowflake SQL queries for efficient data extraction and transformation
  • Utilize Python data libraries for data extraction , analysis, and visualization.
  • Gain familiarity with cloud data engineering services like AWS S3, Redshift, Glue, and EMR
  • Work with project leadership to define data collection requirements from upstream and downstream systems
  • Gain exposure to data warehousing concepts and tools.
  • Perform troubleshooting of data issues and data pipelines.

Powering Smarter Treatments and Healthier People

Technology
Industry
1001-5000
Employees

Mission & Purpose

Medidata is leading the digital transformation of clinical research, creating hope for millions of patients. Our unified platform combines AI powered insights with unrivaled patient-centric clinical trial solutions to help pharmaceutical, biotech, medical device and diagnostics companies, as well as academic researchers accelerate value, minimize risk, and optimize outcomes. Medidata is the first life sciences tech company to surpass 30,000 trials and 9 million participants. We’re powering smarter treatments and healthier people, with over 1.5 million registered users across 2,000+ customers and partners harnessing the world's most trusted clinical trials software for clinical development, commercial, and real-world data. We are a Dassault Systèmes company (Euronext Paris: FR0014003TT8, DSY.PA) headquartered in New York City with offices around the world. See why experience matters at www.medidata.com and follow us on Twitter (@Medidata) and Instagram (@medidata.solutions).