Overview: Responsible for expanding and optimizing VNS Health's data architecture and pipeline infrastructure to support cross-functional data needs. This role focuses on enhancing data flow and collection processes, ensuring consistency and scalability across ongoing initiatives. The Data Engineer contributes to the design and implementation of frameworks and processes that support the modernization and re-engineering of data systems, enabling future product development and data-driven initiatives. Collaborates closely with technical and business teams to deliver reliable, high-performance data solutions. Works under general supervision. Responsibilities:
What We Provide
-
Referral bonus opportunities
-
Generous paid time off (PTO), starting at 30 days of paid time off and 9 company holidays
-
Health insurance plan for you and your loved ones, Medical, Dental, Vision, Life and Disability
-
Employer-matched retirement saving funds
-
Personal and financial wellness programs
-
Pre-tax flexible spending accounts (FSAs) for healthcare and dependent care
-
Generous tuition reimbursement for qualifying degrees
-
Opportunities for professional growth and career advancement
-
Internal mobility, generous tuition reimbursement, CEU credits, and advancement opportunities
What You Will Do
-
Develops and maintains scalable data pipelines and builds out new API integrations to support continuing increases in data volume and complexity.
-
Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
-
Identifies, designs, and implements internal process improvements such as automating manual processes, refactoring legacy code, optimizing data delivery, generating reports, and re-designing infrastructure for greater scalability.
-
Performs ongoing data validation required to troubleshoot data related issues and assist in their resolution.
-
Provides production support as needed, including during non-business hours and weekends.
-
Writes unit/integration tests, contributes to our knowledge base, and documents work.
-
Works closely with the data science and business intelligence teams to develop data models, pipelines, reports, and dashboards to support all lines of business.
-
Engages non-technical personnel and embraces opportunities to connect technical work to broader business goals.
-
Exhibits ability to deliver short-term projects while working on long-term projects.
-
Stays abreast of and advocates for best practices in data engineering.
-
Participates in special projects and performs other duties as assigned.
Qualifications:
Education:
- Bachelor's Degree in Computer Science, Statistics, Informatics, Information Systems or a related field required
-
Master's Degree in Computer Science or a related field preferred
Work Experience:
-
Minimum of five years of Python experience required
-
Minimum of five years of Shell scripting experience required
-
Minimum of five years of object-oriented programming (Java/C++) required
-
Minimum of five years working on relational databases and data warehouses required
-
Experience with AWS cloud services such as EC2, RDS, S3, Athena, Redshift, and Dynamo required
-
Experience with Gitlab/Github CI/CD required
-
Experience with data pipeline and workflow management tools such as AWS Glue and Apache Airflow required
-
Experience building and optimizing big data pipelines, architectures and data sets required
-
Experience with stream-processing systems including PySpark, Storm, and Spark-Streaming required
-
Experience with demonstrable Snowflake experience preferred
-
Experience with demonstrable Apache Iceberg experience preferred
-
Administrative experience (system or database) preferred
Pay Range: USD $98,200.00 - USD $130,800.00 /Yr.