We are looking for a Sr. Data Engineer to join our growing Data Science team and lead the development of scalable machine learning infrastructure, robust data pipelines, and analytics platforms. You will work closely with data scientists, analysts, and engineering teams to design systems that power fatigue risk modeling, reporting, and operational decision-making. This is a unique opportunity to join a small, mission-driven team and have a major impact on the architecture and scalability of our products.
Pulsar Informatics provides real-time fatigue risk management tools to mission-critical or safety-centric workforces. Our team has pioneered research and development in fatigue risk assessment, working in aviation, long-haul trucking, space exploration, and military operations. We’re now building scalable infrastructure to serve global markets across industries.
Your role:
- Design and maintain reliable, scalable data pipelines (batch and real-time) to support analytics and ML.
- Collaborate with data scientists to build feature stores and deploy ML models into production.
- Architect and optimize data warehouses and reporting databases (e.g., Redshift, Postgres, BigQuery).
- Own data orchestration workflows (e.g., Airflow, Prefect, dbt) and CI/CD for data products.
- Work with product and engineering teams to ensure data availability, consistency, and reliability.
- Implement and monitor data quality checks, validations, and lineage tracking.
- Build infrastructure to support ad hoc and self-service data analysis by analysts and business stakeholders.
- Support compliance with data governance and security best practices.
Skills and Experience:
- MS or BS or equivalent experience in Computer Science, Engineering, Mathematics, Statistics or a related field OR equivalent practical experience in Software or Data Engineering
- 3+ years of experience building and operating production data pipelines and data systems.
- Proficiency in SQL, and Python or R.
- Experience with tools in modern data stacks (e.g., dbt, Airflow, Kafka, Snowflake, etc.).
- Familiarity with cloud data infrastructure (AWS preferred) — including S3, Lambda, ECS, Athena, Redshift.
- Experience with ML lifecycle tools (MLFlow, Feast, TensorFlow Extended, or similar)
- Comfortable working with Git-based development workflows and CI/CD pipelines.
- Strong communication skills and ability to collaborate with technical and non-technical teammates.
Bonus Experience
- Experience operationalizing machine learning models.
- Working knowledge of dimensional modeling and OLAP/BI best practices.
- Familiarity with containerization and orchestration (Docker, Kubernetes).
- Exposure to data privacy, access control, and compliance requirements (e.g., HIPAA, GDPR, SOC2).
Benefits
We offer a competitive salary with a full set of benefits, including health & dental, vision, FSA, 401(k) plan contributions and stock options. We provide a dedicated learning budget to support everyone’s professional development.
Eligibility
You must be a US Citizen or US Permanent Resident.
Flexible Work
Our teams operate fully remotely, and we welcome new team members from anywhere within the U.S. We provide home office furniture if required, a computer workstation, and internet service cost reimbursement.
Our Commitment to Fairness
At Pulsar Informatics, we are proud to be an equal opportunity workplace and embrace a commitment to have our team reflect the world around us as we scale the business.
Job Type: Full-time
Pay: $125,000.00 - $175,000.00 per year
Schedule:
Work Location: Remote