We are seeking a highly skilled and motivated Data Scientist with expertise in Epic EMR (Electronic Medical Records) data to join our team. The successful candidate will play a critical role in leveraging Snowflake's data platform to accelerate the adoption of AI in health and hospital projects. This role involves implementing rules-based logic and creating structured data models for Retrieval-Augmented Generation (RAG) to support decision-making and improve operational efficiency and clinical outcomes.
KEY RESPONSIBILITIES:
Data Management and Analysis:
• Extract, clean, and analyze large datasets from Epic EMR and other healthcare data sources.
• Develop and maintain data pipelines to ensure the accurate and timely flow of data.
• Perform data validation and ensure data quality and integrity.
Implementation of Rules-Based Logic:
• Develop and implement rules-based logic to support various healthcare use cases, including One Stop Benefits, Charge Capture Automation, and Denials Optimization.
• Create structured data models that facilitate the application of rules-based logic.
RAG (Retrieval-Augmented Generation) Implementation:
• Design and implement structured data models for RAG to enhance data retrieval and generation processes.
• Develop dashboards and visualizations to present RAG insights and other key performance indicators to stakeholders.
• Utilize AI and Client techniques to enhance the accuracy and predictive capabilities of the RAG models.
Collaboration and Communication:
• Work closely with cross-functional teams, including data engineers, developers, and healthcare professionals, to understand project requirements and deliver data-driven solutions.
• Communicate complex analytical results and insights to non-technical stakeholders in a clear and concise manner.
Project Execution:
• Participate in agile development processes and contribute to sprint planning, reviews, and retrospectives.
• Ensure timely delivery of project milestones and adhere to project timelines.
Preferred Skills
• Design and implement predictive models using various machine learning techniques, including both supervised and unserved algorithms.
• Utilize deep statistical analysis to understand and model complex public health data.
• Develop and deploy Large Language Models (LLMs) for creating chatbots and extracting insights, enhancing user engagement and information dissemination.
• Analyze clinical data to derive insights that improve patient care and clinical workflows.
QUALIFICATIONS:
Education
• Bachelor's or Master’s degree in Data Science, Computer Science, Statistics, or a related field. A Ph.D. is a plus.
Experience
• Minimum of 5 years of experience in data science, with a focus on healthcare analytics.
• Proven experience working with Epic EMR data, including extraction, transformation, and analysis.
• Strong background in implementing rules-based logic and RAG in a healthcare setting.
Knowledge, Skills, Abilities and other Requirements:
• Proficiency in programming languages such as Python, R, and SQL.
• Experience with data visualization tools such as Tableau, Power BI, or similar.
• Familiarity with Snowflake or other cloud-based data platforms.
• Knowledge of ETL processes and tools.
• Experience with AI and Client techniques to enhance data analysis and RAG implementation. but not limited to SQL, Excel and/or SAAS
• Proficient in various data catalog and visualization tools, including but not limited to Tableau, PowerBI, Informatica, and/or Snowflake
• Knowledgeable in electronic medical records, preferably EPIC