Job Title: Data Engineer
Location: San Jose, CA (Hybrid – 3 days onsite)
Data engineer with Prompt writing and ML/AI, Python exp.
About the Role:
We are seeking a Data Engineer I to join our hybrid team in San Jose, CA. In this role, you will execute tactical tasks within larger data projects, leveraging Python, scripting, and machine learning to build data infrastructure and enable data-driven decision-making.
Ideal candidates will have a strong background in Python, experience with data infrastructure, and proficiency in scripting and machine learning workflows. A PhD or advanced degree is strongly preferred.
Key Responsibilities:
- Develop and support data visualizations using internal tools (PLX, Data Studio) and external BI tools (Tableau, Looker).
- Maintain and enhance reports, queries, and dashboards for internal stakeholders.
- Perform exploratory data analysis, profiling, and custom data modeling to solve business problems.
- Leverage and deploy pre-trained machine learning models, and assist in training workflows.
- Collaborate across teams to understand requirements, identify data sources, and translate business needs into technical solutions.
- Design, build, and maintain ETL pipelines, data models, and scalable data processing systems.
- Assist in open-source model training and collaborate on collaborative platforms such as Colab.
- Ensure best practices for data governance, lifecycle management, and secure infrastructure.
- Write and run code notebooks for analytics, automation, and ML tasks.
Must-Have Qualifications:
- 2–3+ years of hands-on experience as a Data Analyst/Engineer with a focus on Python programming.
- Bachelor’s degree in Computer Science or related field (Advanced degree or PhD strongly preferred).
- Strong ability to write Python code and create automated/manual scripts for data analysis and reporting.
- Experience with machine learning, especially training and deploying models in collaborative environments
- Familiarity with public/open-source datasets and experience fine-tuning or training open-source models like Gemma.
- Experience with prompt engineering and writing effective prompts for Generative AI models.
Preferred Skills:
- Experience with big data infrastructure and cloud platforms.
- Proficient in data pipeline design, ETL processes, and data modeling.
- Working knowledge of ML serving systems, data validation, and data quality management.
- Excellent English writing and communication skills to document analyses and collaborate across teams.
- Knowledge of BI tools (PLX, Data Studio, Tableau, Looker).
- Understanding of statistical methods and business intelligence metrics.
Soft Skills & Competencies:
- Strong problem-solving abilities and a passion for data.
- Ability to work collaboratively in cross-functional teams.
- Eagerness to learn and apply technical best practices.
- Strong attention to security, compliance, and data lifecycle principles.
Dexian is a leading provider of staffing, IT, and workforce solutions with over 12,000 employees and 70 locations worldwide. As one of the largest IT staffing companies and the 2nd largest minority-owned staffing company in the U.S., Dexian was formed in 2023 through the merger of DISYS and Signature Consultants. Combining the best elements of its core companies, Dexian's platform connects talent, technology, and organizations to produce game-changing results that help everyone achieve their ambitions and goals.
Dexian's brands include Dexian DISYS, Dexian Signature Consultants, Dexian Government Solutions, Dexian Talent Development and Dexian IT Solutions. Visit https://dexian.com/ to learn more.
Dexian is an Equal Opportunity Employer that recruits and hires qualified candidates without regard to race, religion, sex, sexual orientation, gender identity, age, national origin, ancestry, citizenship, disability, or veteran status