Role : AI/ML Engineer
Location : San Jose, CA
Skills : Cloud resource allocation, Auto-scaling, performance tuning, DevOps
Role: AI/ML Engineer
Location: San Jose, CA (5 days WFO)
Notice period: 2 weeks
Visa: Any (Except OPT and CPT)
Note: Need atleast 1 or 2 resumes by today EOD please try to submit profiles please.
Job Description
- Design and implement AI Agents to optimize cloud resource allocation, auto-scaling, and performance tuning.
- Develop predictive models for failure detection, incident management, and system health monitoring.
- Automate operational workflows using machine learning and intelligent scripting.
- Integrate AI-driven insights with existing cloud monitoring tools.
- Collaborate with DevOps and SRE teams to deploy, monitor, and improve ML models in production environments.
- Conduct anomaly detection for security, cost optimization, and performance analytics.
- Continuously evaluate emerging AI technologies and tools for operational improvements.
- Maintain documentation and best practices for AI/ML integration in cloud systems.
Our Minimum Requirements Include
- Bachelor's or equivalent experience or master’s degree in computer science, Data Science, or related technical field.
- Proven ability building and deploying ML models, with at least 2 years focused on infrastructure or cloud operations.
- Solid knowledge of hybrid cloud technologies (AWS, GCP, OpenStack, Kubernetes).
- Experience with Python, Jupiter, and ML libraries such as PyTorch, TensorFlow, or scikit-learn.
- Familiarity with cloud-native monitoring, logging, and automation tools (e.g., Terraform, Ansible, Prometheus, Splunk, AppDynamics).
- Comfortable working with streaming data, APIs, and telemetry systems.
- Strong communication and multi-functional collaboration skills.
- Experience with Agile and DevOps operating models, including project tracking tools (e.g., Jira), Git (any Version Control systems), and CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins).
- Proficient in general-purpose programming languages (Python, GoLang, Bash and/or C/C++) and development platforms and technologies.
Preferred Qualifications
- Deep understanding of operating systems and experience with Cisco technologies (UCS, Nexus, Thousand Eyes)
- Established record of leading technical initiatives, delivering results, and a commitment to fostering a supportive work environment.
- Hard-working, dedicated to providing quality support for your customers