Objective/Scope
The Manager, AI/ML (Computer Vision & LLMs/Agents) leads a team of machine learning engineers and applied scientists responsible for designing, deploying, and operating production AI systems with a primary emphasis on computer vision and large language model / agentic applications. This leader owns the end-to-end ML lifecycle, from problem framing and data strategy through model development, deployment, and continuous monitoring, and is accountable for the scalability, availability, and accuracy of the AI systems the team ships.
The Manager sets and enforces the standards that make AI systems production-grade: rigorous model evaluation and accuracy targets, MLOps tooling and automation, observability and drift detection, and clear service-level objectives for latency and uptime. They translate AI strategy into an executable roadmap, ensuring solutions meet performance, reliability, and responsible-AI standards appropriate to a clinical/healthcare context including patient-safety-relevant accuracy and applicable ethical and regulatory requirements.
As a people leader, the Manager grows and mentors the team, sets goals, manages performance, and removes blockers, while partnering closely with cross-functional teams (product, data engineering, DevOps/infrastructure, and clinical stakeholders) to align priorities and deliver impact. They remain current with the field, evaluating emerging computer vision and LLM/agent techniques and bringing the most valuable advances into the team's practice.
Job Functions
Technical Leadership & Architecture
Lead the architecture and end-to-end execution of the ML lifecycle - data strategy, model development, deployment, and continuous operation - primarily for computer vision and LLM/agentic systems.
Own the MLOps foundation: training and deployment pipelines, model serving, CI/CD for models, and reproducible experimentation.
Set and enforce standards for model accuracy and quality (evaluation frameworks, offline and online metrics, regression testing of models, and A/B testing) and hold the team to defined targets.
Ensure production AI systems are scalable and highly available: define service-level objectives for latency, throughput, and uptime, and establish monitoring, drift detection, alerting, and rollback practices.
Lead requirements definition and feasibility assessment for new ML features and functionality, balancing technical trade-offs against business and clinical impact.
Guide design and code quality across the team; contribute hands-on in Python and the ML stack as needed to unblock the team and set the technical bar.
Delivery & Project Management
Plan and manage team workload: delegate tasks, set daily, weekly, and monthly goals, and track progress against them.
Provide project estimates and timelines; manage scope of tasks, features, and epics before, during, and after delivery.
Identify risks early and form contingency plans; surface and remove blockers that impede the team's progress.
Manage software/feature releases and communicate release status and outcomes to stakeholders.
Report project status, write progress updates, and deliver presentations to relevant stakeholders, including management and customers.
People Leadership
Lead, mentor, and develop the ML engineering team; conduct regular 1-1s, set goals, and manage performance.
Motivate the team and foster a transparent, psychologically safe environment where people can raise questions, concerns, challenges, failures, and successes openly.
Evaluate team performance, provide guidance to sustain productivity, and partner with senior management and HR on performance management and compensation.
Participate in hiring assessing candidates and providing interview feedback to grow the team.
Cross-Functional Collaboration & Strategy
Partner with product, data engineering, DevOps/infrastructure, and clinical stakeholders to align priorities and drive projects forward.
Translate AI strategy into an executable roadmap, ensuring solutions comply with performance, reliability, and responsible-AI standards appropriate to a clinical/healthcare context.
Stay current with advances in computer vision, LLMs, and agentic systems; evaluate emerging techniques and bring the most valuable into the team's practice.
Analyze existing operations and processes, and drive improvements through training, tooling, and best practices.
Competencies
AI/ML Domain (core)
Computer Vision: image/video understanding, detection, segmentation, real-time inference
Large Language Models & Agentic Systems: LLM application development, prompting/fine-tuning, RAG, agent orchestration
Deep Learning: modern neural architectures (CNNs, transformers) and training practices
Model Evaluation & Accuracy: evaluation frameworks, metrics, A/B testing, regression testing of models
Data Science, Data Modeling, and Predictive Modeling
Engineering & Operations
MLOps & ML Lifecycle: training/deployment pipelines, model serving, CI/CD for models
Production Reliability: scalability, high availability, latency/throughput SLOs, monitoring, drift detection
DevOps & Cloud Infrastructure (incl. GPU/accelerated compute)
Software Engineering Best Practices: design patterns, test-driven development, debugging, code quality
Leadership & Delivery
People Leadership: mentoring, performance management, team development
Technical Project Management: estimation, scope, and delivery
Strategic Planning & Strategy Management
Cross-Functional Collaboration & Stakeholder Communication
Responsible AI: ethics, safety, and (clinical/regulatory) compliance
Supervisory Requirements
Manage machine learning team members including tasks, performance, and related mentoring.
Education and Experience
Required
Bachelor's degree in a related field (Computer Science, Computer Information Systems, or equivalent). Advanced degree in Computer Science, Machine Learning, or a related discipline is preferred.
7+ years of related machine learning or software engineering experience, including 3+ years managing technical teams.
Demonstrated experience leading teams that delivered ML/AI systems to production at scale, with accountability for accuracy, reliability, and availability.
Hands-on expertise in computer vision and/or LLM/agentic systems, with the ability to set the technical direction for both.
Expert-level proficiency in Python and strong command of the modern ML stack.
Proficiency in at least two ML frameworks (for example, PyTorch, TensorFlow, or Hugging Face Transformers).
Working knowledge of MLOps practices: model deployment, serving, monitoring, drift detection
Working knowledge of MLOps practices: model deployment, serving, monitoring, drift detection, and CI/CD for models.
Solid software engineering fundamentals, including design patterns, test-driven development, unit testing, and effective troubleshooting and debugging practices.
Leadership and Delivery
Able to lead, support, motivate, and mentor technical team members.
Able to define and review software requirements, and to build and manage task hierarchies for new features and fixes.
Able to manage scope of tasks, features, and epics before, during, and after delivery.
Able to provide realistic project estimates and timelines, and to identify and remove bottlenecks that impede progress.
Communicates effectively with peers, internal customers, and stakeholders across all levels of the organization.
Takes initiative on resolving issues and continuously improving the codebase and team practices.
Preferred
Experience deploying AI systems in a clinical, healthcare, or other regulated environment.
Familiarity with cloud and GPU/accelerated compute infrastructure for training and inference.
Experience with responsible-AI practices, including model safety, bias evaluation, and applicable compliance requirements.
Physical Demands and Working Conditions:
Must be able to sit for much of the workday with periodic walking and/or standing.
Must be able to work in an office environment.
Minimal travel is required for this role.
Other Duties:
Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities, and activities may change at any time with or without
Compensation & Benefits
Base Salary Range: $180,000 - $200,000 per year
Bonus Eligibility: Yes, bonus eligible
Benefits Offered:AvaSure sponsored Medical, Dental & Vision
Safe Harbor 401K with Employer Matching up to 4%
HSA Employer Contributions, Employer Paid Life, Short-term and Long-term Disability, and AD&D Insurance Plans
Flexible Time Off Plan & Paid Holidays
Parental Leave
Generous Tuition & Continuing Education Reimbursement available
Employee Referral Bonus
FLSA Classification: Exempt
Position Type: Full-time
Location: United States | Remote
Why AvaSure?
As the pioneer and expert in inpatient telehealth, AvaSure has provided safer environments with over 2 million patients monitored and 200 million monitored hours. By continuing to reduce adverse events, and by optimizing workforce efficiencies for the nation's top health systems, AvaSure has consistently lowered the cost of care while providing safer, more efficient healthcare for everyone.
Diversity creates a healthier work environment: AvaSure is an Equal Employment Opportunity/Affirmative Action employer, and all qualified applicants