TixelJobs
C
Centralreachvia Greenhouse

Clinical AI Evaluation Specialist

REMOTEPosted 5d ago
OtherMid LevelFull-time#remote

Not sure if you're a good fit?

Upload your resume and TixelJobs AI will compare it against Clinical AI Evaluation Specialist at Centralreach. Get a match score, missing keywords, and improvement tips before you apply.

Free preview · Your resume stays private

About the Role

CentralReach is a leading provider of autism and IDD care software for Applied Behavior Analysis (ABA), multidisciplinary therapy, and special education. Trusted by more than 200,000 users, we enable therapy providers, educators, and employers to scale the way they deliver ABA and related therapies with innovative technology, market-leading industry expertise, and world-class customer satisfaction. 

This role is at the intersection of behavioral healthcare, artificial intelligence, and product quality. As a Clinical AI Evaluation Specialist on CentralReach's AI Governance team, you will be responsible for defining what "good" looks like for CentralReach's AI outputs and enhancing the evaluation and monitoring systems that ensure we continue to meet that standard at scale. 

You will design evaluation frameworks, develop and refine automated monitoring approaches (including prompt engineering for evaluation automation), conduct structured reviews of AI-generated content, and serve as a clinical subject matter expert that product and engineering teams work alongside to ensure AI features meet the standards of ABA practice, CentralReach's AI Governance Policy, and the Responsible AI for Behavior Analysis (R.ai.BA) framework.  

Key Accountabilities: 

  • Contribute to evaluation frameworks for clinical AI products, including defining acceptance criteria, test plans, clinically relevant rubrics, and performance benchmarks.  
  • Conduct structured sampling reviews of AI-generated outputs, assessing across criteria alignment with ABA principles and BACB ethical standards.  
  • Design and implement monitoring automation in collaboration with engineering, including automated quality checks, alerting thresholds, and drift detection for AI systems.  
  • Contribute to the creation and maintenance of reusable governance artifacts including system cards, monitoring playbooks, evaluation of SOP templates, and risk assessment documentation.  
  • Assist with pre-deployment validation testing for new AI products and features, executing defined testing and results of documentation.  
  • Participate in risk tiering assessments for new and evolving AI products, providing clinical and evaluative perspectives.  
  • Support incident investigation for AI-related issues if/as needed and in alignment with clinical best practice and CR Development Policy. 
  • Collaborate with product management and engineering teams to provide input during product planning and development, ensuring governance considerations are integrated early.  
  • Share