Large language models are increasingly recognized for their potential in providing mental health support. A recent study, published on June 29, 2026, introduces a novel framework aimed at enhancing the therapeutic quality of these models. Researchers, including Mizanur Rahman and six co-authors, developed a two-stage system that utilizes human-aligned evaluations to improve therapeutic responses.
Introducing TheraJudge: A New Evaluator
In Stage I, the team presents TheraJudge, an open-source therapeutic evaluator trained using preference-based optimization on human-annotated data. This innovative tool assesses responses across seven psychological dimensions, ensuring more reliable judgments compared to traditional methods.
TheraJudge demonstrates strong agreement with clinician ratings, achieving intraclass correlation coefficients (ICC) between 0.87 and 0.95. This performance surpasses existing supervised baselines and closed-source judges, particularly excelling in critical areas such as Safety, Relevance, and Empathy.
Operationalizing Evaluations with TheraAgent
Stage II introduces TheraAgent, which operationalizes TheraJudge's evaluations through a coordinated refinement process. This system employs specialized roles, including Critic, Coach, and Therapist, to translate evaluative signals into targeted response revisions.





