Automated Audio Analysis of Dysarthria in Huntington’s disease (Audio-HD)
Luis Sierra1, Karen Hildebrand1, Clementina Ullman1, Magdaline Mwangi1, Henry O'Connell2, Samuel Frank1, Simon Laganiere1
1Neurology, Beth Israel Deaconess Medical Center, 2Canary Speech
Objective:
Using an exploratory approach, our goals were 1) identify features of speech that differentiated HD from controls, 2) use key speech features to train a classifier model and 3) compare these results to clinical assessments.
Background:
The diagnosis of Huntington's disease (HD) is based primarily on the motor exam component of the Unified Huntington’s Disease Rating Scale (UHDRSTM). The UHDRSTM grades dysarthria from 0-4 (normal to anarthria), however automated speech feature detection could capture subtle changes that might serve as early disease biomarkers.
Design/Methods:
HD participants (n=46) and healthy controls (HC) (n=32) were matched for sex, age, and education. Participants were recorded during an 8 minute tablet-based protocol (Audio-HD) that included open-ended questions, passage reading, narrative prompt, picture description, and audio recording of the Stroop Color and Word Test (SCWT). Audio data underwent automated feature detection by Canary Speech after which key features were selected and used to train a classifier model. Model performance was compared to clinical assessment. 
Results:
HD average UHDRSTM total motor score was 16.1 (SD=15.72) and CAG repeat length 43.1 (SD=3.62). Welch’s t-test (95% CI) identified  >1000 features of speech that differentiated HD from HC. The 20 most significant features were entered into a Random Forest classifier to generate a trained model. Classifier accuracy (HD vs. HC) was 96% (91% sensitivity, 98% specificity) and sensitivity was 96% for detecting dysarthria in HD. Interestingly, the model also correctly labeled 40% of HD cases lacking clinical dysarthria, suggesting it was capturing speech changes not apparent to the examiner.
Conclusions:
Audio-HD protocol using Canary Speech has significant potential as an efficient and sensitive tool for identifying speech biomarkers in HD. Further training of the model and longitudinal follow up will be critical for determining the overall utility of this approach.
10.1212/WNL.0000000000203723