Benchmarking Large Language Models for Neurological Imaging Interpretation Using a Multiple Sclerosis Lesion Segmentation Dataset

Assessing Large Language Models as Multimodal Reasoning Engines for Parkinson's Disease Detection