Xuanru Zhou

I'm a senior undergraduate at Zhejiang University. I'm working with Prof. Gopala Krishna Anumanchipalli and PhD Jiachen Lian at Berkeley Speech Group.

I'm actively seeking Intern, RA or visiting positions. If you are interested in my research, feel free to reach out to me!

Email: xuanruzhou15@gmail.com

Email  /  Google Scholar  /  Github  /  LinkedIn

profile photo

Research

Specifically, the topics I am exploring include:

  • Advanced Speech Understanding and Reasoning
    Exploring how cognitive science and human-centered theories can enhance advanced speech understanding, focusing on dynamic reasoning beyond simple CoT methods.
  • Grounded Language and Speech Intelligence
    Leveraging interactive multi-agent systems to explore how agents learn to communicate and collaborate via speech, inspired by human developmental stages, to understand the emergence of grounded language.
  • Audio Representation Learning
    Scaling audio-text training across different tasks to explore and bridge the modality gap.

I have previously worked on the following topics, primarily in speech and language technology for healthcare:

  • Speech Dysfluency Detection
    Developed two end-to-end dysfluency detection methods (Time-based and Token-based) and simulated large-scale dysfluency datasets, establishing a benchmark for the field.
  • Speech Pronunciation Assessment
    Proposed phoneme similarity modeling to improve verbatim speech transcription.

Selected Publications

Towards Accurate Phonetic Error Detection through Phoneme Similarity Modeling
Xuanru Zhou, Jiachen Lian, Cheol Jun Cho, Tejas Prabhune, Shuhe Li, William Li, Rodrigo Ortiz, Zoe Ezzes, Jet Vonk, Brittany Morin, Rian Bogley, Lisa Wauters, Zachary Miller, Maria Gorno-Tempini, Gopala Anumanchipalli
2025 Interspeech. A phonetic error detection system for pronunciation evaluation and articulatory feedback.
[Project Page] ( Oral Presentation )
Automatic Detection of Articulatory-Based Disfluencies in Primary Progressive Aphasia
Jiachen Lian, Xuanru Zhou, Zoe Ezzes, Jet Vonk, Brittany Morin, David Baquirin, Zachary Miller, Maria Luisa Gorno Tempini and Gopala Krishna Anumanchipalli,
2025 JSTSP. An efficient AI Agent for Language Screening and Spoken Language Learning.
[Project Page]
SSDM: Scalable Speech Dysfluency Modeling
Jiachen Lian, Xuanru Zhou, Zoe Ezzes, Jet Vonk, Brittany Morin, David Baquirin, Zachary Miller, Maria Luisa Gorno Tempini and Gopala Krishna Anumanchipalli,
2024 NeurIPS. An AI Agent for Speech Therapy and Spoken Language Learning. A foundation model for scientific research, engineering deployment and business development .
[Project Page] ( NeurIPs Scholar Award )
Time and Tokens: Benchmarking End-to-End Speech Dysfluency Detection
Xuanru Zhou, Jiachen Lian, Cheol Jun Cho, Zoe Ezzes, Jet M.J. Vonk, Brittany T. Morin, David Paul Galang Baquirin, Zachary A. Miller, Maria Luisa Gorno-Tempini, Gopala Anumanchipalli,
Preprint. Open Source Benchmarking Dysfluency Modeling
[Project Page] [Code]
SoK: Dataset Copyright Auditing in Machine Learning Systems
Linkang Du*, Xuanru Zhou*, Min Chen, Chusong Zhang, Zhou Su, Peng Cheng, Jiming Chen, Zhikun Zhang
2025 IEEE S&P
Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection
Xuanru Zhou, Cheol Jun Cho, Ayati Sharma, Brittany Morin, David Baquirin, Jet Vonk, Zoe Ezzes, Zachary Miller, Maria Luisa Gorno Tempini, Jiachen Lian, and Gopala Krishna Anumanchipalli,
2024 SLT. Multi-lingual Co-Dysfluency Detector with Articulatory Simulation
[Code] ( Student Grant Award )
YOLO-Stutter: End-to-End Region-Wise Speech Dysfluency Detection
Xuanru Zhou, Anshul Kashyap, Steve Li, Ayati Sharma, Brittany Morin, David Baquirin, Jet Vonk, Zoe Ezzes, Zachary Miller, Maria Luisa Gorno Tempini, Jiachen Lian, and Gopala Krishna Anumanchipalli,
2024 Interspeech. Dysfluency Modeling as Object Detection . [Code] ( ISCA Student Grant Award ).

Selected Awards

2024 NeurIPs Scholar Award

2024 ISCA Student Travel Award

2024 IEEE SLT Student Travel Grant

Zhejiang Provincial Government Scholarship

Trivia

My favorite artist is Lady Gaga, go support her new album MAYHEM! I'm also fan of Kpop music.

Before I went to middle school, I was a member of the city swimming team.

I enjoy playing with dogs and cats.