Profile picture

Muqiao Yang

(How to pronouce?)


first_name@google.com


CV  ORCID


New York, NY


  Hi! Welcome to Muqiao Yang's homepage. I am a research scientist at Google Speech Team. Previously, I obtained my PhD degree from Carnegie Mellon University and bachelor degree from The Hong Kong Polytechnic University. My research interest is mainly in speech processing & natural language processing!

   I enjoy talking to and learn from different people, especially helping junior PhD/Masters/Undergrad students about research, career plans, and life. Feel free to drop an email if you would like to talk!


Education


Experience


Publications


uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models

Muqiao Yang, Chunlei Zhang, Yong Xu, Zhongweiyang Xu, Heming Wang, Bhiksha Raj, Dong Yu

[pdf] [Demo]


Rethinking Voice-Face Correlation: A Geometry View

Xiang Li, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj

ACM Multimidea (MM), 2023

[pdf]


Unifying Robustness and Fidelity: A Comprehensive Study of Pretrained Generative Methods for Speech Enhancement in Adverse Conditions

Heming Wang, Meng Yu, Hao Zhang, Chunlei Zhang, Zhongweiyang Xu, Muqiao Yang, Yixuan Zhang, Dong Yu

preprint

[pdf]


SpatialCodec: Neural Spatial Speech Coding

Zhongweiyang Xu, Yong Xu, Vinay Kothapally, Heming Wang, Muqiao Yang, Dong Yu

preprint

[pdf]


Sequence-Level Knowledge Distillation for Class-Incremental End-to-End Spoken Language Understanding

Umberto Cappellazzo, Muqiao Yang, Falavigna Daniele, Alessio Brutti.

Conference of the International Speech Communication Association (InterSpeech), 2023

[pdf]


Backdoor Attacks with Input-unique Triggers in NLP

Xukun Zhou, Jiwei Li, Tianwei Zhang, Lingjuan Lyu, Muqiao Yang, Jun He

preprint

[pdf]


Simulating realistic speech overlaps improves multi-talker ASR

Muqiao Yang, Naoyuki Kanda, Xiaofei Wang, Jian Wu, Sunit Sivasankaran, Zhuo Chen, Jinyu Li, Takuya Yoshioka

International Conference on Acoustics, Speech and Signal (ICASSP), 2023

[pdf]


PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement

Muqiao Yang, Joseph Konan, David Bick, Yunyang Zeng, Shuo Han, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

International Conference on Acoustics, Speech and Signal (ICASSP), 2023

[pdf] [code]


TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement

Yunyang Zeng, Joseph Konan, Shuo Han, David Bick, Muqiao Yang, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

International Conference on Acoustics, Speech and Signal (ICASSP), 2023

[pdf] [code]


Improving Speech Enhancement through Fine-Grained Speech Characteristics

Muqiao Yang*, Joseph Konan*, David Bick*, Anurag Kumar, Shinji Watanabe, Bhiksha Raj

Conference of the International Speech Communication Association (InterSpeech), 2022

[pdf] [code]


Online Continual Learning of End-to-End Speech Recognition Models

Muqiao Yang, Ian Lane, Shinji Watanabe

Conference of the International Speech Communication Association (InterSpeech), 2022

[pdf]


Self-supervised Representation Learning with Relative Predictive Coding

Yao-Hung Hubert Tsai, Martin Q. Ma, Muqiao Yang, Ruslan Salakhutdinov, Louis-Philippe Morency

International Conference on Learning Representations (ICLR), 2021

[pdf] [code]


Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis

Yao-Hung Hubert Tsai*, Martin Q. Ma*, Muqiao Yang*, Ruslan Salakhutdinov, Louis-Philippe Morency

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2020

[pdf] [code]


Complex Transformer: A Framework for Modeling Complex-Valued Sequence

Muqiao Yang*, Martin Q. Ma*, Dongyu Li, Yao-Hung Hubert Tsai, Ruslan Salakhutdinov

International Conference on Acoustics, Speech and Signal (ICASSP), 2020

Neural Information Processing Systems (NeurIPS) Science meets Engineering of Deep Learning workshop, 2019 (Oral)

[pdf] [code]


Improving Lesion Segmentation for Diabetic Retinopathy using Adversarial Learning

Qiqi Xiao, Jiaxu Zou, Muqiao Yang, Alex D. Gaudio, Kris Kitani, Asim Smailagic, Pedro Costa, Min Xu

International Conference on Image Analysis and Recognition, 2019

[pdf] [code]


Storing and Querying Large-Scale Spatio-Temporal Graphs with High-Throughput Edge Insertions

Mengsu Ding, Muqiao Yang, Shimin Chen

arXiv preprint arXiv:1904.09610, 2019

[pdf]



Teaching


  • 11-785, Introduction to Deep Learning, Spring 2023, Fall 2023, Spring 2024
  • 18-661, Introduction to Machine Learning for Engineers, Fall 2022
  • 18-660, Optimization, Spring 2022
  • 10-605, Machine Learning with Large Datasets, Fall 2021

Service


  • Reviewer at NeurIPS, EMNLP, ICASSP, Interspeech, SLT, AACL-IJCNLP, Speech Communication