Follow
Mengyue Wu
Title
Cited by
Cited by
Year
Multiple sound sources localization from coarse to fine
R Qian, D Hu, H Dinkel, M Wu, N Xu, W Lin
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
1412020
Audio caption: Listen and tell
M Wu, H Dinkel, K Yu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
732019
Investigating local and global information for automated audio captioning with transfer learning
X Xu, H Dinkel, M Wu, Z Xie, K Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
562021
Towards duration robust weakly supervised sound event detection
H Dinkel, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 887-900, 2021
562021
What does a Car-ssette tape tell?
X Xu, H Dinkel, M Wu, K Yu
arXiv preprint arXiv:1905.13448v1, 2019
52*2019
A CRNN-GRU Based Reinforcement Learning Approach to Audio Captioning.
X Xu, H Dinkel, M Wu, K Yu
DCASE, 225-229, 2020
452020
Depa: Self-supervised audio embedding for depression detection
P Zhang, M Wu, H Dinkel, K Yu
Proceedings of the 29th ACM international conference on multimedia, 135-143, 2021
402021
Voice activity detection in the wild: A data-driven approach using teacher-student training
H Dinkel, S Wang, X Xu, M Wu, K Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1542-1555, 2021
402021
Building interpretable interaction trees for deep nlp models
D Zhang, H Zhang, H Zhou, X Bao, D Huo, R Chen, X Cheng, M Wu, ...
Proceedings of the AAAI conference on artificial intelligence 35 (16), 14328 …, 2021
352021
Can audio captions be evaluated with image caption metrics?
Z Zhou, Z Zhang, X Xu, Z Xie, M Wu, KQ Zhu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
332022
Voice activity detection in the wild via weakly supervised sound event detection
H Dinkel, Y Chen, M Wu, K Yu
arXiv preprint arXiv:2003.12222, 2020
302020
The SJTU system for DCASE2022 challenge task 6: Audio captioning with audio-text retrieval pre-training
X Xu, Z Xie, M Wu, K Yu
DCASE 2022 Challenge, Tech. Rep., 2022
292022
Text-based depression detection on sparse data
H Dinkel, M Wu, K Yu
arXiv preprint arXiv:1904.05154, 2019
252019
Audio-text retrieval in context
S Lou, X Xu, M Wu, K Yu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
232022
Decoupled dialogue modeling and semantic parsing for multi-turn text-to-SQL
Z Chen, L Chen, H Li, R Cao, D Ma, M Wu, K Yu
arXiv preprint arXiv:2106.02282, 2021
192021
Psychiatric scale guided risky post screening for early detection of depression
Z Zhang, S Chen, M Wu, KQ Zhu
arXiv preprint arXiv:2205.09497, 2022
172022
Text-to-audio grounding: Building correspondence between captions and sound events
X Xu, H Dinkel, M Wu, K Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
172021
Audio caption in a car setting with a sentence-level loss
X Xu, H Dinkel, M Wu, K Yu
2021 12th International Symposium on Chinese Spoken Language Processing …, 2021
172021
The SJTU system for DCASE2021 challenge task 6: Audio captioning based on encoder pre-training and reinforcement learning
X Xu, Z Xie, M Wu, K Yu
DCASE2021 Challenge, Tech. Rep, Tech. Rep, 2021
162021
LLM-empowered chatbots for psychiatrist and patient simulation: application and evaluation
S Chen, M Wu, KQ Zhu, K Lan, Z Zhang, L Cui
arXiv preprint arXiv:2305.13614, 2023
152023
The system can't perform the operation now. Try again later.
Articles 1–20