Folgen
Dacheng Yin
Titel
Zitiert von
Zitiert von
Jahr
Phasen: A phase-and-harmonics-aware speech enhancement network
D Yin, C Luo, Z Xiong, W Zeng
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9458-9465, 2020
3522020
ART-V: Auto-Regressive Text-to-Video Generation with Diffusion Models
W Weng, R Feng, Y Wang, Q Dai, C Wang, D Yin, Z Zhao, K Qiu, J Bao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
222024
Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph
D Yin, X Ren, C Luo, Y Wang, Z Xiong, W Zeng
International Conference on Learning Representations (ICLR), 2022
152022
TridentSE: Guiding speech enhancement with 32 global tokens
D Yin, Z Zhao, C Tang, Z Xiong, C Luo
arXiv preprint arXiv:2210.12995, 2022
142022
RetrieverTTS: Modeling Decomposed Factors for Text-Based Speech Insertion
D Yin, C Tang, Y Liu, X Wang, Z Zhao, Y Zhao, Z Xiong, S Zhao, C Luo
Interspeech 2022, 2022
142022
Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
C Tang, C Luo, Z Zhao, D Yin, Y Zhao, W Zeng
Interspeech 2021, 2021
92021
Microcinema: A divide-and-conquer approach for text-to-video generation
Y Wang, J Bao, W Weng, R Feng, D Yin, T Yang, J Zhang, Q Dai, Z Zhao, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
82024
General-purpose speech representation learning through a self-supervised multi-granularity framework
Y Zhao, D Yin, C Luo, Z Zhao, C Tang, W Zeng, ZJ Zha
arXiv preprint arXiv:2102.01930, 2021
82021
MMAR: Towards Lossless Multi-Modal Auto-Regressive Probabilistic Modeling
J Yang, D Yin, Y Zhou, F Rao, W Zhai, Y Cao, ZJ Zha
arXiv preprint arXiv:2410.10798, 2024
42024
Learning trajectories are generalization indicators
J Fu, Z Zhang, D Yin, Y Lu, N Zheng
Advances in Neural Information Processing Systems 36, 2024
32024
Filler Word Detection with Hard Category Mining and Inter-Category Focal Loss
Z Zhao, L Wu, C Tang, D Yin, Y Zhao, C Luo
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2023
Decomposing style, content, and motion for videos
Y Hu, D Yin, Y Wang, Z Chen, C Luo
Journal of Visual Communication and Image Representation 89, 103686, 2022
2022
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–12