|
张景宣 |
职称/职务:讲师 |
电话: |
个人主页: |
电子信箱:jxzhanggg@snnu.edu.cn |
研究方向:语音识别,语音合成,多模态语音 |
办公地点: |
个人简介
张景宣,2016年本科毕业于中国科学技术大学少年班学院,电子信息工程专业。2021年毕业于中国科学技术大学,获得信息与通信工程专业博士学位。2020年于英国爱丁堡大学语音研究中心进行联合培养。2021年至2023年在中国科学技术大学和科大讯飞联合博后工作站工作。2023年7月起任太阳集团tcy8722网站讲师。在语音领域高水平国际会议ICASSP、INTERSPEECH和顶级国际期刊IEEE/ACM TASLP上共发表十余篇论文。研究方向包括多模态语音,语音识别,语音合成,语音无监督预训练等。
学术论文
[1] Jing-Xuan Zhang, Genshun Wan, Zhen-Hua Ling, Jia Pan, Jianqing Gao, Cong Liu, “Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation”, IEEE ICASSP, 2023
[2] Jing-Xuan Zhang, Genshun Wan, Jia Pan, “Is Lip-Region-of-Interest Sufficient for Lipreading?”, ACM ICMI, 2022
[3] Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai, “Non-Parallel Sequence-to-Sequence Voice Conversion with Disentangled Linguistic and Speaker Representations”, IEEE/ACM Transaction on Audio, Speech and Lang, vol. 28, no. 1, pp. 540-552, 2020
[4] Jing-Xuan Zhang, Zhen-Hua Ling, Li-Juan Liu, Yuan Jiang, Li-Rong Dai,“Sequence-to-Sequence Acoustic Modeling for Voice Conversion”, IEEE/ACM Trans. on Audio, Speech and Lang, vol. 27, no. 3, pp. 631-644, 2019
[5] Jing-Xuan Zhang, Korin Richmond, Zhen-Hua Ling, Li-Rong Dai, “TaLNet: Voice Reconstruction from Tongue and Lip Articulation with Transfer Learning from Text-to-Speech Synthesis”, Proceedings of the AAAI Conference on Artificial Intelligence, 35(16), pp. 14402-14410, 2021
[6] Jing-Xuan Zhang, Zhen-Hua Ling, Yuan Jiang, Li-Juan Liu, Chen Liang, Li-Rong Dai, “Improving Sequence-to-Sequence Acoustic Modeling by Adding Text-Supervision”, IEEE ICASSP, pp. 6785-6789, 2019
[7] Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai, “Forward Attention in Sequence-to-sequence Acoustic Modelling for Speech Synthesis”, IEEE ICASSP, pp. 4789-4793, 2018
[8] Jing-Xuan Zhang, Zhen-Hua Ling, Li-Rong Dai, “Recognition-Synthesis Based Non-Parallel Voice Conversion with Adversarial Learning”, INTERSPEECH, pp. 771-775, 2020
[9] Jing-Xuan Zhang, Li-Juan Liu,Yan-Nian Chen, Ya-Jun Hu, Yuan Jiang, Zhen-Hua Ling, Li-Rong Dai, “Voice Conversion by Cascading Automatic Speech Recognition and Text-to-Speech Synthesis with Prosody Transfer”, Joint Workshop for the Blizzard Challenge and Voice Conversion Challenge, pp. 121-125, 2020
发表专利
[1] 张景宣; 万根顺, 高建清, 刘聪, 胡国平, 刘庆峰, 胡郁, “语音识别方法、语音识别设备及计算机可读存储介质”, 中华人民共和国国家知识产权局, 发明专利, 2022. 8, ZL202210400143
[2] 张景宣, 万根顺, 高建清, 刘聪, 胡国平, 刘庆峰, “语音识别方法、语音识别模型的训练方法以及相关装置”, 中华人民共和国国家知识产权局, 发明专利, 2022. 2, ZL202111666006
[3] 闻战胜, 张景宣, 高万军, “音视频同步方法、装置、介质、设备及程序产品”, 中华人民共和国国家知识产权局, 发明专利, 2022.4, ZL202210095944