人工智能合成语音与自然语音的对比研究

doi:10.3969/j.issn.1671-2072.2026.02.005

中国司法鉴定 ›› 2026 ›› Issue (2): 38-45.DOI: 10.3969/j.issn.1671-2072.2026.02.005

• 专题研讨：新质生产力赋能司法鉴定多场景应用 • 上一篇下一篇

人工智能合成语音与自然语音的对比研究

廖方菱¹，陈蔓青¹，陈胜湘¹，郭宇航¹，杨英仓¹，牟帆²

1. 贵州警察学院刑事技术系； 2. 贵阳市公安局刑侦支队

收稿日期:2025-04-24 出版日期:2026-03-15 发布日期:2026-03-25

Comparative Study of AI-Synthesized Speech and Natural Speech

LIAO Fangling¹, CHEN Manqing¹, CHEN Shengxiang¹, GUO Yuhang¹, YANG Yingcang¹, MU Fan²

1. Department of Forensic Science and Technology, Guizhou Police College;
2. Criminal Investigation Division, Guiyang Municipal Public Security Bureau

Received:2025-04-24 Published:2026-03-15 Online:2026-03-25

摘要/Abstract

摘要： 目的随着人工智能（artificial intelligence，AI）合成语音技术的快速发展，其在司法鉴定中的可检测性成为关键问题。通过听觉感知与声学量化双维度对比研究，系统分析AI合成语音与自然语音的差异特征，为司法实践中合成语音的识别、防范、检验和鉴定提供有效参考。方法听觉检验采用李克特量表，对自然语音与合成语音的一致性进行评分；声学检验利用Praat语音分析软件提取基频、共振峰、音强、时长等特征参数，结合SPSS 27统计学分析软件进行成对样本t检验，量化自然语音与合成语音之间的差异性。结果与自然语音相比，AI合成语音在听觉特征上表现为单音节完整性、儿化音特征、轻重音、语速、流畅度方面较差；声学检验中，基频与共振峰的统计学分析显示差异显著，而音强和时长的差异不显著。结论司法鉴定中综合运用“人耳初筛”与声学量化双维度检验技术，可有效区分AI合成语音与自然语音，为相关合成语音的检验和鉴定提供技术支撑。

关键词: 人工智能, 合成语音, 自然语音, 对比研究, 司法鉴定

Abstract: Objective With the rapid development of artificial intelligence (AI)-synthesized speech technology, its detectability in forensic appraisal has become a key issue. This study systematically analyzes the differential features between AI-synthesized speech and natural speech through a two-dimensional comparative study of auditory perception and acoustic quantification, thereby providing an effective reference for the identification, prevention, inspection, and appraisal of synthesized speech in judicial practice. Methods In the auditory test, the Likert-type Scale was used to rate the consistency between natural speech and synthesized speech. Acoustic tests were conducted by extracting feature parameters such as fundamental frequency, formants, sound intensity, and duration using the Praat speech analysis software. Combined with SPSS 27 statistical analysis software, a paired-sample t-test was conducted to quantify the differences between natural speech and synthesized speech. Results Compared with natural speech, AI-synthesized speech exhibited poorer performance in terms of auditory features such as monosyllabic integrity, retroflex features, stress, speech rate, and fluency. Statistical analysis of the acoustic testing showed that there were significant differences in fundamental frequency and formants, while sound intensity and duration showed no significant differences. Conclusion The combined application of “human ear preliminary screening” and acoustic quantification two-dimensional testing techniques in forensic appraisal can effectively distinguish AI-synthesized speech from natural speech, providing technical support for the inspection and appraisal of AI-synthesized speech.

Key words: artificial intelligence (AI), synthesized speech, natural speech, comparative study, forensic appraisal

中图分类号:

DF794.1

廖方菱, 陈蔓青, 陈胜湘, 郭宇航, 杨英仓, 牟帆. 人工智能合成语音与自然语音的对比研究[J]. 中国司法鉴定, 2026(2): 38-45.

LIAO Fangling, CHEN Manqing, CHEN Shengxiang, GUO Yuhang, YANG Yingcang, MU Fan. Comparative Study of AI-Synthesized Speech and Natural Speech[J]. Chinese Journal of Forensic Sciences, 2026(2): 38-45.

导出引用管理器 EndNote|Ris|BibTeX

链接本文: http://www.chsfjd.cn/CN/10.3969/j.issn.1671-2072.2026.02.005

http://www.chsfjd.cn/CN/Y2026/V0/I2/38

参考文献

[ 1 ] STUPP C.Fraudsters used AI to mimic CEO’s voice in unusual cybercrime case[N].The Wall Street Journal，2019-08-30.
[ 2 ] 张炎坤. AI语音合成技术在有声出版中的法律风险及其治理策略：域外经验与中国方案[J]. 出版科学，2025，33（3）：27-38.
[ 3 ] 马瑞萍. AI语音合成技术的应用和风险与声音权的保护研究[D]. 广州：暨南大学，2021.
[ 4 ] AI换声[J].方圆，2025（1）：8-9.
[ 5 ] 北京互联网法院课题组. AI生成声音侵害声音权益的法律认定——以殷某某诉北京某智能科技公司等人格权侵权案为例[J]. 法律适用，2024（9）：123-133.
[ 6 ] 张天培，苏滨.以假乱真的AI诈骗，如何防范?[N].人民日报，2025-04-08（13）.
[ 7 ] 陈志业，张智骞，王兵，等. AI语音合成技术的应用与展望[J]. 影视制作，2023，29（3）：51-55.
[ 8 ] 张小峰，谢钧，罗健欣，等. 深度学习语音合成技术综述[J]. 计算机工程与应用，2021，57（9）：50-59.
[ 9 ] 科大讯飞发布首个全国产算力平台成果“飞星一号”讯飞星火V3.5[J]. 党史纵览，2024（3）：58.
[10] 古嘉豪. 试论DeepSeek模型对基层公安民警工作效能的提升作用[J]. 公安教育，2025（3）：33-36.
[11] 王俏.全国人大代表雷军：加强“AI换脸拟声”治理压实平台各方责任[N].人民法院报，2025-03-05（5）.
[12] ALBADAWY E A，LYU S W，FARID H. Detecting AI-synthesized speech using bispectral analysis[EB/OL].[2026-02-02].https：//cse.buffalo.edu/~siweilyu/papers/cvprw19b.pdf.
[13] SINGH A K，SINGH P. Detection of AI-synthesized speech using cepstral & bispectral statistics[C]//Proceedings of the 2021 IEEE 4th International Conference on Multimedia Information Processing and Retrieval （MIPR）.Washington，D.C.，USA：IEEE Press，2021：412-417.
[14] WANG R，JUEFEI-XU F，HUANG Y H，et al. DeepSonar：Towards effective and robust detection of AI-synthesized fake voices[C]//Proceedings of the 28th ACM International Conference on Multimedia. Washington，D.C.，USA：ACM Press，2020：1207-1216.
[15] 江波翰，王新伟. 特征参数比对法在识别AI合成语音中的研究[J]. 信息与电脑（理论版），2023，35（24）：174-177.
[16] 王向阳，孟利，梁文静. 语音合成软件声纹特征的分析[J]. 警察技术，2024（6）：51-53.
[17] 彭嘉俊，陈海贤，彭康盛，等. 人工智能技术合成语音的鉴定特征研究[J]. 广东公安科技，2024，32（1）：27-31.
[18] 王帅. 合成语音与自然语音的音高差异——从18种语言核心词看合成语音的音高特点[J]. 天津外国语大学学报，2024，31（5）：88-98.
[19] 冉启斌，黄玮. 合成语音与自然语音嗓音的声学对比分析——以18种语言为例[J]. 天津外国语大学学报，2024，31（5）：73-87.
[20] 陈子谦，王虹. 合成语音的语音同一认定研究[J]. 广东公安科技，2021，29（3）：43-46.
[21] 张学海，杨璐铭. 合成语音的声纹鉴定分析——以两名AI虚拟主播语音为基础[J]. 中国司法鉴定，2022（2）：69-72.
[22] LIKERT R. A technique for the measurement of attitudes[J]. Archives of Psychology，1932，22（140）：55.
[23] 牟科浩. 小样本及缺乏发音字典条件下的基于Tacotron2的中国方言语音合成方法[D]. 北京：北京化工大学，2023.
[24] 张翠玲. 法庭语音技术研究[M]. 北京：中国社会出版社，2009.
[25] 杨英仓，徐毓文，欧荣安，等. 听辨在声纹鉴定中的作用[J]. 刑事技术，2012，37（1）：43-45.

人工智能合成语音与自然语音的对比研究

Comparative Study of AI-Synthesized Speech and Natural Speech

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

编辑推荐

Metrics