面向多场景的环境异常声音识别 Environmental Abnormal Sound Recognition for Multiple Scenes期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

按检索

面向多场景的环境异常声音识别

引用本文：	郑文宾,何蔚.面向多场景的环境异常声音识别[J].科学技术与工程,2023,23(17):7444-7449.

作者姓名：	郑文宾何蔚

作者单位：	上海电力大学计算机科学与技术学院;公安部第三研究所

基金项目：	国家自然科学基金(61872230,U1936213)；上海市学术带头人计划(21XD1421500)；上海市科委项目(20020500600)

摘要：	随着人工智能和大数据的发展，各种场景中对异常声音识别的需求日益增长，基于人工智能的声音识别技术正在兴起并被高度重视。现行主流的异常声音识别算法多为浅层机器学习模型结构，对异常声音的识别率较低，且识别的声音类型单一。为了有效识别异常声音，提出一种基于梅尔频率倒谱系数(Mel-frequency cepstral coefficient, MFCC)和卷积神经网络(convolution neural network, CNN)的环境声音识别算法，对各类异常声音进行采集和有效识别，并及时反馈声音状态，为各类声识别应用场景提供精细化管理技术手段。结果表明：提出的算法对5类场景下环境异常声音的识别率得到极大提高，适用于更广泛的声学场景，具有明显的优势。
关键词：	异常声音梅尔频率倒谱系数卷积神经网络音频事件检测
收稿时间：	2022/5/10 0:00:00
修稿时间：	2022/12/14 0:00:00
Environmental Abnormal Sound Recognition for Multiple Scenes

Zheng Wenbin,He Wei.Environmental Abnormal Sound Recognition for Multiple Scenes[J].Science Technology and Engineering,2023,23(17):7444-7449.

Authors:	Zheng Wenbin He Wei

Institution:	College of Computer Science and Technology,Shanghai University of Electric Power

Abstract:	With the development of artificial intelligence and big data, there is an increasing demand for abnormal sound recognition in various scenes. Sound recognition technology based on artificial intelligence is emerging and is highly valued. Most of the current mainstream abnormal sound recognition algorithms are shallow machine learning model structure. The recognition rate of abnormal sound is low, and the recognition of the sound type is single. In this paper, a high effective algorithm of sound recognition based on Mel-frequency cepstral coefficient (MFCC) and Convolution neural network (CNN) is proposed. It could collect all kinds of abnormal sounds and feed back the sound state in time, which provides effective management technical means for all kinds of sound recognition application scenarios. The results show that the recognition rate of the proposed algorithm for five kinds of abnormal sounds in the environment is greatly improved, which is suitable for a wider range of acoustic scenes and has obvious advantages.

Keywords:	abnormal sound Mel-frequency cepstral coefficient convolutional neural network audio event detection

	点击此处可从《科学技术与工程》浏览原始摘要信息
	点击此处可从《科学技术与工程》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏