基于Matlab的场景图像分类算法研究毕业论文

2021-03-13 11:03

摘要

图像信息是人类活动中常用的记载形式，随着图片信息的日益增多，如何利用优秀的算法来解决图像分类问题是图像处理领域的一大难题，基于这一技术要求，本论文探究提取图像局部特征的方法，特征的描述方法和分类算法的模型。

得到场景图像的语义标签是图像分类技术的重点，本论文在传统的空间金字塔模型的基础上，构建了一种基于SIFT(Scale-invariant feature transform，即尺度不变特征转换)局部特征稀疏编码的空间金字塔匹配的场景图像分类算法。在论文中，首先对图像进行稀疏采样，具体方法是以一定像素为采样步长对图像进行滑动网格划分，得到图像块之后检测SIFT特征，并对检测到的特征进行稀疏编码。与传统的矢量量化（Vector Quantization，VQ）方法相比较，稀疏编码在降低量化误差的方面取得了十分显著的效果（提升了14%的正确率），因而能够得到更加精准的语义表示。获得疏编码之后，使用最大池化（max pooling）函数得到图像特征的统计特性，max pooling函数对于局部变换具备鲁棒性。最后将图像进行空间分割，利用底层特征构建图像语义的空间金字塔模型，以一定的图像进行训练通过线性的SVM(support vector machine，即支持向量机)分类器来对测试图像进行分类。利用MATLAB软件编写代码，对论文提出的分类算法进行仿真，得到该分类算法的分类精确度并得到相应的混淆矩阵。

关键词：场景图像分类，SIFT特征，稀疏编码，空间金字塔，支持向量机

Abstract

Image information is a common form of human activities, with the increasing number of image information, how to use excellent algorithms to solve the image classification problem is a major problem in the field of image processing, based on this technical requirements, this paper explores the detection of image Local feature method, feature description method and classification algorithm model.

The semantic label of the scene image is the focus of the image classification technology. Based on the traditional spatial pyramid model, this paper constructs a scene image classification algorithm based on SIFT local feature sparse coding spatial pyramid matching. In the paper, the image is sparse and sampled. The method is to divide the image by a certain pixel as the sampling step. The image block is used to detect the SIFT feature, and the detected feature is sparsely coded. Compared with the traditional vector quantization method, sparse coding has achieved very significant effect (14% correct rate) in reducing the quantization error, so we can get a more accurate semantic representation. After obtaining the sparse coding, the maximum pooling function is used to generate the statistical properties of the image features. The max pooling function has better robustness to the local transform. Finally, the image is segmented by space, and the spatial pyramid model of image semantics is constructed by using the underlying features. The image is classified by a linear SVM classifier. The code is written by MATLAB software, and the classification algorithm proposed by the paper is simulated. The classification accuracy of the classification algorithm is obtained and the corresponding confusion matrix is obtained.

Keywords: scene image classification, SIFT feature, sparse coding, spatial pyramid, SVM

第一章绪论

1.1研究的背景及意义

图像是对我们周围事物生动形象的描述，是人类社会活动中经常使用的一种信息载体，人们将很多信息以图片的形式存储。在互联网和云存储技术快速发展的现阶段，人们面临着处理越来越多的图像信息的问题。在什么都追求快速和高效的信息时代，如何利用计算机高效的检索和管理大量且不断增长的图像数据成为我们面临的重大任务。要达到这一目的，就要求计算机学习人类的辨识方式来解读图像信息，不同于文字载体，计算机对图像的解读能力与人类大相径庭。在计算机研究领域，利用计算机学习人类感知图像的能力，通过机器学习对增长迅猛的图像数据进行处理和分析是一大研究课题，在此研究背景下提出了图像分类技术来解决人们面临的问题。以图像分类技术为基础，对分类技术进行拓展，在计算机视觉领域中有很多实际应用：

（1）图像检索：图像检索是图像分类技术最直接的应用，如我们在浏览器上查找图片，输入图片关键字就可以得到我们想要的图片。关键字可以理解为图像的语义信息，图像检索就是根据语义信息进行查找，从图像数据库中找出与我们搜索的信息具有相似特性的其它图像。

您需要先支付 80元 才能查看全部内容！立即支付

注册

找回密码

基于Matlab的场景图像分类算法研究毕业论文

Abstract

目录

第一章绪论

1.1研究的背景及意义

您可能感兴趣的文章

最新文档

推荐栏目

登录

注册

找回密码

基于Matlab的场景图像分类算法研究毕业论文

Abstract

目 录

第一章 绪论

1.1研究的背景及意义

您可能感兴趣的文章

最新文档

推荐栏目

目录

第一章绪论