期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Multiple-Instance Learning with Instance Selection via Constructive Covering Algorithm

Yanping Zhang ;Heng Zhang ;Huazhen Wei ;Jie Tang ;Shu Zhao 《清华大学学报》2014,(3):285-292

Multiple-Instance Learning （MIL） is used to predict the unlabeled bags＇ label by learning the labeled positive training bags and negative training bags.Each bag is made up of several unlabeled instances.A bag is labeled positive if at least one of its instances is positive,otherwise negative.Existing multiple-instance learning methods with instance selection ignore the representative degree of the selected instances.For example,if an instance has many similar instances with the same label around it,the instance should be more representative than others.Based on this idea,in this paper,a multiple-instance learning with instance selection via constructive covering algorithm （MilCa） is proposed.In MilCa,we firstly use maximal Hausdorff to select some initial positive instances from positive bags,then use a Constructive Covering Algorithm （CCA） to restructure the structure of the original instances of negative bags.Then an inverse testing process is employed to exclude the false positive instances from positive bags and to select the high representative degree instances ordered by the number of covered instances from training bags.Finally,a similarity measure function is used to convert the training bag into a single sample and CCA is again used to classification for the converted samples.Experimental results on synthetic data and standard benchmark datasets demonstrate that MilCa can decrease the number of the selected instances and it is competitive with the state-of-the-art MIL algorithms. 相似文献

2.

Learnability of multi-instance multi-label learning

Wei Wang ZhiHua Zhou 《科学通报(英文版)》2012,57(19):2492-2495

Multi-instance multi-label learning(MIML) is a new machine learning framework where one data object is described by multiple instances and associated with multiple class labels.During the past few years,many MIML algorithms have been developed and many applications have been described.However,there lacks theoretical exploration to the learnability of MIML.In this paper,through proving a generalization bound for multi-instance single-label learner and viewing MIML as a number of multi-instance single-label learning subtasks with the correlation among the labels,we show that the MIML hypothesis class constructed from a multi-instance single-label hypothesis class is PAC-learnable. 相似文献

3.

Incremental frequent tree-structured pattern mining from semi-structured data

ChenEnhong LinLe WuGongqing 《高技术通讯(英文版)》2005,11(1):6-8

The paper studies the problem of incremental pattern mining from semi-structrued data. When a new dataset is added into the original dataset, it is difficult for existing pattern mining algorithms to incrementally update the mined results. To solve the problem, an incremental pattern mining algorithm based on the rightmost expansion technique is proposed here to improve the mining performance by utilizing the original mining results and information obtained in the previous mining process. To improve the efficiency, the algorithm adopts a pruning technique by using the frequent pattern expansion forest obtained in mining processes. Comparative experiments with different volume of initial datasets, incremental datasets and different minimum support thresholds demonstrate that the algorithm has a great improvement in the efficiency compared with that of non-incremental pattern mining algorithm. 相似文献

4.

A multi-label classification algorithm based on label-specific features

Huaqiao Qu Shichao Zhang Huawen Liu Jianmin Zhao 《武汉大学学报:自然科学英文版》2011,16(6):520-524

Aiming at the problem of multi-label classification,a multi-label classification algorithm based on label-specific features is proposed in this paper.In this algorithm,we compute feature density on the positive and negative instances set of each class firstly and then select mk features of high density from the positive and negative instances set of each class,respectively;the intersection is taken as the label-specific features of the corresponding class.Finally,multi-label data are classified on the basis of label-specific features.The algorithm can show the label-specific features of each class.Experiments show that our proposed method,the MLSF algorithm,performs significantly better than the other state-of-the-art multi-label learning approaches. 相似文献

5.

CFSBC： Clustering in High-Dimensional Space Based on Closed Frequent Item Set

NIWei-wei SUNZhi-hui 《武汉大学学报:自然科学英文版》2004,9(5):590-594

Clustering in high-dimensional space is an important domain in data mining. It is the process of discovering groups in a high-dimensional dataset, in such way, that the similarity between the elements of the same cluster is maximum and between different clusters is minimal. Many clustering algorithms are not applicable to high dimensional space for its sparseness and decline properties. Dimensionality reduction is an effective method to solve this problem. The paper proposes a novel clustering algorithm CFSBC based onclosed frequent hemsets derived from association rule mining. which can get the clustering attributes with high efficiency. The algorithm has several advantages. First, it deals effectively with the problem of dimensionality reduction. Second, it is applicable to different kinds of attributes, Third, it is suitable for very large data sets. Experiment shows that the proposed algorithm is effective and efficient 相似文献

6.

Mining Frequent Closed Itemsets in Large High Dimensional Data

余光柱曾宪辉邵世煌《东华大学学报(英文版)》2008,25(4):416-424

Large high-dimensional data have posed great challenges to existing algorithms for frequent itemsets mining. To solve the problem, a hybrid method, consisting of a novel row enumeration algorithm and a column enumeration algorithm, is proposed. The intention of the hybrid method is to decompose the mining task into two subtasks and then choose appropriate algorithms to solve them respectively. The novel algorithm, i.e., Intertransaction is based on the characteristic that there are few common items between or among long transactions. In addition, an optimization technique is adopted to improve the performance of the intersection of bit-vectors. Experiments on synthetic data show that our method achieves high performance in large high-dimensional data. 相似文献

7.

Intrusion detection based on rough set and artificial immune

《高技术通讯(英文版)》2016,(4):368-375

In order to increase intrusion detection rate and decrease false positive detection rate , a novel intrusion detection algorithm based on rough set and artificial immune ( RSAI-IDA) is proposed. Using artificial immune in intrusion detection , anomaly actions are detected adaptively , and with rough set , effective antibodies can be obtained .A scheme , in which antibodies are partly generated randomly and others are from the artificial immune algorithm , is applied to ensure the antibodies di-versity.Finally, simulations of RSAI-IDA and comparisons with other algorithms are given .The ex-perimental results illustrate that the novel algorithm achieves more effective performances on anomaly intrusion detection , where the algorithm ’ s time complexity decreases , the true positive detection rate increases , and the false positive detection rate is decreased . 相似文献

8.

Motion Planning Method for Obstacle Avoidance of 6-DOF Manipulator Based on Improved A* Algorithm

汪首坤朱磊《东华大学学报(英文版)》2015,32(1):79-85

The conventional A* algorithm may suffer from the infinite loop and a large number of search data in the process of motion planning for manipulator. To solve the problem,an improved A* algorithm is proposed in this paper by the means of selecting middle points and applying variable step segments searching during the searching process. In addition,a new method is proposed for collision detection in the workspace. In this paper,the MOTOMAN MH6 manipulator with 6-DOF is applied for motion plan. The algorithm is based on the basis of the simplification for the manipulator and obstacles by cylinder enveloping. Based on the analysis of collision detection,the free space can be achieved which makes it possible for the entire body to avoid collisions with obstacles. Compared with the Conventional A*,the improved algorithm deals with less searching points and performs more efficiently. The simulation developed in VC + + with OpenGL and the actual system experiments prove effectiveness and feasibility of this improved method. 相似文献

9.

Application of marching cubes algorithm in visualization of mineral deposits

Dewen Seng Zhongxue Li Cuiping Li Chunmin Li 《矿物冶金与材料学报》2005,12(3):203-207

An implementation scheme of the marching cubes (MC) algorithm was presented for the visualization of mineral deposits.The basic principles, processes and pitfalls of the MC algorithm were discussed. The asymptotic decider algorithm was employed to solve the ambiguity problem associated with the MC algorithm. The implementation scheme was applied to model and reconstruct the surfaces of mineral deposits, using the geological data obtained from an iron mine in China. Experimental results demonstrate the ability of the implementation scheme to solve the ambiguity problem, and illustrate the effectiveness and efficiency of the MC algorithm in the visualization of mineral deposits. 相似文献

10.

An Improved Algorithm for k-Nearest-Neighbor Finding and Surface Normals Estimation

赵灿孟祥林《清华大学学报》2009,(Z1):77-81

This paper is to improve the speed of k-nearest-neighbor search and put forward algorithms related to tangent plane estimation based on existing methods. Starting from the points cloud, the algorithm segments the whole data into many different small cubes in space, and the size of cube is related to the density of the points cloud. Considering the position of the point in the cube, the algorithm enlarges the area around the given point step by step until the k-nearest-neighbor is accomplished. The neighbor’s least-squares tangent plane is estimated. In order to orient the planes, the k-nearest-neighbor is introduced into the problem of seeking the minimum spanning trees instead of searching the whole data. The research proved that the algorithms put forward in this paper were effective in processing data in short time and with high precision. The theory was useful for the practical application in reverse engineering and other areas related. Solution for finding k-nearest-neighbor problem, which still costs much time in present, was provided, and a propagation algorithm for orienting the planes was also discussed. The algorithm chose the orientation among the k-nearest-neighbor of the current point. 相似文献

11.

彩色图像的边缘检测

李轩张红《科学技术与工程》2018,18(13)

彩色图像的边缘检测算法有很多,但检测结果仍存在细节丢失、抗噪性差的问题。本文提出一种新的彩色图像的边缘检测算法。首先,在RGB空间下,采用矢量梯度算法,分别在三个通道中求出像素的梯度值。然后,采用自适应双阈值算法,对梯度值进行判断,筛选出图像的边缘点。最后,只采用筛选后的边缘点的信息,构造新的模糊矩阵的隶属函数,从而形成新的模糊算法。利用该算法,检测出最终的图像边缘信息。实验证明,该算法得到的图像检测结果,不仅对比度高、边缘连续、细节保存完整,而且具有较强的抗噪性。相似文献

12.

时间表理论中LPT算法在装箱问题中的应用

江厚元《贵州科学》1992,10(4):25-31

本文利用二分搜索法和时间表理论中LPT算法求解装箱问题的近似最优解;给出了一个直观性算法,并研究这个算法的最坏情形,最后说明此算法在某些方面优于著名的FFD算法。相似文献

13.

自适应滤波算法综述 总被引：34，自引：0，他引：34

邹艳碧高鹰《广州大学学报(自然科学版)》2002,1(2):44-50

基于二阶统计量即相关函数的自适应滤波算法在自适应信号处理中有着广泛的应用，本文就国内外对这些算法的研究进行了综述。相似文献

14.

基于老化算法的分布式文件缓存算法

刘敏房至一王红斌徐鹏《吉林大学学报(理学版)》2011,49(5):895-900

为提升数据检索读的性能, 基于老化算法采取Cache方法, 通过设计合理的缓存结构, 给出一种新的分布式文件缓存算法. 该算法在缓存实现部分, 使用了LRU算法中常用的老化算法, 并将其由一个页面置换算法改进为一个文件缓存替换算法, 且在该过程中完好地继承了老化算法的优点. 评测结果显示了改进方法的有效性. 相似文献

15.

EM算法理论及其应用 总被引：3，自引：0，他引：3

杨基栋《安庆师范学院学报(自然科学版)》2009,15(4):30-35

EM算法是一种迭代算法,主要用来计算后验分布的众数或极大似然估计,广泛地应用于缺损数据、截尾数据、成群数据、带有讨厌参数的数据等所谓的不完全数据的统计推断问题。在介绍EM算法的基础上,针对EM算法收敛速度慢的缺陷,具体讨论了加速EM算法：EMB算法和MEMB算法;针对EM算法计算的局限性,给出了EM算法的推广：GEM和MCEM算法。最后给出了EM的实值实例,结果精确。相似文献

16.

基于Bresenham算法的四步画直线算法 总被引：12，自引：0，他引：12

林笠 Rong CHEN 《暨南大学学报(自然科学与医学版)》2003,24(5):19-22

通过分析计算机图形学中的画直线的Bresenham算法，以及由此改进的“对称算法”、“二步法”，提出将“对称算法”和“二步法”结合形成“4—点画线算法”，与Bresenham算法相比，该算法可以将画线效率提高近2倍。相似文献

17.

一种改进的KR模式匹配算法

尚俊平刘合兵《河南科学》2012,30(4):473-476

在分析BF、KMP和KR等模式匹配算法的基础上提出一种改进的KR算法(IKR),在产生哈希冲突时利用双向比较法进行匹配.实验结果表明,该算法可以快速有效地进行模式匹配. 相似文献

18.

几种模式匹配算法的效率分析

巫喜红《大庆师范学院学报》2007,27(2):50-52

分析几种模式匹配算法如KMP、BM、RK、SO。通过上机实验对这些算法的匹配时间进行测试,结果表明在这些模式匹配算法中BM算法是速度最快效率最高的算法。相似文献

19.

排课系统中混合算法的应用研究

马振飞刘晓丽《辽宁大学学报(自然科学版)》2007,34(3):237-240

排课系统比较复杂又具有智能特点,其算法主要有模拟手工算法、回溯算法、遗传算法、贪心算法等.在软件开发过程中,发挥每种算法优点以提高排课的科学性、高效性和合理性是个重要课题.结合成功研制排课系统的经验,阐述了不同算法的应用,提出了通过所有算法的混合应用解决排课问题的方法. 相似文献

20.

大数快速模幂算法的研究 总被引：1，自引：0，他引：1

陈艳波唐四云王学理《科学技术与工程》2006,6(5):625-627

大数模幂在现代密码学领域有着广泛的应用，它是RSA．ELGamal等公钥密码的基本运算。对目前具有典型代表的各种大数模幂算法进行分析，从基本设计原理和实现角度对这些模幂算法进行分类，归纳并给出了各类算法的实现方法、优缺点和研究现状。相似文献