期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Agnostic Science. Towards a Philosophy of Data Analysis

D. Napoletani M. Panza D. C. Struppa 《Foundations of Science》2011,16(1):1-20

In this paper we will offer a few examples to illustrate the orientation of contemporary research in data analysis and we will investigate the corresponding role of mathematics. We argue that the modus operandi of data analysis is implicitly based on the belief that if we have collected enough and sufficiently diverse data, we will be able to answer most relevant questions concerning the phenomenon itself. This is a methodological paradigm strongly related, but not limited to, biology, and we label it the microarray paradigm. In this new framework, mathematics provides powerful techniques and general ideas which generate new computational tools. But it is missing any explicit isomorphism between a mathematical structure and the phenomenon under consideration. This methodology used in data analysis suggests the possibility of forecasting and analyzing without a structured and general understanding. This is the perspective we propose to call agnostic science, and we argue that, rather than diminishing or flattening the role of mathematics in science, the lack of isomorphisms with phenomena liberates mathematics, paradoxically making more likely the practical use of some of its most sophisticated ideas. 相似文献

2.

Developments in Data Analysis, edited by A. Ferligoj and A. Kramberger

Dawn Iacobucci 《Journal of Classification》1999,16(1):144-146

相似文献

3.

B. Le Roux and H. Rouanet, Geometric Data Analysis, From Correspondence Analysis to Structured Data Analysis, Dordrecht, Kluwer, 2004, pp. xi + 475.

Fionn Murtagh 《Journal of Classification》2008,25(1):137-141

相似文献

4.

小资料

《中国科技术语》1999,1(1):33

1996年4月11日全国自然科学名词审定委员会(现称全国科学技术名词审定委员会)与中国天文学会联合召开新闻发布会,对21世纪起始年问题作了介绍,并提出了建议,现重新登载,供参考：关于21世纪起始年问题的意见随着21世纪的临近,关于其起始年是2000年还是2001年的问题已日益为各界关心,新闻界和其他媒体的各种报导、文章中出现的矛盾日趋严重,现经全国自然科学名词审定委员会与中国天文学会研究,就此问题向各位先生作些介绍并提出我们的意见。“世纪”(Century)的概念包含两个方面,一是指每100年为一个世纪;二是这100年从哪年到哪年。其中前者国内外均无异议,后者国内外一直有争议,目前没有明确、统一的国际法规。由此造成的影响相当严重,从中央党政领导重要讲话、重要社论,直至新闻、教学、科普教育、出版等方面,涉及面很宽,影响较大(见附件本资料中的附件这次均未附——本刊第1、5、6、19、20页)。全国名词委近6年来一直在组织有关学科的专家对此问题(属多学科交叉问题)进行调查、研究,1993年天文学名词审定委员会曾提出了“世纪”和“年代”都从“0”年起的建议(附件第6页)。最近,1996年3月15日,英国皇家格林尼治天文台发表公告称21世纪应始于2001年(附件第1、2、3、4页),此消息发布后使许多人不知所从,或误解为这是国际上结论性的意见。我们认为有必要进行科学性和实事求是的说明和引导。一、我们建议21世纪从“0”年开始公元从“1”年起始(无“0”年),这是公认的历史,按此类推“世纪”应从1～0,即21世纪为2001年至2100年(第14页)。但实际上有一系列问题,我们认为从客观实际出发,今后以“0”年为起始更好,其理由是：1.“世纪”和“年代”的纪元法协调一致为好。“年代”划分从“0”开始,已无异议。例如20世纪90年代从1990年至1999年底。如果“世纪”从“1”开始,则“世纪”交界的“00”年将无法归入任何一个“年代”。例如若21世纪定为2001年,则2000年既不能归入20世纪90年代,也不能归入21世纪初十年代(附件第13页)。2.一般量的起始原点为“0”(长度、重量等),“世纪”从“0”开始符合计量的一般规律。3.符合国际公众的观念和当前趋势(附件第1、5、10、11、24页)。4.适应计算机广泛使用的现代信息技术潮流。5.鉴于公元无0年,可认定公元一世纪仅含99年,以后就从“0”开始,每一世纪均为100年。二、当前几点意见1.在国际权威机构(例如联合国)未作正式规定之前,我们建议21世纪从2000年开始。2.我国重要的文件中(如公告、社论、中央文件、领导讲话等),涉及世纪时,一般要避免使用精确数字表达“世纪”起始问题(例如“再过×年就进入21世纪”;“本世纪末将达到什么指标”等等),而应尽量使用确切的起止年份(如1996年到2000年)。泛指性的表达,可继续使用“世纪”一词(如：“世纪之交”、“迎接新世纪”等等)。3.国际有关组织已确定的事件和计划,报导时与世纪起始有关的,按原意不作改动(如国际卫生组织确定的“世纪婴儿”等等)。4.促进国际协调一致“世纪”应是国际性的约定,某一个国家或学术团体的意见都不能作为国际制约性权威意见,这个问题又不宜各自为政,自行其是,建议我国有关部门多渠道促进国际(例如通过联合国)协调一致,力求形成一个国际性的权威意见,解决这个每一百年都会遇到的历史性问题。5.今后凡跨越公元前、后的历史事件和纪念活动,应注意公元无“0”年的历史事实(附件第12、14页)。下一个十年怎么称呼人类即将迈入一个新世纪。在此之前,每隔十年,算一个年代,如20世纪80年代、90年代。但下一个世纪头10年应该怎么称呼？迄今为止,世界上竟然没有任何一个国家想出一个既简单又好听的名字。21世纪头10年究竟应该如何简称,如果说成“21世纪头零零年代中期”或“21世纪头10年中期”,多拗口！为了解决这个世纪难题,“美国之音”最近发起一场全球性的竞赛,看谁能为下个世纪的头10年起一个好名字。历史学家肯·戴维斯建议就称作“头10年”。很多人或许会说,20世纪我们是怎么过来的？当初人们是怎么称呼它的,现在依葫芦画瓢不就行了？令人遗憾的是,谁也不知道当初人们是怎么简称20世纪头10年的。历史学家证实,在美国,直到第一次世界大战后,才开始以10年划分一个时间段。而在此之前,人们问日期,要么直接回答哪一年,要么就说谁执政时期。(《深圳商报》1999.2.7 秋凌文)本刊注：1993年以来全国名词委天文学名词审定委员会多次讨论21世纪起始年问题,同时研讨了下一个十年的称呼问题,提出的方案主要是“21世纪初十年代”。欢迎大家提出意见。21世纪从哪一年算起？2000年还是2001年？追根溯源,盖因公元无零年。欧洲人把耶酥诞生那年计为公元1年,它之前是公元前1年。为什么公元无零年呢？原来,古代欧洲人用罗马数码计数,而罗马数码只有Ⅰ、Ⅴ、Ⅹ、L、C、D和M七个,分别代表1、5、10、50、100、500和1000,用以组合任何自然数,如：Ⅳ—4,Ⅷ—8,ⅩL—40,ⅩCⅨ—99,CMⅩC—990,等等,“零”不是自然数,无罗马数码,用罗马数码计数的古欧洲人不可能有设置公元零年的念头。(石磬) * 本资料中的附件这次均未附——本刊相似文献

5.

Clustering Functional Data 总被引：1，自引：0，他引：1

Thaddeus Tarpey Kimberly K. J. Kinateder 《Journal of Classification》2003,20(1):093-114

相似文献

6.

Data Analysis and Information Systems: Statistical and Conceptual Approaches, edited by H.-H. Bock and W. Polasek

F.James Rohlf 《Journal of Classification》1999,16(1):139-140

相似文献

7.

科学数据共享平台之数据联盟模式初探 总被引：2，自引：0，他引：2

刘润达赵辉李大玲《中国基础科学》2010,12(6):27-32

数据资源建设是科学数据共享平台建设的重要内容之一,也是其难点之一。本文比较了我国科学数据共享平台资源建设的三种主要模式,指出数据交换模式更适合于当前目标定位,即建设开放共享的科学数据共享平台;进一步提出了科学数据联盟模式,由此构建的科学数据共享平台可灵活地解决数据资源建设和整合问题,从而提高数据的利用率,提升服务质量;并提出了当前构建科学数据联盟模式的科学数据共享平台的两点建议;最后以地球系统科学数据共享网的建设为例进行了实例说明。本研究将为我国科学数据共享平台建设提供参考。相似文献

8.

Advances in Data Science and Classification, edited by A. Rizzi, M. Vichi, and H.-H. Bock

Michael P. Windham 《Journal of Classification》2001,18(2):289-290

相似文献

9.

科学数据发布策略探讨与气象数据共享实践 总被引：2，自引：0，他引：2

李集明《中国基础科学》2003,(1):56-62

科学数据是国家最重要的信息资源,在宏观决策、经济建设、科技创新和社会发展中发挥着不可替代的作用。科学数据需要通过不同方式的发布才能流通、传播并被各领域应用,才能产生经济和社会效益,实现其价值。可以认为,科学数据发布策略的制相似文献

10.

Concavity in Data Analysis

Michael P. Windham 《Journal of Classification》2003,20(1):077-092

M -estimators, and building clustering objective functions. Finally, using the common thread of concavity, all three will be combined to build a comprehensive, flexible procedure for robust cluster analysis. 相似文献

11.

Accurate Tree-based Missing Data Imputation and Data Fusion within the Statistical Learning Paradigm

Antonio D’Ambrosio Massimo Aria Roberta Siciliano 《Journal of Classification》2012,29(2):227-258

Framework of this paper is statistical data editing, specifically how to edit or impute missing or contradictory data and how to merge two independent data sets presenting some lack of information. Assuming a missing at random mechanism, this paper provides an accurate tree-based methodology for both missing data imputation and data fusion that is justified within the Statistical Learning Theory of Vapnik. It considers both an incremental variable imputation method to improve computational efficiency as well as boosted trees to gain in prediction accuracy with respect to other methods. As a result, the best approximation of the structural risk (also known as irreducible error) is reached, thus reducing at minimum the generalization (or prediction) error of imputation. Moreover, it is distribution free, it holds independently of the underlying probability law generating missing data values. Performance analysis is discussed considering simulation case studies and real world applications. 相似文献

12.

中国地学大断面与深部地球物理资料整理工作进展

方盛明杨健孙国伟《中国基础科学》2002,(5):12-15

中国地学大断面与深部地球物理资料整理是一项重要的基础性工作,其重要意义在于抢救了投入巨资获得的相当珍贵的科技资料.本文简要介绍了中国地学大断面的有关情况,重点介绍了地学大断面与深部地球物理资料整理的工作进展和已取得的主要成果. 相似文献

13.

Combinatorial Data Analysis: Optimization by Dynamic Programming, by L. Hubert, P. Arabie, and J. Meulman

Michael J. Brusco 《Journal of Classification》2001,18(2):276-280

相似文献

14.

Unfolding Incomplete Data: Guidelines for Unfolding Row-Conditional Rank Order Data with Random Missings

Frank M. T. A. Busing Mark de Rooij 《Journal of Classification》2009,26(3):329-360

Unfolding creates configurations from preference information. In this paper, it is argued that not all preference information needs to be collected and that good solutions are still obtained, even when more than half of the data is missing. Simulation studies are conducted to compare missing data treatments, sources of missing data, and designs for the specification of missing data. Guidelines are provided and used in actual practice. 相似文献

15.

大数据算法的歧视本质

《自然辩证法研究》2017,(5):81-86

作为从海量数据中挖掘出知识的重要手段,大数据算法日益受到世人的重视。但大数据算法的负面效应——歧视性也日益凸显。本文先从文化、技术哲学和心理学等角度阐述了大数据算法的歧视内涵,并从三个不同层面讨论了大数据算法存在歧视性的内在必然性,最后给出缓解歧视性的三个方法。相似文献

16.

Classification of Asymmetric Proximity Data

Donatella Vicari 《Journal of Classification》2014,31(3):386-420

When clustering asymmetric proximity data, only the average amounts are often considered by assuming that the asymmetry is due to noise. But when the asymmetry is structural, as typically may happen for exchange flows, migration data or confusion data, this may strongly affect the search for the groups because the directions of the exchanges are ignored and not integrated in the clustering process. The clustering model proposed here relies on the decomposition of the asymmetric dissimilarity matrix into symmetric and skew-symmetric effects both decomposed in within and between cluster effects. The classification structures used here are generally based on two different partitions of the objects fitted to the symmetric and the skew-symmetric part of the data, respectively; the restricted case is also presented where the partition fits jointly both of them allowing for clusters of objects similar with respect to the average amounts and directions of the data. Parsimonious models are presented which allow for effective and simple graphical representations of the results. 相似文献

17.

Inequalities Between Similarities for Numerical Data

Matthijs J. Warrens 《Journal of Classification》2016,33(1):141-148

Similarity measures are entities that can be used to quantify the similarity between two vectors with real numbers. We present inequalities between seven well known similarities. The inequalities are valid if the vectors contain non-negative real numbers. 相似文献

18.

科学数据资源与共享 总被引：8，自引：1，他引：8

孙九林《中国基础科学》2003,(1):30-33

一、科学数据资源 1.几个基本概念 (1)数据:数据是人类为了生存,而与自然界进行斗争的产物,是人们为了认识和改造世界,而用于记录世界的一种符号。它借助于“数字”或其它符号去勾画和记录现实世界客体的本质、特征以及运动相似文献

19.

科学数据姿源的管理 总被引：4，自引：0，他引：4

董诚黄鼎成《中国基础科学》2006,8(6):20-24

本文从科学数据管理的整体角度出发,对科学数据共享的一些基本问题进行了讨论。提出了“科学数据作为资源的价值的发挥,在于对科学数据、科学数据资源、科学数据活动的有效管理”的理念;阐述了科学数据管理及其时代特点、管理的原则;论述了科学数据资源建设必须以需求为导向;并基于分类分级的共享机制,探讨了数据发布问题。相似文献

20.

科学数据共享标准体系框架 总被引：12，自引：0，他引：12

徐枫《中国基础科学》2003,(1):44-49

一、科学数据共享的内涵科学数据共享,旨在通过应用现代信息技术手段,从国家宏观层次上营造一种科学数据共享氛围,使政府部门、科技教育工作者和社会公众可以广泛、有效、便捷地获取所需要的科学数据,对科学数据进行有序管理,实现科学数据拥有者为政府部门和广相似文献