Displaying posts filed under

Data analysis

All posts related to data analysis

Dec
23
2009

VBA function for multiple comparison

In SAS, lsmeans treatment /pdiff of PROC mixed or GLM output mean values and a matrix of probability values of all pair wise comparison. However, we have to create the traditional comparison table by ourselves. In order to automate the process, I developed a VBA function to do the job. The source code is the [...]

Nov
4
2009

Construction of objective function in optimization problem

Many optimization procedures require a single objective function. How to construct an efficient objective function in optimization problem is a critical. Sometimes we have multiple goals, how to create a single function to integrate all the goals together? Sometimes, even though we have a single goal, but we have specific aspect of requests for the [...]

Oct
8
2009

Matlab Toolbox: Fuzzy Clustering and Data Analysis Toolbox

I googled “fuzzy clustering and matlab” and wanted to find some tools for data clustering today. I came across a very good MATLAB toolbox at http://www.fmt.vein.hu/softcomp/fclusttoolbox/. In the web page, you can find source code and a detailed documentation that describes several popular clustering methods and validation methods, too. The Toolbox is a collection of [...]

Sep
13
2009

Table with Excel – TWE 2.0

Introduction
Microsoft Excel is very popular spreadsheet software. It is widely used in education, in scientific research, in managing financial activities. Nowadays, the common versions of Microsoft Excel are version 2003 and 2007. The functions of these versions are very powerful and include enormous functions and possibilities.
Scientists and engineers use Microsoft Excel to record and keep [...]

Aug
27
2009

实用统计软件MINITAB讲座

1996年,中国农科院棉花研究所科研研究人员的基本科研素质需要提高, 我根据当时DOS版MINITAB小巧实用、功能齐全的特点, 编写了一本详细的MINITAB在农业试验统计中应用的讲座,并举办了为期一周的培训班,使科研研究人员掌握了其基本的统计功能, 能够独立地完成自己的试验统计工作。本讲座是在原讲稿的基础上浓缩加工而成, 下面列出了该讲座的主要内容。

统计分析与MINITAB. 计算机农业应用. 1996, (2):43-45
MINITAB命令概述. 计算机与农业. 1996, (3):40-43; 1996, (4):40-41
MINITAB软件数据输入、输出与编辑. 计算机与农业. 1997, (1):30-37
MINITAB的基本统计命令与假设测验. 计算机与农业. 1997, (3):29-33
MINITAB在方差分析中的应用. 计算机与农业. 1997, (4):28-34
MINITAB在常用试验统计分析中的应用. 计算机与农业. 1998, (1):26-31
MINITAB在回归和相关分析中的应用. 计算机与农业. 1998, (2):31
MINITAB在次数资料测验中的应用. 计算机与农业. 1998, (3):37,1998,(4):40
MINITAB在非参数测验中的应用. 计算机与农业. 1999, (1):34
MINITAB在回归设计试验结果统计分析中的应用. 计算机与农业. 1999, (2):32

感谢“计算机与农业”杂志编辑部的胡燕老师的支持和帮助, 使本讲座得以与全国的广大读者见面。在讲座连载期间,我收到很多朋友的来信,由于当时比较忙,有些问题并未给予详细回答。 在目前计算机操作系统广泛升级到WINDOWS的情况下,该软件仍然可以使用,对计算机硬件资源不是十分先进的单位和个人, 建议继续使用它。如果你有足够的资金,可以向MINITAB.COM购买最新的WINDOWS版本,目前是15版。对WINDOWS版的MINITAB,绝大多数程序仍然适用,只要将文中的程序复制到命令窗口,还可以达到同样的结果,说明除了MINITAB增强和改进的功能以外, 其基本的核心功能继承了下来。如你在使用过程中仍然有问题,请跟贴。为了方便用户,我将所有的讲座内容制作成一个PDF文档,你可以下载并打印作为参考手册使用。
为了学习方便,你可以下载MINITAB的DOS版本, 本讲座中的实例均是基于这个版本的。你也可以访问MINITAB.COM, 下载当前最新的Windows版本,本讲座的绝大多数实例仍然可以使用。
为了顺应时代的发展,我正在撰写一套全新的讲座,它不仅给出了MINITAB的实例,同时有SAS和R的实例。内容方面有极大的扩展,加入了多元统计方法和广泛使用的混合数据模型的分析方法,原来的讲义没有图形部分,新的讲座包括了各种各样统计图表。希望给广大的科技工作者提供一个系统的、全面的、即查即用的手册。新的讲座将首先以单独的文章形式发表在本站。

[...]

Aug
27
2009

An example for generating experimental layout

Dr. Roe asked me to generate a pot map for her growth chamber experiments. Since she will do the experiments again and again. One fixed map is not good. I wrote some VBA code in Excel and create an Excel Add-in. This program can generate pot layout randomly. All you have to do is put [...]

Aug
27
2009

Chi Sqaure test in Excel

It is pretty easy to do Chi Square Test in Excel. First you need arrange your dat in table with one or two categories, then calculate the theretical value for each interactions (cells). Then just put the actual data and theoretical data into CHITEST worksheet function.
Sometimes that you need deal with your raw data first, [...]

Aug
27
2009

Extracting data from the searching results of citation database

Today (Sept. 25, 2002), my major advisor asked me to extract citation data from one text file he created and sent me through email. After I looked at the data, I realized that the data came from the citation databases, such as AGRICOLA, Biological Abstract, CAB, and so on. The requirements that you asked me [...]

Aug
27
2009

Search maximum and minimum of a function in a given range and its corresponding value of independent variable

More often, we need find the maximum or minimum value of a function within a given range. In Excel, we can develop a user-defined worksheet function to fulfill this goal. Here provides a user-defined function called minmax. This function is very powerful because it can be applied to any polynomial equation.
The definition of the [...]

Aug
27
2009

Estimation of confidence interval and predicted values of the dependent variable in a linear equation

Here provides a user-defined function to estimate predicted values of the dependent variable and corresponding confidence interval for a given dataset with linear relationship. The function definition is the following:
yhat(Y, X, XNew, CI, alpha)
Where, Y includes values of the dependent variable; X includes values of independent variables (can includes multiple columns (variables)); XNew includes new [...]