吃什么水果补肾| 梦见捡到很多钱是什么意思| 日语一个一个是什么意思| 藏青色是什么颜色| 坐月子可以吃什么蔬菜| 横批是什么意思| 宝宝什么时候断奶最好| 考妣是什么意思| 口腔科主要看什么| 二大爷是什么意思| 食字五行属什么| 6.25是什么日子| 宫腔内异常回声是什么意思| 布朗尼是什么| 尿酸高吃什么| 姜黄与生姜有什么区别| 什么醒酒最快| 慢性结肠炎用什么药| 梦见鬼是什么意思| 骶髂关节在什么位置| 欲言又止的欲什么意思| 八个月宝宝可以吃什么水果| 除湿气吃什么好| 白头发吃什么维生素能变黑| 什么人容易得梦游症| 什么食物胆固醇高| 橘红是什么东西| 荷花的别称是什么| 甲亢能吃什么水果| 纤维素是什么| 云吞是什么| 羊水是什么颜色的| 六味地黄丸有什么用| 什么一清二白| 身心健康是什么意思| 晚上睡觉阴部外面为什么会痒| 封豕长蛇是什么意思| hpv阳性意味着什么| 六月十三是什么日子| 一月底是什么星座| 嘿是什么意思| 这是什么字| 白气是什么物态变化| 甲状腺功能亢进吃什么药| 什么酒不能喝脑筋急转弯| 鞭尸什么意思| 玉米和什么不能一起吃| 妤读什么| 为什么会便秘| 什么是世界观| 否极泰来是什么生肖| 脑梗能吃什么水果| 水洗棉是什么面料| 穿裙子搭配什么鞋子| 葵水是什么| qs认证是什么意思| 测骨龄去医院挂什么科| 巴西货币叫什么| 间质性改变是什么意思| 奇行种什么意思| 痤疮是什么引起的| 1月26号是什么星座| 幽灵蛛为什么不能打死| 戊型肝炎是什么病| 九门提督相当于现在什么官| 4a广告公司什么意思| 血小板高吃什么药| 螨虫用什么药可以杀死它| 两个火念什么| 甲辰年五行属什么| 身上有红点是什么病| 早晨起来口干口苦是什么原因| beko是什么牌子| 长命百岁是什么意思| 今年16岁属什么| 衣服最小码是什么字母| 尖锐湿疣挂什么科| 白猫来家里有什么预兆| 女人梦见鱼是什么意思| 苯三酚注射有什么用| 处女座属于什么星象| 嗓子不舒服吃什么药| 大惊小怪是什么生肖| 弯是什么意思| 金价下跌意味着什么| 夏季吃什么| 左手无名指戴戒指什么意思| 血糖高吃什么可以降下来| 8月5日是什么星座| cet是什么意思| 尿酸ua偏高是什么意思| 为什么拉稀| 秋字五行属什么| 中性粒细胞百分比偏低是什么意思| 粘米粉可以做什么好吃的| 19属什么| 草酸钙结晶是什么意思| 什么叫护理| 社恐的人适合什么工作| 经常吃莲子有什么好处| 过敏性紫癜有什么危害| 阴茎插入阴道什么感觉| 什么鲜花填动词| 右侧上颌窦粘膜增厚是什么意思| 过氧化氢是什么| 一动就大汗淋漓是什么原因| 淋巴癌有什么症状| 川军为什么那么出名| 脑梗挂号挂什么科室| 什么生机| 死有余辜是什么意思| ahc属于什么档次| 抗皱用什么产品好| 内热是什么原因引起的| 属兔是什么命| 干咳吃什么食物好| 下巴长痘是什么原因| 俗气是什么意思| 渺渺是什么意思| 新生儿眼屎多是什么原因| 为什么老是肚子疼| 吃什么养肝护肝效果最好| 热络是什么意思| 诛仙讲的是什么故事| 升白细胞的针剂叫什么| 21三体综合征是指什么| 吃什么对牙齿好| 脂肪瘤看什么科| 有所作为的意思是什么| 女人什么时候是安全期| 琪五行属什么| 科目三为什么这么难| 长智齿意味着什么| 微盟是做什么的| 感冒咳嗽一直不好是什么原因| 一直打喷嚏是什么原因| 气短挂什么科| 扣字是什么意思| 舌尖红是什么原因| 额头上长斑是什么原因造成的| 体位性低血压是什么| 什么是瘦马| 烤鱼一般用什么鱼| 易孕体质有什么特征| 脚肿什么原因| 天庭的动物是什么生肖| 猫的眼睛为什么会发光| 政委是什么军衔| 荸荠读音是什么| 融合菜是什么意思| 乌克兰和俄罗斯为什么打仗| 利尿是什么意思| 湿疹什么样| 法国鳄鱼属于什么档次| 7月八号是什么星座| 降钙素原检测是查什么的| 世界上笔画最多的字是什么字| 单剂量给药是什么意思| 什么晚霜比较好用| 羊肉不能和什么食物一起吃| 阿胶糕什么人不能吃| 下午一点多是什么时辰| 甲状腺有什么危害| 早泄吃什么药| 难以启齿是什么意思| 怀孕做梦梦到蛇是什么意思| 初一不能做什么| 做梦梦见兔子是什么意思| 直肠炎是什么原因引起| 喜气洋洋是什么意思| 蚕屎做枕头有什么好处| 刷酸什么意思| 山峰是什么意思| 今年22岁属什么生肖| 手热脚热是什么原因| 夜宵吃什么好| 梦见胡萝卜是什么意思| 氯高是什么原因| 什么病不能吃丝瓜| 喝酒为什么会吐| a21和以纯什么关系| 什么食物吃了不胖| 油嘴滑舌是什么意思| 寿诞是什么意思| 将军代表什么生肖| 冬枣什么时候成熟| 跳蛋是什么| 辅弼是什么意思| 为什么不建议吃茵栀黄| 漠漠什么意思| 黑色记号笔用什么能擦掉| apart是什么意思| 鼠女和什么生肖最配| 什么手机有红外线功能| 脚转筋是什么原因| 银杏叶提取物治什么病| 出轨是什么意思| 肚子咕噜咕噜响是什么原因| 皮重是什么意思| 清影是什么意思| prc是什么意思| 孬种是什么意思| 肺部散在小结节是什么意思| 夜间睡觉口干是什么原因| 不停的打嗝是什么原因| gp是什么意思| 吃什么盐比较好有利于健康| 6月28号是什么星座| 班草是什么意思| 不正常的人有什么表现| 什么是心衰| 16年属什么| cpr是什么| 给花施肥用什么肥料| 佩戴狼牙有什么好处| 健康管理师是干什么的| 耐药菌感染什么意思| 结婚20年是什么婚姻| 什么症状| 花生碎能做什么食物吃| 上午十点是什么时辰| 益生菌什么牌子最好| 做头发是什么意思| 贵阳有什么特产| 缪斯什么意思| 溺爱的意思是什么| 米加白念什么| 笑靥什么意思| 码是什么单位| 脾虚什么症状| 未病是什么意思| 糖链抗原高是什么原因| 七月二十二什么日子| 坐月子吃什么水果| 羊是什么命| 佳偶天成是什么意思| 武夷岩茶是什么茶| 男性尿道口流脓吃什么药最管用| 衬衫什么面料好| 一什么冰箱| 3月4号是什么星座| 夏天出汗多是什么原因| 高密度脂蛋白胆固醇是什么意思| 龙吃什么| 书房字画写什么内容好| 反差是什么意思| 社保跟医保有什么区别| 青岛属于什么气候| 吃什么排宿便清肠彻底| 产后为什么脸部松弛| 什么是鸡奸| 鼠的守护神是什么菩萨| 白细胞偏低是什么意思| 圣诞礼物什么时候送| 喝什么汤下奶最快最多| 女生学什么专业好| 肺部疼痛什么原因| 排暖期出血是什么原因| 7是什么生肖| 鸭嘴鱼吃什么食物| 痔疮是什么原因引起| 心什么如什么| 血粘稠吃什么药最好| 气管憩室什么意思| 百度

平安信托去年净利近40亿 信托业务支出减少55%

(Redirected from Sample covariance)
百度   面临复杂的天气气候形势,中国气象局官网数据显示,去年共针对汉江流域强降水、台风天鸽及北方极端高温等启动18次应急响应和2次特别工作状态,发布突发事件预警信息21万余条。

The sample mean (sample average) or empirical mean (empirical average), and the sample covariance or empirical covariance are statistics computed from a sample of data on one or more random variables.

The sample mean is the average value (or mean value) of a sample of numbers taken from a larger population of numbers, where "population" indicates not number of people but the entirety of relevant data, whether collected or not. A sample of 40 companies' sales from the Fortune 500 might be used for convenience instead of looking at the population, all 500 companies' sales. The sample mean is used as an estimator for the population mean, the average value in the entire population, where the estimate is more likely to be close to the population mean if the sample is large and representative. The reliability of the sample mean is estimated using the standard error, which in turn is calculated using the variance of the sample. If the sample is random, the standard error falls with the size of the sample and the sample mean's distribution approaches the normal distribution as the sample size increases.

The term "sample mean" can also be used to refer to a vector of average values when the statistician is looking at the values of several variables in the sample, e.g. the sales, profits, and employees of a sample of Fortune 500 companies. In this case, there is not just a sample variance for each variable but a sample variance-covariance matrix (or simply covariance matrix) showing also the relationship between each pair of variables. This would be a 3×3 matrix when 3 variables are being considered. The sample covariance is useful in judging the reliability of the sample means as estimators and is also useful as an estimate of the population covariance matrix.

Due to their ease of calculation and other desirable characteristics, the sample mean and sample covariance are widely used in statistics to represent the location and dispersion of the distribution of values in the sample, and to estimate the values for the population.

Definition of the sample mean

edit

The sample mean is the average of the values of a variable in a sample, which is the sum of those values divided by the number of values. Using mathematical notation, if a sample of N observations on variable X is taken from the population, the sample mean is:

?

Under this definition, if the sample (1, 4, 1) is taken from the population (1,1,3,4,0,2,1,0), then the sample mean is ?, as compared to the population mean of ?. Even if a sample is random, it is rarely perfectly representative, and other samples would have other sample means even if the samples were all from the same population. The sample (2, 1, 0), for example, would have a sample mean of 1.

If the statistician is interested in K variables rather than one, each observation having a value for each of those K variables, the overall sample mean consists of K sample means for individual variables. Let ? be the ith independently drawn observation (i=1,...,N) on the jth random variable (j=1,...,K). These observations can be arranged into N column vectors, each with K entries, with the K×1 column vector giving the i-th observations of all variables being denoted ? (i=1,...,N).

The sample mean vector ? is a column vector whose j-th element ? is the average value of the N observations of the jth variable:

?

Thus, the sample mean vector contains the average of the observations for each variable, and is written

?

Definition of sample covariance

edit

The sample covariance matrix is a K-by-K matrix ? with entries

?

where ? is an estimate of the covariance between the jth variable and the kth variable of the population underlying the data. In terms of the observation vectors, the sample covariance is

?

Alternatively, arranging the observation vectors as the columns of a matrix, so that

?,

which is a matrix of K rows and N columns. Here, the sample covariance matrix can be computed as

?,

where ? is an N by 1 vector of ones. If the observations are arranged as rows instead of columns, so ? is now a 1×K row vector and ? is an N×K matrix whose column j is the vector of N observations on variable j, then applying transposes in the appropriate places yields

?

Like covariance matrices for random vector, sample covariance matrices are positive semi-definite. To prove it, note that for any matrix ? the matrix ? is positive semi-definite. Furthermore, a covariance matrix is positive definite if and only if the rank of the ? vectors is K.

Unbiasedness

edit

The sample mean and the sample covariance matrix are unbiased estimates of the mean and the covariance matrix of the random vector ?, a row vector whose jth element (j = 1, ..., K) is one of the random variables.[1] The sample covariance matrix has ? in the denominator rather than ? due to a variant of Bessel's correction: In short, the sample covariance relies on the difference between each observation and the sample mean, but the sample mean is slightly correlated with each observation since it is defined in terms of all observations. If the population mean ? is known, the analogous unbiased estimate

?

using the population mean, has ? in the denominator. This is an example of why in probability and statistics it is essential to distinguish between random variables (upper case letters) and realizations of the random variables (lower case letters).

The maximum likelihood estimate of the covariance

?

for the Gaussian distribution case has N in the denominator as well. The ratio of 1/N to 1/(N???1) approaches 1 for large?N, so the maximum likelihood estimate approximately equals the unbiased estimate when the sample is large.

Distribution of the sample mean

edit

For each random variable, the sample mean is a good estimator of the population mean, where a "good" estimator is defined as being efficient and unbiased. Of course the estimator will likely not be the true value of the population mean since different samples drawn from the same distribution will give different sample means and hence different estimates of the true mean. Thus the sample mean is a random variable, not a constant, and consequently has its own distribution.

Denoting with μ the population mean and with ? the population variance, for a random sample of n independent observations drawn from the population, the expected value of the sample mean is

?

and the variance of the sample mean is

?

If the samples are not independent, but correlated, then special care has to be taken in order to avoid the problem of pseudoreplication.

If the population is normally distributed, then the sample mean is normally distributed as follows:

?

If the population is not normally distributed, the sample mean is nonetheless approximately normally distributed if n is large and?σ2/n?<?+∞. This is a consequence of the central limit theorem.

Weighted samples

edit

In a weighted sample, each vector ? (each set of single observations on each of the K random variables) is assigned a weight ?. Without loss of generality, assume that the weights are normalized:

?

(If they are not, divide the weights by their sum). Then the weighted mean vector ? is given by

?

and the elements ? of the weighted covariance matrix ? are [2]

?

If all weights are the same, ?, the weighted mean and covariance reduce to the (biased) sample mean and covariance mentioned above.

Criticism

edit

The sample mean and sample covariance are not robust statistics, meaning that they are sensitive to outliers. As robustness is often a desired trait, particularly in real-world applications, robust alternatives may prove desirable, notably quantile-based statistics such as the sample median for location,[3] and interquartile range (IQR) for dispersion. Other alternatives include trimming and Winsorising, as in the trimmed mean and the Winsorized mean.

See also

edit

References

edit
  1. ^ Richard Arnold Johnson; Dean W. Wichern (2007). Applied Multivariate Statistical Analysis. Pearson Prentice Hall. ISBN?978-0-13-187715-3. Retrieved 10 August 2012.
  2. ^ Mark Galassi, Jim Davies, James Theiler, Brian Gough, Gerard Jungman, Michael Booth, and Fabrice Rossi. GNU Scientific Library - Reference manual, Version 2.6, 2021. Section Statistics: Weighted Samples
  3. ^ The World Question Center 2006: The Sample Mean Archived 2025-08-14 at the Wayback Machine, Bart Kosko
拉肚子拉稀水吃什么药 一个月来两次例假是什么原因 如法炮制是什么意思 头发掉要用什么洗发水 什么叫幽门螺旋杆菌
医学上cr是什么意思 辅警是什么编制 黄水疮用什么药膏最快 维民所止什么意思 六甲是什么意思
裸钻是什么 眼睛干涩用什么眼药水好 女性什么时间是排卵期 长疮是什么原因 无花果什么功效
louisvuitton什么牌子 哈密瓜不能和什么一起吃 霍金什么病 受割礼是什么意思 女性胆固醇高吃什么
2001属什么hcv8jop1ns9r.cn 上大厕拉出血是什么原因hcv9jop0ns3r.cn 女人更年期吃什么药调理最好hcv8jop6ns2r.cn 癸酉五行属什么hcv9jop3ns3r.cn 院感是什么意思hcv8jop8ns7r.cn
强龙不压地头蛇是什么生肖onlinewuye.com 10月25日什么星座hcv8jop6ns6r.cn 脚肿是什么原因引起的hcv8jop2ns1r.cn 皮肤是什么组织weuuu.com 夏季穿什么鞋hcv8jop5ns7r.cn
黄油是什么油cj623037.com 减肥吃什么最好hcv9jop2ns1r.cn 浦去掉三点水念什么hcv9jop4ns2r.cn 医院属于什么单位hcv9jop1ns2r.cn 岱字五行属什么hcv9jop3ns6r.cn
内推是什么意思hcv9jop0ns2r.cn 附件炎吃什么药效果好hcv8jop0ns8r.cn 1947年属什么生肖wzqsfys.com 黄疸肝炎有什么症状hcv9jop6ns4r.cn 灌肤是什么意思hcv8jop8ns5r.cn
百度