如何在SAS中利⽤数据的分位数等统计量实现⾃动分组?
眼线画法
背景:有时我们要观察各个分数区间的⽤户,在各个特征上的表现有⽆差异。在进⾏分组时,除了使⽤PROC FORMAT⼿⼯定义区间之外,也可以使⽤PROC RANK和PROC FORMAT,利⽤分数(或者其他数据)的分位数等统计量,实现⾃动化分组排序。
通讯稿PR OC R AN K
proc rank data=test out=r_test【输出的数据集】;
var spend【对spend进⾏排序】;
吸烟有害健康英文ranks r_spend【序号变量命名为r_spend】; �
run;
PR OC U N IVAR IATE
proc univariate data=events noprint;
var neg_score;
output out=p pctlpre=P_【分位数变量名称的前缀为P_】
pctlpts=10 to 100 by 10;
weight SamplingWeight;
化学反应方程式run;
proc transpo data=p out=pt;
草图大师难学吗run;
有机菜花的做法
1997年多大了proc sort data=pt
吸烟的危害有哪些nodupkey force noequals;
by COL1;
run;
Generating deciles, quartiles, percentiles or other groups from numeric variables. The GROUPS opti
onis ud here to specify the binning. Deciles are created by specifying GROUPS=10, quartiles can be generated by GROUPS=4, and percentiles are created with tting GROUPS=100.