RandLA-Net亮点1------基于概率的训练样本选取和随机下采样

更新时间:2023-07-28 10:33:18 阅读：评论：0

RandLA-Net亮点1------基于概率的训练样本选取和随机下采样RandLA-Net 主要解决两个关键问题: FPS采样耗时和下采样带来的信息丢失.

解决策略: 随机采样(基于概率的训练样本选取和随机下采样) + 局部特征编码(LFA)

RandLA-Net 亮点2——local feature aggregation （LFA）:

RandLA-Net Decoder：

RandLA-Net训练Semantic3D数据:

策略1. 基于概率的训练样本选取和随机下采样

点云数据的预处理⼯作⼀直都很⼤, 很多⼯作都是对点云进⾏滑窗+下采样的⽅式.滑窗⽅式不可避免的对点云进⾏了切割，使得原本的物体被切割成⼏份，破坏了物体的⼏何形状.⽽且对于⼤场景点云数据来说, 滑窗采样⼯作量⾮常⼤.因此RandLA-Net⾮常巧妙的提出了⼀个随机采样策略，即能保证快速采样,⼜能保证均匀采样整个点集. 从⽽避免⼀个物体被分成多个部分.房屋防水补漏

实施细节: 先随机赋予每个点⼀个概率值，然后从最⼩概率对应的场景点云中的最⼩概率点开始，以其为中⼼,query周边最近的N个点作为⼀个输⼊样本点集(包含中⼼点本⾝). 对这N个点进⾏概率更新，使其概率变⼤，以保证不再采样到此处的点，从⽽保证在整个点集上均匀采样. 如下图所⽰

计算机组成部分1.预处理( data_prepare_mantic3D.py)

对原始数据(点和标签)进⾏⽹格采样, 并⽣成ply⽂件和KDTree⽂件. 同时保存投影信息.

对每个场景分别做数据预处理.

采样:

包括0.01降采样和0.06降采样. 将每个⽴⽅体内的点做均值，并统计该⽴⽅体内的每个类别的数量，将占⽐最⼤的类别作为采样后的类别.

# Subsample to save space

sub_points, sub_colors, sub_labels = DP.grid_sub_sampling(pc[:,:3].astype(np.float32),\

pc[:, 4:7].astype(np.uint8), labels, 0.01)

精彩瞬间grid_size = 0.06

sub_xyz, sub_colors, sub_labels = DP.grid_sub_sampling(sub_points, sub_colors, sub_labels, grid_size)

颜⾊归⼀化处理:

sub_colors = sub_colors / 255.0

存储为ply⽂件：原始输⼊点云为(x,y,z,i,r,g,b)7维信息,⽹络只使⽤其中的坐标和颜⾊信息，再添加⼀个类别信息⼀起保存，即(x,y,z,r,g,b,cls)，保存格式为ply.

write_ply(sub_ply_file, [sub_xyz, sub_colors, sub_labels], ['x', 'y', 'z', 'red', 'green', 'blue', 'class'])

存储KDTree：同时对0.06降采样后的点云数据构建KDTree，并保存为pkl⽂件.

arch_tree = KDTree(sub_xyz, leaf_size=50)

kd_tree_file = join(sub_pc_folder, file_name + '_KDTree.pkl')

存储投影信息:

proj_idx = np.squeeze(arch_tree.query(sub_points, return_distance=Fal))

proj_idx = proj_idx.astype(np.int32)

proj_save = join(sub_pc_folder, file_name + '_proj.pkl')

2. 样本集⽣成 (main_Semantic3D.py--->def get_batch_gen())

随机⽣成概率:

# Random initialize

for i, tree in enumerate(lf.input_trees[split]):

lf.possibility[split] += [np.random.rand(tree.data.shape[0]) * 1e-3]

lf.min_possibility[split] += [float(np.min(lf.possibility[split][-1]))]

最⼩概率场景和该场景下最⼩概率点:

# Choo the cloud with the lowest probability

cloud_idx = int(np.argmin(lf.min_possibility[split]))

# choo the point with the minimum of possibility in the cloud as query point

point_ind = np.argmin(lf.possibility[split][cloud_idx])

# Get all points within the cloud from tree structure

四季歌词points = np.array(lf.input_trees[split][cloud_idx].data, copy=Fal)

# Center point of input region

center_point = points[point_ind, :].reshape(1, -1)

搜索N个最近邻：先对中⼼点添加坐标扰动，然后搜索N个最近点(包含中⼼店⾃⾝)并打乱，然后从整个场景中取出这⽚点云. mantic3d数据的 N= 65536.打乱的⽬的是在点云随机采样的时候可以直接取前⾯的.

# Add noi to the center point

noi = al(i_init / 10, size=center_point.shape)

pick_point = center_point + noi.astype(center_point.dtype)

query_idx = lf.input_trees[split][cloud_idx].query(pick_point, k=cfg.num_points)[1][0]

# Shuffle index

query_idx = DP.shuffle_idx(query_idx)

# Get corresponding points and colors bad on the index

queried_pc_xyz = points[query_idx]

关于竹子的古诗

ueried_pc_xyz[:, 0:2] = queried_pc_xyz[:, 0:2] - pick_point[:, 0:2]

queried_pc_colors = lf.input_colors[split][cloud_idx][query_idx]

if split == 'test':

queried_pc_labels = np.zeros(queried_pc_xyz.shape[0])

queried_pt_weight = 1

el:

queried_pc_labels = lf.input_labels[split][cloud_idx][query_idx]

小孩出虚汗怎么办queried_pc_labels = np.array([lf.label_to_idx[l] for l in queried_pc_labels])

queried_pt_weight = np.array([lf.class_weight[split][0][n] for n in queried_pc_labels])

更新概率: 被取出过的点会根据距离信息得到⼀个概率，以免再次取到此区域

# Update the possibility of the lected points

砌体工程施工方案dists = np.sum(np.square((points[query_idx] - pick_point).astype(np.float32)), axis=1)

delta = np.square(1 - dists / np.max(dists)) * queried_pt_weight

lf.possibility[split][cloud_idx][query_idx] += delta

lf.min_possibility[split][cloud_idx] = float(np.min(lf.possibility[split][cloud_idx]))

上述过程即为收集⼀个训练样本的过程,.再次搜寻最⼩概率场景和最⼩点，重复num_per_epoch = train_steps * batch_size次，即可得到⼀个epoch所需要的样本数据.