【已解决】python解决replace(“n“,““)⽆法替换换⾏符
先看原数据,⼀直在想办法清除 “\n”和“/”这两个符号。
# 从提取出的⼏列来看,还有些细节需要再洗洗:例如为了统计和美观需要,“\n”和“/”这两个符号应去掉。
lendhou_content_split3 = lendhou_content_split2.iloc[:,[0,16,24,42,70,94]]
lendhou_lumns=['location_name','area','direction','houtype','stair_type','stairs']
春江花月夜书法
print("未使⽤replace前:\n",lendhou_content_split3.head(2),"\n")
# lendhou_content_split3 = lendhou_content_split3.map(lambda x: x.replace("/n",""))
# 报错 AttributeError: 'DataFrame' object has no attribute 'map'
lendhou_content_split3 = lendhou_place("/n","")
print("第⼀次使⽤replace:\n",lendhou_content_split3.head(2))
# 并没有替换成功,看 print 结果还是有 “/n” 这个符号在。
用粘土做动物
# 第⼀列 location_name 还需要再分列,下⾯先分列整理。
未使⽤replace前:
location_name area direction houtype stair_type stairs
0 黄埔-科学城-万科⾥享家\n 78㎡\n /南 3室2厅1卫中楼层(34层)
1 黄埔-科学城-沙湾新村\n 18㎡\n /南 4室2厅2卫低楼层(16层)家常炒藕片的做法
第⼀次使⽤replace:
location_name area direction houtype stair_type stairs
0 黄埔-科学城-万科⾥享家\n 78㎡\n /南 3室2厅1卫中楼层(34层)
1 黄埔-科学城-沙湾新村\n 18㎡\n /南 4室2厅2卫低楼层(16层)
# lendhou_content_split4 = pd.DataFrame(x.split("-") for x in lendhou_content_split3[0]) # 报错 KeyError: 0 —— 备注以对⽐参考。
lendhou_content_split4 = pd.DataFrame(x.split("-")for x in lendhou_content_split3['location_name'])
lendhou_lumns=['district','板块','name','none1']
lendhou_content_split4.head()
district板块name none1 0黄埔科学城万科⾥享家\n None 1黄埔科学城沙湾新村\n None 2番禺⽯碁雅苑青年公馆\n None 3仅剩4间\n None None None 4天河华景新城华景新城绿茵居\n None # 合并 lendhou_content_split3 和 lendhou_content_split4
lendhou_content_split5 = pd.merge(lendhou_content_split4.iloc[:,:3],lendhou_content_split3.iloc[:,1:6],
right_index=True, left_index=True)
print("得到 lendhou_content 的数据状态:\n",lendhou_content_split5.head())
# 接下来要想办法清除 “\n”和“/”这两个符号。
得到 lendhou_content 的数据状态:
district 板块 name area direction houtype stair_type stairs
0 黄埔科学城万科⾥享家\n 78㎡\n /南 3室2厅1卫中楼层(34层)
1 黄埔科学城沙湾新村\n 18㎡\n /南 4室2厅2卫低楼层(16层)
2 番禺⽯碁雅苑青年公馆\n 61㎡\n /北 1室1厅1卫中楼层(5层)
3 仅剩4间\n None None /\n None None
4 天河华景新城华景新城绿茵居\n 62㎡\n /南 2室1厅1卫低楼层(9层)
# lendhou_content_split5['area'] = lendhou_content_split5['area'].replace("\n","")
lendhou_content_split5 = lendhou_place("\r\n","")
print("第⼀次使⽤replace:\n",lendhou_content_split5.head(2))
# lendhou_content_split5['direction'] = lendhou_content_split5['direction'].replace("/","")
lendhou_content_split5 = lendhou_place("/","")
print("\n第⼆次使⽤replace:\n",lendhou_content_split5.head(2))
护国战争# 发现替换函数 replace 还是没有⽣效。那接下来看看能不能直接截取特定符号前⾯或者特定符号后⾯的字符串,作为新的内容。
# lendhou__excel(total_path+"\\lendhou_content_split5"+".xlsx", encoding='utf-8', index=Fal, header=True)
第⼀次使⽤replace:
district 板块 name area direction houtype stair_type stairs
适合学生看的励志电影0 黄埔科学城万科⾥享家\n 78㎡\n /南 3室2厅1卫中楼层(34层)
1 黄埔科学城沙湾新村\n 18㎡\n /南 4室2厅2卫低楼层(16层)
第⼆次使⽤replace:
district 板块 name area direction houtype stair_type stairs
0 黄埔科学城万科⾥享家\n 78㎡\n /南 3室2厅1卫中楼层(34层)
1 黄埔科学城沙湾新村\n 18㎡\n /南 4室2厅2卫低楼层(16层)
# import re
# # lendhou_content_split5['direction'] = re.findall(r'/*', lendhou_content_split5['direction'])
# # # 上述报错 error: nothing to repeat at position 0
# # print("第⼀次使⽤re.findall:\n",lendhou_content_split5.head(2))
# lendhou_content_split5['area'] = re.findall(r'*\n', lendhou_content_split5['area'])
# # 上述报错 error: nothing to repeat at position 0
# print("\n第⼆次使⽤re.findall:\n",lendhou_content_split5.head(2))
⽂中提到:
如果⾏尾符是 CR,则⽤replace("\r","")
如果⾏尾符是 LF,则⽤replace("\n","")
生活是多么广阔按照指引,查到了⾃⼰的⾏尾符(如下图):两种都有。
什么是出柜
接下来两种都替换,但结果还是没替换掉(如下)。额,尝试继续失败。
# lendhou_content_split5['area'] = lendhou_content_split5['area'].replace("\n","")
lendhou_content_split5 = lendhou_place("\r\n","")
lendhou_content_split5 = lendhou_place("\n","")
print("第⼀次使⽤replace:\n",lendhou_content_split5.head(2))
# lendhou_content_split5['direction'] = lendhou_content_split5['direction'].replace("/","")手足无措的意思是什么
lendhou_content_split5 = lendhou_place("/","")
print("\n第⼆次使⽤replace:\n",lendhou_content_split5.head(2))