Python Pandas——从 DataFrame 列中移除数字


若要从字符串中移除数字,我们可以使用 replace() 方法进行简单替换。首先导入所需库 −

import pandas as pd

使用学生记录创建 DataFrame。Id 列含有带数字的字符串 −

dataFrame = pd.DataFrame(
   {
      "Id": ['S01','S02','S03','S04','S05','S06','S07'],"Name": ['Jack', 'Robin', 'Ted', 'Robin', 'Scarlett', 'Kat', 'Ted'],"Result": ['Pass', 'Fail', 'Pass', 'Fail', 'Pass', 'Pass', 'Pass']
   }
)

从特定列(此处为“Id”)的字符串中移除数字 −

dataFrame['Id'] = dataFrame['Id'].str.replace('\d+', '')

示例

以下为代码 −

import pandas as pd

# Create DataFrame with student records
dataFrame = pd.DataFrame(
   {
      "Id": ['S01','S02','S03','S04','S05','S06','S07'],"Name": ['Jack', 'Robin', 'Ted', 'Robin', 'Scarlett', 'Kat', 'Ted'],"Result": ['Pass', 'Fail', 'Pass', 'Fail', 'Pass', 'Pass', 'Pass']
   }
)

print"DataFrame ...\n",dataFrame

# removing number from strings of a specific column
dataFrame['Id'] = dataFrame['Id'].str.replace('\d+', '')

print"\nUpdated DataFrame...\n", dataFrame

输出

将生成以下输出 −

DataFrame ...
     Id      Name   Result
0   S01      Jack     Pass
1   S02     Robin     Fail
2   S03       Ted     Pass
3   S04     Robin     Fail
4   S05  Scarlett     Pass
5   S06       Kat     Pass
6   S07       Ted     Pass

Updated DataFrame...
   Id      Name   Result
0   S      Jack     Pass
1   S     Robin     Fail
2   S       Ted     Pass
3   S     Robin     Fail
4   S  Scarlett     Pass
5   S       Kat     Pass
6   S       Ted     Pass

更新于:2021-09-20

3K+ 次浏览

开启您的 职业生涯

完成课程,获取认证

开始学习
广告
© . All rights reserved.