Python Pandas - 指出重复索引值,最后一出现值除外
若要指明重复的索引值(最后出现的值除外),请使用 index.duplicated()。将 keep 参数与值 last 一起使用。
首先,导入必需的库 −
import pandas as pd
使用一些重复值创建索引 −
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])
显示索引 −
print("Pandas Index with duplicates...\n",index)指明重复的索引值(最后一出现的值除外)。将“keep”参数设置为“last” −
print("\nIndicating duplicate values except the last occurrence...\n", index.duplicated(keep='last'))示例
以下是代码 −
import pandas as pd
# Creating the index with some duplicates
index = pd.Index(['Car','Bike','Airplane','Ship','Airplane'])
# Display the index
print("Pandas Index with duplicates...\n",index)
# Return the dtype of the data
print("\nThe dtype object...\n",index.dtype)
# get the dimensions of the data
print("\nGet the dimensions...\n",index.ndim)
# Indicate duplicate index values as True, except the last occurrence
# Set the "keep" parameter as "last"
print("\nIndicating duplicate values except the last occurrence...\n", index.duplicated(keep='last'))输出
这将生成以下代码 −
Pandas Index with duplicates... Index(['Car', 'Bike', 'Airplane', 'Ship', 'Airplane'], dtype='object') The dtype object... object Get the dimensions... 1 Indicating duplicate values except the last occurrence... [False False True False False]
广告
数据结构
网络
RDBMS
操作系统
Java
iOS
HTML
CSS
Android
Python
C 编程
C++
C#
MongoDB
MySQL
Javascript
PHP