maxframe.dataframe.Index.drop_duplicates#

Index.drop_duplicates(keep='first', method='auto')#

返回去除重复值后的 Index。

参数:

keep ({'first', 'last', False}, default 'first') --

返回:

去重后的

返回类型:

Index

参见

示例

生成一个包含重复值的 pandas.Index。

>>> import maxframe.dataframe as md

>>> idx = md.Index(['lame', 'cow', 'lame', 'beetle', 'lame', 'hippo'])

keep 参数控制删除哪些重复值。'first' 值会保留每组重复项中的第一次出现的值。keep 的默认值是 'first'。

>>> idx.drop_duplicates(keep='first').execute()
Index(['lame', 'cow', 'beetle', 'hippo'], dtype='object')

'last' 值会保留每组重复项中的最后一次出现的值。

>>> idx.drop_duplicates(keep='last').execute()
Index(['cow', 'beetle', 'lame', 'hippo'], dtype='object')

False 值会丢弃所有重复项。

>>> idx.drop_duplicates(keep=False).execute()
Index(['cow', 'beetle', 'hippo'], dtype='object')