我在Pandas中有DataFrame,如下所示:
COL1 | COL2 | ... | COLn
------|------|------|-------
aaa | AA_x | ... | ...
bbb | AA_x | ... | ...
ggg | AA_x | ... | ...
ppp | AA_x | ... | ...
aaa | DD_x | ... | ...
ggg | DD_x | ... | ...
ppp | DD_x | ... | ...
bbb | DD_x | ... | ...
.... | ... | ... | ...COL1是字符串数据类型,我需要根据自己的想法(如:aaa,bbb,ppp,ggg )对基于DataFrame的COL1值进行排序,因此我需要如下所示:
COL2 | COL1 | ... | COLn
-------|-------|-------|------
AA_x | aaa | ... | ...
AA_x | bbb | ... | ...
AA_x | ppp | ... | ...
AA_x | ggg | ... | ...
DD_x | aaa | ... | ...
DD_x | bbb | ... | ...
DD_x | ppp | ... | ...
DD_x | ggg | ... | ...
... | .... | ... |...我怎么能在Python Pandas中做到这一点呢?我假设手动叫喊被排序,这个DataFrame?
发布于 2022-10-18 18:03:55
IIUC,您希望按COL1进行排序,并保持相对aaa/bbb/等的原始顺序。
您可以使用:
order = ['aaa', 'bbb', 'ppp', 'ggg']
df['COL1'] = pd.Categorical(df['COL1'], categories=order, ordered=True)
out = (df.assign(n=df.groupby('COL1').cumcount())
.sort_values(by=['n', 'COL1'])
.drop(columns='n')
)如果您已经有了一个用于排序的次要列(例如。COL2):
order = ['aaa', 'bbb', 'ppp', 'ggg']
df['COL1'] = pd.Categorical(df['COL1'], categories=order, ordered=True)
out = df.sort_values(by=['COL2', 'COL1'])https://stackoverflow.com/questions/74115029
复制相似问题