它是熊猫/Dataframe,它包含了每个人每天的所有分数,我想增加一个专栏来收集它有多少次得分最高(可能不止一个人,一些数据是nan)。
import pandas as pd
import numpy as np
data = np.array([['','day1','day2','day3','day4','day5'],
['larry',1,4,7,3,5],
['niko',2,-1,3,6,4],
['tin',np.nan,5,5, 6,7]])
df = pd.DataFrame(data=data[1:,1:],
index=data[1:,0],
columns=data[0,1:])
print(df)输出
day1 day2 day3 day4 day5
larry 1 4 7 3 5
niko 2 -1 3 6 4
tin nan 5 5 6 7预期结果是(拉里:1次,尼科:2次,锡:3次)
times_of_top day1 day2 day3 day4 day5
larry 1 1 4 7 3 5
niko 2 2 -1 3 6 4
tin 3 nan 5 5 6 7niko在day1和day4上得分最高,所以他的times_of_top是2。
tin在day2,day4和day5上得分最高,所以他的times_of_top是3分。
发布于 2021-02-06 10:43:12
使用pandas.DataFrame.stack和count的一种方法
# df = df.astype(float)
# Since the sample data are in object type
df["times_of_top"] = df[df == df.max()].stack().count(0)
print(df)输出:
day1 day2 day3 day4 day5 times_of_top
larry 1.0 4.0 7.0 3.0 5.0 1
niko 2.0 -1.0 3.0 6.0 4.0 2
tin NaN 5.0 5.0 6.0 7.0 3https://stackoverflow.com/questions/66075873
复制相似问题