问如何在熊猫数据栏中插入字符串列到另一个字符串列？
EN

Stack Overflow用户

提问于 2022-11-15 06:12:08

回答 1查看 25关注 0票数 1

我有超过10万行和300列的数据集，

下面是示例数据集：

pd.options.display.max_colwidth = 1000

df = pd.DataFrame({'EVENT_DTL':['1. Name : John Johns \n2. Date : 05 March 2013 \n3. founded : 75075 Plano, Dallas Texas \n4. Charactor : Impersive \n5. Corona corelation : Cannot be found',
                               '1. Name : Mark Dwaine \n2. Date : 13 January 2020 \n3. founded : 45184 Miami, Florida \n4. Charactor : Slow learner \n5. Corona corelation : Suicide because of the economic difficulty',
                               '1. Name : Janny chung \n2. Date : 11 December 2011 \n3. founded : 77543 Bay area, San Fransisco \n4. Charactor : Always ambitious \n5. Corona corelation : Cannot be found but probably related to epidemic',
                               '1. Name : Sally \n2. Date : 11 December 2021 \n3. founded : 75074 Saginow, Fort Worth \n4. Charactor : energetic \n5. Corona corelation : Her friends guess it is because of corona'],
                   'EVENT_DTL_2':['He is always fast mover','He is brillient, smart','she is kind of person who is always eager to learn new subejct','he was a lunatic, his neighber said']})
df.loc[2,'EVENT_DTL_2'] = np.nan

df

我试图将'EVENT_DTL_2‘插入'EVENT_DTL’，但在\n4. Charactor : xxx子字符串旁边插入

期望的输出是：

df2 = pd.DataFrame({'EVENT_DTL':['1. Name : John Johns \n2. Date : 05 March 2013 \n3. founded : 75075 Plano, Dallas Texas \n4. Charactor : Impersive He is always fast mover\n5. Corona corelation : Cannot be found',
                               '1. Name : Mark Dwaine \n2. Date : 13 January 2020 \n3. founded : 45184 Miami, Florida \n4. Charactor : Slow learner He is brillient, smart\n5. Corona corelation : Suicide because of the economic difficulty',
                               '1. Name : Janny chung \n2. Date : 11 December 2011 \n3. founded : 77543 Bay area, San Fransisco \n4. Charactor : Always ambitious \n5. Corona corelation : Cannot be found but probably related to epidemic',
                               '1. Name : Sally \n2. Date : 11 December 2021 \n3. founded : 75074 Saginow, Fort Worth \n4. Charactor : energetic he was a lunatic, his neighber said\n5. Corona corelation : Her friends guess it is because of corona'],
                   'EVENT_DTL_2':['He is always fast mover','He is brillient, smart',np.nan,'he was a lunatic, his neighber said']})
df2

我需要一种有效的方法，因为我需要应用这个方法--非常大的数据集。

python

pandas

回答 1

Stack Overflow用户

回答已采纳

发布于 2022-11-15 07:42:34

您可以再次拆分和合并：

df2 = df['EVENT_DTL'].str.split('(?<=\n4\.)', expand=True)
df['EVENT_DTL'] = df2[0]+' '+df['EVENT_DTL_2']+' '+df2[1]

票数 0

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/74441249

复制

相似问题

问如何在熊猫数据栏中插入字符串列到另一个字符串列？
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何在熊猫数据栏中插入字符串列到另一个字符串列？EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何在熊猫数据栏中插入字符串列到另一个字符串列？
EN