文章/答案/技术大牛

发布

社区首页 >问答首页 >替换熊猫数据帧列中的字符串

问替换熊猫数据帧列中的字符串
EN

Stack Overflow用户

提问于 2022-10-24 11:54:06

回答 2查看 31关注 0票数 2

我有一个熊猫数据框架，列名为“内容”，包含文本。我想从本专栏中的每一篇文章中删除一些单词。我想用空字符串替换每个字符串，但是当我打印函数的结果时，我看到单词没有被删除。我的代码如下：

def replace_words(t):
  words = ['Livre', 'Chapitre', 'Titre', 'Chapter', 'Article' ]
  for i in t:
    if i in words:
      t.replace (i, '')
    else:
      continue
  print(t)


st = 'this is Livre and Chapitre and Titre and Chapter and Article'

replace_words(st)

理想结果的一个例子是：“这是和”

在下面的代码中，我想将上面的函数应用于列“content”中的每个文本：

df['content'].apply(lambda x: replace_words(x))

有人能帮我创建一个函数，删除我需要的所有单词，然后将这个函数应用到df列中的所有文本中吗？

python

pandas

function

replace

回答 2

Stack Overflow用户

回答已采纳

发布于 2022-10-24 12:10:20

您可以使用str.replace。

输入：

df = pd.DataFrame({
    'ID' : np.arange(4),
    'words' : ['this is Livre and Chapitre and Titre and Chapter and Article', 
               'this is car and Chapitre and bus and Chapter and Article',
              'this is Livre and Chapitre',
              'nothing to replace']
})

words = ['Livre', 'Chapitre', 'Titre', 'Chapter', 'Article']
pat = '|'.join(map(re.escape, words))
print(pat)
'Livre|Chapitre|Titre|Chapter|Article'

df['words'] = df['words'].str.replace(pat, '', regex=True)
print(df)

   ID                               words
0   0        this is  and  and  and  and 
1   1  this is car and  and bus and  and 
2   2                       this is  and 
3   3                  nothing to replace

票数 1

Stack Overflow用户

发布于 2022-10-24 12:00:28

两个问题：

如果使用word.

t.replace拆分，则每个i都是一个字母，而不是一个for i in t:不工作的

用这个：

def replace_words(t):
    words = ['Livre', 'Chapitre', 'Titre', 'Chapter', 'Article' ]
    for i in t.split(' '):
        # print(i) # remove to see problem 1
        if i in words:
            t= t.replace (i, '')
        else:
            continue
    # print(t)
    return t

编辑:您可以直接调用df['col'].apply(replace_words)。

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/74180734

复制

相似问题

问替换熊猫数据帧列中的字符串
EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问替换熊猫数据帧列中的字符串EN

回答 2

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问替换熊猫数据帧列中的字符串
EN