如果包含某些单词，则提取html表中的文本

如果包含某些单词，则提取HTML表中的文本可以通过使用Python的BeautifulSoup库来实现。BeautifulSoup是一个用于解析HTML和XML文档的库，它提供了一种简单的方式来遍历文档树、搜索特定标签以及提取所需的文本内容。

以下是一个示例代码，演示了如何使用BeautifulSoup提取HTML表中的文本：

from bs4 import BeautifulSoup

# 假设HTML表格内容保存在一个名为html的字符串变量中
html = '''
<table>
  <tr>
    <th>姓名</th>
    <th>年龄</th>
    <th>性别</th>
  </tr>
  <tr>
    <td>张三</td>
    <td>25</td>
    <td>男</td>
  </tr>
  <tr>
    <td>李四</td>
    <td>30</td>
    <td>女</td>
  </tr>
</table>
'''

# 创建BeautifulSoup对象并指定解析器
soup = BeautifulSoup(html, 'html.parser')

# 找到表格标签
table = soup.find('table')

# 遍历表格的每一行
for row in table.find_all('tr'):
    # 遍历行中的每一列
    for cell in row.find_all('td'):
        # 提取文本内容并打印
        print(cell.text)

运行上述代码，将会输出表格中每个单元格的文本内容：