问Python regex，用于检测字符串后面的多个可选子字符串之一
EN

Stack Overflow用户

提问于 2018-10-23 02:16:06

回答 1查看 86关注 0票数 0

我需要像下面这样匹配模式：AAXX#

其中：

* AA来自一个集合(即1-3个字符阿尔法前缀的list)，

* XX来自不同的预定义字符串列表，以及

* any single-digit numeral跟随。

AA字符串：['bo','h','fr','sam','pe']

XX字符串：cl + ['x','n','r','nr','eaner] //或者就是// ro

Desired Result: bool，指示是否有任何可能的组合与提供的字符串匹配。

测试字符串示例：

item = "boro1" -即bo + ro + 1

item = "samcl2"-即sam + cl + 2

item = "hcln3" -即h + cln + 3

我能想到的最好的方法是使用一个循环，但是我在使用基本的正则表达式时遇到了麻烦。它适用于单字母可选的cln, clx, clr，但不适用于较长的clnr, cleaner。

代码：

item = "hclnr2" #h + clnr + 2
out = False
arr = ['bo','h','fr','sam','pe']
for mnrl in arr:
    myrx = re.escape(mnrl) + r'cl[x|n|r|nr|eaner]\d'
    thisone = bool(re.search(myrx, item))
    print('mnrl: '+mnrl+' - ', thisone)
    if thisone: out = True

##########################################################################
# SKIP THIS - INCLUDED IN CASE S/O HAS A BETTER SOLUTION THAN A SECOND LOOP
# THE ABOVE FOR-LOOP handled THE CL[opts] TESTS, THIS LOOP DOES THE RO TESTS
##########################################################################
#if not out: #If not found a match amongst the "cl__" options, test for "ro"
#    for mnrl in arr:
#        myrx = re.escape(mnrl) + r'ro\d'
#        thisone = bool(re.search(myrx, item))
#        print('mnrl: '+mnrl+' - ', thisone)
#    if thisone: out = True
##########################################################################

print('result: ', out)

打印：

mnrl: bo - False

mnrl: h - False <======

mnrl: fr - False

mnrl: sam - False

mnrl: pe - False

但是，将item更改为：

item = "hcln2" #h + cln + 2

打印：

mnrl: bo - False

mnrl: h - True <========

mnrl: fr - False

mnrl: sam - False

mnrl: pe - False

同样适用于item = hclr5或item = hclx9，但不适用于hcleaner9

python

regex

回答 1

Stack Overflow用户

回答已采纳

发布于 2018-10-23 02:51:48

我的方法是

import re

words = ['boro1', 'samcl2', 'hcln3', 'boro1+unwantedstuff']

p = r'(bo|h|fr|sam|pe)(cl(x|n|r|nr|eaner|)|ro)\d$'

for w in words:
      print(re.match(p, w))

结果：

<_sre.SRE_Match object; span=(0, 5), match='boro1'>
<_sre.SRE_Match object; span=(0, 6), match='samcl2'>    
<_sre.SRE_Match object; span=(0, 5), match='hcln3'>
None

对于你想要的布尔输出，你可以简单地将match对象转换为'bool‘。

票数 2

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/52935405

复制

相似问题

问Python regex，用于检测字符串后面的多个可选子字符串之一
EN

回答 1

Stack Overflow用户

社区

活动

资源

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Python regex，用于检测字符串后面的多个可选子字符串之一EN

回答 1

Stack Overflow用户

社区

活动

资源

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问Python regex，用于检测字符串后面的多个可选子字符串之一
EN