我使用的模式是,
Pattern listPattern = Pattern.compile(
"\\s*'([^']*('')*)+'\\s*(,\\s*'([^']*('')*)+'\\s*)*"
+ "|"
+ "\\s*[0-9\\.\\-]+(,\\s*[0-9\\.-]+)*\\s*",
Pattern.MULTILINE|Pattern.CASE_INSENSITIVE);
需要此模式来验证在sql查询中添加in()子句的输入是否正确,&值如下:
String value="'xyz2006201257200426888282d','xyz2006201300193058314082d'";
在这里我只使用了2个ids,但是当这个ids (例如xyz2006201257200426888282d)的数量更多(~ >600 ),我收到堆栈溢出异常。有人可以帮助解决正则表达式模式中由于发生堆栈溢出而导致的效率低下问题吗?
堆栈跟踪:
Exception in thread "main" java.lang.StackOverflowError
at java.lang.Character.codePointAt(Character.java:4866)
at java.util.regex.Pattern$CharProperty.match(Pattern.java:3775)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4250)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.match(Pattern.java:4785)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$GroupCurly.match0(Pattern.java:4485)
at java.util.regex.Pattern$GroupCurly.match(Pattern.java:4405)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4272)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.matchInit(Pattern.java:4801)
at java.util.regex.Pattern$Prolog.match(Pattern.java:4741)
at java.util.regex.Pattern$BmpCharProperty.match(Pattern.java:3798)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4279)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$BmpCharProperty.match(Pattern.java:3798)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.match(Pattern.java:4785)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4279)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$BmpCharProperty.match(Pattern.java:3798)
at java.util.regex.Pattern$Loop.match(Pattern.java:4794)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$GroupCurly.match0(Pattern.java:4485)
at java.util.regex.Pattern$GroupCurly.match(Pattern.java:4405)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4279)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.match(Pattern.java:4785)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$GroupCurly.match0(Pattern.java:4485)
at java.util.regex.Pattern$GroupCurly.match(Pattern.java:4405)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4272)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.matchInit(Pattern.java:4801)
at java.util.regex.Pattern$Prolog.match(Pattern.java:4741)
at java.util.regex.Pattern$BmpCharProperty.match(Pattern.java:3798)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4279)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$BmpCharProperty.match(Pattern.java:3798)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.match(Pattern.java:4785)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4279)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$BmpCharProperty.match(Pattern.java:3798)
at java.util.regex.Pattern$Loop.match(Pattern.java:4794)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$GroupCurly.match0(Pattern.java:4485)
at java.util.regex.Pattern$GroupCurly.match(Pattern.java:4405)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4279)
at java.util.regex.Pattern$Curly.match(Pattern.java:4234)
at java.util.regex.Pattern$GroupHead.match(Pattern.java:4658)
at java.util.regex.Pattern$Loop.match(Pattern.java:4785)
at java.util.regex.Pattern$GroupTail.match(Pattern.java:4717)
at java.util.regex.Pattern$GroupCurly.match0(Pattern.java:4485)
at java.util.regex.Pattern$GroupCurly.match(Pattern.java:4405)
at java.util.regex.Pattern$Curly.match0(Pattern.java:4272)
发布于 2018-07-19 04:01:48
我想你的基本问题是([^']*('')*)+
这个条款
它可能添加了更多的步骤,而不是必要的。
更新:
您可以将其替换为一个展开的循环版本,该版本将显著
减少总体步骤。[^']*(?:''[^']*)*
重写它现在变成的正则表达式
"(\\s*'[^']*(?:''[^']*)*'(?:\\s*,\\s*'[^']*(?:''[^']*)*')*\\s*)|(\\s*[0-9.-]+(?:,\\s*[0-9.-]+)*\\s*)"
在此演示中,目标是800 'xyz2006201257200426888282d'
,间隔为
逗号。需要8010步。
https://regex101.com/r/WVrPBb/1
试一试,更糟的是它会堆栈溢出。
可读性版本
( # (1 start)
\s*
'
[^']*
(?: '' [^']* )*
'
(?:
\s* , \s*
'
[^']*
(?: '' [^']* )*
'
)*
\s*
) # (1 end)
|
( # (2 start)
\s*
[0-9.-]+
(?:
, \s* [0-9.-]+
)*
\s*
) # (2 end)
https://stackoverflow.com/questions/51409201
复制相似问题