blocks|key|1234751|text|如果您有GNU+Grep，则可以使用-P使匹配不贪婪：|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|1234752|$+tr+-d+\\012+<+price.html+%7C+grep+-Po+'<tr>.*?</tr>'|code-block|syntax|javascript|1234753|-P选项允许Perl兼容的正则表达式(PCRE)，这是与?作为基本正则表达式(BRE)进行非贪婪匹配所必需的，而扩展正则表达式(ERE)不支持它。|1234754|如果使用的是-P，也可以使用环顾四周来避免打印匹配中的标记，如下所示：|1234755|$+tr+-d+\\012+<+price.html+%7C+grep+-Po+'(?<=<tr>).*?(?=</tr>)'|1234756|如果您没有GNU+grep，并且HTML格式良好，那么您可以这样做：|1234757|$+tr+-d+\\012+<+price.html+%7C+grep+-o+'<tr>[%5E<]*</tr>'|1234758|注意:上面的示例不适用于<tr>__中的嵌套标记。|1234759|entityMap|0|LINK|mutability|MUTABLE|url|http://www.regular-expressions.info/lookaround.html^0|4|8|I|2|0|0|0|2|S|1|0|6|2|E|4|0|0|0|5|8|0|0|C|4|0^^$0|@$1|2|3|4|5|6|7|14|8|@$9|15|A|16|B|C]|$9|17|A|18|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|19|8|@]|D|@]|E|$I|J]]|$1|K|3|L|5|6|7|1A|8|@$9|1B|A|1C|B|C]|$9|1D|A|1E|B|C]]|D|@]|E|$]]|$1|M|3|N|5|6|7|1F|8|@$9|1G|A|1H|B|C]]|D|@$9|1I|A|1J|1|1K]]|E|$]]|$1|O|3|P|5|H|7|1L|8|@]|D|@]|E|$I|J]]|$1|Q|3|R|5|6|7|1M|8|@$9|1N|A|1O|B|C]]|D|@]|E|$]]|$1|S|3|T|5|H|7|1P|8|@]|D|@]|E|$I|J]]|$1|U|3|V|5|6|7|1Q|8|@$9|1R|A|1S|B|C]]|D|@]|E|$]]|$1|W|3|-4|5|6|7|1T|8|@]|D|@]|E|$]]]|X|$Y|$5|Z|10|11|E|$12|13]]]]

If you have <code>GNU Grep</code> you can use <code>-P</code> to make the match non-greedy: 

<pre><code>$ tr -d \\012 &lt; price.html | grep -Po '&lt;tr&gt;.*?&lt;/tr&gt;'
</code></pre>

The <code>-P</code> option enables Perl Compliant Regular Expression (PCRE) which is needed for non-greedy matching with <code>?</code> as Basic Regular Expression (BRE) and Extended Regular Expression (ERE) do not support it.

If you are using <code>-P</code> you could also use <a href="http://www.regular-expressions.info/lookaround.html" rel="noreferrer">look arounds</a> to avoid printing the tags in the match like so:

<pre><code>$ tr -d \\012 &lt; price.html | grep -Po '(?&lt;=&lt;tr&gt;).*?(?=&lt;/tr&gt;)'
</code></pre>

<hr>

If you don't have <code>GNU grep</code> and the HTML is well formed you could just do:

<pre><code>$ tr -d \\012 &lt; price.html | grep -o '&lt;tr&gt;[^&lt;]*&lt;/tr&gt;'
</code></pre>

Note: The above example won't work with nested tags within <code>&lt;tr&gt;</code>.

blocks|key|2312156|text|.*?是一个Perl正则表达式。将grep更改为|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|2312157|grep+-oP+'<tr>.*?</tr>'|code-block|syntax|javascript|2312158|entityMap^0|0|3|H|4|0|0^^$0|@$1|2|3|4|5|6|7|M|8|@$9|N|A|O|B|C]|$9|P|A|Q|B|C]]|D|@]|E|$]]|$1|F|3|G|5|H|7|R|8|@]|D|@]|E|$I|J]]|$1|K|3|-4|5|6|7|S|8|@]|D|@]|E|$]]]|L|$]]

<code>.*?</code> is a Perl regular expression. Change your <code>grep</code> to

<pre><code>grep -oP '&lt;tr&gt;.*?&lt;/tr&gt;'
</code></pre>

blocks|key|193718|text|尝试perl风格的regexp。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|193719|$+grep+-Po+'<tr>.*?</tr>'+input
<tr>stuff</tr>
<tr>more+stuff</tr>|code-block|syntax|javascript|193720|entityMap^0|0|0^^$0|@$1|2|3|4|5|6|7|I|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|J|8|@]|9|@]|A|$E|F]]|$1|G|3|-4|5|6|7|K|8|@]|9|@]|A|$]]]|H|$]]

Try perl-style-regexp

<pre><code>$ grep -Po '&lt;tr&gt;.*?&lt;/tr&gt;' input
&lt;tr&gt;stuff&lt;/tr&gt;
&lt;tr&gt;more stuff&lt;/tr&gt;
</code></pre>

blocks|key|2312194|text|非贪婪匹配不是grep+-E支持的扩展正则表达式语法的一部分。如果有，则使用grep+-P，或者切换到Perl+/+Python+/+Ruby+/什么东西。(哦，还有pcregrep.)|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|2312195|当然，如果你真的是说|2312196|<tr>[%5E<>]*</tr>|code-block|syntax|javascript|2312197|相反，您应该这样说；那么普通的旧grep就可以正常工作了。|2312198|您可以(乏味地)扩展regex以接受嵌套标记，这些标记不是<tr>，但当然，最好使用适当的HTML解析器，而不是花费大量时间重新发现正则表达式为什么不是正确的工具。|2312199|entityMap^0|7|7|12|7|2B|8|0|0|0|G|4|0|T|4|0^^$0|@$1|2|3|4|5|6|7|S|8|@$9|T|A|U|B|C]|$9|V|A|W|B|C]|$9|X|A|Y|B|C]]|D|@]|E|$]]|$1|F|3|G|5|6|7|Z|8|@]|D|@]|E|$]]|$1|H|3|I|5|J|7|10|8|@]|D|@]|E|$K|L]]|$1|M|3|N|5|6|7|11|8|@$9|12|A|13|B|C]]|D|@]|E|$]]|$1|O|3|P|5|6|7|14|8|@$9|15|A|16|B|C]]|D|@]|E|$]]|$1|Q|3|-4|5|6|7|17|8|@]|D|@]|E|$]]]|R|$]]

Non-greedy matching is not part of the Extended Regular Expression syntax supported by <code>grep -E</code>. Use <code>grep -P</code> instead if you have that, or switch to Perl / Python / Ruby / what have you. (Oh, and <code>pcregrep</code>.)

Of course, if you really mean

<pre><code>&lt;tr&gt;[^&lt;&gt;]*&lt;/tr&gt;
</code></pre>

you should say that instead; then plain old <code>grep</code> will work fine.

You could (tediously) extend the regex to accept nested tags which are not <code>&lt;tr&gt;</code> but of course, it's better to use a proper HTML parser than spend a lot of time rediscovering why regular expressions are not the right tool for this.

I'm writing a bash script which analyses a html file and
I want to get the content of each single <code>&lt;tr&gt;...&lt;/tr&gt;</code>. So my command looks like:

<pre><code>$ tr -d \\012 &lt; price.html | grep -oE '&lt;tr&gt;.*?&lt;/tr&gt;'
</code></pre>

But it seems that <code>grep</code> gives me the result of:

<pre><code>$ tr -d \\012 &lt; price.html | grep -oE '&lt;tr&gt;.*&lt;/tr&gt;'
</code></pre>

How can I make <code>.*</code> non-greedy?

Non greedy matching using ? with grep

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

我正在编写一个bash脚本，它分析一个html文件，并希望获得每个<tr>...</tr>的内容。所以我的命令看起来是：$ tr -d \\012 < price.html | grep -oE '<tr>.*?</tr>'但是grep似乎给了我以下结果：$ tr -d \\012 < price.html | gre...

问非贪婪匹配使用？带着grep
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问非贪婪匹配使用？带着grepEN