blocks|key|2600738|text|我想你可以用苏特|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|2600739|Custom+Use+(only+the+mentioned+tags+and+attributes+are+allowed,+nothing+else)
<%25=+sanitize+@article.body,+tags:+%25w(table+tr+td),+attributes:+%25w(id+class+style)+%25>|code-block|syntax|javascript|2600740|所以，像这样的东西应该能起作用：|2600741|sanitize+result_string,+tags:+%25w(em)|2600742|entityMap|0|LINK|mutability|MUTABLE|url|http://api.rubyonrails.org/classes/ActionView/Helpers/SanitizeHelper.html#method-i-sanitize^0|6|2|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|U|8|@]|9|@$A|V|B|W|1|X]]|C|$]]|$1|D|3|E|5|F|7|Y|8|@]|9|@]|C|$G|H]]|$1|I|3|J|5|6|7|Z|8|@]|9|@]|C|$]]|$1|K|3|L|5|F|7|10|8|@]|9|@]|C|$G|H]]|$1|M|3|-4|5|6|7|11|8|@]|9|@]|C|$]]]|N|$O|$5|P|Q|R|C|$S|T]]]]

I think you can use the <a href="http://api.rubyonrails.org/classes/ActionView/Helpers/SanitizeHelper.html#method-i-sanitize" rel="nofollow">sinitize</a>:

<pre><code>Custom Use (only the mentioned tags and attributes are allowed, nothing else)
&lt;%= sanitize @article.body, tags: %w(table tr td), attributes: %w(id class style) %&gt;
</code></pre>

So, something like that should work:

<pre><code>sanitize result_string, tags: %w(em)
</code></pre>

blocks|key|1364171|text|使用消毒的附加参数，您可以指定允许哪些标记。|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|1364172|在您的示例中，请尝试：|1364173|ActionController::Base.helpers.sanitize(+result_string,+tags:+%25w(em)+)+|code-block|syntax|javascript|1364174|它应该能起作用|1364175|entityMap|0|LINK|mutability|MUTABLE|url|http://apidock.com/rails/v4.0.2/ActionView/Helpers/SanitizeHelper/sanitize^0|2|2|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|U|8|@]|9|@$A|V|B|W|1|X]]|C|$]]|$1|D|3|E|5|6|7|Y|8|@]|9|@]|C|$]]|$1|F|3|G|5|H|7|Z|8|@]|9|@]|C|$I|J]]|$1|K|3|L|5|6|7|10|8|@]|9|@]|C|$]]|$1|M|3|-4|5|6|7|11|8|@]|9|@]|C|$]]]|N|$O|$5|P|Q|R|C|$S|T]]]]

With an additional parameter to <a href="http://apidock.com/rails/v4.0.2/ActionView/Helpers/SanitizeHelper/sanitize" rel="nofollow">sanitize</a>, you can specify which tags are allowed.

In your example, try:

<pre><code>ActionController::Base.helpers.sanitize( result_string, tags: %w(em) ) 
</code></pre>

It should do the trick

blocks|key|1137952|text|你可以打电话给gsub！若要丢弃所有标记，但只保留独立或不包含在html标记中的标记，请执行以下操作。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1137953|result_string.gsub!(/(<\/?[%5Ee][%5Em]>)%7C(<\w*<\/em>>)%7C(<\/\w*<\/em>>)/,+'')|code-block|syntax|javascript|1137954|会起作用|1137955|解释：|1137956|#+first+group+(<\/?[%5Ee][%5Em]>)+
#+find+all+html+tags+that+are+not++or+

#+second+group+(<\w*<\/em>>)
#+find+all+opening+tags+that+have+++inside+of+them+like:
#+<li>+++or+<ul>

#+third+group+(<\/\w*<\/em>>)
#+find+all+closing+tags+that+have+++inside+of+them:
#+</li>+++or++</ul>

#+and+gsub+replaces+all+of+this+with+empty+string|1137957|entityMap^0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|O|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|P|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|Q|8|@]|9|@]|A|$]]|$1|I|3|J|5|6|7|R|8|@]|9|@]|A|$]]|$1|K|3|L|5|D|7|S|8|@]|9|@]|A|$E|F]]|$1|M|3|-4|5|6|7|T|8|@]|9|@]|A|$]]]|N|$]]

You could call gsub! to discard all tags but keep only tags that are independent, or that are not included in html tag.

<pre><code>result_string.gsub!(/(&lt;\/?[^e][^m]&gt;)|(&lt;&lt;em&gt;\w*&lt;\/em&gt;&gt;)|(&lt;\/&lt;em&gt;\w*&lt;\/em&gt;&gt;)/, '')
</code></pre>

would do the trick

To explain:

<pre><code># first group (&lt;\/?[^e][^m]&gt;) 
# find all html tags that are not &lt;em&gt; or &lt;/em&gt;

# second group (&lt;&lt;em&gt;\w*&lt;\/em&gt;&gt;)
# find all opening tags that have &lt;em&gt; &lt;/em&gt; inside of them like:
# &lt;&lt;em&gt;li&lt;/em&gt;&gt; or &lt;&lt;em&gt;ul&lt;/em&gt;&gt;

# third group (&lt;\/&lt;em&gt;\w*&lt;\/em&gt;&gt;)
# find all closing tags that have &lt;em&gt; &lt;/em&gt; inside of them:
# &lt;/&lt;em&gt;li&lt;/em&gt;&gt; or &lt;/&lt;em&gt;ul&lt;/em&gt;&gt;

# and gsub replaces all of this with empty string
</code></pre>

I am trying to sanitalize Solr search results, cause it has html tags inside:

<code>ActionController::Base.helpers.sanitize( result_string )</code> 

It is easy to sanitalize not highlighted string like: <code>I know &lt;ul&gt;&lt;li&gt;ruby&lt;/li&gt; &lt;li&gt;rails&lt;/li&gt;&lt;/ul&gt;</code>.

But when results is highlighted I have additional important tags inside - <code>&lt;em&gt;</code> and <code>&lt;/em&gt;</code>:

<code>I &lt;em&gt;know&lt;/em&gt; &lt;&lt;em&gt;ul&lt;/em&gt;&gt;&lt;&lt;em&gt;li&lt;/em&gt;&gt;&lt;em&gt;ruby&lt;/em&gt;&lt;/&lt;em&gt;li&lt;/em&gt;&gt; &lt;&lt;em&gt;li&lt;/em&gt;&gt;&lt;em&gt;rails&lt;/em&gt;&lt;/&lt;em&gt;li&lt;/em&gt;&gt;&lt;/&lt;em&gt;ul&lt;/em&gt;&gt;</code>.

So, when I sanitalize string with nested html and highlighting tags, I get string with peaces of htmls tags. And it is bad :)

How can I sanitalize highlighted string with <code>&lt;em&gt;</code> tags inside to get correct result (string with <code>&lt;em&gt;</code> tags only)? 

I found the way, but it's slow and not pretty:

<pre><code>string = 'I &lt;em&gt;know&lt;/em&gt; &lt;&lt;em&gt;ul&lt;/em&gt;&gt;&lt;&lt;em&gt;li&lt;/em&gt;&gt;&lt;em&gt;ruby&lt;/em&gt;&lt;/&lt;em&gt;li&lt;/em&gt;&gt; &lt;&lt;em&gt;li&lt;/em&gt;&gt;&lt;em&gt;rails&lt;/em&gt;&lt;/&lt;em&gt;li&lt;/em&gt;&gt;&lt;/&lt;em&gt;ul&lt;/em&gt;&gt;'

['p', 'ul', 'li', 'ol', 'span', 'b', 'br'].each do |tag| 
 string.gsub!( "&lt;&lt;em&gt;#{tag}&lt;/em&gt;&gt;", '' )
 string.gsub!( "&lt;/&lt;em&gt;#{tag}&lt;/em&gt;&gt;", '' )
end

string = ActionController::Base.helpers.sanitize string, tags: %w(em)
</code></pre>

How can I optimize it or do it using some better solution?
to write some regex and remove html_tags, but keep <code>&lt;em&gt;</code> and <code>&lt;/em&gt;</code> e.g.

Please help, thanks.

How to sanitalize string with nested html tags but keep tag?

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

我试图对Solr搜索结果进行清理，因为其中包含html标记：ActionController::Base.helpers.sanitize( result_string )很容易清除突出显示的字符串，比如：I know <ul><li>ruby</li> <li>rails</li></ul>。但是当结果突出显示时，我...

问如何使用嵌套的html标记清除字符串，但保留<em>标记？
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何使用嵌套的html标记清除字符串，但保留<em>标记？EN