blocks|key|713709|text|import+re

output_string+=+re.sub(r'[%5E\d\s-]',+'',+input_string)|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|713710|模式[%5E\d\s-]将匹配除数字、破折号或空格以外的任何内容-因此，用空字符串替换任何匹配项将删除除数字(包括减号)和空格之外的所有内容。|unstyled|offset|length|style|CODE|713711|entityMap^0|0|2|8|0^^$0|@$1|2|3|4|5|6|7|M|8|@]|9|@]|A|$B|C]]|$1|D|3|E|5|F|7|N|8|@$G|O|H|P|I|J]]|9|@]|A|$]]|$1|K|3|-4|5|F|7|Q|8|@]|9|@]|A|$]]]|L|$]]

<pre><code>import re

output_string = re.sub(r'[^\d\s-]', '', input_string)
</code></pre>

The pattern <code>[^\d\s-]</code> will match anything that's not a digit, dash, or whitespace - thus, replacing any match with an empty string will remove everything except the numbers (including minus signs) and whitespace.

blocks|key|2617293|text|如果您只想保留数字、加号和减号以及所有空格，最简单的方法可能是|type|unstyled|depth|inlineStyleRanges|entityRanges|data|2617294|import+re
+++...
line+=+re.sub(r'[%5E\d\s%2B-]%2B',+'',+line)|code-block|syntax|javascript|2617295|其内容为“将每个序列中的一个或多个非数字非空格替换为空”。|2617296|字符串的translate方法会更快，但它的设置就不那么简单了，所以，既然您要求“简单”，我建议使用re方法(现在准备好迎接re-haters的尖叫声...；-)。|offset|length|style|CODE|2617297|entityMap^0|0|0|0|4|9|1E|2|1Q|2|0^^$0|@$1|2|3|4|5|6|7|Q|8|@]|9|@]|A|$]]|$1|B|3|C|5|D|7|R|8|@]|9|@]|A|$E|F]]|$1|G|3|H|5|6|7|S|8|@]|9|@]|A|$]]|$1|I|3|J|5|6|7|T|8|@$K|U|L|V|M|N]|$K|W|L|X|M|N]|$K|Y|L|Z|M|N]]|9|@]|A|$]]|$1|O|3|-4|5|6|7|10|8|@]|9|@]|A|$]]]|P|$]]

If you want to keep just digits, plus and minus signs, and all whitespace, simplest might be

<pre><code>import re
 ...
line = re.sub(r'[^\d\s+-]+', '', line)
</code></pre>

which reads "replace each sequence of one or more non-digit non-whitespace with nothing".

Faster would be the <code>translate</code> method of strings, but it is quite a bit less simple to set up, so, since you ask for "straightforward", I suggest the <code>re</code> approach (now brace for the sure-to-come screeches of the <code>re</code>-haters...;-).

blocks|key|2625369|text|''.join([x+for+x+in+s+if+x+in+string.digits%2Bstring.whitespace])|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|2625370|或者，如果你真正想要的是一个数字列表：|unstyled|2625371|import+re
re.findall('\d%2B',s)|2625372|entityMap^0|0|0|0^^$0|@$1|2|3|4|5|6|7|K|8|@]|9|@]|A|$B|C]]|$1|D|3|E|5|F|7|L|8|@]|9|@]|A|$]]|$1|G|3|H|5|6|7|M|8|@]|9|@]|A|$B|C]]|$1|I|3|-4|5|F|7|N|8|@]|9|@]|A|$]]]|J|$]]

<pre><code>''.join([x for x in s if x in string.digits+string.whitespace])
</code></pre>

or if what you really want is a list of the numbers:

<pre><code>import re
re.findall('\d+',s)
</code></pre>

blocks|key|224346|text|LOL+@Alex的regex评论...希望不会有太多的仇恨者。话虽如此，尽管它们更快，因为它们是在C中执行的，正则表达式并不是我的首选...也许我对famous+jwz+quote有偏见：“有些人在遇到问题时，会认为”我知道，我会使用正则表达式。“现在他们有两个问题。”|type|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|data|224347|我要说的是，解决这个作业练习是很棘手的，因为解决方案充满了错误，就像到目前为止现有的解决方案所看到的那样。也许这是偶然的，因为它需要OP来调试和纠正这些建议，而不是仅仅将它们逐字剪切并粘贴到他们的分配解决方案中。|224348|就问题而言，它们包括但不限于：|224349|224350|底线..。我最喜欢哪种解决方案？我将启动以下操作之一并从那里进行调试：|224351|对于正则表达式，我将选择：|224352|@Alex的解决方案或者@Matt的解决方案，如果我只想要数据而不是“黄金”字符串|224353|对于字符串处理，我将把@Matt的解决方案修改为：|224354|keep+=+set(string.whitespace%2Bstring.digits%2B'%2B-')
line+=+''.join(x+for+x+in+line+if+x+in+keep)|code-block|syntax|javascript|224355|最后，@Greg有一个很好的观点。如果没有明确的规范，这些只是部分解决方案。|224356|entityMap|0|LINK|mutability|MUTABLE|url|http://regex.info/blog/2006-09-15/247^0|23|G|0|0|0|0|0|0|0|0|0|0|0^^$0|@$1|2|3|4|5|6|7|15|8|@]|9|@$A|16|B|17|1|18]]|C|$]]|$1|D|3|E|5|6|7|19|8|@]|9|@]|C|$]]|$1|F|3|G|5|6|7|1A|8|@]|9|@]|C|$]]|$1|H|3|-4|5|6|7|1B|8|@]|9|@]|C|$]]|$1|I|3|J|5|6|7|1C|8|@]|9|@]|C|$]]|$1|K|3|L|5|6|7|1D|8|@]|9|@]|C|$]]|$1|M|3|N|5|6|7|1E|8|@]|9|@]|C|$]]|$1|O|3|P|5|6|7|1F|8|@]|9|@]|C|$]]|$1|Q|3|R|5|S|7|1G|8|@]|9|@]|C|$T|U]]|$1|V|3|W|5|6|7|1H|8|@]|9|@]|C|$]]|$1|X|3|-4|5|6|7|1I|8|@]|9|@]|C|$]]]|Y|$Z|$5|10|11|12|C|$13|14]]]]

LOL @Alex's regex comment... hopefully there aren't too many haters. With that said however, although they're faster because they're executed in C, regexes aren't my first choice... perhaps i've been biased by the <a href="http://regex.info/blog/2006-09-15/247" rel="nofollow noreferrer">famous jwz quote</a>: '''Some people, when confronted with a problem, think “I know, I'll use regular expressions.” Now they have two problems.'''

I will say that solving this homework exercise is tricky because solutions are fraught with errors, as seen in the existing solutions so far. Perhaps this is serendipity because it requires the OP to debug and correct those suggestions instead of just cutting-and-pasting them verbatim into their assignment solution.

As far as the problems go, they include but are not limited to:

<ul>
<li>leaving successive spaces</li>
<li>removing negative signs, and</li>
<li>merging multiple numbers together</li>
</ul>

Bottom line... which solutions do I like best? I would start one of the following and debug from there:

For regex, i'll pick:

@Alex's solution or @Matt's if I want just the data instead of the "golden" string

For string processing, I'll modify @Matt's solution to:

<pre><code>keep = set(string.whitespace+string.digits+'+-')
line = ''.join(x for x in line if x in keep)
</code></pre>

Finally, @Greg has a good point. Without a clear spec, these are just partial solutions.

How can I remove special characters and letters from a line read from a text file while preserving the whitespaces? Let's say we have the following contents in a file:

16 ` C38# 26535 2010 4 14 2 7 7 3 8^@1 2
 15 100 140 30 $ 14^]
 (2003 2 ! -6 �021 0 � 14 ! 2 3! 1 0 35454
 0$ ^@0 0 0 "0 "63 194 (56 188 26 27" 24 0 0 10� 994! 8 58
 0 0 " � 0 0 32�47 32767 32767 ! 1

The output basically should be:

16 38 26535 2010 4 14 2 7 7 3 8 1 2
 15 100 140 30 14 
 2003 2 -6 021 0 14 2 3 1 0 35454
 0 0 0 0 0 63 194 56 188 26 27 24 0 0 10 994 8 58
 0 0 0 0 32 47 32767 32767 1

What's the most straightforward way to do this?

How to remove special characters and letters from a line read from a text file in Python?

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

如何删除从文本文件中读取的行中的特殊字符和字母，同时保留空格？假设我们在一个文件中有以下内容：16 ` C38# 26535 26535 2010 4 14 2 7 7 38^@1 2 15 100 140 30 $ 14^] (2003年2！-6�021 0�14！2 3！1 0 35454 0$ ^@0 0 "0 ...

问如何在Python中删除从文本文件中读取的行中的特殊字符和字母？
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何在Python中删除从文本文件中读取的行中的特殊字符和字母？EN