entityMap|blocks|key|2b7mo|text|这是因为您选择了错误的编码。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|9mhok|由于您使用的是Windows计算机，因此只需替换|otcu|Past=pd.read_csv("C:/Users/.../Past.csv",encoding='utf-8')+|code-block|syntax|javascript|51tng|使用|atvic|Past=pd.read_csv("C:/Users/.../Past.csv",encoding='cp1252')|5igs3|应该可以解决这个问题。|ebc0^0|0|0|0|0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|Q|9|@]|A|@]|B|$]]|$2|C|4|D|6|7|8|R|9|@]|A|@]|B|$]]|$2|E|4|F|6|G|8|S|9|@]|A|@]|B|$H|I]]|$2|J|4|K|6|7|8|T|9|@]|A|@]|B|$]]|$2|L|4|M|6|G|8|U|9|@]|A|@]|B|$H|I]]|$2|N|4|O|6|7|8|V|9|@]|A|@]|B|$]]|$2|P|4|-4|6|7|8|W|9|@]|A|@]|B|$]]]]

This happens because you chose the wrong encoding.
Since you are working on a Windows machine, just replacing
<pre><code>Past=pd.read_csv(&quot;C:/Users/.../Past.csv&quot;,encoding='utf-8') 
</code></pre>
with
<pre><code>Past=pd.read_csv(&quot;C:/Users/.../Past.csv&quot;,encoding='cp1252')
</code></pre>
should solve the problem.

entityMap|0|type|LINK|mutability|MUTABLE|data|url|https://docs.python.org/3/howto/unicode.html#the-unicode-type|blocks|key|7d3lr|text|使用此解决方案，它将剥离(忽略)字符并返回不带字符的字符串。只有当你需要剥离它们而不是转换它们时才使用它。|unstyled|depth|inlineStyleRanges|entityRanges|9kvd1|with+open(path,+encoding="utf8",+errors='ignore')+as+f:|code-block|syntax|javascript|ar65l|使用errors='ignore'只会丢失一些字符。但是，如果你不关心他们，因为他们似乎是额外的字符起源于一个错误的格式和编程的客户端连接到我的套接字服务器。然后这是一个简单直接的解决方案。reference|offset|length|style|CODE|ashst^0|0|0|2|F|2N|9|0|0^^$0|$1|$2|3|4|5|6|$7|8]]]|9|@$A|B|C|D|2|E|F|U|G|@]|H|@]|6|$]]|$A|I|C|J|2|K|F|V|G|@]|H|@]|6|$L|M]]|$A|N|C|O|2|E|F|W|G|@$P|X|Q|Y|R|S]]|H|@$P|Z|Q|10|A|11]]|6|$]]|$A|T|C|-4|2|E|F|12|G|@]|H|@]|6|$]]]]

Use this solution it will strip out (ignore) the characters and return the string without them. Only use this if your need is to strip them not convert them.

<pre><code>with open(path, encoding="utf8", errors='ignore') as f:
</code></pre>

Using <code>errors='ignore'</code> You'll just lose some characters. but if your don't care about them as they seem to be extra characters originating from a the bad formatting and programming of the clients connecting to my socket server. Then its a easy direct solution. <a href="https://docs.python.org/3/howto/unicode.html#the-unicode-type" rel="noreferrer">reference</a>

entityMap|blocks|key|2du2n|text|尝试使用：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|6jvp6|pd.read_csv("Your+filename",+encoding="ISO-8859-1")|code-block|syntax|javascript|cgntl|我从一些网站解析的代码被转换成这种编码，而不是标准的默认UTF-8编码。|4r3mi^0|0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|K|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|L|9|@]|A|@]|B|$F|G]]|$2|H|4|I|6|7|8|M|9|@]|A|@]|B|$]]|$2|J|4|-4|6|7|8|N|9|@]|A|@]|B|$]]]]

Try using :
<pre><code>pd.read_csv(&quot;Your filename&quot;, encoding=&quot;ISO-8859-1&quot;)
</code></pre>
The code that I parsed from some website was converted in this encoding instead of default UTF-8 encoding which is standard.

entityMap|blocks|key|5kfvu|text|下面这些对我来说非常好用：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|4j9o4|encoding+=+'latin1'|code-block|syntax|javascript|61k27^0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|I|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|J|9|@]|A|@]|B|$F|G]]|$2|H|4|-4|6|7|8|K|9|@]|A|@]|B|$]]]]

The following works very well for me:

<pre><code>encoding = 'latin1'
</code></pre>

entityMap|blocks|key|5bgse|text|使用下面的代码对我来说很有效：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|a9bef|with+open(keeniz_dir+%2B+'/world_cities.csv',++'r',+encoding='latin1')+as+input:|code-block|syntax|javascript|beonq^0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|I|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|J|9|@]|A|@]|B|$F|G]]|$2|H|4|-4|6|7|8|K|9|@]|A|@]|B|$]]]]

Using the code bellow works for me:

<pre><code>with open(keeniz_dir + '/world_cities.csv', 'r', encoding='latin1') as input:
</code></pre>

entityMap|0|type|LINK|mutability|MUTABLE|data|url|https://docs.python.org/3/library/codecs.html#standard-encodings|1|https://stackoverflow.com/questions/4255305/how-to-determine-encoding-table-of-a-text-file|blocks|key|46htv|text|这是一个古老的问题，但在搜索此错误的解决方案时出现了。因此，我想要回答所有那些仍然在这个帖子上跌跌撞撞的人。在为编码参数传递正确的值之前，可以检查文件的编码。要获得编码，Windows中的一个简单选项是在Notepad%2B%2B中打开文件并查看编码。然后，可以在the+python+documentation中找到编码参数的正确值。有关获取文件编码的不同可能性的更多详细信息，请查看此question+and+the+answers+on+stackoverflow。|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|ehdp4^0|3K|O|0|5A|15|1|0^^$0|$1|$2|3|4|5|6|$7|8]]|9|$2|3|4|5|6|$7|A]]]|B|@$C|D|E|F|2|G|H|N|I|@]|J|@$K|O|L|P|C|Q]|$K|R|L|S|C|T]]|6|$]]|$C|M|E|-4|2|G|H|U|I|@]|J|@]|6|$]]]]

Its an old question but shows up while searching for solutions to this error. So I thought to answer for all who still stumble on this thread.
The encoding for the file can be checked before passing the correct value for the encoding argument.
To get the encoding, a simple option in Windows is to open the file in Notepad++ and look at the encoding. The correct value for the encoding argument can then be found in <a href="https://docs.python.org/3/library/codecs.html#standard-encodings" rel="nofollow noreferrer">the python documentation</a>.
Look at this <a href="https://stackoverflow.com/questions/4255305/how-to-determine-encoding-table-of-a-text-file">question and the answers on stackoverflow</a> for more details on different possibilities to get the file encoding.

entityMap|blocks|key|deu9n|text|除非您确定文件编码，否则不要传递编码选项。默认值encoding=None将errors="replace“传递给调用的open()函数。有编码错误的字符将被替换，然后您可以找出正确的编码或只使用生成的Dataframe。如果提供了错误的编码，pd将把errors="strict“传递给open()，如果编码不正确，则获取ValueError。|type|unstyled|depth|inlineStyleRanges|entityRanges|data|fat9n^0|0^^$0|$]|1|@$2|3|4|5|6|7|8|D|9|@]|A|@]|B|$]]|$2|C|4|-4|6|7|8|E|9|@]|A|@]|B|$]]]]

Don't pass encoding option unless you are sure about file encoding. Default value encoding=None passes errors=&quot;replace&quot; to open() function called. Characters with encoding errors will be substituted with replacements, you can then figure out correct encoding or just use the resulting Dataframe. If wrong encoding is provided pd will pass errors=&quot;strict&quot; to open() and get ValueError if encoding is incorrect.

I am new to Python, I am trying to read csv file using below script.

<pre><code>Past=pd.read_csv("C:/Users/Admin/Desktop/Python/Past.csv",encoding='utf-8')
</code></pre>

But, getting error "UnicodeDecodeError: 'utf-8' codec can't decode byte 0x96 in position 35: invalid start byte", Please help me to know issue here, I used encoding in script thought it will resolve error.

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x96 in position 35: invalid start byte

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋 

腾讯云代码助手

CODING DevOps

Cloud Studio

SDK中心

API中心

命令行工具

 我是Python的新手，我正在尝试使用下面的脚本读取csv文件。 Past=pd.read_csv("C:/Users/Admin/Desktop/Python/Past.csv",encoding='utf-8') 但是，得到错误"UnicodeDecodeError：'utf-8‘编解码器无法解码字节0x96在位置35:无效的开始字节“，请帮助我了解这里的问题，我在脚本中使用编码，认为它可以

问UnicodeDecodeError：'utf-8‘编解码器无法解码位置35处的字节0x96 :无效的起始字节
EN

回答 7

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问UnicodeDecodeError：'utf-8‘编解码器无法解码位置35处的字节0x96 :无效的起始字节EN

回答 7

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问UnicodeDecodeError：'utf-8‘编解码器无法解码位置35处的字节0x96 :无效的起始字节
EN