开发者社区

文档建议反馈控制台

最新优惠活动

文章/答案/技术大牛

发布

正在读取CSV & Columns KeyError：“[Int64Index([0，1，2，3]，dtype='int64')]都不在[columns]中”

这个错误是由于读取CSV文件时，指定的列索引在文件的列名中找不到所引起的。下面是对这个错误的完善且全面的答案：

CSV文件是一种常用的文本文件格式，用于存储表格数据。在读取CSV文件时，我们通常需要指定要读取的列索引或列名。然而，当指定的列索引在文件的列名中找不到时，就会出现"KeyError: '[Int64Index([0, 1, 2, 3], dtype='int64')] not in [columns]'"的错误。

要解决这个错误，我们可以采取以下步骤：

检查CSV文件的列名：首先，我们需要确保CSV文件的列名与我们指定的列索引或列名匹配。可以使用文本编辑器或CSV文件阅读器查看文件的内容，并确认列名是否正确。
检查读取代码：如果列名正确，那么可能是读取代码中出现了问题。请检查读取CSV文件的代码，确保正确指定了要读取的列索引或列名。可以使用Python的pandas库来读取CSV文件，示例代码如下：

import pandas as pd

# 读取CSV文件
data = pd.read_csv('file.csv')

# 指定要读取的列索引或列名
columns = [0, 1, 2, 3]  # 或者 columns = ['column1', 'column2', 'column3', 'column4']

# 检查指定的列索引或列名是否存在于文件的列名中
missing_columns = [col for col in columns if col not in data.columns]

if missing_columns:
    print(f"The following columns are missing: {missing_columns}")
else:
    # 执行其他操作，如数据处理、分析等
    pass

在上述代码中，我们使用pandas库的read_csv函数读取CSV文件，并指定要读取的列索引或列名。然后，我们检查指定的列索引或列名是否存在于文件的列名中，如果有缺失的列，则输出缺失的列名。

检查CSV文件的格式：如果以上步骤都没有问题，那么可能是CSV文件的格式有误。请确保CSV文件的格式正确，每列之间使用逗号分隔，并且没有其他格式错误。

总结起来，当出现"KeyError: '[Int64Index([0, 1, 2, 3], dtype='int64')] not in [columns]'"的错误时，我们需要检查CSV文件的列名、读取代码和文件格式，以确定问题的根源并进行修复。

腾讯云提供了一系列与云计算相关的产品和服务，包括云服务器、云数据库、云存储等。您可以访问腾讯云官方网站（https://cloud.tencent.com/）了解更多关于腾讯云的产品和服务信息。

页面内容是否对你有帮助？

有帮助

没帮助

相关·内容

Pandas 第一轮零基础扫盲

3 f 7 dtype: int64 获取数组中多个数据「连续的」 In [14]: data[1:3] # 也可以有步长 Out[14]: k 3 x 5 dtype: int64...([2, 3], dtype='int64'), 'Kings': Int64Index([4, 6, 7], dtype='int64'), 'Riders': Int64Index([0, 1,...8, 11], dtype='int64'), 'Royals': Int64Index([9, 10], dtype='int64'), 'kings': Int64Index([5], dtype...CSV 文件「默认会把文件的第一行，变成标题」https://aiyc.lanzoux.com/iSU8ufj79af data = pd.read_csv('rating.csv') 读取 CSV...文件，不要标题行「取消第一行为标题」 data = pd.read_csv('rating.csv', header=None) 读取 CSV 文件，自定义标题行 data = pd.read_csv(

2K0 0

Pandas

别的同事会给你一个excel文件或者csv文件 #2. 使用pandas读取csv文件 movies = pd.read_csv('..../aaaa.csv',index=False) #### 保存数据到一个文件中 pd.read_csv('....([48, 49], dtype='int64'), '克里夫兰骑士队': Int64Index([70], dtype='int64'), '华盛顿子弹队': Int64Index([32], dtype...dtype='int64'), '塞拉库斯民族队': Int64Index([9], dtype='int64'), '多伦多猛龙队': Int64Index([73], dtype='int64...='int64'), '西雅图超音速队': Int64Index([33], dtype='int64'), '费城76人队': Int64Index([21, 37], dtype='int64'

1.5K1 1

多因子模型之因子（信号）测试平台----python中Pandas做处理时内存节省的技巧

1.查看dataframe占用空间例如，我们读取之前的所有行情和因子数据： data = pd.read_csv('total_data.csv', index_col=0) data.info...(memory_usage='deep') 首先，我们读取total_data.csv这个数据，并制定第一列是index，然后，我们获取一下这个dataframe这个对象在内存中的情况。...csv读取进来的时候，默认时间是str格式，这一格式在pandas中被存储为object格式，还是很占内存的。...3.修改数字其实，pandas在读取csv的时候，可以定义读取每一列的类型的，我们看到上面默认是float64，对于整数，默认是int64，知道一点计算机知识的都明白，很多时候我们是不需要这么float64...假设，我们一开始就定义好浮点数列的数据类型为float16 data = pd.read_csv('total_data.csv', index_col=0, dtype={'open': 'float16

1K4 0

关于巧克力数据集的数据分析数据读取数据预处理问题分析探索分析

数据集来自kaggle import numpy as np import pandas as pd 数据读取 dataset = pd.read_csv("..../flavors_of_cacao.csv") dataset.columns = dataset.columns.map(lambda x:x.replace("\n"," ")) dataset.columns...].map(lambda x:float(x.strip('%')) / 100) dataset_nona.info() Int64Index...x:x.replace(" ","")) best_coco.info() Int64Index: 1793 entries...2012 3.181701 2013 3.197011 2014 3.189271 2015 3.246491 2016 3.226027 2017 3.312500 dtype

1.1K7 0

科学计算库-Pandas随笔【附网络隐私闲谈】

([1, 2, 3, 4, 5], dtype='int64') 怎么取值？...(np.arange(12).reshape(4,3),index=[0,1,2,3],columns=['b','c','d']) df2 = DataFrame(np.arange(12).reshape...data = pd.read_csv('demo.CSV',skiprows=3) Out： Empty DataFrame Columns: [13, 433, 2] Index: [] 不加中括号，...6）指定读取行数【读大文件预览用】这里指定读取2行， data = pd.read_csv('demo.CSV',nrows=2) 7）转存为data.CSV文件，且替换默认分隔符为’|‘ data...②pandas CSV文件处理方法中谈到的索引默认指的是列索引【不是绝对的，Dataframe 有些方法既有index、又有 columns 时，index 表示行】。

2.9K18 0

实践应用|Python自动化连接FTP批量下载指定文件

pandas.csv()读取数据后，我们使用info可以发现原始日志包含了71个字段，同时单个文件200MB+38万条数据。。... (total 71 columns): # Column Non-Null Count Dtype --- ------ ...14 non-null int64 dtypes: float64(7), int64(54), object(10) memory usage: 7.9+ KB 选择需要用到的列实际上我们在后续处理中需要用到的列比较少...: 14 entries, 117184 to 384421 Data columns (total 5 columns): # Column Non-Null Count Dtype...>>>runfile('D:/ftp资源下载/ftp批量下载文件.py', wdir='D:/ftp资源下载') 正在读取原始对局日志......

9652 0

对不起！《唐人街探案3》和《你好，李焕英》相比，我更推荐《你好，李焕英》！

('D:\数据小刀\爬虫④\豆瓣_影评\唐人街探案3.csv') df2 = pd.read_csv('D:\数据小刀\爬虫④\豆瓣_影评\你好，李焕英.csv') ?...: 498 entries, 0 to 499 Data columns (total 5 columns): # Column Non-Null Count Dtype --- ----...0 to 499 Data columns (total 5 columns): # Column Non-Null Count Dtype --- ------ ----------...: 487 entries, 0 to 499 Data columns (total 5 columns): # Column Non-Null Count Dtype --- ----...十条短评中评分为‘推荐’（四星）的只有一个。 2、评分占比各个评分占比中，过一半占比为很差和较差，共占比69.88%： ?

3622 0

《Pandas 1.x Cookbook · 第二版》第03章创建和持久化DataFrame

使用dtype参数，设置读取的数值类型： >>> diamonds2 = pd.read_csv( ... "data/diamonds.csv", ......dtype('O') 因为CSV文件中包含日期的列，它是字符串。...-1 -1 更多如果压缩文件中只有一个文件，则read_csv方法还可以读取GZIP、BZ2和XZ文件。...HTML表格可以使用Pandas读取HTML中的表格： ?...url, match="List of studio albums", na_values="—" ... ) >>> len(dfs) 1 >>> dfs[0].columns Int64Index(

1.3K3 0

文本文件比对_文本文件格式有哪些

print('索引上+1就是比对的参数值') print('------data1数据源------') print(data1.columns...print(data1.ix[0:10]) print('------data2数据源------') print(data2.columns.../shell/merge.sh 2.txt 1 3.txt 1 result.csv debug 索引上+1就是比对的参数值 ------data1数据源------ Int64Index([0], dtype...='int64') 0 0 111-1116-3782 1 111-1120-5765 2 111-1114-6846 3 111-1121-1087 4...([0], dtype='int64') 0 0 111-1127-3269 1 111-1123-1863 2 111-1125-5555 3 111-

9352 0

【Python】这25个Pandas高频实用技巧，不得不服！

int64(1), object(1) memory usage: 13.6 KB 通过仅读取用到的两列，我们将DataFrame的空间大小缩小至13.6KB。...从剪贴板中创建DataFrame 假设你将一些数据储存在Excel或者Google Sheet中，你又想要尽快地将他们读取至DataFrame中。你需要选择这些数据并复制至剪贴板。...然后，你可以使用read_clipboard()函数将他们读取至DataFrame中： df = pd.read_clipboard() df 和read_csv()类似，read_clipboard...='int64', length=734) 或者"moives_2": movies_2.index.sort_values() Int64Index([ 1, 3, 4, 10, 12...136 Name: genre, dtype: int64 事实上我们在该Series中需要的是索引： counts.nlargest(3).index Index(['Drama', 'Comedy

6.4K4 0

pandas中的index对象详解

Int64Index([1, 2, 3, 4], dtype='int64') # 区间的长度 >>> a.length Int64Index([1, 1, 1, 1], dtype='int64')...([2020, 2020, 2020, 2020], dtype='int64') >>> a.month Int64Index([1, 1, 1, 1], dtype='int64') >>> a.day...Int64Index([1, 2, 3, 4], dtype='int64') >>> a.hour Int64Index([0, 0, 0, 0], dtype='int64') >>> a.minute...Int64Index([0, 0, 0, 0], dtype='int64') >>> a.second Int64Index([0, 0, 0, 0], dtype='int64') 5....([1, 2, 3, 4], dtype='int64') >>> a.month Int64Index([1, 1, 1, 1], dtype='int64') >>> a.year Int64Index

6.2K3 0

数据分析 ——— pandas基础（四）

([0, 6, 7, 10], dtype='int64'), 2: Int64Index([1, 2, 8, 11], dtype='int64'), 3: Int64Index([3, 4], dtype...([3], dtype='int64'), ('Kings', 2014): Int64Index([4], dtype='int64'), ('Kings', 2016): Int64Index...([6], dtype='int64'), ('Kings', 2017): Int64Index([7], dtype='int64'), ('Riders', 2014): Int64Index...([0], dtype='int64'), ('Riders', 2015): Int64Index([1], dtype='int64'), ('Riders', 2016): Int64Index...([8], dtype='int64'), ('Riders', 2017): Int64Index([11], dtype='int64'), ('Royals', 2014): Int64Index

1.1K4 0

《Pandas Cookbook》第02章 DataFrame基本操作1. 选取多个DataFrame列2. 对列名进行排序3. 在整个DataFrame上操作4. 串联DataFrame方法5. 在

3 object 11 dtype: int64 # 使用select_dtypes()，选取整数列 In[7]: movie.select_dtypes...对列名进行排序 # 读取movie数据集 In[12]: movie = pd.read_csv('data/movie.csv') In[13]: movie.head() Out[13]: ?...() Out[34]: bool 28 dtype: int64 更多 # movie数据集的对象数据包含缺失值。...: int64 # 统计每行的非缺失值个数 In[66]: college_ugds_.count(axis='columns').head() Out[66]: INSTNM Alabama...: int64 # 除了统计每行的非缺失值个数，也可以求和加以确认 In[67]: college_ugds_.sum(axis='columns').head() Out[67]: INSTNM

4.5K4 0

长文：一文掌握Pandas

(column labels) arguments. pd.DataFrame(data=None, index=None, columns=None, dtype=None...)...其表现如下代码片段所示 >>> index = pd.Index([2, 3, 5, 7, 11]) >>> index Int64Index([2, 3, 5, 7, 11], dtype='int64...') # operates like an array >>> index[::2] Int64Index([2, 5, 11], dtype='int64') # like numpy ndarray...([3, 5, 7], dtype='int64') >>> indexA.intersection(indexB) Int64Index([3, 5, 7], dtype='int64') Index...: int64 >>> s[(s > 0) & (s < 2)] 2 1 dtype: int64 # isin. the isin() method of Series returns a boolean

8214 0

Pandas 2.2 中文官方教程和指南（二十四）

使用pandas.read_csv()，您可以指定usecols来限制读入内存的列。并非所有可以被 pandas 读取的文件格式都提供读取子集列的选项。...: int64 一些读取器，比如pandas.read_csv()，在读取单个文件时提供了控制chunksize的参数。...使用pandas.read_csv()，您可以指定usecols来限制读入内存的列。并非所有可以被 pandas 读取的文件格式都提供了读取子集列的选项。...: int64 一些读取器，如pandas.read_csv()，在读取单个文件时提供控制chunksize的参数。..._check_indexing_error(key) KeyError: 'a' 要解决这个问题，可以制作一份副本，这样变异就不会应用于正在迭代的容器。

2710 0

机器学习测试笔记（2）——Pandas

子集分解等操作；直观地合并（merge）、**连接（join）**数据集；灵活地重塑（reshape）、**透视（pivot）**数据集；轴支持结构化标签：一个刻度支持多个标签；成熟的 IO 工具：读取文本文件...: int64 #1.3 通过序列创建DataFrame df1 = pd.DataFrame(s1,columns=["number"]) #指定列名 print("DataFrame1...([3, 4, 5, 6], dtype='int64') DataFrame 列名: Index(['A', 'B'], dtype='object') 3 排序数据 def sort_df(df...’) axis:若axis=0或’index’，则按照指定列中数据大小排序；若axis=1或’columns’，则按照指定索引中数据大小排序，默认axis=0 ascending:是否按指定列的数组升序排列...load_file(): data = pd.read_csv('my.csv') print("my.csv:\n",data) data.to_csv('my.csv',

1.5K3 0

《Pandas Cookbook》第04章选取数据子集1. 选取Series数据2. 选取DataFrame的行3. 同时选取DataFrame的行和列4. 用整数和标签选取数据5. 快速选取标量6

选取Series数据 # 读取college数据集，查看CITY的前5行 In[2]: college = pd.read_csv('data/college.csv', index_col='INSTNM...选取DataFrame的行 # 还是读取college数据集 In[14]: college = pd.read_csv('data/college.csv', index_col='INSTNM')...: int64 4....用整数和标签选取数据 # 读取college数据集，行索引命名为INSTNM In[33]: college = pd.read_csv('data/college.csv', index_col='...惰性行切片 # 读取college数据集；从行索引10到20，每隔一个取一行 In[50]: college = pd.read_csv('data/college.csv', index_col='

3.5K1 0

《Pandas 1.x Cookbook · 第二版》第02章 DataFrame基础运算

"director_name", ... ] >>> movie_actor_director = movies[cols] 如果没有使用列表，则会报KeyError错误。...=shorten) >>> movies.dtypes.value_counts() float64 13 int64 3 object 12 dtype: int64 使用....先读取数据，缩短列名： >>> movies = pd.read_csv("data/movie.csv") >>> def shorten(col): ......"_for_reviews", "" ... ) >>> movies = movies.rename(columns=shorten) 对下面的列名进行 >>> movies.columns...: int64

7011 0

数据分析实际案例之：pandas在餐厅评分数据中的使用

简介为了更好的熟练掌握pandas在实际数据分析中的应用，今天我们再介绍一下怎么使用pandas做美国餐厅评分数据的分析。...包含了一千多条数据，有5个属性，分别是： userID：用户ID placeID：餐厅ID rating：总体评分 food_rating：食物评分 service_rating：服务评分我们使用pandas来读取数据.../data/restaurant_rating_final.csv' df = pd.read_csv(path) df userID placeID rating food_rating service_rating...132715 1 1 0 1158 U1068 132733 1 1 0 1159 U1068 132594 1 1 1 1160 U1068 132660 0 0 0 1161 rows × 5 columns...>= 4] active_place Int64Index([132560, 132561, 132564, 132572, 132583, 132584, 132594, 132608,

1.6K2 0

《Pandas 1.x Cookbook · 第二版》第06章选取数据子集

6.1 选取Series数据读取大学数据集，使用校名作为索行引： >>> import pandas as pd >>> import numpy as np >>> college = pd.read_csv...---- 6.2 选取DataFrame行这一节和上节有点像，还是先读取数据： >>> college = pd.read_csv( ......: int64 更多下面两种操作等价： college.iloc[:10] college.iloc[:10, :] ---- 6.4 用整数和标签选取数据先读取数据： >>> college =..."data/college.csv", index_col="INSTNM" ... ) 使用.get_loc找到某一列的序号： >>> col_start = college.columns.get_loc...] ---- 6.5 按字母顺序切分先读取数据： >>> college = pd.read_csv( ...

3092 0

点击加载更多

扫码

添加站长进交流群

领取专属 10元无门槛券

手把手带您无忧上云

扫码加入开发者社群

相关资讯

热门标签

活动推荐

运营活动

活动名称

广告关闭