文章/答案/技术大牛

发布

社区首页 >问答首页 >xarray -将字符串存储为“string”数据类型，而不是Python2.7的“char”(n维字符数组)。

问xarray -将字符串存储为“string”数据类型，而不是Python2.7的“char”(n维字符数组)。
EN

Stack Overflow用户

提问于 2018-02-24 03:54:06

回答 1查看 1.6K关注 0票数 2

我使用xarray将文本文件转换为netCDF格式。当我使用netCDF4格式和Python3时，它将字符串变量存储为字符串，但当我使用Python2时，它将它们存储为n维字符数组。我尝试在编码中设置dtype='str‘，这并没有什么区别。是否有办法使这些变量具有字符串数据类型使用Python2？任何想法都将不胜感激。

这是我的代码：

import pandas as pd
import xarray as xr

column_names = ['timestamp', 'air_temp', 'vtempdiff', 'rh', 'pressure', 'wind_dir', 'wind_spd']

df = pd.read_csv(args.input_file, skiprows = 1, header=None, names = column_names)
ds = xr.Dataset.from_dataframe(df)

encoding = {'timestamp': {'dtype': 'str'},
            'air_temp': {'_FillValue': 9.96921e+36, 'dtype': 'f4'}
            }

ds.to_netcdf(op_file.nc, format = 'NETCDF4', unlimited_dims={'time':True}, encoding = encoding)

当我使用Python3.6对op_file.nc进行ncdump时，我得到：

netcdf op_file {
dimensions:
    time = UNLIMITED ; // (24 currently)
variables:
    string timestamp(time) ;
    float air_temp(time) ;
    .
    .
    .

当我使用Python2.7时，我得到：

netcdf op_file {
dimensions:
    time = UNLIMITED ; // (24 currently)
    string20 = 20 ;
variables:
    char timestamp(time, string20) ;
        timestamp:_Encoding = "utf-8" ;
    float air_temp(time) ;
    .
    .
    .

示例输入文件如下所示：

# Fields: stamp,AGO-4.air_temp,AGO-4.vtempdiff,AGO-4.rh,AGO-4.pressure,AGO-4.wind_dir,AGO-4.wind_spd
2016-11-30T00:00:00Z,-36.50,,56.00,624.60,269.00,5.80
2016-11-30T01:00:00Z,-35.70,,55.80,624.70,265.00,5.90

netcdf

python-xarray

netcdf4

xarray

python

Stack Overflow用户

回答已采纳

发布于 2018-03-01 17:04:12

Xarray将Python2的str/bytes类型映射到NetCDF的NC_CHAR类型。这两种类型都表示单个字节字符数据(通常是ASCII)，因此这具有一定的意义。

要获得netCDF字符串NC_STRING，您需要传递unicode数据(在Python3上的str)。您可以通过使用.astype(unicode)或在encoding中传递{'dtype': unicode}显式地将时间戳列强制到unicode来实现这一点。

票数 5

查看全部 1 条回答

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/48959201

复制

相似问题

问xarray -将字符串存储为“string”数据类型，而不是Python2.7的“char”(n维字符数组)。
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问xarray -将字符串存储为“string”数据类型，而不是Python2.7的“char”(n维字符数组)。EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问xarray -将字符串存储为“string”数据类型，而不是Python2.7的“char”(n维字符数组)。
EN