我试图将一个变量中的所有路径同时提供给snakemake中的一个snakemake脚本,如下所示:
rule neo4j:
input:
script = 'python/neo4j.py',
path_to_cl = 'results/clusters/umap/{sample}_umap_clusters.csv',
path_to_umap = 'results/umap/{sample}_umap.csv',
path_to_mtx = 'data_files/normalized/{sample}.csv'
output: 'results/neo4j/{sample}/cells.csv', 'results/neo4j/{sample}/genes.csv',
'results/neo4j/{sample}/cl_nodes.csv', 'results/neo4j/{sample}/cl_contains.csv',
'results/neo4j/{sample}/cl_isin.csv', 'results/neo4j/{sample}/expr_by.csv',
'results/neo4j/{sample}/expr_ess.csv'
shell:
"python {input.script} -path_to_cl {input.path_to_cl} -path_to_umap {input.path_to_umap} -path_to_mtx {input.path_to_mtx} -output {output}"当我在访问output脚本中的python参数时,它只看到第一个路径:'results/neo4j/{sample}/cells.csv'。我也试着命名每条路,但它没有解决这个问题。如何将规则的output中的所有路径作为数组或字典传递,以便以后能够在python中访问它们
发布于 2018-08-31 14:12:06
如果我正确理解您的问题,您的问题是neo4j.py脚本不能为其-output参数接受多个文件: shell命令可能以完整的文件列表(使用snakemake的-p选项检查)结束,但是脚本只考虑了第一个文件。
如果确实如此,一种可能更简洁的方法是修改neo4j.py脚本的接口,以便它为每个输出文件使用一个参数。
然后,您将修改规则如下:
rule neo4j:
input:
script = 'python/neo4j.py',
path_to_cl = 'results/clusters/umap/{sample}_umap_clusters.csv',
path_to_umap = 'results/umap/{sample}_umap.csv',
path_to_mtx = 'data_files/normalized/{sample}.csv'
output:
cells = 'results/neo4j/{sample}/cells.csv',
genes = 'results/neo4j/{sample}/genes.csv',
nodes = 'results/neo4j/{sample}/cl_nodes.csv',
contains = 'results/neo4j/{sample}/cl_contains.csv',
isin = 'results/neo4j/{sample}/cl_isin.csv',
by = 'results/neo4j/{sample}/expr_by.csv',
ess = 'results/neo4j/{sample}/expr_ess.csv'
shell:
"""
python {input.script} \\
--path_to_cl {input.path_to_cl} \\
--path_to_umap {input.path_to_umap} \\
--path_to_mtx {input.path_to_mtx} \\
--cells {output.cells} \\
--genes {output.genes} \\
--nodes {output.nodes} \\
--contains {output.contains} \\
--isin {output.isin} \\
--by {output.by} \\
--ess {output.ess}
"""一些可能有用的python模块用于设置脚本的接口:
编辑
如果不希望将每个输入文件作为单独的参数传递,则只需传递输出目录,并让脚本从这个单个参数构建输出路径。给定您想要的文件名,这似乎是可能的:
rule neo4j:
input:
script = 'python/neo4j.py',
path_to_cl = 'results/clusters/umap/{sample}_umap_clusters.csv',
path_to_umap = 'results/umap/{sample}_umap.csv',
path_to_mtx = 'data_files/normalized/{sample}.csv'
output:
'results/neo4j/{sample}/cells.csv',
'results/neo4j/{sample}/genes.csv',
'results/neo4j/{sample}/cl_nodes.csv',
'results/neo4j/{sample}/cl_contains.csv',
'results/neo4j/{sample}/cl_isin.csv',
'results/neo4j/{sample}/expr_by.csv',
'results/neo4j/{sample}/expr_ess.csv'
shell:
"""
python {input.script} \\
--path_to_cl {input.path_to_cl} \\
--path_to_umap {input.path_to_umap} \\
--path_to_mtx {input.path_to_mtx} \\
--out_dir results/neo4j/{wildcards.sample}
"""发布于 2018-08-30 07:03:10
rule hello:
output:
"woot", "hoot"
run:
for f in output:
print(f)
print(output[1])印上“呜呜”、“呜呜”、“呜呜”。
https://stackoverflow.com/questions/52088953
复制相似问题