blocks|key|1751392|text|$+cat+file.txt
AA,A=14,B=356,C=845,D=4516
BB,A=65,C=255,D=841,E=5133,F=1428
CC,A=88,B=54,C=549,F=225

$+perl+-F,+-le+'@k=(A..F);
+++$op[0]=$F[0];+@op[1..6]=("-")x6;
+++$j=0;+for($i=1;$i<=$#F;){+if($F[$i]+=~+m/$k[$j%2B%2B]=/){$op[$j]=$F[$i];+$i%2B%2B}+}
+++print+join(",",@op)
+++'+file.txt
AA,A=14,B=356,C=845,D=4516,-,-
BB,A=65,-,C=255,D=841,E=5133,F=1428
CC,A=88,B=54,C=549,-,-,F=225|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|1751393|-F,在,上拆分输入行并保存到@F数组|unordered-list-item|offset|length|style|CODE|1751394|-l从输入行中删除换行符，将换行符添加到输出|1751395|@k=(A..F);用A、B等对@k数组进行初始化，直至F|1751396|$op[0]=$F[0];+@op[1..6]=("-")x6;使用@F的第一个元素和剩下的6个元素作为-来初始化@op数组|1751397|循环遍历@F数组，如果元素与相应索引中的@k数组元素匹配，则更改@op元素。|1751398|print+join(",",@op)打印以,为分隔符的@op数组|1751399|unstyled|entityMap^0|0|0|3|4|1|F|2|0|0|2|0|0|A|B|1|D|1|G|2|S|1|0|0|W|Y|2|1G|1|1L|3|0|4|2|K|2|W|3|0|0|J|M|1|S|3|0^^$0|@$1|2|3|4|5|6|7|X|8|@]|9|@]|A|$B|C]]|$1|D|3|E|5|F|7|Y|8|@$G|Z|H|10|I|J]|$G|11|H|12|I|J]|$G|13|H|14|I|J]]|9|@]|A|$]]|$1|K|3|L|5|F|7|15|8|@$G|16|H|17|I|J]]|9|@]|A|$]]|$1|M|3|N|5|F|7|18|8|@$G|19|H|1A|I|J]|$G|1B|H|1C|I|J]|$G|1D|H|1E|I|J]|$G|1F|H|1G|I|J]|$G|1H|H|1I|I|J]]|9|@]|A|$]]|$1|O|3|P|5|F|7|1J|8|@$G|1K|H|1L|I|J]|$G|1M|H|1N|I|J]|$G|1O|H|1P|I|J]|$G|1Q|H|1R|I|J]]|9|@]|A|$]]|$1|Q|3|R|5|F|7|1S|8|@$G|1T|H|1U|I|J]|$G|1V|H|1W|I|J]|$G|1X|H|1Y|I|J]]|9|@]|A|$]]|$1|S|3|T|5|F|7|1Z|8|@$G|20|H|21|I|J]|$G|22|H|23|I|J]|$G|24|H|25|I|J]]|9|@]|A|$]]|$1|U|3|-4|5|V|7|26|8|@]|9|@]|A|$]]]|W|$]]

<pre><code>$ cat file.txt
AA,A=14,B=356,C=845,D=4516
BB,A=65,C=255,D=841,E=5133,F=1428
CC,A=88,B=54,C=549,F=225

$ perl -F, -le '@k=(A..F);
 $op[0]=$F[0]; @op[1..6]=("-")x6;
 $j=0; for($i=1;$i&lt;=$#F;){ if($F[$i] =~ m/$k[$j++]=/){$op[$j]=$F[$i]; $i++} }
 print join(",",@op)
 ' file.txt
AA,A=14,B=356,C=845,D=4516,-,-
BB,A=65,-,C=255,D=841,E=5133,F=1428
CC,A=88,B=54,C=549,-,-,F=225
</code></pre>

<ul>
<li><code>-F,</code> split input line on <code>,</code> and save to <code>@F</code> array</li>
<li><code>-l</code> removes newline from input line, adds newline to output</li>
<li><code>@k=(A..F);</code> initialize <code>@k</code> array with <code>A</code>, <code>B</code>, etc upto <code>F</code></li>
<li><code>$op[0]=$F[0]; @op[1..6]=("-")x6;</code> initalize <code>@op</code> array with first element of <code>@F</code> and remaining six elements as <code>-</code></li>
<li>for-loop iterates over <code>@F</code> array, if element matches with <code>@k</code> array element in corresponding index followed by <code>=</code>, change <code>@op</code> element</li>
<li><code>print join(",",@op)</code> print the <code>@op</code> array with <code>,</code> as separator</li>
</ul>

blocks|key|1751406|text|Perl来救我！|type|unstyled|depth|inlineStyleRanges|entityRanges|data|1751407|您还没有指定如何获取标头信息，因此在下面的脚本中，@header数组将直接填充。|1751408|%25to_idx散列将列名映射到它们的索引(A+=>+0、B+=>+1等)。|offset|length|style|CODE|1751409|每一行被分割成字段，每个字段与预期的字段($next)进行比较，并在需要时打印破折号。对于缺失的拖尾字段，情况也是如此。|1751410|#!/usr/bin/perl
use+warnings;
use+strict;

my+@header+=+qw(+A+B+C+D+E+F+);

my+%25to_idx+=+map+%2B($header[$_]+=>+$_),+0+..+$#header;

open+my+$IN,+'<',+shift+or+die+$!;
while+(<$IN>)+{
++++chomp;
++++my+@fields+=+split+/,/;
++++print+shift+@fields;
++++my+$next+=+0;
++++for+my+$field+(@fields)+{
++++++++my+($name,+$value)+=+split+/=/,+$field;
++++++++print+',-'+x+($to_idx{$name}+-+$next);
++++++++print+",$name=$value";
++++++++$next+=+$to_idx{$name}+%2B+1;
++++}
++++print+',-'+x+(1+%2B+$#header+-+$next);++#+Missing+trailing+fields.
++++print+"\n"
}|code-block|syntax|javascript|1751411|entityMap^0|0|0|0|7|0|L|5|0|0^^$0|@$1|2|3|4|5|6|7|S|8|@]|9|@]|A|$]]|$1|B|3|C|5|6|7|T|8|@]|9|@]|A|$]]|$1|D|3|E|5|6|7|U|8|@$F|V|G|W|H|I]]|9|@]|A|$]]|$1|J|3|K|5|6|7|X|8|@$F|Y|G|Z|H|I]]|9|@]|A|$]]|$1|L|3|M|5|N|7|10|8|@]|9|@]|A|$O|P]]|$1|Q|3|-4|5|6|7|11|8|@]|9|@]|A|$]]]|R|$]]

Perl to the rescue!

You haven't specified how to obtain the header information, so in the following script, the @header array is populated directly.

<code>%to_idx</code> hash maps the column names to their indices (A => 0, B => 1 etc.).

Each lines is split into fields, each field is compared to the expected one (<code>$next</code>) and dashes are printed if needed. The same happens for missing trailing fields.

<pre><code>#!/usr/bin/perl
use warnings;
use strict;

my @header = qw( A B C D E F );

my %to_idx = map +($header[$_] =&gt; $_), 0 .. $#header;

open my $IN, '&lt;', shift or die $!;
while (&lt;$IN&gt;) {
 chomp;
 my @fields = split /,/;
 print shift @fields;
 my $next = 0;
 for my $field (@fields) {
 my ($name, $value) = split /=/, $field;
 print ',-' x ($to_idx{$name} - $next);
 print ",$name=$value";
 $next = $to_idx{$name} + 1;
 }
 print ',-' x (1 + $#header - $next); # Missing trailing fields.
 print "\n"
}
</code></pre>

blocks|key|2400306|text|BEGIN+{++++++++++++++++++++++++++++++++++
++++PROCINFO["sorted_in"]="@ind_str_asc"+#+order+for+for(i+in+a)
++++for(i=65;i<=90;i%2B%2B)++++++++++++++++++#+create+the+whole+alphabet+to+array+a[]
++++++++a[sprintf("%25c",+i)]++++++++++++++#+you+could+read+the+header+and+use+that+as+well
}
{
++++split($0,b,",")++++++++++++++++++++++#+split+record+by+","
++++printf+"%25s",+b[1]++++++++++++++++++++#+printf+first+element+(AA,+BB...)
++++delete+b[1]++++++++++++++++++++++++++#+get+rid+of+it
++++for(i+in+b)+
++++++++b[substr(b[i],1,1)]=b[i]+++++++++#+take+the+first+letter+to+use+as+index+(A=12)
++++for(i+in+a)++++++++++++++++++++++++++#+go+thru+alphabet+and+printf+from+b[]
++++++++printf+"%25s%25s",+OFS,+(i+in+b?b[i]:"-");+print+""
}

awk+-v+OFS=\,+-f+parsing.awk+tbparsed.txt
AA,A=14,B=356,C=845,D=4516,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-
BB,A=65,-,C=255,D=841,E=5133,F=1428,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-
CC,A=88,B=54,C=549,-,-,F=225,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript|2400307|它为记录中没有找到的每一个字母打印"-“。如果数据有头部，可以将split转换为二维数组b[NR]，并将for(i+in+a)更改为for(i+in+b[1])+...+printf+...+b[NR][b[1][i]]+...，如果不需要静态第一列，则删除第一个printf和delete。|unstyled|offset|length|style|CODE|2400308|entityMap^0|0|W|5|18|5|1G|B|1U|1C|3O|6|3V|6|0^^$0|@$1|2|3|4|5|6|7|M|8|@]|9|@]|A|$B|C]]|$1|D|3|E|5|F|7|N|8|@$G|O|H|P|I|J]|$G|Q|H|R|I|J]|$G|S|H|T|I|J]|$G|U|H|V|I|J]|$G|W|H|X|I|J]|$G|Y|H|Z|I|J]]|9|@]|A|$]]|$1|K|3|-4|5|F|7|10|8|@]|9|@]|A|$]]]|L|$]]

<pre><code>BEGIN { 
 PROCINFO["sorted_in"]="@ind_str_asc" # order for for(i in a)
 for(i=65;i&lt;=90;i++) # create the whole alphabet to array a[]
 a[sprintf("%c", i)] # you could read the header and use that as well
}
{
 split($0,b,",") # split record by ","
 printf "%s", b[1] # printf first element (AA, BB...)
 delete b[1] # get rid of it
 for(i in b) 
 b[substr(b[i],1,1)]=b[i] # take the first letter to use as index (A=12)
 for(i in a) # go thru alphabet and printf from b[]
 printf "%s%s", OFS, (i in b?b[i]:"-"); print ""
}

awk -v OFS=\, -f parsing.awk tbparsed.txt
AA,A=14,B=356,C=845,D=4516,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-
BB,A=65,-,C=255,D=841,E=5133,F=1428,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-
CC,A=88,B=54,C=549,-,-,F=225,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-,-
</code></pre>

It prints "-" for each letter not found in the record. If the data had a header, you could <code>split</code> to 2-D array <code>b[NR]</code> and change the <code>for(i in a)</code> to <code>for(i in b[1]) ... printf ... b[NR][b[1][i]] ...</code> and if you don't need the static first column, remove the first <code>printf</code> and <code>delete</code>.

I have file like:

<pre><code>AA,A=14,B=356,C=845,D=4516
BB,A=65,C=255,D=841,E=5133,F=1428
CC,A=88,B=54,C=549,F=225
</code></pre>

I never know if in the row missing A,B,C or D value. But I need to transform this file like:

<pre><code>AA,A=14,B=356,C=845,D=4516,-,-
BB,A=65,-,C=255,D=841,E=5133,F=1428
CC,A=88,B=54,C=549,-,-,F=225
</code></pre>

So if any value missing print just <code>-</code> mark. My plan is have the same number of columns to easy parsing. I am prefer awk solution. Thank you for any advice or help.

My first try was:

<pre><code>awk '{gsub(/[,]/, "\t")}; BEGIN{ FS = OFS = "\t" } { for(i=1; i&lt;=NF; i++) if($i ~ /^ *$/) $i = "-" }; {print $0}'
</code></pre>

But then I notice, that some values are missing.

EDIT:

From my header I know that there is value A,B,C,D,E,F...

How to find and print specific character in bash

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

我有这样的档案：AA,A=14,B=356,C=845,D=4516BB,A=65,C=255,D=841,E=5133,F=1428CC,A=88,B=54,C=549,F=225我从来不知道在行中是否遗漏了A，B，C或D值。但我需要转换这个文件如下：AA,A=14,B=356,C=845,D=4516,-,-BB,...

问如何在bash中查找和打印特定字符
EN

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问如何在bash中查找和打印特定字符EN