如何改进这个字符转换脚本？

Question

αԋɱҽԃ αмєяιcαη

Asked: 2020-02-14 08:23:11 +0800 CST2020-02-14 08:23:11 +0800 CST 2020-02-14 08:23:11 +0800 CST

根据第一列比较 2 个文件并打印不匹配的

772

文件#1：

test1,1
test2,2
test3

文件#2：

test2
test1
test4

期望的输出：

test4

5 个回答

Voted

terdon · Answer 1 · 2020-02-14T09:17:22+08:00

您可以grep为此使用：

$ grep -vwf <(cut -d, -f1 file1) file2
test4

解释

grep选项：

-v, --invert-match
      Invert the sense of matching, to select non-matching lines.
-w, --word-regexp
      Select  only  those  lines  containing  matches  that form 
      whole words.  
-f FILE, --file=FILE
      Obtain patterns from FILE, one per line.

因此，结合起来，grep -vwf patternFile inputFile意味着“从 patternFile 中找到那些在 inputFile 中永远不会作为整个单词出现的行”。

<(command)：这称为进程替换，在支持它的 shell（例如 bash）中，它本质上就像一个文件。这使我们能够将cut命令的输出用作 grep-f选项的“文件”。
cut -d, -f1 file1: 仅打印 file1 的第一个逗号分隔字段。

请注意，您可能希望使用-x（匹配整行）而不是仅-w当您的数据确实如您显示的那样：

  -x, --line-regexp
          Select  only  those  matches  that exactly match the whole line.

所以：

$ grep -vxf <(cut -d, -f1 file1) file2
test4

此外，如果您file1可以包含任何正则表达式字符（.、等） *，?您可能还想使用-F：

  -F, --fixed-strings
          Interpret PATTERNS as fixed strings, not regular expressions.

所以：

$ grep -Fvxf <(cut -d, -f1 file1) file2
test4

Freddy · Answer 2 · 2020-02-14T09:12:41+08:00

Freddy

2020-02-14T09:12:41+08:002020-02-14T09:12:41+08:00

使用cut和grep：

grep -F -x -v -f <(cut -d',' -f1 file1) file2

cut -d',' -f1 file1打印第一个字段file1并将grep输出用作模式输入文件（选项-f）。选项-F和-x用于匹配固定字符串和整行并-v反转匹配项。

2

francois P · Answer 3 · 2020-02-14T09:09:08+08:00

francois P

2020-02-14T09:09:08+08:002020-02-14T09:09:08+08:00

:~$ cat > toto
a b
c d
e f
:~$ cat > titi
a b
d e
f g
:~$ awk 'NR==FNR{c[$1]++;next};c[$1] == 0' toto titi
d e
f g

这只是我从示例列表中获得的一个示例，您可以使用它来解决您自己的需要。

1

bu5hman · Answer 4 · 2020-02-14T09:13:46+08:00

bu5hman

2020-02-14T09:13:46+08:002020-02-14T09:13:46+08:00

awk假设第一个字段包含file1文件名并且字段分隔符始终是,

awk -F"," 'NR==FNR{test[$1]=1}NR!=FNR{if (!test[$1]) print $1}' file1 file2

（见评论中@Terdon 精简版，然后结合我的

awk -F"," 'NR==FNR{test[$1]++}!test[$1]{print $1}' file1 file2

)

替代使用join

join -t, -v2 <(sort file1) <(sort file2)

1

RudiC · Answer 5 · 2020-02-14T08:29:11+08:00

RudiC

2020-02-14T08:29:11+08:002020-02-14T08:29:11+08:00

对于这个设置，

grep -ffile2 -v file1
test3

会做。但是 - 请注意例如需要采取额外措施的误报。

0

根据第一列比较 2 个文件并打印不匹配的

解释

模块 i915 可能缺少固件 /lib/firmware/i915/*

无法获取 jessie backports 存储库

如何将 GPG 私钥和公钥导出到文件

我们如何运行存储在变量中的命令？

如何配置 systemd-resolved 和 systemd-networkd 以使用本地 DNS 服务器来解析本地域和远程 DNS 服务器来解析远程域？

dist-upgrade 后 Kali Linux 中的 apt-get update 错误 [重复]

如何从 systemctl 服务日志中查看最新的 x 行

Nano - 跳转到文件末尾

grub 错误：你需要先加载内核

如何下载软件包而不是使用 apt-get 命令安装它？

根据第一列比较 2 个文件并打印不匹配的

5 个回答

解释

相关问题