在另一个文件之后逐行追加行

Question

Ramón Wilhelm

Asked: 2022-01-30 05:33:55 +0800 CST2022-01-30 05:33:55 +0800 CST 2022-01-30 05:33:55 +0800 CST

AWK：在源词之后插入目标词的快速方法

772

我不熟悉awk。为了在 198058 随机行中的源术语之后插入单个目标术语，我在此处有此代码

awk -i inplace '(NR==FNR){a[$1];next}
    (FNR in a) && gsub(/\<Source Term\>/,"& Target Term")
     1
    ' <(shuf -n 198058 -i 1-$(wc -l < file)) file

file包含这样的句子行

David has to eat his vegetables .
This weather is very cold .
Can you please stop this music ? This is terrible music .
The teddy bear is very plushy .
I must be going !

例如，如果我想在“天气”之后插入“Wetter”这个词，那么某行会是这样的

This weather Wetter is very cold .

如何重写代码，所以我只需要包含两个不同的文件，其中包含源术语和目标术语的列表？

假设源术语文件被调用sourceterms，目标术语文件被调用targetterms。

如果sourceterms包含这些术语的列表

vegetables
weather
terrible
plushy
going

并targetterms包含这些条款

Gemüse
Wetter
schreckliche
flauschig
gehen

我希望我的代码检查每一行file是否包含源术语并在其后插入目标术语，因此我的代码file如下所示：

David has to eat his vegetables Gemüse .
This weather Wetter is very cold .
Can you please stop this music ? This is terrible schreckliche music .
The teddy bear is very plushy flauschig.
I must be going gehen!

是否可以重写上面的代码？

1 个回答

Voted

Ed Morton · Answer 1 · 2022-01-30T06:56:29+08:00

Best Answer

Ed Morton

2022-01-30T06:56:29+08:002022-01-30T06:56:29+08:00

将 GNU awk（OP 正在使用）用于 ARGIND 和字边界：

$ cat tst.awk
ARGIND == 1 { olds[FNR] = "\\<" $1 "\\>"; next }
ARGIND == 2 { map[olds[FNR]] = "& " $1; next }
{
    for ( old in map ) {
        new = map[old]
        gsub(old,new)
    }
    print
}

$ awk -f tst.awk sourceterms targetterms file
David has to eat his vegetables Gemüse .
This weather Wetter is very cold .
Can you please stop this music ? This is terrible schreckliche music .
The teddy bear is very plushy flauschig .
I must be going gehen !

以上假设您的源不包含任何正则表达式元字符，并且您的替换文本不包含&反向引用元字符。它还假设如果相同的单词同时出现在源和目标中，您并不关心替换发生的顺序。

2

AWK：在源词之后插入目标词的快速方法

模块 i915 可能缺少固件 /lib/firmware/i915/*

无法获取 jessie backports 存储库

如何将 GPG 私钥和公钥导出到文件

我们如何运行存储在变量中的命令？

如何配置 systemd-resolved 和 systemd-networkd 以使用本地 DNS 服务器来解析本地域和远程 DNS 服务器来解析远程域？

dist-upgrade 后 Kali Linux 中的 apt-get update 错误 [重复]

如何从 systemctl 服务日志中查看最新的 x 行

Nano - 跳转到文件末尾

grub 错误：你需要先加载内核

如何下载软件包而不是使用 apt-get 命令安装它？

AWK：在源词之后插入目标词的快速方法

1 个回答

相关问题