这个位置的“let”表达式是不稳定的

Question

$infracritical$

Asked: 2024-01-09 23:48:17 +0800 CST2024-01-09 23:48:17 +0800 CST 2024-01-09 23:48:17 +0800 CST

想要精确匹配字符串（尽管有变体），并仅删除该字符串

772

我需要从文件中删除特定的、精确的字符串。这被用作我正在实施的清理过程的一部分。问题是，有些变体与我要删除的特别精确的字符串相似，但不完全相同。

例如，以下是文件“sample”的示例：

tmp2
tmp3
tmp0
tmp1
tmp3
tmp3
tmp3
tmp1.1
tmp3
tmp2
tmp3
tmp1.2
tmp4

我只想删除“tmp1”，而不是“tmp1.1”或“tmp1.2”。

我正在使用单行 Perl 命令：

perl -i -nle 'print if !/tmp1/' ./sample

显然，单行脚本并不流畅。当然，它会删除“tmp1”，但是，它也会删除“tmp1.1”和“tmp1.2”。

有任何想法吗？

4 个回答

Voted

Bork · Answer 1 · 2024-01-09T23:50:07+08:00

使用锚点。^用于行首和$行尾。

$ perl -i -nle 'print if !/^tmp1$/' ./sample

Ed Morton · Answer 2 · 2024-01-10T01:31:11+08:00

在每个 Unix 机器上的任何 shell 中使用任何 awk，下面是一个全行字符串比较，它将删除与该字符串匹配的行：

$ awk '$0 != "tmp1"' sample
tmp2
tmp3
tmp0
tmp3
tmp3
tmp3
tmp1.1
tmp3
tmp2
tmp3
tmp1.2
tmp4

或使用变量：

$ awk -v str='tmp1' '$0 != str' sample
tmp2
tmp3
tmp0
tmp3
tmp3
tmp3
tmp1.1
tmp3
tmp2
tmp3
tmp1.2
tmp4

请参阅如何在 awk 脚本中使用 shell 变量？了解更多信息。

请注意，上面正在进行文字字符串比较，因此即使您的目标字符串包含正则表达式元字符，它也会工作，例如：

$ cat file
foo.bar1
foo.bar
foo bar

$ awk '$0 != "foo.bar"' file
foo.bar1
foo bar

Naval · Answer 3 · 2024-01-10T01:59:19+08:00

除了基于 shell 和 perl 的命令之外，您也可以尝试使用 python。

data = ["temp1", "temp1.1", "temp1.2", "temp2", "another_temp1.1"]
filtered_data = [item for item in data if "temp1" not in item]
print(filtered_data)`

输出将为：['temp1.1', 'temp1.2', 'other_temp', 'another_temp1.1']

jubilatious1 · Answer 4 · 2024-01-10T03:46:53+08:00

使用Raku（以前称为 Perl_6）

使用 Raku 的m/…/匹配运算符：

~$ raku -ne '.put unless m/^ tmp1 $/;' sample.txt > tmp

或者，当您说“匹配并删除”时，建议使用s///或S///替换运算符（什么都不替换）：

~$ raku -e 'for lines.join("\n") {S:g/^^ tmp1 $$ \n//.put};'  sample.txt > tmp

Raku 是 Perl 系列中的一种编程语言，提供对 Unicode 的内置高级支持。上面是两个答案，但像 Perl 本身一样，TMTOWTDI 适用，并且可以设想其他答案。

正如其他答案中提到的，这里的关键是使用零宽度锚点，例如：^字符串开头、$字符串结尾、^^行开头$$、行结尾。更多正则表达式建议位于底部链接。

输入示例：

tmp2
tmp3
tmp0
tmp1
tmp3
tmp3
tmp3
tmp1.1
tmp3
tmp2
tmp3
tmp1.2
tmp4

示例输出：

tmp2
tmp3
tmp0
tmp3
tmp3
tmp3
tmp1.1
tmp3
tmp2
tmp3
tmp1.2
tmp4

https://docs.raku.org/language/regexes
https://docs.raku.org/language/regexes-best-practices
https://raku.org

想要精确匹配字符串（尽管有变体），并仅删除该字符串

为什么双破折号 (--) 会导致此 MariaDB 子句评估为 true？

AdMob：MobileAds.initialize() - 对于某些设备，“java.lang.Integer 无法转换为 java.lang.String”

ELF 重定位的应用顺序在哪里指定？

为什么 GCC 生成有条件执行 SIMD 实现的代码？

Selenium urllib.error.HTTPError：HTTP 错误 404：未找到

Box::new() 会从堆栈复制到堆吗？

sizeof("string") 的正确输出是什么？

使用 <font color="#xxx"> 突出显示 html 中的代码

我正在尝试仅使用海龟随机和数学模块来制作吃豆人游戏

C++17 中 std::byte 只能按位运算？

想要精确匹配字符串（尽管有变体），并仅删除该字符串

4 个回答

相关问题