从文本文件传递变量的奇怪问题

Question

Kintaro

Asked: 2018-11-30 04:01:42 +0800 CST2018-11-30 04:01:42 +0800 CST 2018-11-30 04:01:42 +0800 CST

grep 使用数组值并使其更快

772

array[1] 是从 30k 行 CSV 中提取的字符串：示例：

samsung black 2014

我需要将这些行与数组（arrayItems）中包含的值之一匹配。

arrayItems 包含 221 个值，例如：

apple
sony
samsung

实际脚本：

while IFS=$';' read -r -a array
do
    mapfile -t arrayItems < $itemsFile
    ## now loop through the above array
    for itemToFind in "${arrayItems[@]}"
    do
       itemFound=""
       itemFound="$(echo ${array[1]} | grep -o '^$itemToFind')"
       if [ -n "$itemFound" ] 
       then 
          echo $itemFound 
          # so end to search in case the item is found
          break
       fi
    done
   # here I do something with ${array[2]}, ${array[4]} line by line and so on, 
   # so I can't match the whole file $file_in at once but online line by line.
done < $file_in

问题是 grep 不匹配。

但如果我尝试像这样对 $itemToFind 进行硬编码：

itemFound="$(echo ${array[1]} | grep -o '^samsung')"

另一件事是......如何更快地做到这一点，因为 $file_in 是 30k 行 CSV？

2 个回答

Voted

apapillon · Answer 1 · 2018-11-30T04:19:07+08:00

Best Answer

apapillon

2018-11-30T04:19:07+08:002018-11-30T04:19:07+08:00

您可以将 grep 与文件模式选项 (-f) 一起使用

例子：

$ echo -e "apple\nsony\nsamsung" > file_pattern
$ grep -f file_pattern your.csv

编辑：针对您的新限制：

sed 's/^/\^/g' $itemsFile > /tmp/pattern_file
while IFS=$';' read -r -a array
do
    echo ${array[1]} | grep -q -f /tmp/pattern_file.txt
    if [ $? -eq 0 ]; then 
        # here I do something with ${array[2]}, ${array[4]} line by line and so on, 
        # so I can't match the whole file $file_in at once but online line by line.
    fi
done < $file_in

2

lauhub · Answer 2 · 2018-11-30T04:18:08+08:00

lauhub

2018-11-30T04:18:08+08:002018-11-30T04:18:08+08:00

您的脚本中有两个错误：

grep 尝试匹配字符串$itemToFind，因为您将它放在单引号之间'。请改用双引号。
您正在使用索引 1 中的数组，同时help read告诉它从零开始。

这应该给出：

while IFS=$';' read -r -a array
do
    mapfile -t arrayItems < $itemsFile
    ## now loop through the above array
    for itemToFind in "${arrayItems[@]}"
    do
       itemFound=""
       itemFound=$(echo ${array[0]} | grep -o "$itemToFind")
       if [ -n "$itemFound" ] 
       then 
          echo $itemFound 
          # so end to search in case the item is found
          break
       fi
    done
done < $file_in

编辑：

如果你想让它更快，你可以使用扩展的正则表达式：

grep -E 'apple|sony|samsung' $file_in

如果您只想显示品牌：

grep -E 'apple|sony|samsung' $file_in | awk '{print $1}'

1

grep 使用数组值并使其更快

如何将 GPG 私钥和公钥导出到文件

ssh 无法协商：“找不到匹配的密码”，正在拒绝 cbc

我们如何运行存储在变量中的命令？

如何配置 systemd-resolved 和 systemd-networkd 以使用本地 DNS 服务器来解析本地域和远程 DNS 服务器来解析远程域？

如何卸载内核模块“nvidia-drm”？

dist-upgrade 后 Kali Linux 中的 apt-get update 错误 [重复]

如何从 systemctl 服务日志中查看最新的 x 行

Nano - 跳转到文件末尾

grub 错误：你需要先加载内核

如何下载软件包而不是使用 apt-get 命令安装它？

grep 使用数组值并使其更快

2 个回答

相关问题