我有一个大约有 15 列的文本文件。字段之间用逗号分隔。作为描述的一列被双引号括起来,并且还有一些单词被双引号括起来。我需要保留开头和结尾的双引号,并仅删除内部双引号。
像这样的事情:
"Hi there, we are from XYZ team, we have an "Opportunity" at our organization"
我需要输出为:
"Hi there, we are from XYZ team, we have an Opportunity at our organization"
我不想继续Python编程。我一直在寻找 awk 命令或任何其他最佳选择。
该文件可能有 100 行数据,但此描述列对几行而非所有 100 行使用双引号。
这是一些示例数据:
invoice number,invoice date,vendor number,vendor site ID,supplier site CODE,invoice description,invoice currency code,invoice total amount,line number,line amount,line description,account code,business unit,business center,department,issue code,project,task number
1686,2024-03-28,258,9845,NEWYORK,CA Project: Content,USD,538,1,26,279.6,"Review new applications, and instruct the same.The deposits. Review correspondence applications. Review and applications. Research "Material Included" and artwork , and email. Communications with team website. Call, and communications.",230,,,,,295,10
我必须删除行描述中“包含材料”的双引号。
请注意:我需要整个文件并保留所有列,但只需删除行描述值中的内部双引号。只有行描述字段具有此类内部双引号值。就目前而言,只有一个内部双引号单词用于文件的行描述,我们还没有注意到超过一个。