sed 中的行号和否定搜索组合

Question

Ryan

Asked: 2024-11-21 05:17:19 +0800 CST2024-11-21 05:17:19 +0800 CST 2024-11-21 05:17:19 +0800 CST

检测字符变量文本字符串中的单词并根据该单词的存在创建变量 SAS

772

您好，抱歉标题太长了！我正在处理一些包含长文本字符串的数据（一些观察结果最多有 2000 个字符）。这些字符串中可能有一个单词（AB/CD），该单词可能位于字符串中的任何位置。我试图检测文本字符串中的 AB/CD，并创建一个二进制变量（ABCD_present），如果该单词出现在文本中。

以下是一些示例数据

data test;
length status $175;
infile datalines dsd dlm="|" truncover;
input ID Status$;

datalines;
1|This is example text I am using instead of real data. I am making the length of this text longer to mimic the long text strings of my data AB/CD
2|This is example AB/CD text I am using instead of real data. I am making the length of this text longer to mimic the long text strings of my data
3|This is example text I am using instead of real data. I AB/CD am making the length of this text longer to mimic the long text strings of my data
4|This is example text I am using instead of real data. I am making the length of this text longer to mimic the long text strings of my data
5|This is example text I am using instead of real data. I am making the length of this text longer to mimic the long text strings of my data
6|This is example text I am using instead of real data. I am making the length of this text longer to AB/CD mimic the long text strings of my data

;
run;

任何有关这方面的指导都非常好！我没有太多使用长文本字符串的经验。

先感谢您

2 个回答

Voted

Stu Sztukowski · Answer 1 · 2024-11-21T05:27:08+08:00

Best Answer

Stu Sztukowski

2024-11-21T05:27:08+08:002024-11-21T05:27:08+08:00

您可以使用该find功能。

data want;
    set test;
    flag_abcd = (find(status, 'AB/CD') > 0);
run;

Status ID   flag_abcd
...    1    1
...    2    1
...    3    1
...    4    0
...    5    0
...    6    1

1

Richard · Answer 2 · 2024-11-21T19:19:10+08:00

Richard

2024-11-21T19:19:10+08:002024-11-21T19:19:10+08:00

另外两个检测子字符串是否存在的函数INDEX是PRXMATCH

flag = index (status, 'AB/CD') > 0 ;
flag = prxmatch ('m/AB\/CD/', status) > 0 ;

0

检测字符变量文本字符串中的单词并根据该单词的存在创建变量 SAS

Vue 3：创建时出错“预期标识符但发现‘导入’”[重复]

为什么这个简单而小的 Java 代码在所有 Graal JVM 上的运行速度都快 30 倍，但在任何 Oracle JVM 上却不行？

具有指定基础类型但没有枚举器的“枚举类”的用途是什么？

如何修复未手动导入的模块的 MODULE_NOT_FOUND 错误？

`(表达式，左值) = 右值` 在 C 或 C++ 中是有效的赋值吗？为什么有些编译器会接受/拒绝它？

何时应使用 std::inplace_vector 而不是 std::vector？

在 C++ 中，一个不执行任何操作的空程序需要 204KB 的堆，但在 C 中则不需要

PowerBI 目前与 BigQuery 不兼容：Simba 驱动程序与 Windows 更新有关

AdMob：MobileAds.initialize() - 对于某些设备，“java.lang.Integer 无法转换为 java.lang.String”

我正在尝试仅使用海龟随机和数学模块来制作吃豆人游戏

检测字符变量文本字符串中的单词并根据该单词的存在创建变量 SAS

2 个回答

相关问题