AskOverflow.Dev

AskOverflow.Dev Logo AskOverflow.Dev Logo

AskOverflow.Dev Navigation

  • 主页
  • 系统&网络
  • Ubuntu
  • Unix
  • DBA
  • Computer
  • Coding
  • LangChain

Mobile menu

Close
  • 主页
  • 系统&网络
    • 最新
    • 热门
    • 标签
  • Ubuntu
    • 最新
    • 热门
    • 标签
  • Unix
    • 最新
    • 标签
  • DBA
    • 最新
    • 标签
  • Computer
    • 最新
    • 标签
  • Coding
    • 最新
    • 标签
主页 / computer / 问题 / 1735120
Accepted
DynV
DynV
Asked: 2022-08-04 03:49:48 +0800 CST2022-08-04 03:49:48 +0800 CST 2022-08-04 03:49:48 +0800 CST

忽略时间戳,如何删除非聊天重复项?

  • 772

忽略时间戳,如何删除以下非聊天重复项?聊天有 2 种格式

  1. 以尖括号包围的昵称开头,并且
  2. 以昵称开头,后跟“告诉您:”。

如果它是在 Notepad++ 中完成的,我更喜欢,但由于 Cygwin,我还可以访问多个实用程序。

原来的

[16:29] You see a sheep; it looks like it weighs about 98.
[16:30] You see a sheep; it looks like it weighs about 100.
[16:52] anonymized tells you: Do you know the bank yet?
[17:11] Only anonymized may access the corpse for now.
[17:12] Only anonymized may access the corpse for now.
[17:14] <anonymized> You can do it later.
[17:14] <anonymized> The dagger for example
[17:15] <anonymized> The dagger for example
[17:15] <dynv> hi
[17:32] gnome has been killed by anonymized and dynv
[17:32] The corpse is too far away.
[17:32] The corpse is too far away.
[17:33] anonymized: now is gets dangerous

期望的结果

[16:29] You see a sheep; it looks like it weighs about 98.
[16:30] You see a sheep; it looks like it weighs about 100.
[16:52] anonymized tells you: Do you know the bank yet?
[17:11] Only anonymized may access the corpse for now.
[17:14] <anonymized> You can do it later.
[17:14] <anonymized> The dagger for example
[17:15] <anonymized> The dagger for example
[17:15] <dynv> hi
[17:32] gnome has been killed by anonymized and dynv
[17:32] The corpse is too far away.
[17:33] anonymized: now is gets dangerous

非常感谢你

notepad++ cygwin
  • 2 2 个回答
  • 35 Views

2 个回答

  • Voted
  1. Best Answer
    Toto
    2022-08-04T06:48:31+08:002022-08-04T06:48:31+08:00
    • Ctrl+H
    • 找什么:^\[.+?] (?!<\w+>|\w+ tells you:)(.+)\K\R\[.+?] \1
    • 用。。。来代替:EMPTY
    • TICK 环绕
    • 选择 正则表达式
    • 取消勾选 . matches newline
    • Replace all

    解释:

    ^                   # beginning of line
        \[.+?]              # time stamp in square brackets followed by a space
        (?!                 # negative lookahead, make sure we haven't after:
            <\w+>               # nickname surrounded by angle brackets 
          |                   # OR
            \w+ tells you:      # nickname followed by " tells you:"
        )                   # end lookahead
        (.+)                # group 1, 1 or more any character but newline, (the text)
        \K                  # forget all we have seen until this position
    \R                  # any kind of linebreak
        \[.+?]              # time stamp in square brackets followed by a space
        \1                  # back reference to group 1 (the text)
    

    截图(之前):

    在此处输入图像描述

    截图(之后):

    在此处输入图像描述

    • 2
  2. user1686
    2022-08-04T20:27:29+08:002022-08-04T20:27:29+08:00

    我通常认为程序方法比完全基于正则表达式的魔法更容易理解,即使它需要不止一行:

    #!/usr/bin/env python3
    import re
    import sys
    
    prev = None
    for line in sys.stdin:
        line = line.strip()
        if m := re.search(r"^\[\d+:\d+\] (.+)$", line):
            text = m.group(1)
            if re.search(r"^<\S+> ", text):
                print(line)
            elif re.search(r"^\S+ tells you: ", text):
                print(line)
            elif text != prev:
                print(line)
            prev = text
        else:
            print(line)
    
    #!/usr/bin/env perl
    
    my $last = "";
    while (my $line = <>) {
        if ($line =~ /^\[\d+:\d+\] (.+)/) {
            my $text = $1;
            if ($text =~ /^<\S+> / || $text =~ /^\S+ tells you:/) {
                print $line;
            } else {
                print $line unless ($text eq $last);
            }
            $last = $text;
        } else {
            print $line;
        }
    }
    
    • 0

相关问题

  • 在 Notepad++“另存为...”对话框中保留当前文件类型/扩展名

  • 计算文本字符串中的逗号并添加新行

  • 删除信息

  • Notepad++ 删除直到冒号替换所有行

  • NotePad++ 用户定义语言不显示条件语句

Sidebar

Stats

  • 问题 205573
  • 回答 270741
  • 最佳答案 135370
  • 用户 68524
  • 热门
  • 回答
  • Marko Smith

    如何减少“vmmem”进程的消耗?

    • 11 个回答
  • Marko Smith

    从 Microsoft Stream 下载视频

    • 4 个回答
  • Marko Smith

    Google Chrome DevTools 无法解析 SourceMap:chrome-extension

    • 6 个回答
  • Marko Smith

    Windows 照片查看器因为内存不足而无法运行?

    • 5 个回答
  • Marko Smith

    支持结束后如何激活 WindowsXP?

    • 6 个回答
  • Marko Smith

    远程桌面间歇性冻结

    • 7 个回答
  • Marko Smith

    子网掩码 /32 是什么意思?

    • 6 个回答
  • Marko Smith

    鼠标指针在 Windows 中按下的箭头键上移动?

    • 1 个回答
  • Marko Smith

    VirtualBox 无法以 VERR_NEM_VM_CREATE_FAILED 启动

    • 8 个回答
  • Marko Smith

    应用程序不会出现在 MacBook 的摄像头和麦克风隐私设置中

    • 5 个回答
  • Martin Hope
    Saaru Lindestøkke 为什么使用 Python 的 tar 库时 tar.xz 文件比 macOS tar 小 15 倍? 2021-03-14 09:37:48 +0800 CST
  • Martin Hope
    CiaranWelsh 如何减少“vmmem”进程的消耗? 2020-06-10 02:06:58 +0800 CST
  • Martin Hope
    Jim Windows 10 搜索未加载,显示空白窗口 2020-02-06 03:28:26 +0800 CST
  • Martin Hope
    v15 为什么通过电缆(同轴电缆)的千兆位/秒 Internet 连接不能像光纤一样提供对称速度? 2020-01-25 08:53:31 +0800 CST
  • Martin Hope
    andre_ss6 远程桌面间歇性冻结 2019-09-11 12:56:40 +0800 CST
  • Martin Hope
    Riley Carney 为什么在 URL 后面加一个点会删除登录信息? 2019-08-06 10:59:24 +0800 CST
  • Martin Hope
    zdimension 鼠标指针在 Windows 中按下的箭头键上移动? 2019-08-04 06:39:57 +0800 CST
  • Martin Hope
    jonsca 我所有的 Firefox 附加组件突然被禁用了,我该如何重新启用它们? 2019-05-04 17:58:52 +0800 CST
  • Martin Hope
    MCK 是否可以使用文本创建二维码? 2019-04-02 06:32:14 +0800 CST
  • Martin Hope
    SoniEx2 更改 git init 默认分支名称 2019-04-01 06:16:56 +0800 CST

热门标签

windows-10 linux windows microsoft-excel networking ubuntu worksheet-function bash command-line hard-drive

Explore

  • 主页
  • 问题
    • 最新
    • 热门
  • 标签
  • 帮助

Footer

AskOverflow.Dev

关于我们

  • 关于我们
  • 联系我们

Legal Stuff

  • Privacy Policy

Language

  • Pt
  • Server
  • Unix

© 2023 AskOverflow.DEV All Rights Reserve