문제

I create a Report List with Winrar.
Inside this list i have a text list like this

<tag>Adventures of Shuggy</tag>
!Shuggy.png
!Sound Bank.txt
4.lwav
5.lwav
6.lwav
88.lwav
89.lwav
<tag>Adventures of Jack</tag>
90.lwav
91.lwav
92.lwav
93.lwav
!Sound Bank.xsb

I want remove duplicates extensions inside every tag and have a text like this:

<tag>Adventures of Shuggy</tag>
!Shuggy.png
!Sound Bank.txt
4.lwav
<tag>Adventures of Jack</tag>
90.lwav
!Sound Bank.xsb

or even better

<tag>Adventures of Shuggy</tag>
.png
.txt
.lwav
 <tag>Adventures of Jack</tag>
.xsb

Is there a regular expression to use on notepad++ to remove lines with identical .txt or identical .lwav inside every tag like <title>?
Can i use excel for this?

도움이 되었습니까?

해결책

Tossed this up real quick. Should work fine in notepad++. Make sure to set the global flag (if there is one) and the multiline one.

/^.+(\.[^.]+)$(?=\s*(?:(?!<tag>)[^.])+\1)|^(?!<tag>)[^.]+/gm

You should (most likely) not use the flags and delimiters like I have above. I'm sure you'll figure it out.

Explanation + demo : http://regex101.com/r/lC0lD1

라이센스 : CC-BY-SA ~와 함께 속성
제휴하지 않습니다 StackOverflow
scroll top