(已解析)Regex用于选择(替换)单行中任何字符串的第三个和更多重复项

oxosxuxt 于 2023-01-27 发布在其他

关注(0)|答案(1)|浏览(135)

Regex是为文本到语音转换程序设计的，该程序需要处理包含延伸单词或可发音的场景中断的作品，如AAAAAAAAARRRRRGGHHH！！！！或XXXXXXXXXXXXXXXXXX，虽然阅读不是问题，但文本到语音转换程序在放弃发音后会读出每个字母。
文本到语音有一个支持正则表达式的发音调整，因为简单的查找和替换是不够的。
正则表达式需要找到重复3次或更多次的字符串，但实际上只选择（并因此替换）第三个或更多个这样的示例。
https://regex101.com/r/Z6zVOg/2我管理得最好的是这个(?|(?'a'.*)\k'a'(\1))\1我有很多采样线，每一个都应该匹配下面的行，但是似乎只有一部分工作，

The quick brown fox jumps over the lazy lazy lazy dog.
The quick brown fox jumps over the lazy lazy dog.
Attack Attack Attack Attack Attack Attack
Attack Attack 
Attack!!!!!!
Attack!!
WAAAAAAGGGGGGHHHHHH!!!
WAAGGHH!!
Attack Whatever Attack Attack
Attack Whatever Attack Attack

The quick brown fox jumps over the lazy lazy lazy dog.
The quick brown fox jumps over the lazy lazy dog.
Attack Attack
Attack Attack 
Attack!!
Attack!!
WAAGGHH!!!
WAAGGHH!!
Attack Whatever Attack Attack
Attack Whatever Attack Attack
This This This Friend Friend Friend
This This Friend Friend

编辑：虽然给出的两个解决方案确实能在Regex 101中工作，但它们似乎不能在@voice中工作，因此我目前正试图弄清楚它使用的是regex的哪一个变体。