不确定为什么Regex.Replace()在使用包含Regex模式的字典时不起作用(涉及捕获组)

6l7fqoea  于 2022-12-14  发布在  其他
关注(0)|答案(1)|浏览(102)

所以我想用正则表达式把“100美分”替换成“100美分”,我用的模式是(\d+)(·),除此之外,我还想替换其他东西,所以我需要一个Dictionary数据结构来保存所有正则表达式模式作为键,以及我想替换它们的值作为字典值。
我目前拥有的代码如下:

var replacementsMap = new Dictionary<string, string>()
        {
            {@"(\d+)(¢)", "$1 cents"}
        };

字典中会有更多的内容,但为了简单起见,我只添加了一个模式-值对,我使用反向引用使第一个捕获组后面带有“cents”,而不是符号。
例如:5美分-〉5美分
为了替换,我这样做:

string input = "100¢";
        Console.WriteLine(input); //showing original input

        var regex = new Regex(String.Join("|",replacementsMap.Keys));

        var newStr = regex.Replace(input, m => replacementsMap[m.Value]);
        Console.WriteLine(newStr); //showing new input

我得到的错误是这样的,我不确定我的实现哪里出了问题:

Unhandled exception. System.Collections.Generic.KeyNotFoundException: The given key '100¢' was not present in the dictionary.
   at System.Collections.Generic.Dictionary`2.get_Item(TKey key)
   at Program.<>c__DisplayClass1_0.<Main>b__0(Match m) in Program.cs:line 79
   at System.Text.RegularExpressions.Regex.<>c.<Replace>b__99_0(ValueTuple`5& state, Match match)
   at System.Text.RegularExpressions.Regex.RunAllMatchesWithCallback[TState](String inputString, ReadOnlySpan`1 inputSpan, Int32 startat, TState& state, MatchCallback`1 callback, RegexRunnerMode mode, Boolean reuseMatchObject)
   at System.Text.RegularExpressions.Regex.RunAllMatchesWithCallback[TState](String input, Int32 startat, TState& state, MatchCallback`1 callback, RegexRunnerMode mode, Boolean reuseMatchObject)   
   at System.Text.RegularExpressions.Regex.Replace(MatchEvaluator evaluator, Regex regex, String input, Int32 count, Int32 startat)
   at System.Text.RegularExpressions.Regex.Replace(String input, MatchEvaluator evaluator)
   at Program.Main() in Program.cs:line 79
f87krz0w

f87krz0w1#

问题是,当你有一个匹配时,这个匹配并不包含与之匹配的原始模式的信息,所以你不能在字典中进行查找,因为你没有在字典中用作键的模式。
解决方案:将模式组合成一个模式时,请用命名的捕获组围绕每个模式。名称基于模式列表中的模式索引。
然后,您可以从匹配信息中获取该名称,使用自动生成的名称从列表中检索原始模式和替换模式,并将单个模式应用于匹配的值。
样本代码:

string input = "I have 5$ and 4€ and 6¢";

// Use a List instead of Dictionary so we can retrieve the entries by index
List<(string pattern, string replacement)> replacementInstructions = new List<(string pattern, string replacement)> {
    (@"(\d+)(¢)", "$1 cents"),
    (@"(\d+)(€)", "$1 euros"),
    (@"(\d+)(\$)", "$1 dollars"),
};

// Create combined pattern with auto-named groups
StringBuilder builder = new StringBuilder();

for(int i=0; i < replacementInstructions.Count; i++)
{
    if(i > 0) builder.Append("|");

    var (pattern, _) = replacementInstructions[i];

    string groupName = "GN" + i;
    builder.Append("(?<" + groupName + ">" + pattern + ")");
}

string combinedPattern = builder.ToString();
Console.WriteLine("Combined Pattern: " + combinedPattern);

// Declare callback that will do the replacement
MatchEvaluator evaluator = (Match match) =>
{
    // Get named group that matched and its name
    Group group = (from Group g in match.Groups
                   where g.Success &&
                   g.Name.StartsWith("GN")
                   select g).First();
    string groupName = group.Name;

    // Get number from groupname 
    // and then entry from replacementInstructions based on number
    string numberString = groupName.Substring(2);
    int number = int.Parse(numberString);
    var (pattern, replacement) = replacementInstructions[number];

    // Apply replacement pattern on match
    return Regex.Replace(match.Value, pattern, replacement);
};

// Replace
string result = Regex.Replace(input, combinedPattern, evaluator);

Console.WriteLine("Result: " + result);

输出量:

Combined Pattern: (?<GN0>(\d+)(¢))|(?<GN1>(\d+)(€))|(?<GN2>(\d+)(\$))
Result: I have 5 dollars and 4 euros and 6 cents

相关问题