Swift 正则速查手册

          
                               let bitcoinAddress_v1 = /([13][a-km-zA-HJ-NP-Z0-9]{26,33})/

表达式	说明	示例
`[aeiou]`	匹配指定字符集	On e V’s␣D e n␣ i s␣ a ␣bl o g.
`[^aeiou]`	排除字符集	On e V’s␣D e n␣ i s␣ a ␣bl o g.
`[A-Z]`	匹配字符范围	O ne V ’s␣ D en␣is␣a␣blog.
`.`	除换行符以外的任意字符。等效于 `[^\n\r]`	OneV’s␣Den␣is␣a␣blog.
`\s`	匹配空格字符 (包括 tab 和换行)	OneV’s ␣ Den ␣ is ␣ a ␣ blog.
`\S`	匹配非空格字符	OneV’s ␣ Den ␣ is ␣ a ␣ blog.
`[\s\S]`	匹配空格和非空格，也即任意字符。等效于 `[^]`	OneV’s␣Den␣is␣a␣blog.
`\w`	匹配字母数字下划线等低位 ASCII。等效于 `[A-Za-z0-9_]`	OneV ‘ s ␣ Den ␣ is ␣ a ␣ blog .
`\W`	等效于 `[^A-Za-z0-9_]`
`\d`	匹配数字，等效于 `[0-9]`	+( 81 ) 021 - 1234 - 5678
`\D`	非数字，等效于 `[^0-9]`	+( 81 ) 021 - 1234 - 5678

表达式	说明	示例	结果
`+`	匹配一个或多个	`b\w+`	b be bee beer beers
`*`	匹配零个或多个	`b\w*`	b be bee beer beers
`{2,3}`	匹配若干个	`b\w{2,3}`	b be bee beer beer s
`?`	匹配零个或一个	`colou?r`	color colour
数量 + `?`	使前置数量进行惰性匹配 (尽可能少)	`b\w+?`	b be be e be er be ers
`\|`	逻辑或，择一匹配	`b(a\|e\|i)d`	bad bud bod bed bid

表达式	说明	示例	结果
`^`	匹配字符串开头	`^\w+`	she sells seashells
`$`	匹配字符串结尾	`\w+$`	she sells seashells
`\b`	匹配 `\w` 和非 `\w` 的边缘位置	`s\b`	she sell s seashell s
`\B`	匹配非边缘位置	`s\B`	s he s ells s ea s hells

表达式	说明	示例
`(OneV)+`	捕获括号内的匹配，使其成组并出现在匹配结果中	OneV ’s Den is a blog.
`(?<name>OneV)+`	命名捕获匹配，在结果中可使用名字对匹配结果进行引用
`(?:OneV)+`	成组但不进行捕获，允许使用数量但不关心和捕获结果

表达式	说明	示例
`\d(?=px)`	`?=` - Positive lookahead。预先检查，符合时再进行主体匹配	1pt 2 px 3em 4 px
`\d(?!px)`	`?!` - Negative lookahead。预先检查，不符合时进行主体匹配	1 pt 2px 3 em 4px

          
                               let bitcoinAddress_v1 = /([13][a-km-zA-HJ-NP-Z0-9]{26,33})/

          
                               import RegexBuilder
let bitcoinAddress_v1 = Regex {
  Capture {
    One(.anyOf("13"))
    Repeat(26...33) {
      CharacterClass(
        ("a"..."k"),
        ("m"..."z"),
        ("A"..."H"),
        ("J"..."N"),
        ("P"..."Z"),
        ("0"..."9")

字面量表达式	等效的 `RegexComponent`
`[aeiou]`	`.anyOf("aeiou")` 。为了可读性，可以考虑加上量词 `One(.anyOf("aeiou"))`
`[^aeiou]`	`CharacterClass.anyOf("aeiou").inverted`
`[A-Z]`	`("A"..."Z")`
`.`	`.any`
`\s`	`.whitespace`
`\S`	`.whitespace.inverted`
`[\s\S]`	`CharacterClass(.whitespace, .whitespace.inverted)`
`\w`	`.word`
`\W`	`.word.inverted`
`\d`	`.digit`
`\D`	`.digit.inverted`

字面量表达式 (例)	等效的 `RegexComponent`
`+` ( `b\w+` )	`OneOrMore(.word)`
`` ( `b\w` )	`ZeroOrMore(.word)`
`{2,3}` ( `b\w{2,3}` )	`Repeat(2...3) { .word }`
`?` ( `colou?r` )	`Optionally { "u" }`
数量 + `?` ( `b\w+?` )	`OneOrMore(.word, .reluctant)`
`\|` ( `b(a\|e\|i)d` )	`ChoiceOf { "a" ↵ "e" ↵ "i" }`

字面量表达式 (例)	等效的 `RegexComponent`
`^` ( `^\w+` )	`Regex { Anchor.startOfSubject ↵ OneOrMore(.word) }`
`$` ( `\w+$` )	`Regex { OneOrMore(.word) ↵ Anchor.endOfSubject }`
`\b` ( `s\b` )	`Regex { "s" ↵ Anchor.wordBoundary }`
`\B` ( `s\B` )	`Regex { "s" ↵ Anchor.wordBoundary.inverted }`

字面量表达式	等效的 `RegexComponent`
`(OneV)+`	`OneOrMore { Capture { "OneV" } }`
`(?<name>OneV)+`	`let name = Reference(Substring.self)` `OneOrMore { Capture(as: name) { "OneV" } }`
`(?:OneV)+`	`OneOrMore { "OneV" }`

          
                               Regex {
  TryCapture(as: kind) {
    OneOrMore(.word)
  } transform: {
    Transaction.Kind($0)
  } // 得到一个强类型 `Kind` 值

字面量表达式	等效的 `RegexComponent`
`\d(?=px)`	`Regex { .digit ↵ Lookahead { "px" } }`
`\d(?!px)`	`Regex { .digit ↵ NegativeLookahead { "px" } }`

所属 Parser 类型	方法签名	可解析示例
`Date.ParseStrategy`	`date(_:locale:timeZone:calendar:)`	Oct 21, 2015, 10/21/2015, etc
`Date.ParseStrategy`	`date(format:locale:timeZone:calendar:twoDigitStartDate:)`	05_04_22
`Date.ParseStrategy`	`dateTime(date:time:locale:timeZone:calendar:)`	10/17/2020, 9:54:29 PM
`Date.ISO8601FormatStyle`	`iso8601(timeZone:...)`	2021-06-21T211015
`Date.ISO8601FormatStyle`	`iso8601Date(timeZone:dateSeparator:)`	2015-11-14
`Date.ISO8601FormatStyle`	`iso8601WithTimeZone(...)`	2021-06-21T21:10:15+0800
`Decimal.FormatStyle.Currency`	`localizedCurrency(code:locale:)`	$52,249.98 -> `Decimal`
`Decimal.FormatStyle`	`localizedDecimal(locale:)`	1.234, 1E5 -> `Decimal`
`FloatingPointFormatStyle<Double>`	`localizedDouble(locale:)`	1.234, 1E5 -> -> `Double`
`FlatingPointFormatStyle<Double>.Percent`	`localizedDoublePercentage(locale:)`	15.4%, `-200%` -> `Double`
`IntegerFormatStyle<Int>`	`localizedInteger(locale:)`	199, 1.234 -> `Int`
`IntegerFormatStyle<Int>.Currency`	`localizedIntegerCurrency(code:locale:)`	$52,249.98 -> `Int`
`IntegerFormatStyle<Int>.Percent`	`localizedIntegerPercentage(locale:)`	15.4%, -200% -> `Int`

          
                               func consuming(
    _ input: String,
    startingAt index: String.Index,
    in bounds: Range<String.Index>
) throws -> (upperBound: String.Index, output: Self.RegexOutput)?

          
                               import Darwin
struct CDoubleParser: CustomConsumingRegexComponent {
    typealias RegexOutput = Double
    func consuming(
        _ input: String, startingAt index: String.Index, in bounds: Range<String.Index>
    ) throws -> (upperBound: String.Index, output: Double)? {
        input[index...].withCString { startAddress in
            var endAddress: UnsafeMutablePointer<CChar>!
            let output = strtod(startAddress, &endAddress)
            guard endAddress > startAddress else { return nil }
            let parsedLength = startAddress.distance(to: endAddress)
            let upperBound = input.utf8.index(index, offsetBy: parsedLength)
            return (upperBound, output)

          
                               extension RegexComponent where Self == CDoubleParser {
    static var cDouble: Self { CDoubleParser() }

          
                               // 匹配所有可能项，并将全部结果返回
input.matches(of: regex) // [Regex<Output>.Match]
// 匹配时返回第一个结果
input.firstMatch(of: regex) // Regex<Output>.Match?
// 整个字符串能完整匹配时才返回结果
input.wholeMatch(of: regex) // Regex<Output>.Match?
// 字符串的开始部分匹配的话返回结果
// 如果只需要判断是否匹配，使用 `start(with:)`
input.prefixMatch(of: regex) // Regex<Output>.Match?

          
                               let regex = /Welcome to (.+?), a person blog from (\d+)/
let text = "Welcome to OneV's Den, a person blog from 2011"
if let result = text.wholeMatch(of: regex) {
    print("Title: \(result.1)") // OneV's Den
    print("Year: \(result.2)")  // 2011

          
                               let regex = /Welcome to (?<name>.+?), a person blog from (?<year>\d+)/
let text = "Welcome to OneV's Den, a person blog from 2011"

总览

常见字面量

字符集

数量

锚点

捕获组

Lookahead

Builder DSL

字符集

数量

锚点

捕获

Lookahead

常用 Parser

自定义 Parser 和 CustomConsumingRegexComponent

匹配方式

常见的匹配方法