iOS 中:正则表达式,又称正规表示法、常规表示法。(英语:Regular Expression,在代码中常简写为regex、regexp或RE),计算机科学的一个概念。正则表达式使用单个字符串来描述、匹配一系列符合某个句法规则。在很多文本编辑器里,正则表达式通常被用来检索、替换那些符合某个模式的文本。
先看一下头文件:
NSRegularExpression:头文件的枚举
typedef NS_OPTIONS(NSUInteger, NSRegularExpressionOptions) {
NSRegularExpressionCaseInsensitive = 1 << 0, // 不区分大小写的
NSRegularExpressionAllowCommentsAndWhitespace = 1 << 1, // 忽略空格和# (注释符)
NSRegularExpressionIgnoreMetacharacters = 1 << 2, // 整体化
NSRegularExpressionDotMatchesLineSeparators = 1 << 3, // 匹配任何字符,包括行分隔符
NSRegularExpressionAnchorsMatchLines = 1 << 4, // 允许^和$在匹配的开始和结束行
NSRegularExpressionUseUnixLineSeparators = 1 << 5, // (查找范围为整个的话无效)
NSRegularExpressionUseUnicodeWordBoundaries = 1 << 6 // (查找范围为整个的话无效)
};
这些枚举是针对表达式的,例如options选择为:NSRegularExpressionCaseInsensitive,那么表达式Pattern可以@"Yu"匹配字符串中的yu。
下面2个枚举貌似都没什么意义,除了在block方法中,一般情况下,直接给0吧
typedef NS_OPTIONS(NSUInteger, NSMatchingOptions) {
NSMatchingReportProgress = 1 << 0, //找到最长的匹配字符串后调用block回调
NSMatchingReportCompletion = 1 << 1, //找到任何一个匹配串后都回调一次block
NSMatchingAnchored = 1 << 2, //从匹配范围的开始处进行匹配
NSMatchingWithTransparentBounds = 1 << 3, //允许匹配的范围超出设置的范围
NSMatchingWithoutAnchoringBounds = 1 << 4 //禁止^和$自动匹配行还是和结束
};
此枚举值只在5.block方法中用到
typedef NS_OPTIONS(NSUInteger, NSMatchingFlags) {
NSMatchingProgress = 1 << 0, //匹配到最长串是被设置
NSMatchingCompleted = 1 << 1, //全部分配完成后被设置
NSMatchingHitEnd = 1 << 2, //匹配到设置范围的末尾时被设置
NSMatchingRequiredEnd = 1 << 3, //当前匹配到的字符串在匹配范围的末尾时被设置
NSMatchingInternalError = 1 << 4 //由于错误导致的匹配失败时被设置
};
下面是一些方法:
1. 返回所有匹配结果的集合(适合,从一段字符串中提取我们想要匹配的所有数据)
* - (NSArray *)matchesInString:(NSString *)string options:(NSMatchingOptions)options range:(NSRange)range;
2. 返回正确匹配的个数(通过等于0,来验证邮箱,电话什么的,代替NSPredicate)
* - (NSUInteger)numberOfMatchesInString:(NSString *)string options:(NSMatchingOptions)options range:(NSRange)range;
3. 返回第一个匹配的结果。注意,匹配的结果保存在 NSTextCheckingResult 类型中
* - (NSTextCheckingResult *)firstMatchInString:(NSString *)string options:(NSMatchingOptions)options range:(NSRange)range;
4. 返回第一个正确匹配结果字符串的NSRange
* - (NSRange)rangeOfFirstMatchInString:(NSString *)string options:(NSMatchingOptions)options range:(NSRange)range;
5. block方法
* - (void)enumerateMatchesInString:(NSString *)string options:(NSMatchingOptions)options range:(NSRange)range usingBlock:(void (^)(NSTextCheckingResult *result, NSMatchingFlags flags, BOOL *stop))block;
替换方法:
- (NSString *)stringByReplacingMatchesInString:(NSString *)string options:(NSMatchingOptions)options range:(NSRange)range withTemplate:(NSString *)templ;
- (NSUInteger)replaceMatchesInString:(NSMutableString *)string options:(NSMatchingOptions)options range:(NSRange)range withTemplate:(NSString *)templ;
- (NSString *)replacementStringForResult:(NSTextCheckingResult *)result inString:(NSString *)string offset:(NSInteger)offset template:(NSString *)templ;
e.g.
找出并替换
NSString *testStr3 = @"msdfdmmms谁人56dfQs89dfmmmdddmmddfd";
NSString *regex3 = @"谁人";//@"[ms]"
NSRegularExpression *regularE3 = [[NSRegularExpression alloc] initWithPattern:regex3 options:NSRegularExpressionCaseInsensitive error:nil];
NSString *modifiedStr = [regularE3 stringByReplacingMatchesInString:testStr3 options:NSMatchingReportProgress range:NSMakeRange(0, testStr.length) withTemplate:@"&&&"];
NSLog(@"replace:%@",modifiedStr);
基本的NSRegularExpression匹配方法
一个Block遍历方法,即调用的时候指定一个Block,进而在执行时,会对目标string上找到的每个匹配的子串执行此Block:
//匹配出来的格式为 +[类名 方法名] 或者 -[类名 方法名]
NSString *regularExpStr = @"[-//|+]\\[.+\\]";
NSRegularExpression *regularExp = [[NSRegularExpression alloc] initWithPattern:regularExpStr options:NSRegularExpressionCaseInsensitive error:nil];
// NSString *testStr = @"2 wgj_avoidCrash 0x0000000106c50534 -[ViewController viewDidLoad] + 212";
NSString *testStr = @"dfadf-[ViewController viewDidLoad]";
__block NSString *resultS = nil;
[regularExp enumerateMatchesInString:testStr options:(NSMatchingReportProgress) range:NSMakeRange(0, testStr.length) usingBlock:^(NSTextCheckingResult * _Nullable result, NSMatchingFlags flags, BOOL * _Nonnull stop) {
if (result) {
NSString *resultStr = [testStr substringWithRange:result.range];
resultS = resultStr;
}
}];
NSLog(@"%@",resultS);
此方法为NSRegularExpression最基本的匹配方法,其遍历string的每个match,并对每个match进行block中指定的操作,且可在任何执行步骤停止。
返回所有匹配子串 的array
NSString *testStr = @"msdfdmmms56dfQs89dfmmmdddmmddfd";
NSString *regex = @"ms";//@"[ms]"
NSRegularExpression *regularE = [[NSRegularExpression alloc] initWithPattern:regex options:NSRegularExpressionCaseInsensitive error:nil];
NSArray *matches = [regularE matchesInString:testStr options:NSMatchingReportProgress range:NSMakeRange(0, testStr.length)];
NSLog(@"%@",matches);
所有匹配子串数目
NSUInteger numOfMatches = [regularE numberOfMatchesInString:testStr options:NSMatchingReportProgress range:NSMakeRange(0, testStr.length)];
NSLog(@"num:%ld",numOfMatches);
//第一个匹配的子串
NSTextCheckingResult *firstMatch = [regularE firstMatchInString:testStr options:NSMatchingReportProgress range:NSMakeRange(0, testStr.length)];
if (firstMatch) {
NSRange range = [firstMatch range];
NSRange firstRange = [firstMatch rangeAtIndex:0];
// NSLog(@"range:%d %d",firstRange,range);
}
//找出第一个匹配的位置和串
NSRange rangeOfFirst = [regularE rangeOfFirstMatchInString:testStr options:NSMatchingReportProgress range:NSMakeRange(0, testStr.length)];
if (!NSEqualRanges(rangeOfFirst, NSMakeRange(NSNotFound, 0))) {
NSString *subStringForFirstMatch = [testStr substringWithRange:rangeOfFirst];
NSLog(@"matchStr:%@",subStringForFirstMatch);
}
//找出并替换
NSString *modifiedStr = [regularE stringByReplacingMatchesInString:testStr options:NSMatchingReportProgress range:NSMakeRange(0, testStr.length) withTemplate:@"&&&"];
NSLog(@"replace:%@",modifiedStr);
返回所有的匹配结果(正则匹配)
// //returns all the matching results
NSString *testStr2 = @"yuiie7gok8deed78fdsafsd78ree";
NSString *re = @"Yu";
NSRegularExpression *regular2 = [NSRegularExpression regularExpressionWithPattern:re options:(NSRegularExpressionCaseInsensitive) error:nil];
NSArray *matchesArr = [regular2 matchesInString:testStr2
options:0
range:NSMakeRange(0, [testStr2 length])];
for (NSTextCheckingResult *match in matchesArr) {
// NSRange matchRange = [match range];
// NSRange firstHalfRange = [match rangeAtIndex:1];
// NSRange secondHalfRange = [match rangeAtIndex:2];
}
NSString *checkStr = @"thfef我们dfdtel:15236262956fdfd";
NSString *checkRe = @"\\d{3,4}[- ]?\\d{7,8}";//匹配10到12位连续数字,或者带连字符/空格的固话号,空格和连字符可以省略。
// NSString *checkRe = @"[0-9]";
//正则匹配
NSRange stringRange = NSMakeRange(0, checkStr.length);
NSError *error;
NSRegularExpression *regexps = [NSRegularExpression regularExpressionWithPattern:checkRe options:0 error:&error];
NSArray *checkResultArr = [regexps matchesInString:checkStr options:0 range:stringRange];
if (checkResultArr) {
for (NSTextCheckingResult *result in checkResultArr) {
NSString *testS = [checkStr substringWithRange:result.range];
NSDataDetector *detector = [NSDataDetector dataDetectorWithTypes:NSTextCheckingTypePhoneNumber | NSTextCheckingTypeLink error:&error];
NSTextCheckingResult *matchesd = [detector firstMatchInString:testS options:0 range:NSMakeRange(0, testS.length)];
if (matchesd) {
NSLog(@"phone:%@",matchesd.phoneNumber);
}
}
}
NSDataDetector继承于NSRegularExpression,自带一些过滤功能
NSError *error = nil;
NSString *phhstr = @"www.baidu.comdhfjdfj 15236262956";
NSRange inputRange = NSMakeRange(0, phhstr.length);
NSDataDetector *detector = [NSDataDetector dataDetectorWithTypes:NSTextCheckingTypePhoneNumber | NSTextCheckingTypeLink error:&error];
NSArray *matchesPPP = [detector matchesInString:phhstr options:0 range:inputRange];
for (NSTextCheckingResult *match in matchesPPP) {
//
if ([match resultType] == NSTextCheckingTypePhoneNumber) {
NSRange matchRange = [match range];
NSLog(@"ssss:%@",[phhstr substringWithRange:matchRange]);
}
}
每个单独的匹配是由NSTextCheckingResult类实例所描述的,其包含整体的匹配range的信息(通过range属性), 每个捕获的Group的range(通过rangeAtIndex:方法)。对于基本的NSRegularExpression对象,这些匹配结果是 NSTextCheckingTypeRegularExpression类型,NSRegularExpression子类型可以使用其他类型。
所以要获取某个字符串匹配中的某个group,需要使用 [NSTextCheckingResult rangeAtIndex:groupindex] (0 总是代表整个匹配的range)