正则笔记

正则生成

调用RegExp对象的构造函数

var reg = new RegExp('^[a-z]$', 'gi')

使用正则表达式字面值，将匹配模式封闭在两个斜杠中

var reg = /^[a-z]$/gi

正则的组成

正则组成

标示字符

限定符

转义字符

* + ? | \ / { [ ( ) ] }^ $ . 这些字符匹配本身需要转义

匹配问号： \?        /a\?b/.test('a?b');
匹配星号:  \*        /a\*b/.test(’a*b');

正则实例

test():

接受一个字符串参数，如果正则表达式与指定的字符串匹配返回 true 否则返回 false；示例：/javascript/.test('hello,javascript');

exec():

接受一个字符串为参数，返回一个数组，其中存放匹配的结果。如果未找到匹配，则返回值为 null;

示例：

var str = "a and b and c";
var reg = /(a and )?(b and )?c/;
reg.exec(str);

结果打印

有g，test指针移动

正则的相关方法

search()

接受参数可以是被检索字符串中的子串，也可以是需要检索的 RegExp 对象，返回被检索的字符串中第一个与正则表达式相匹配的子串的起始位置，否则返回-1；

'hello,javascript'.search('javascript'); //6
'hello,javascript'.search(/javascript/); //6
'hello,javascript,javascript'.search(/javascript/g);  //6

1. xxsearch() 方法不执行全局匹配，它将忽略标志 g；
2. 它同时忽略 regexp 的 lastIndex 属性；
3. 总是从字符串的开始进行检索；

match()

接受参数可以是被检索字符串中的子串，也可以是需要检索的 RegExp 对象，返回存放匹配结果的数组，如果没有找到则返回null,(该数组的内容依赖于 regexp 是否具有全局标志 g)；

'hello,javascript'.match('javascript');
'hello,javascript'.match(/javascript/);

如果 regexp 没有标志 g，那么 match() 方法就只能在 stringObject 中执行一次匹配；
如果 regexp 具有标志 g，则 match() 方法将执行全局检索，找到被检索字符串中的所有匹配的子串，它的数组元素中存放的是被检索字符串中所有的匹配的子串，而且也没有 index 属性或 input 属性；

var str = “a and b and c”;
var reg = /(a and )?(b and )?c/; str.match(reg);

标识符有g

var str = "a and b and c"; 
var reg = /(a and )?(b and )?c/g; str.match(reg);

标识符没g

replace()

接受两个参数，第一个是要被替换的文本，可以是正则也可以是字符串。第二个是替换成的文本，可以是字符串或者函数，字符串可以使用一些特殊的变量来替代前面捕获到的子串；

'hello,javascript'.replace('javascript', 'nodeJs'); 
'hello,javascript'.replace(/javascript/, 'nodeJs');       
'javascript nodeJs'.replace(/(\w+)\s(\w+)/, '$2 $1');

replace()中 $用法

replace()中参数

结果

split()

接受两个参数，返回一个数组。第一个是用来分割字符串的字符或者正则，如果是空字符串则会将元字符串中的每个字符以数组形式返回，第二个参数可选作为限制分割多少个字符，也是返回的数组的长度限制。

'hello,javascript'.split(','); //["hello", "javascript"]
'hello,javascript'.split(/,/);    //["hello", "javascript"]
'hello,javascript'.split(/,/, 1); //["hello"]
'hello,javascript'.split(/(,)/);  //["hello", ",", "javascript"]

用捕获括号的时候会将匹配结果也包含在返回的数组中

正则匹配

精确匹配

/hello/.test('hello');

模糊匹配

"123 1234 12345 123456".match(/\d{2,5}/g);  //["123", "1234", "12345", "12345"]
"a0b a1b a2b a3b a4b".match(/a[123]b/g);   // ["a1b", "a2b", "a3b"]

排除字符

'baby,back,bad,good'.match(/\b([^b,]\w+)\b/g)

分支匹配

分支结构是惰性的，即当前面的匹配上了，后面的就不再尝试了

'beacher'.match(/beach|beacher/g); //beach

分组匹配

使用括号来提供分组功能

"ab abb abab".match(/(ab)+/g);  //["ab", "ab", "abab"]

分组引用

"2017-06-12".replace(/(\d{4})-(\d{2})-(\d{2})/,  "$2/$3/$1")

反向应用

可以在正则本身里引用分组。但只能引用之前出现的分组，即反向引用

var a = “2017-06-12”;
var b = “2017/06/12”;
var c = “2017.06.12”;
/\d{4}(-|\/|\.)\d{2}\1\d{2}/.test(a);

括号嵌套以左开括号为准；'1231231233'.match(/^((\d)(\d(\d)))\1\2\3\4$/)
引用不存在的分组，正则不会报错，只是匹配反向引用的字符本身;
反向分组后面有量词的话，反向分组最终捕获到的数据是最后一次的匹配; var reg = /(\d)+ \1/;"12345 1".match(reg);

反向应用

位置匹配

"hello".replace(/^|$/g, '#');      // #hello#
"hello".replace(/\b/, '#');          //#hello
"hello".replace(/(?=l)/g, '#')      //he#l#lo
"hello".replace(/(?!l)/g, '#')       //#h#ell#o#

贪婪模式

'a123bbb'.match(/a.*b/g);  //["a123bbb"]
'123456'.match(/\d+/g);   //["123456"]

非贪婪模式

在限定符后面直接加上一个问号？就是非贪婪模式

'a123bbb'.match(/a.*?b/g);     //["a123b"]
'123456'.match(/\d+?/g);      //["1", "2", "3", "4", "5", "6"]

捕获分组

以小括号()来实现，捕获性分组工作模式()会把每个分组里匹配的值保存起来

/([a-z]+)(\d+)/.exec('a133');     // ["a133", "a", "133", index: 0, input: "a133”]

非捕获分组

分组(?:)会作为匹配校验，并出现在匹配结果字符里面，但不作为子匹配返回

/(?:[a-z]+)(?:\d+)/.exec('a133');  // ["a133", index: 0, input: "a133”]

常用正则

1、匹配汉字：^[\u4e00-\u9fa5]{0,}$
2、Email地址：^\w+([-+.]\w+)*@\w+([-.]\w+)*\.\w+([-.]\w+)*$
3、日期格式：^\d{4}-\d{1,2}-\d{1,2}

4、时间：/^(0?[0-9]|1[0-9]|[2][0-3]):(0?[0-9]|[1-5][0-9])$/ 
5、一年的12个月(01～09和1～12)：^(0?[1-9]|1[0-2])$
6、一个月的31天(01～09和1～31)：^((0?[1-9])|((1|2)[0-9])|30|31)$
7、中国邮政编码：[1-9]\d{5}(?!\d)
8、IP地址：((?:(?:25[0-5]|2[0-4]\\d|[01]?\\d?\\d)\\.){3}(?:25[0-5]|2[0-4]\\d|[01]?\\d?\\d))

9、匹配id: '<div id="container" class="main"></div>’.match(/id="[^"]*"/ );

10、格式化货币：function format (num) { return 
num.toFixed(2).replace(/\B(?=(\d{3})+\b)/, ",").replace(/^/, "$$ "); }; 

11、去掉左右空格： '    javascript   '.replace(/^\s+|\s+$/g, ''); 

12、每个单词的首字母转换为大写 ： function format(str) { return str.toLowerCase().replace(/(?:^|\s)\w/g, function (c) { return c.toUpperCase(); }); } 

13、匹配标签： "aaa<title>hello</title>" .match(/<([^>]+)>[\d\D]*<\/\1>/)

14、身份证号码：/^(\d{15}|\d{17}[\dxX])$/

正则回溯

'"abc"de'.match(/".*"/)

回溯过程

不回溯

'"abc"de'.match /"[^"]*"/