Python3中正则模块re.compile、re.match及re.search函数用法详解

更新时间：2018年06月11日 14:24:29 作者：Citizen_Wang

这篇文章主要介绍了Python3中正则模块re.compile、re.match及re.search函数用法,结合实例形式较为详细的分析了re模块中re.compile、re.match及re.search函数的功能、参数、具体使用技巧与注意事项,需要的朋友可以参考下

本文实例讲述了Python3中正则模块re.compile、re.match及re.search函数用法。分享给大家供大家参考，具体如下：

re模块 re.compile、re.match、 re.search

re 模块官方说明文档

正则匹配的时候，第一个字符是 r，表示 raw string 原生字符，意在声明字符串中间的特殊字符不用转义。

比如表示 ‘\n'，可以写 r'\n'，或者不适用原生字符 ‘\n'。

推荐使用 re.match

re.compile() 函数

编译正则表达式模式，返回一个对象。可以把常用的正则表达式编译成正则表达式对象，方便后续调用及提高效率。

re.compile(pattern, flags=0)

pattern 指定编译时的表达式字符串
flags 编译标志位，用来修改正则表达式的匹配方式。支持 re.L|re.M 同时匹配

flags 标志位参数

re.I(re.IGNORECASE)
使匹配对大小写不敏感

re.L(re.LOCAL)
做本地化识别（locale-aware）匹配

re.M(re.MULTILINE)
多行匹配，影响 ^ 和 $

re.S(re.DOTALL)
使 . 匹配包括换行在内的所有字符

re.U(re.UNICODE)
根据Unicode字符集解析字符。这个标志影响 \w, \W, \b, \B.

re.X(re.VERBOSE)
该标志通过给予你更灵活的格式以便你将正则表达式写得更易于理解。

示例：

import re
content = 'Citizen wang , always fall in love with neighbour，WANG'
rr = re.compile(r'wan\w', re.I) # 不区分大小写
print(type(rr))
a = rr.findall(content)
print(type(a))
print(a)

findall 返回的是一个 list 对象

<class '_sre.SRE_Pattern'>
<class 'list'>
['wang', 'WANG']

re.match() 函数

总是从字符串‘开头曲匹配'，并返回匹配的字符串的 match 对象 <class '_sre.SRE_Match'>。

re.match(pattern, string[, flags=0])

pattern 匹配模式，由 re.compile 获得
string 需要匹配的字符串

import re
pattern = re.compile(r'hello')
a = re.match(pattern, 'hello world')
b = re.match(pattern, 'world hello')
c = re.match(pattern, 'hell')
d = re.match(pattern, 'hello ')
if a:
  print(a.group())
else:
  print('a 失败')
if b:
  print(b.group())
else:
  print('b 失败')
if c:
  print(c.group())
else:
  print('c 失败')
if d:
  print(d.group())
else:
  print('d 失败')

hello
b 失败
c 失败
hello

match 的方法和属性

参考链接

import re
str = 'hello world! hello python'
pattern = re.compile(r'(?P<first>hell\w)(?P<symbol>\s)(?P<last>.*ld!)') # 分组，0 组是整个 hello world!, 1组 hello，2组 ld!
match = re.match(pattern, str)
print('group 0:', match.group(0)) # 匹配 0 组，整个字符串
print('group 1:', match.group(1)) # 匹配第一组，hello
print('group 2:', match.group(2)) # 匹配第二组，空格
print('group 3:', match.group(3)) # 匹配第三组，ld!
print('groups:', match.groups())  # groups 方法，返回一个包含所有分组匹配的元组
print('start 0:', match.start(0), 'end 0:', match.end(0)) # 整个匹配开始和结束的索引值
print('start 1:', match.start(1), 'end 1:', match.end(1)) # 第一组开始和结束的索引值
print('start 2:', match.start(1), 'end 2:', match.end(2)) # 第二组开始和结束的索引值
print('pos 开始于：', match.pos)
print('endpos 结束于：', match.endpos) # string 的长度
print('lastgroup 最后一个被捕获的分组的名字：', match.lastgroup)
print('lastindex 最后一个分组在文本中的索引：', match.lastindex)
print('string 匹配时候使用的文本：', match.string)
print('re 匹配时候使用的 Pattern 对象：', match.re)
print('span 返回分组匹配的 index （start(group),end(group))：', match.span(2))

返回结果：

group 0: hello world!
group 1: hello
group 2:
group 3: world!
groups: ('hello', ' ', 'world!')
start 0: 0 end 0: 12
start 1: 0 end 1: 5
start 2: 0 end 2: 6
pos 开始于： 0
endpos 结束于： 25
lastgroup 最后一个被捕获的分组的名字： last
lastindex 最后一个分组在文本中的索引： 3
string 匹配时候使用的文本： hello world! hello python
re 匹配时候使用的 Pattern 对象： re.compile('(?P<first>hell\\w)(?P<symbol>\\s)(?P<last>.*ld!)')
span 返回分组匹配的 index （start(group),end(group))： (5, 6)

re.search 函数

对整个字符串进行搜索匹配，返回第一个匹配的字符串的 match 对象。

re.search(pattern, string[, flags=0])

pattern 匹配模式，由 re.compile 获得
string 需要匹配的字符串

import re
str = 'say hello world! hello python'
pattern = re.compile(r'(?P<first>hell\w)(?P<symbol>\s)(?P<last>.*ld!)') # 分组，0 组是整个 hello world!, 1组 hello，2组 ld!
search = re.search(pattern, str)
print('group 0:', search.group(0)) # 匹配 0 组，整个字符串
print('group 1:', search.group(1)) # 匹配第一组，hello
print('group 2:', search.group(2)) # 匹配第二组，空格
print('group 3:', search.group(3)) # 匹配第三组，ld!
print('groups:', search.groups())  # groups 方法，返回一个包含所有分组匹配的元组
print('start 0:', search.start(0), 'end 0:', search.end(0)) # 整个匹配开始和结束的索引值
print('start 1:', search.start(1), 'end 1:', search.end(1)) # 第一组开始和结束的索引值
print('start 2:', search.start(1), 'end 2:', search.end(2)) # 第二组开始和结束的索引值
print('pos 开始于：', search.pos)
print('endpos 结束于：', search.endpos) # string 的长度
print('lastgroup 最后一个被捕获的分组的名字：', search.lastgroup)
print('lastindex 最后一个分组在文本中的索引：', search.lastindex)
print('string 匹配时候使用的文本：', search.string)
print('re 匹配时候使用的 Pattern 对象：', search.re)
print('span 返回分组匹配的 index （start(group),end(group))：', search.span(2))

注意 re.search 和 re.match 匹配的 str 的区别

打印结果：

group 0: hello world!
group 1: hello
group 2:
group 3: world!
groups: ('hello', ' ', 'world!')
start 0: 4 end 0: 16
start 1: 4 end 1: 9
start 2: 4 end 2: 10
pos 开始于： 0
endpos 结束于： 29
lastgroup 最后一个被捕获的分组的名字： last
lastindex 最后一个分组在文本中的索引： 3
string 匹配时候使用的文本： say hello world! hello python
re 匹配时候使用的 Pattern 对象： re.compile('(?P<first>hell\\w)(?P<symbol>\\s)(?P<last>.*ld!)')
span 返回分组匹配的 index （start(group),end(group))： (9, 10)

PS：这里再为大家提供2款非常方便的正则表达式工具供大家参考使用：

JavaScript正则表达式在线测试工具：
http://tools.jb51.net/regex/javascript

正则表达式在线生成工具：
http://tools.jb51.net/regex/create_reg

更多关于Python相关内容可查看本站专题：《Python正则表达式用法总结》、《Python数据结构与算法教程》、《Python函数使用技巧总结》、《Python字符串操作技巧汇总》、《Python入门与进阶经典教程》及《Python文件与目录操作技巧汇总》

希望本文所述对大家Python程序设计有所帮助。

您可能感兴趣的文章:

举例讲解Linux系统下Python调用系统Shell的方法
这篇文章主要介绍了举例讲解Linux系统下Python调用系统Shell的方法,包括用Python和shell读取文件某一行的实例,需要的朋友可以参考下
2015-11-11
使用 Python 读取电子表格中的数据实例详解
这篇文章主要介绍了使用 Python 读取电子表格中的数据,本文通过实例代码给大家介绍的非常详细，对大家的学习或工作具有一定的参考借鉴价值，需要的朋友可以参考下
2020-04-04
如何使用python读取Excel指定范围并转为数组
python处理数据文件的途径有很多种,下面这篇文章主要给大家介绍了关于如何使用python读取Excel指定范围并转为数组的相关资料,文中通过图文以及实例代码介绍的非常详细,需要的朋友可以参考下
2022-11-11
Python解惑之整数比较详解
这篇文章主要给大家介绍了Python中整数比较的相关资料，文中通过示例代码介绍的非常详细，详细会对大家学习python的整数具有一定的参考价值，需要的朋友下面跟着小编一起来学习学习吧。
2017-04-04
Python实现的数据结构与算法之双端队列详解
这篇文章主要介绍了Python实现的数据结构与算法之双端队列,详细讲述了双端队列的概念、功能、定义及Python实现与使用双端队列的相关技巧,非常具有实用价值,需要的朋友可以参考下
2015-04-04
Tensorflow Summary用法学习笔记
这篇文章主要介绍了Tensorflow Summary用法学习笔记，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧
2020-01-01
python实现合并两个有序列表的示例代码
这篇文章主要介绍了python实现合并两个有序列表的示例代码，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧
2021-04-04
python抓取文件夹的所有文件
这篇文章主要为大家详细介绍了python抓取文件夹的所有文件,包括子文件夹和子文件夹的文件，具有一定的参考价值，感兴趣的小伙伴们可以参考一下
2018-02-02
Python自动化测试selenium指定截图文件名方法
这篇文章主要介绍了Python自动化测试selenium指定截图文件名方法，Selenium 支持 Web 浏览器的自动化，它提供一套测试函数，用于支持 Web 自动化测试，下文基于python实现指定截图文件名方法，需要的小伙伴可以参考一下
2022-05-05
python简单实现图片文字分割
这篇文章主要为大家详细介绍了python简单实现图片文字分割，文中示例代码介绍的非常详细，具有一定的参考价值，感兴趣的小伙伴们可以参考一下
2021-09-09