当前位置：首页 > 编程笔记 > 正文

已解决

第7章模式匹配与正则表达式

来自网友在路上 11108110提问提问时间：2023-11-20 12:43:35阅读次数： 110

最佳答案问答题库1108位专家为你答疑解惑

1. 不用正则表达式来查找文本模式
2. 用正则表达式来查找文本模式
- 2.1 创建正则表达式（Regex）对象
- 2.2 匹配Regex对象
3. 用正则表达式匹配更多模式
- 3.1 利用括号分组
- 3.2 用管道匹配多个分组
- 3.3 用问号实现可选匹配
- 3.4 用星号匹配零次或多次
- 3.5 用加号匹配一次或多次
- 3.6 用花括号匹配特定次数
4. 贪心和非贪心匹配
5. findall() 方法
6. 字符分类
7. 建立自己的字符分类
8. 插入字符和美元字符
9. 通配字符
- 9.1 用点-星匹配所有字符
- 9.2 用句点字符匹配换行符
10. 不区分大小写的匹配
11. 用 sub() 方法替换字符串

1. 不用正则表达式来查找文本模式

def isPhoneNumber(text):if len(text) != 11:return Falsefor i in range(0, len(text)):if (i == 3 or i == 7) and text[i] != "-":return Falseelif i != 3 and i != 7 and not text[i].isdecimal():return Falsereturn Truetext = "123-456-789"
print(text)
print(isPhoneNumber(text))

2. 用正则表达式来查找文本模式

2.1 创建正则表达式（Regex）对象

import retext = re.compile(r'\d\d\d-\d\d\d-\d\d\d')

2.2 匹配Regex对象

import retext = re.compile(r'\d\d\d-\d\d\d-\d\d\d')
match = text.search("~~~123-456-789~~~")
print(match.group())

3. 用正则表达式匹配更多模式

3.1 利用括号分组

import retext = re.compile(r'(\d\d\d)-(\d\d\d-\d\d\d)')
match = text.search("~~~123-456-789~~~")
print(match.group(1))
# 123
print(match.group(2))
# 456-789
print(match.groups())
# ('123', '456-789')

3.2 用管道匹配多个分组

| ：管道

import retext = re.compile(r'456|123')
match = text.search("123-456-789")
print(match.group())
# 123

3.3 用问号实现可选匹配

import retext = re.compile(r'\d\d\d(~)?\d\d\d')
match = text.search("123123")
print(match.group())
# 123123match = text.search("123~123")
print(match.group())
# 123~123

3.4 用星号匹配零次或多次

import retext = re.compile(r'\d\d\d(~)*\d\d\d')
match = text.search("123123")
print(match.group())
# 123123match = text.search("123~~~123")
print(match.group())
# 123~~~123

3.5 用加号匹配一次或多次

import retext = re.compile(r'\d\d\d(~)+\d\d\d')
match = text.search("123~123")
print(match.group())
# 123~123match = text.search("123~~~123")
print(match.group())
# 123~~~123

3.6 用花括号匹配特定次数

import retext = re.compile(r'\d\d\d(~){3,5}\d\d\d')
match = text.search("123~~~123")
print(match.group())
# 123~~~123match = text.search("123~~~~~123")
print(match.group())
# 123~~~~~123

4. 贪心和非贪心匹配

贪心匹配：尽可能匹配最长的字符串
非贪心匹配：尽可能匹配最短的字符串

import retext = re.compile(r'(123 ){2,4}')
match = text.search("123 123 123 123 123 ")
print(match.group())
# 123 123 123 123text = re.compile(r'(123 ){2,4}?')
match = text.search("123 123 123 123 123 ")
print(match.group())
# 123 123

5. findall() 方法

import retext = re.compile(r'\d\d\d-\d\d\d-\d\d\d')
match = text.search("~~~123-456-789~~~111-222-333~~~")
print(match.group())
# 123-456-789match = text.findall("~~~123-456-789~~~111-222-333~~~")
print(match)
# ['123-456-789', '111-222-333']

6. 字符分类

编写字符分类表示\d0~9的任何数字\D除0~9的数字以外的任何字符\w任何字母、数字和下划线字符\W除字母、数字和下划线以外的任何字符\s空格、制表符或换行符\S除空格、制表符和换行符以外的任何字符

7. 建立自己的字符分类

import retext = re.compile(r'[0-5]')
match = text.findall("1a2b3c4d")
print(match)
# ['1', '2', '3', '4']text = re.compile(r'[abc]')
match = text.findall("1a2b3c4d")
print(match)
# ['a', 'b', 'c']text = re.compile(r'[^abc]')
match = text.findall("1a2b3c4d")
print(match)
# ['1', '2', '3', '4', 'd']

8. 插入字符和美元字符

^ ：以指定文本开始
$ ：以指定文本结束

import retext = re.compile(r'^\d\d\d')
match = text.search("123abc456")
print(match)
# <re.Match object; span=(0, 3), match='123'>text = re.compile(r'\d\d\d$')
match = text.search("123abc456")
print(match)
# <re.Match object; span=(6, 9), match='456'>

9. 通配字符

. ：匹配换行符之外的所有字符

import retext = re.compile(r'..23')
match = text.findall("123abc23")
print(match)
# ['bc23']

9.1 用点-星匹配所有字符

import retext = re.compile(r'123(.*)456(.*)')
match = text.findall("123abc456def")
print(match)
# [('abc', 'def')]

9.2 用句点字符匹配换行符

re.DOTALL ：让句点字符匹配所有字符（包括换行符）

import retext = re.compile(r'.*')
match = text.search("123abc\n456def")
print(match.group())
# 123abctext = re.compile(r'.*', re.DOTALL)
match = text.search("123abc\n456def")
print(match.group())
# 123abc\n456def

10. 不区分大小写的匹配

re.I ：不区分大小写

import retext = re.compile(r'abc', re.I)
match = text.findall("abcABC")
print(match)
# ['abc', 'ABC']

11. 用 sub() 方法替换字符串

import retext = re.compile(r'ABC\w*')
match = text.sub("abc", "ABC : 123")
print(match)
# abc : 123

查看全文

99%的人还看了

相似问题

猜你感兴趣

版权申明

本文"第7章模式匹配与正则表达式"：http://eshow365.cn/6-40346-0.html 内容来自互联网，请自行判断内容的正确性。如有侵权请联系我们，立即删除！

上一篇: pytorch文本分类（一）：文本预处理
下一篇: 高效案例检索工具，Alpha案例库智慧检索成为律师检索工具首选

晴海小常识分享

晴海小常识分享

第7章模式匹配与正则表达式

最佳答案问答题库1108位专家为你答疑解惑

目录

1. 不用正则表达式来查找文本模式

2. 用正则表达式来查找文本模式

2.1 创建正则表达式（Regex）对象

2.2 匹配Regex对象

3. 用正则表达式匹配更多模式

3.1 利用括号分组

3.2 用管道匹配多个分组

3.3 用问号实现可选匹配

3.4 用星号匹配零次或多次

3.5 用加号匹配一次或多次

3.6 用花括号匹配特定次数

4. 贪心和非贪心匹配

5. findall() 方法

6. 字符分类

7. 建立自己的字符分类

8. 插入字符和美元字符

9. 通配字符

9.1 用点-星匹配所有字符

9.2 用句点字符匹配换行符

10. 不区分大小写的匹配

11. 用 sub() 方法替换字符串

99%的人还看了

相似问题

猜你感兴趣

版权申明

推荐回答

晴海小常识分享

晴海小常识分享

第7章 模式匹配与正则表达式

最佳答案 问答题库1108位专家为你答疑解惑

目录

1. 不用正则表达式来查找文本模式

2. 用正则表达式来查找文本模式

2.1 创建正则表达式（Regex）对象

2.2 匹配Regex对象

3. 用正则表达式匹配更多模式

3.1 利用括号分组

3.2 用管道匹配多个分组

3.3 用问号实现可选匹配

3.4 用星号匹配零次或多次

3.5 用加号匹配一次或多次

3.6 用花括号匹配特定次数

4. 贪心和非贪心匹配

5. findall() 方法

6. 字符分类

7. 建立自己的字符分类

8. 插入字符和美元字符

9. 通配字符

9.1 用点-星匹配所有字符

9.2 用句点字符匹配换行符

10. 不区分大小写的匹配

11. 用 sub() 方法替换字符串

99%的人还看了

相似问题

猜你感兴趣

版权申明

推荐回答

第7章模式匹配与正则表达式

最佳答案问答题库1108位专家为你答疑解惑