Python - 受约束搜索

搜索结果出来后，我们经常需要对现有搜索结果的某一部分进行更深层次的搜索。例如，在给定的文本主体中，我们的目标是获取网址，并提取网址的不同部分（如协议、域名等）。在这种情况下，我们需要借助分组函数，该函数可根据分配的正则表达式将搜索结果划分为不同的组。我们通过使用圆括号将可搜索部分与需要匹配的固定单词分离开来，创建此类组表达式。

import re
text = "The web address is https://tutorialspoint.com"

# Taking "://" and "." to separate the groups 
result = re.search('([\w.-]+)://([\w.-]+)\.([\w.-]+)', text)
if result :
    print "The main web Address: ",result.group()
    print "The protocol: ",result.group(1)
    print "The doman name: ",result.group(2) 
    print "The TLD: ",result.group(3)

当我们运行上述程序时，会得到以下输出：

The main web Address:  https://tutorialspoint.com
The protocol:  https
The doman name:  www.tutorialspoint
The TLD:  com

打印页面

上一个

下一个