参考博客原址:https://blog.csdn.net/YZXnuaa/article/details/79346963
<link rel="stylesheet" href="https://csdnimg.cn/release/phoenix/template/css/ck_htmledit_views-5edb848729.css">
<div class="htmledit_views" id="content_views">
<pre>正则 re.findall 的简单用法(返回string中所有与pattern相匹配的全部字串,返回形式为数组)
语法:
1 | findall(pattern, string, flags = 0 ) |
import re
Python 正则表达式 re findall 方法能够以列表的形式返回能匹配的子串
print (help(re.findall))
print (dir(re.findall))
findall查找全部r标识代表后面是正则的语句
1 2 3 | print (regular_v1)# [‘docs’] |
符号^表示匹配以https开头的的字符串返回,
1 2 3 | print (regular_v2)# [‘https’] |
用 符 号 表 示 以 h t m l 结 尾 的 字 符 串 返 回 , 判 断 是 否 字 符 串 结 束 的 字 符 串 < / p r e > < d i v c l a s s = " c n b l o g s H i g h l i g h t e r s h − g u t t e r " > < d i v > < d i v c l a s s = " s y n t a x h i g h l i g h t e r p y t h o n " > < d i v c l a s s = " t a b l e − b o x " > < t a b l e c e l l s p a c i n g = " 0 " c e l l p a d d i n g = " 0 " b o r d e r = " 0 " > < t b o d y > < t r > < t d c l a s s = " g u t t e r " > < d i v c l a s s = " l i n e n u m b e r 1 i n d e x 0 a l t 2 " > 1 < / d i v > < d i v c l a s s = " l i n e n u m b e r 2 i n d e x 1 a l t 1 " > 2 < / d i v > < d i v c l a s s = " l i n e n u m b e r 3 i n d e x 2 a l t 2 " > 3 < / d i v > < / t d > < t d c l a s s = " c o d e " > < d i v > < d i v c l a s s = " l i n e n u m b e r 1 i n d e x 0 a l t 2 " > < c o d e c l a s s = " p y t h o n p l a i n " > r e g u l a r v 3 < / c o d e > < c o d e c l a s s = " p y t h o n k e y w o r d " > = < / c o d e > < c o d e c l a s s = " p y t h o n p l a i n " > r e . f i n d a l l ( r < / c o d e > < c o d e c l a s s = " p y t h o n s t r i n g " > " h t m l 符号表示以html结尾的字符串返回,判断是否字符串结束的字符串</pre><div class="cnblogs_Highlighter sh-gutter"><div><div class="syntaxhighlighter python"><div class="table-box"><table cellspacing="0" cellpadding="0" border="0"><tbody><tr><td class="gutter"><div class="line number1 index0 alt2">1</div><div class="line number2 index1 alt1">2</div><div class="line number3 index2 alt2">3</div></td><td class="code"><div><div class="line number1 index0 alt2"><code class="python plain">regular_v3 </code><code class="python keyword">=</code> <code class="python plain">re.findall(r</code><code class="python string">"html 符号表示以html结尾的字符串返回,判断是否字符串结束的字符串</pre><divclass="cnblogsHighlightersh−gutter"><div><divclass="syntaxhighlighterpython"><divclass="table−box"><tablecellspacing="0"cellpadding="0"border="0"><tbody><tr><tdclass="gutter"><divclass="linenumber1index0alt2">1</div><divclass="linenumber2index1alt1">2</div><divclass="linenumber3index2alt2">3</div></td><tdclass="code"><div><divclass="linenumber1index0alt2"><codeclass="pythonplain">regularv3</code><codeclass="pythonkeyword">=</code><codeclass="pythonplain">re.findall(r</code><codeclass="pythonstring">"html",“https://docs.python.org/3/whatsnew/3.6.html”)(regular_v3)# [‘html’]# […]匹配括号中的其中一个字符
123(regular_v4)# [‘th’, ‘wh’]“d”是正则语法规则用来匹配0到9之间的数返回列表
123456(regular_v5)# [‘3’, ‘3’, ‘6’](regular_v6)# [‘123’]小d表示取数字0-9,大D表示不要数字,也就是出了数字以外的内容返回
123(regular_v7)# [‘h’, ‘t’, ‘t’, ‘p’, ‘s’, ‘:’, ‘/’, ‘/’, ‘d’, ‘o’, ‘c’, ‘s’, ‘.’, ‘p’, ‘y’, ‘t’, ‘h’, ‘o’, ‘n’, ‘.’, ‘o’, ‘r’, ‘g’, ‘/’, ‘/’, ‘w’, ‘h’, ‘a’, ‘t’, ‘s’, ‘n’, ‘e’, ‘w’, ‘/’, ‘.’, ‘.’, ‘h’, ‘t’, ‘m’, ‘l’]“w”在正则里面代表匹配从小写a到z,大写A到Z,数字0到9
123(regular_v8)#[‘h’, ‘t’, ‘t’, ‘p’, ‘s’, ‘d’, ‘o’, ‘c’, ‘s’, ‘p’, ‘y’, ‘t’, ‘h’, ‘o’, ‘n’, ‘o’, ‘r’, ‘g’, ‘3’, ‘w’, ‘h’, ‘a’, ‘t’, ‘s’, ‘n’, ‘e’, ‘w’, ‘3’, ‘6’, ‘h’, ‘t’, ‘m’, ‘l’]
“W”在正则里面代表匹配除了字母与数字以外的特殊符号
123(regular_v9)# [’:’, ‘/’, ‘/’, ‘.’, ‘.’, ‘/’, ‘/’, ‘/’, ‘.’, ‘.’]