entityMap|blocks|key|n6o3|text|>>>+re.split('(\W)',+'foo/bar+spam\neggs')
['foo',+'/',+'bar',+'+',+'spam',+'\n',+'eggs']|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript^0^^$0|$]|1|@$2|3|4|5|6|7|8|E|9|@]|A|@]|B|$C|D]]]]

<pre><code>&gt;&gt;&gt; re.split('(\W)', 'foo/bar spam\neggs')
['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs']
</code></pre>

entityMap|0|type|LINK|mutability|MUTABLE|data|url|https://docs.python.org/3/library/stdtypes.html#str.splitlines|blocks|key|7k0m0|text|如果要在换行处拆分，请使用`splitlines(True)`..。|unstyled|depth|inlineStyleRanges|entityRanges|offset|length|49jhm|>>>+'line+1\nline+2\nline+without+newline'.splitlines(True)
['line+1\n',+'line+2\n',+'line+without+newline']|code-block|syntax|javascript|1kpf|(不是一个通用的解决方案，但是在这里添加这个，以防有人在这里没有意识到这个方法的存在。)^0|D|I|0|0|0^^$0|$1|$2|3|4|5|6|$7|8]]]|9|@$A|B|C|D|2|E|F|R|G|@]|H|@$I|S|J|T|A|U]]|6|$]]|$A|K|C|L|2|M|F|V|G|@]|H|@]|6|$N|O]]|$A|P|C|Q|2|E|F|W|G|@]|H|@]|6|$]]]]

If you are splitting on newline, use <a href="https://docs.python.org/3/library/stdtypes.html#str.splitlines" rel="noreferrer"><code>splitlines(True)</code></a>.

<pre><code>&gt;&gt;&gt; 'line 1\nline 2\nline without newline'.splitlines(True)
['line 1\n', 'line 2\n', 'line without newline']
</code></pre>

(Not a general solution, but adding this here in case someone comes here not realizing this method existed.)

entityMap|blocks|key|3092h|text|另一个示例，拆分非字母数字并保留分隔符|type|unstyled|depth|inlineStyleRanges|entityRanges|data|fsg38|import+re
a+=+"foo,bar@candy*ice%25cream"
re.split('([%5Ea-zA-Z0-9])',a)|code-block|syntax|javascript|7dcls|输出：|5s8v6|['foo',+',',+'bar',+'@',+'candy',+'*',+'ice',+'%25',+'cream']|js|5korc|解释|e26c7|re.split('([%5Ea-zA-Z0-9])',a)

()+<-+keep+the+separators
[]+<-+match+everything+in+between
%5Ea-zA-Z0-9+<-except+alphabets,+upper/lower+and+numbers.^0|0|0|0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|Q|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|R|9|@]|A|@]|B|$F|G]]|$2|H|4|I|6|7|8|S|9|@]|A|@]|B|$]]|$2|J|4|K|6|E|8|T|9|@]|A|@]|B|$F|L]]|$2|M|4|N|6|7|8|U|9|@]|A|@]|B|$]]|$2|O|4|P|6|E|8|V|9|@]|A|@]|B|$F|G]]]]

another example, split on non alpha-numeric and keep the separators

<pre><code>import re
a = "foo,bar@candy*ice%cream"
re.split('([^a-zA-Z0-9])',a)
</code></pre>

output:

<pre><code>['foo', ',', 'bar', '@', 'candy', '*', 'ice', '%', 'cream']
</code></pre>

explanation

<pre><code>re.split('([^a-zA-Z0-9])',a)

() &lt;- keep the separators
[] &lt;- match everything in between
^a-zA-Z0-9 &lt;-except alphabets, upper/lower and numbers.
</code></pre>

entityMap|blocks|key|9nehe|text|如果你只有一个分隔符，你可以使用列表理解：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|dk9tn|text+=+'foo,bar,baz,qux'++
sep+=+','|code-block|syntax|javascript|6720a|附加/前置分隔符：|37c7m|result+=+[x%2Bsep+for+x+in+text.split(sep)]
#['foo,',+'bar,',+'baz,',+'qux,']
#+to+get+rid+of+trailing
result[-1]+=+result[-1].strip(sep)
#['foo,',+'bar,',+'baz,',+'qux']

result+=+[sep%2Bx+for+x+in+text.split(sep)]
#[',foo',+',bar',+',baz',+',qux']
#+to+get+rid+of+trailing
result[0]+=+result[0].strip(sep)
#['foo',+',bar',+',baz',+',qux']|4n0dv|分隔符作为它自己的元素：|fhfcr|result+=+[u+for+x+in+text.split(sep)+for+u+in+(x,+sep)]
#['foo',+',',+'bar',+',',+'baz',+',',+'qux',+',']
results+=+result[:-1]+++#+to+get+rid+of+trailing^0|0|0|0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|P|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|Q|9|@]|A|@]|B|$F|G]]|$2|H|4|I|6|7|8|R|9|@]|A|@]|B|$]]|$2|J|4|K|6|E|8|S|9|@]|A|@]|B|$F|G]]|$2|L|4|M|6|7|8|T|9|@]|A|@]|B|$]]|$2|N|4|O|6|E|8|U|9|@]|A|@]|B|$F|G]]]]

If you have only 1 separator, you can employ list comprehensions:

<pre><code>text = 'foo,bar,baz,qux' 
sep = ','
</code></pre>

Appending/prepending separator:

<pre><code>result = [x+sep for x in text.split(sep)]
#['foo,', 'bar,', 'baz,', 'qux,']
# to get rid of trailing
result[-1] = result[-1].strip(sep)
#['foo,', 'bar,', 'baz,', 'qux']

result = [sep+x for x in text.split(sep)]
#[',foo', ',bar', ',baz', ',qux']
# to get rid of trailing
result[0] = result[0].strip(sep)
#['foo', ',bar', ',baz', ',qux']
</code></pre>

Separator as it's own element:

<pre><code>result = [u for x in text.split(sep) for u in (x, sep)]
#['foo', ',', 'bar', ',', 'baz', ',', 'qux', ',']
results = result[:-1] # to get rid of trailing
</code></pre>

entityMap|blocks|key|aj6fl|text|另一个在Python3上运行良好的无正则表达式解决方案|type|unstyled|depth|inlineStyleRanges|entityRanges|data|djurd|#+Split+strings+and+keep+separator
test_strings+=+['',+'Hi',+'+',+'<',+'']

def+split_and_keep(s,+sep):
+++if+not+s:+return+['']+#+consistent+with+string.split()

+++#+Find+replacement+character+that+is+not+used+in+string
+++#+i.e.+just+use+the+highest+available+character+plus+one
+++#+Note:+This+fails+if+ord(max(s))+=+0x10FFFF+(ValueError)
+++p=chr(ord(max(s))%2B1)+

+++return+s.replace(sep,+sep%2Bp).split(p)

for+s+in+test_strings:
+++print(split_and_keep(s,+'<'))


#+If+the+unicode+limit+is+reached+it+will+fail+explicitly
unicode_max_char+=+chr(1114111)
ridiculous_string+=+''%2Bunicode_max_char%2B''
print(split_and_keep(ridiculous_string,+'<'))|code-block|syntax|javascript^0|0^^$0|$]|1|@$2|3|4|5|6|7|8|H|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|I|9|@]|A|@]|B|$F|G]]]]

Another no-regex solution that works well on Python 3

<pre><code># Split strings and keep separator
test_strings = ['&lt;Hello&gt;', 'Hi', '&lt;Hi&gt; &lt;Planet&gt;', '&lt;', '']

def split_and_keep(s, sep):
 if not s: return [''] # consistent with string.split()

 # Find replacement character that is not used in string
 # i.e. just use the highest available character plus one
 # Note: This fails if ord(max(s)) = 0x10FFFF (ValueError)
 p=chr(ord(max(s))+1) 

 return s.replace(sep, sep+p).split(p)

for s in test_strings:
 print(split_and_keep(s, '&lt;'))


# If the unicode limit is reached it will fail explicitly
unicode_max_char = chr(1114111)
ridiculous_string = '&lt;Hello&gt;'+unicode_max_char+'&lt;World&gt;'
print(split_and_keep(ridiculous_string, '&lt;'))
</code></pre>

entityMap|blocks|key|5agk0|text|您还可以使用字符串数组而不是正则表达式来拆分字符串，如下所示：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|3ljtl|def+tokenizeString(aString,+separators):
++++#separators+is+an+array+of+strings+that+are+being+used+to+split+the+string.
++++#sort+separators+in+order+of+descending+length
++++separators.sort(key=len)
++++listToReturn+=+[]
++++i+=+0
++++while+i+<+len(aString):
++++++++theSeparator+=+""
++++++++for+current+in+separators:
++++++++++++if+current+==+aString[i:i%2Blen(current)]:
++++++++++++++++theSeparator+=+current
++++++++if+theSeparator+!=+"":
++++++++++++listToReturn+%2B=+[theSeparator]
++++++++++++i+=+i+%2B+len(theSeparator)
++++++++else:
++++++++++++if+listToReturn+==+[]:
++++++++++++++++listToReturn+=+[""]
++++++++++++if(listToReturn[-1]+in+separators):
++++++++++++++++listToReturn+%2B=+[""]
++++++++++++listToReturn[-1]+%2B=+aString[i]
++++++++++++i+%2B=+1
++++return+listToReturn
++++

print(tokenizeString(aString+=+"\"\"\"hi\"\"\"+hello+%2B+world+%2B=+(1*2%2B3/5)+'''hi'''",+separators+=+["'''",+'%2B=',+'%2B',+"/",+"*",+"\\'",+'\\"',+"-=",+"-",+"+",+'"""',+"(",+")"]))|code-block|syntax|javascript^0|0^^$0|$]|1|@$2|3|4|5|6|7|8|H|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|I|9|@]|A|@]|B|$F|G]]]]

You can also split a string with an array of strings instead of a regular expression, like this:
<pre><code>def tokenizeString(aString, separators):
 #separators is an array of strings that are being used to split the string.
 #sort separators in order of descending length
 separators.sort(key=len)
 listToReturn = []
 i = 0
 while i &lt; len(aString):
 theSeparator = &quot;&quot;
 for current in separators:
 if current == aString[i:i+len(current)]:
 theSeparator = current
 if theSeparator != &quot;&quot;:
 listToReturn += [theSeparator]
 i = i + len(theSeparator)
 else:
 if listToReturn == []:
 listToReturn = [&quot;&quot;]
 if(listToReturn[-1] in separators):
 listToReturn += [&quot;&quot;]
 listToReturn[-1] += aString[i]
 i += 1
 return listToReturn
 

print(tokenizeString(aString = &quot;\&quot;\&quot;\&quot;hi\&quot;\&quot;\&quot; hello + world += (1*2+3/5) '''hi'''&quot;, separators = [&quot;'''&quot;, '+=', '+', &quot;/&quot;, &quot;*&quot;, &quot;\\'&quot;, '\\&quot;', &quot;-=&quot;, &quot;-&quot;, &quot; &quot;, '&quot;&quot;&quot;', &quot;(&quot;, &quot;)&quot;]))
</code></pre>

entityMap|blocks|key|5vqt5|text|#+This+keeps+all+separators++in+result+
##########################################################################
import+re
st="%25%25(c%2Bdd%2Be%2Bf-1523)%25%257"
sh=re.compile('[\%2B\-//\*\<\>\%25]')

def+splitStringFull(sh,+st):
+++ls=sh.split(st)
+++lo=[]
+++start=0
+++for+l+in+ls:
+++++if+not+l+:+continue
+++++k=st.find(l)
+++++llen=len(l)
+++++if+k>+start:
+++++++tmp=+st[start:k]
+++++++lo.append(tmp)
+++++++lo.append(l)
+++++++start+=+k+%2B+llen
+++++else:
+++++++lo.append(l)
+++++++start+=llen
+++return+lo
++#############################

li=+splitStringFull(sh+,+st)
['%25%25(',+'c',+'%2B',+'dd',+'%2B',+'e',+'%2B',+'f',+'-',+'1523',+')%25%25',+'7']|type|code-block|depth|inlineStyleRanges|entityRanges|data|syntax|javascript^0^^$0|$]|1|@$2|3|4|5|6|7|8|E|9|@]|A|@]|B|$C|D]]]]

<pre><code># This keeps all separators in result 
##########################################################################
import re
st="%%(c+dd+e+f-1523)%%7"
sh=re.compile('[\+\-//\*\&lt;\&gt;\%]')

def splitStringFull(sh, st):
 ls=sh.split(st)
 lo=[]
 start=0
 for l in ls:
 if not l : continue
 k=st.find(l)
 llen=len(l)
 if k&gt; start:
 tmp= st[start:k]
 lo.append(tmp)
 lo.append(l)
 start = k + llen
 else:
 lo.append(l)
 start =llen
 return lo
 #############################

li= splitStringFull(sh , st)
['%%(', 'c', '+', 'dd', '+', 'e', '+', 'f', '-', '1523', ')%%', '7']
</code></pre>

entityMap|blocks|key|a22g1|text|一个懒惰而简单的解决方案|type|unstyled|depth|inlineStyleRanges|entityRanges|data|398k6|假设您的正则表达式模式为split_pattern+=+r'(!%7C\?)'|offset|length|style|CODE|82t4d|首先，添加一些与新分隔符相同的字符，如‘剪切‘|3sq0o|new_string+=+re.sub(split_pattern,+'\\1[cut]',++your_string)|55git|然后你拆分新的分隔符，new_string.split('[cut]')^0|0|C|P|0|0|0|1O|0|B|P^^$0|$]|1|@$2|3|4|5|6|7|8|O|9|@]|A|@]|B|$]]|$2|C|4|D|6|7|8|P|9|@$E|Q|F|R|G|H]]|A|@]|B|$]]|$2|I|4|J|6|7|8|S|9|@]|A|@]|B|$]]|$2|K|4|L|6|7|8|T|9|@$E|U|F|V|G|H]]|A|@]|B|$]]|$2|M|4|N|6|7|8|W|9|@$E|X|F|Y|G|H]]|A|@]|B|$]]]]

One Lazy and Simple Solution

Assume your regex pattern is <code>split_pattern = r'(!|\?)'</code>

First, you add some same character as the new separator, like '[cut]'

<code>new_string = re.sub(split_pattern, '\\1[cut]', your_string)</code>

Then you split the new separator, <code>new_string.split('[cut]')</code>

entityMap|blocks|key|5d0f0|text|全部替换seperator:+(\W)使用seperator+%2B+new_seperator:+(\W;)拆分为new_seperator:+(;)|type|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|data|79c1m|def+split_and_keep(seperator,+s):
++return+re.split(';',+re.sub(seperator,+lambda+match:+match.group()+%2B+';',+s))

print('\W',+'foo/bar+spam\neggs')|code-block|syntax|javascript^0|4|F|L|W|1K|I|0^^$0|$]|1|@$2|3|4|5|6|7|8|L|9|@$A|M|B|N|C|D]|$A|O|B|P|C|D]|$A|Q|B|R|C|D]]|E|@]|F|$]]|$2|G|4|H|6|I|8|S|9|@]|E|@]|F|$J|K]]]]

<ol>
<li>replace all <code>seperator: (\W)</code> with <code>seperator + new_seperator: (\W;)</code></li>
<li>split by the <code>new_seperator: (;)</code></li>
</ol>

<pre class="lang-py prettyprint-override"><code>def split_and_keep(seperator, s):
 return re.split(';', re.sub(seperator, lambda match: match.group() + ';', s))

print('\W', 'foo/bar spam\neggs')
</code></pre>

entityMap|0|type|LINK|mutability|MUTABLE|data|url|https://stackoverflow.com/questions/7866128/python-split-without-removing-the-delimiter|blocks|key|f4653|text|下面是一个简单的.split不使用正则表达式的解决方案。|unstyled|depth|inlineStyleRanges|offset|length|style|CODE|entityRanges|7mnie|这是对以下问题的回答|fl7h8|在不删除delimiter](https://stackoverflow.com/questions/7866128/python-split-without-removing-the-delimiter)的情况下拆分[Python()，所以不完全是原始帖子所问的问题，但另一个问题作为这个问题的副本关闭了。|ceco5|def+splitkeep(s,+delimiter):
++++split+=+s.split(delimiter)
++++return+[substr+%2B+delimiter+for+substr+in+split[:-1]]+%2B+[split[-1]]|code-block|syntax|javascript|12bv5|随机测试：|1loan|import+random

CHARS+=+[".",+"a",+"b",+"c"]
assert+splitkeep("",+"X")+==+[""]++#+0+length+test
for+delimiter+in+('.',+'..'):
++++for+_+in+range(100000):
++++++++length+=+random.randint(1,+50)
++++++++s+=+"".join(random.choice(CHARS)+for+_+in+range(length))
++++++++assert+"".join(splitkeep(s,+delimiter))+==+s^0|8|6|0|0|4|36|0|0|0|0^^$0|$1|$2|3|4|5|6|$7|8]]]|9|@$A|B|C|D|2|E|F|Z|G|@$H|10|I|11|J|K]]|L|@]|6|$]]|$A|M|C|N|2|E|F|12|G|@]|L|@]|6|$]]|$A|O|C|P|2|E|F|13|G|@]|L|@$H|14|I|15|A|16]]|6|$]]|$A|Q|C|R|2|S|F|17|G|@]|L|@]|6|$T|U]]|$A|V|C|W|2|E|F|18|G|@]|L|@]|6|$]]|$A|X|C|Y|2|S|F|19|G|@]|L|@]|6|$T|U]]]]

Here is a simple <code>.split</code> solution that works without regex.
This is an answer for <a href="https://stackoverflow.com/questions/7866128/python-split-without-removing-the-delimiter">Python split() without removing the delimiter</a>, so not exactly what the original post asks but the other question was closed as a duplicate for this one.
<pre class="lang-py prettyprint-override"><code>def splitkeep(s, delimiter):
 split = s.split(delimiter)
 return [substr + delimiter for substr in split[:-1]] + [split[-1]]
</code></pre>
Random tests:
<pre class="lang-py prettyprint-override"><code>import random

CHARS = [&quot;.&quot;, &quot;a&quot;, &quot;b&quot;, &quot;c&quot;]
assert splitkeep(&quot;&quot;, &quot;X&quot;) == [&quot;&quot;] # 0 length test
for delimiter in ('.', '..'):
 for _ in range(100000):
 length = random.randint(1, 50)
 s = &quot;&quot;.join(random.choice(CHARS) for _ in range(length))
 assert &quot;&quot;.join(splitkeep(s, delimiter)) == s
</code></pre>

entityMap|blocks|key|cvrj4|text|如果想要拆分字符串，同时通过regex保留分隔符，而不捕获group：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|9te76|def+finditer_with_separators(regex,+s):
++++matches+=+[]
++++prev_end+=+0
++++for+match+in+regex.finditer(s):
++++++++match_start+=+match.start()
++++++++if+(prev_end+!=+0+or+match_start+>+0)+and+match_start+!=+prev_end:
++++++++++++matches.append(s[prev_end:match.start()])
++++++++matches.append(match.group())
++++++++prev_end+=+match.end()
++++if+prev_end+<+len(s):
++++++++matches.append(s[prev_end:])
++++return+matches

regex+=+re.compile(r"[]")
matches+=+finditer_with_separators(regex,+s)|code-block|syntax|javascript|2b12j|如果假设正则表达式被包装到捕获组中：|fvpic|def+split_with_separators(regex,+s):
++++matches+=+list(filter(None,+regex.split(s)))
++++return+matches

regex+=+re.compile(r"([])")
matches+=+split_with_separators(regex,+s)|c13r4|这两种方法也将删除在大多数情况下无用和恼人的空组。^0|0|0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|N|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|O|9|@]|A|@]|B|$F|G]]|$2|H|4|I|6|7|8|P|9|@]|A|@]|B|$]]|$2|J|4|K|6|E|8|Q|9|@]|A|@]|B|$F|G]]|$2|L|4|M|6|7|8|R|9|@]|A|@]|B|$]]]]

If one wants to split string while keeping separators by regex without capturing group:

<pre><code>def finditer_with_separators(regex, s):
 matches = []
 prev_end = 0
 for match in regex.finditer(s):
 match_start = match.start()
 if (prev_end != 0 or match_start &gt; 0) and match_start != prev_end:
 matches.append(s[prev_end:match.start()])
 matches.append(match.group())
 prev_end = match.end()
 if prev_end &lt; len(s):
 matches.append(s[prev_end:])
 return matches

regex = re.compile(r"[]")
matches = finditer_with_separators(regex, s)
</code></pre>

If one assumes that regex is wrapped up into capturing group:

<pre><code>def split_with_separators(regex, s):
 matches = list(filter(None, regex.split(s)))
 return matches

regex = re.compile(r"([])")
matches = split_with_separators(regex, s)
</code></pre>

Both ways also will remove empty groups which are useless and annoying in most of the cases.

entityMap|blocks|key|85ogm|text|我可以把它留在这里吗？|type|unstyled|depth|inlineStyleRanges|entityRanges|data|8j3sr|s+=+'foo/bar+spam\neggs'
print(s.replace('/',+'%2B%2B%2B/%2B%2B%2B').replace('+',+'%2B%2B%2B+%2B%2B%2B').replace('\n',+'%2B%2B%2B\n%2B%2B%2B').split('%2B%2B%2B'))

['foo',+'/',+'bar',+'+',+'spam',+'\n',+'eggs']|code-block|syntax|javascript^0|0^^$0|$]|1|@$2|3|4|5|6|7|8|H|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|I|9|@]|A|@]|B|$F|G]]]]

May I just leave it here
<pre><code>s = 'foo/bar spam\neggs'
print(s.replace('/', '+++/+++').replace(' ', '+++ +++').replace('\n', '+++\n+++').split('+++'))

['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs']
</code></pre>

entityMap|blocks|key|8640k|text|我在尝试拆分文件路径时遇到了类似的问题，并努力找到一个简单的答案。这对我来说很有效，并且不需要在拆分文本中重新替换分隔符：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|ca6pc|my_path+=+'folder1/folder2/folder3/file1'|offset|length|style|CODE|el6ps|import+re|4eb7d|re.findall('[%5E/]%2B/%7C[%5E/]%2B',+my_path)|673pt|返回：|qru7|['folder1/',+'folder2/',+'folder3/',+'file1']^0|0|0|15|0|0|9|0|0|Z|0|0|0|19^^$0|$]|1|@$2|3|4|5|6|7|8|Q|9|@]|A|@]|B|$]]|$2|C|4|D|6|7|8|R|9|@$E|S|F|T|G|H]]|A|@]|B|$]]|$2|I|4|J|6|7|8|U|9|@$E|V|F|W|G|H]]|A|@]|B|$]]|$2|K|4|L|6|7|8|X|9|@$E|Y|F|Z|G|H]]|A|@]|B|$]]|$2|M|4|N|6|7|8|10|9|@]|A|@]|B|$]]|$2|O|4|P|6|7|8|11|9|@$E|12|F|13|G|H]]|A|@]|B|$]]]]

I had a similar issue trying to split a file path and struggled to find a simple answer.
This worked for me and didn't involve having to substitute delimiters back into the split text:

<code>my_path = 'folder1/folder2/folder3/file1'</code>

<code>import re</code>

<code>re.findall('[^/]+/|[^/]+', my_path)</code>

returns:

<code>['folder1/', 'folder2/', 'folder3/', 'file1']</code>

entityMap|blocks|key|266rf|text|我发现这种基于生成器的方法更令人满意：|type|unstyled|depth|inlineStyleRanges|entityRanges|data|dvhcn|def+split_keep(string,+sep):
++++"""Usage:
++++>>>+list(split_keep("a.b.c.d",+"."))
++++['a.',+'b.',+'c.',+'d']
++++"""
++++start+=+0
++++while+True:
++++++++end+=+string.find(sep,+start)+%2B+1
++++++++if+end+==+0:
++++++++++++break
++++++++yield+string[start:end]
++++++++start+=+end
++++yield+string[start:]|code-block|syntax|javascript|3tp9a|它避免了找出正确的正则表达式的需要，而在理论上应该相当便宜。它不创建新的字符串对象，并将大部分迭代工作委托给高效的find方法。|dalg7|..。在Python+3.8中，它可以如此简短：|9023j|def+split_keep(string,+sep):
++++start+=+0
++++while+(end+:=+string.find(sep,+start)+%2B+1)+>+0:
++++++++yield+string[start:end]
++++++++start+=+end
++++yield+string[start:]^0|0|0|0|0^^$0|$]|1|@$2|3|4|5|6|7|8|N|9|@]|A|@]|B|$]]|$2|C|4|D|6|E|8|O|9|@]|A|@]|B|$F|G]]|$2|H|4|I|6|7|8|P|9|@]|A|@]|B|$]]|$2|J|4|K|6|7|8|Q|9|@]|A|@]|B|$]]|$2|L|4|M|6|E|8|R|9|@]|A|@]|B|$F|G]]]]

I found this generator based approach more satisfying:

<pre><code>def split_keep(string, sep):
 """Usage:
 &gt;&gt;&gt; list(split_keep("a.b.c.d", "."))
 ['a.', 'b.', 'c.', 'd']
 """
 start = 0
 while True:
 end = string.find(sep, start) + 1
 if end == 0:
 break
 yield string[start:end]
 start = end
 yield string[start:]
</code></pre>

It avoids the need to figure out the correct regex, while in theory should be fairly cheap. It doesn't create new string objects and, delegates most of the iteration work to the efficient find method.

... and in Python 3.8 it can be as short as:

<pre><code>def split_keep(string, sep):
 start = 0
 while (end := string.find(sep, start) + 1) &gt; 0:
 yield string[start:end]
 start = end
 yield string[start:]
</code></pre>

Here's the simplest way to explain this. Here's what I'm using:

<pre><code>re.split('\W', 'foo/bar spam\neggs')
-&gt; ['foo', 'bar', 'spam', 'eggs']
</code></pre>

Here's what I want:

<pre><code>someMethod('\W', 'foo/bar spam\neggs')
-&gt; ['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs']
</code></pre>

The reason is that I want to split a string into tokens, manipulate it, then put it back together again.

In Python, how do I split a string and keep the separators?

翻译质量差，导致语言生硬或混乱。

没有提供实际的解决方法或示例。

解答不清晰，无法理解或解决问题。

页面排版不美观，阅读体验差。

文章

问答

视频

学习中心

腾讯云实验室

直播

竞赛

腾讯云代码分析专区

腾讯iOA零信任安全管理系统专区

腾讯云架构师技术同盟交流圈

腾讯云数据库专区

腾讯云顾问专区

腾讯云原生专区

腾讯混元专区

腾讯云TCE专区

腾讯云Lighthouse专区

腾讯云HAI专区

腾讯云Edgeone专区

腾讯云存储专区

腾讯云智能专区

腾讯轻联专区 

腾讯云开发专区

TAPD专区

腾讯轻量云游戏服专区

腾讯云最具价值专家

腾讯云架构师技术同盟

腾讯云创作之星

腾讯云开发者先锋

腾讯云代码助手

云原生构建

TAPD 敏捷项目管理

Cloud Studio

SDK中心

API中心

命令行工具

涵盖代码开发、场景应用、自动测试全流程，助你从零构建专属AI助手

一站式MCP教程库，解锁AI应用新玩法

 下面是解释这一点的最简单的方法。下面是我使用的代码： re.split('\W', 'foo/bar spam\neggs')-> ['foo', 'bar', 'spam', 'eggs'] 这是我想要的： someMethod('\W', 'foo/bar spam\neggs')-> ['foo', '/', 'bar', ' ', 'spam', '\n', 'eggs'] 原因是我想将

问在Python中，如何拆分字符串并保留分隔符？
EN

回答 14

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在Python中，如何拆分字符串并保留分隔符？EN

回答 14

Stack Overflow用户

Stack Overflow用户

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问在Python中，如何拆分字符串并保留分隔符？
EN