I’m looking for a RegEx pattern again. It should search a Unicode string, no matter if with russian or arabic letters, for capital letters and then divide it. Only one string without spaces before or after is used as source.
Example:Dim s As String = "helloWorldisACoolThing"
To be split as in: “hello”, “Worldis”, “A”, “Cool”, “Thing”.
\p matches based on Unicode properties. {Ll} is lowercase letters, {Lu} is uppercase. If you have RegExRX, you can see the complete list of scripts and properties.
Looking good for my sample, but if I modify my source to “HhelloWorldisACoolThing” then the pattern doesn’t work correct. Same for “HHHhelloWorldisACoolThing”.
Hi Kem,
I have tested the pattern in detail once. Unfortunately it ignores punctuation marks within a string like dot, comma, semicolon, hyphen, underscore, Numbers etc. (e.g. helloWorldisACoolThing1,;:^"&%). What is missing in the pattern?