function _biblio_get_utf8_regex in Bibliography Module 6.2
Same name and namespace in other branches
- 5 biblio.module \_biblio_get_utf8_regex()
- 6 biblio_theme.inc \_biblio_get_utf8_regex()
- 7 includes/biblio_theme.inc \_biblio_get_utf8_regex()
- 7.2 includes/biblio.theme.inc \_biblio_get_utf8_regex()
1 call to _biblio_get_utf8_regex()
- _biblio_get_regex_patterns in includes/
biblio_theme.inc
File
- includes/
biblio_theme.inc, line 427
Code
function _biblio_get_utf8_regex() {
// Matches Unicode letters & digits:
$alnum = "\\p{Ll}\\p{Lu}\\p{Lt}\\p{Lo}\\p{Nd}";
// Unicode-aware equivalent of "[:alnum:]"
// Matches Unicode letters:
$alpha = "\\p{Ll}\\p{Lu}\\p{Lt}\\p{Lo}";
// Unicode-aware equivalent of "[:alpha:]"
// Matches Unicode control codes & characters not in other categories:
$cntrl = "\\p{C}";
// Unicode-aware equivalent of "[:cntrl:]"
// Matches Unicode dashes & hyphens:
$dash = "\\p{Pd}";
// Matches Unicode digits:
$digit = "\\p{Nd}";
// Unicode-aware equivalent of "[:digit:]"
// Matches Unicode printing characters (excluding space):
$graph = "^\\p{C}\t\n\f\r\\p{Z}";
// Unicode-aware equivalent of "[:graph:]"
// Matches Unicode lower case letters:
$lower = "\\p{Ll}\\p{M}";
// Unicode-aware equivalent of "[:lower:]"
// Matches Unicode printing characters (including space):
$print = "\\P{C}";
// same as "^\p{C}", Unicode-aware equivalent of "[:print:]"
// Matches Unicode punctuation (printing characters excluding letters & digits):
$punct = "\\p{P}";
// Unicode-aware equivalent of "[:punct:]"
// Matches Unicode whitespace (separating characters with no visual representation):
$space = "\t\n\f\r\\p{Z}";
// Unicode-aware equivalent of "[:space:]"
// Matches Unicode upper case letters:
$upper = "\\p{Lu}\\p{Lt}";
// Unicode-aware equivalent of "[:upper:]"
// Matches Unicode "word" characters:
$word = "_\\p{Ll}\\p{Lu}\\p{Lt}\\p{Lo}\\p{Nd}";
// Unicode-aware equivalent of "[:word:]" (or "[:alnum:]" plus "_")
// Defines the PCRE pattern modifier(s) to be used in conjunction with the above variables:
// More info: <http://www.php.net/manual/en/reference.pcre.pattern.modifiers.php>
$patternModifiers = "u";
// the "u" (PCRE_UTF8) pattern modifier causes PHP/PCRE to treat pattern strings as UTF-8
return array(
$alnum,
$alpha,
$cntrl,
$dash,
$digit,
$graph,
$lower,
$print,
$punct,
$space,
$upper,
$word,
$patternModifiers,
);
}