You are here

constant PREG_CLASS_CJK in Drupal 8

Same name and namespace in other branches
  1. 4 modules/search.module \PREG_CLASS_CJK
  2. 5 modules/search/search.module \PREG_CLASS_CJK
  3. 6 modules/search/search.module \PREG_CLASS_CJK
  4. 7 modules/search/search.module \PREG_CLASS_CJK
  5. 9 core/modules/search/search.module \PREG_CLASS_CJK

Matches CJK (Chinese, Japanese, Korean) letter-like characters.

This list is derived from the "East Asian Scripts" section of http://www.unicode.org/charts/index.html, as well as a comment on http://unicode.org/reports/tr11/tr11-11.html listing some character ranges that are reserved for additional CJK ideographs.

The character ranges do not include numbers, punctuation, or symbols, since these are handled separately in search. Note that radicals and strokes are considered symbols. (See http://www.unicode.org/Public/UNIDATA/extracted/DerivedGeneralCategory.txt)

See also

search_expand_cjk()

2 uses of PREG_CLASS_CJK
search_excerpt in core/modules/search/search.module
Returns snippets from a piece of text, with search keywords highlighted.
search_simplify in core/modules/search/search.module
Simplifies and preprocesses text for searching.

File

core/modules/search/search.module, line 63
Enables site-wide keyword searching.

Code

define('PREG_CLASS_CJK', '\\x{1100}-\\x{11FF}\\x{3040}-\\x{309F}\\x{30A1}-\\x{318E}' . '\\x{31A0}-\\x{31B7}\\x{31F0}-\\x{31FF}\\x{3400}-\\x{4DBF}\\x{4E00}-\\x{9FCF}' . '\\x{A000}-\\x{A48F}\\x{A4D0}-\\x{A4FD}\\x{A960}-\\x{A97F}\\x{AC00}-\\x{D7FF}' . '\\x{F900}-\\x{FAFF}\\x{FF21}-\\x{FF3A}\\x{FF41}-\\x{FF5A}\\x{FF66}-\\x{FFDC}' . '\\x{20000}-\\x{2FFFD}\\x{30000}-\\x{3FFFD}');