function drupal_substr in Drupal 6
Same name and namespace in other branches
- 4 includes/unicode.inc \drupal_substr()
- 5 includes/unicode.inc \drupal_substr()
- 7 includes/unicode.inc \drupal_substr()
Cut off a piece of a string based on character indices and counts. Follows the same behavior as PHP's own substr() function.
Note that for cutting off a string at a known character/substring location, the usage of PHP's normal strpos/substr is safe and much faster.
5 calls to drupal_substr()
- blogapi_metaweblog_new_media_object in modules/
blogapi/ blogapi.module - Blogging API callback. Inserts a file into Drupal.
- drupal_ucfirst in includes/
unicode.inc - Capitalize the first letter of a UTF-8 string.
- search_expand_cjk in modules/
search/ search.module - Basic CJK tokenizer. Simply splits a string into consecutive, overlapping sequences of characters ('minimum_word_size' long).
- theme_username in includes/
theme.inc - Format a username.
- truncate_utf8 in includes/
unicode.inc - Truncate a UTF-8-encoded string safely to a number of characters.
File
- includes/
unicode.inc, line 475
Code
function drupal_substr($text, $start, $length = NULL) {
global $multibyte;
if ($multibyte == UNICODE_MULTIBYTE) {
return $length === NULL ? mb_substr($text, $start) : mb_substr($text, $start, $length);
}
else {
$strlen = strlen($text);
// Find the starting byte offset
$bytes = 0;
if ($start > 0) {
// Count all the continuation bytes from the start until we have found
// $start characters
$bytes = -1;
$chars = -1;
while ($bytes < $strlen && $chars < $start) {
$bytes++;
$c = ord($text[$bytes]);
if ($c < 0x80 || $c >= 0xc0) {
$chars++;
}
}
}
else {
if ($start < 0) {
// Count all the continuation bytes from the end until we have found
// abs($start) characters
$start = abs($start);
$bytes = $strlen;
$chars = 0;
while ($bytes > 0 && $chars < $start) {
$bytes--;
$c = ord($text[$bytes]);
if ($c < 0x80 || $c >= 0xc0) {
$chars++;
}
}
}
}
$istart = $bytes;
// Find the ending byte offset
if ($length === NULL) {
$bytes = $strlen - 1;
}
else {
if ($length > 0) {
// Count all the continuation bytes from the starting index until we have
// found $length + 1 characters. Then backtrack one byte.
$bytes = $istart;
$chars = 0;
while ($bytes < $strlen && $chars < $length) {
$bytes++;
$c = ord($text[$bytes]);
if ($c < 0x80 || $c >= 0xc0) {
$chars++;
}
}
$bytes--;
}
else {
if ($length < 0) {
// Count all the continuation bytes from the end until we have found
// abs($length) characters
$length = abs($length);
$bytes = $strlen - 1;
$chars = 0;
while ($bytes >= 0 && $chars < $length) {
$c = ord($text[$bytes]);
if ($c < 0x80 || $c >= 0xc0) {
$chars++;
}
$bytes--;
}
}
}
}
$iend = $bytes;
return substr($text, $istart, max(0, $iend - $istart + 1));
}
}