function decode_entities in Drupal 4

Same name and namespace in other branches

5 includes/unicode.inc \decode_entities()
6 includes/unicode.inc \decode_entities()
7 includes/unicode.inc \decode_entities()

Decode all HTML entities (including numerical ones) to regular UTF-8 bytes. Double-escaped entities will only be decoded once ("&lt;" becomes "<", not "<").

Parameters

$text: The text to decode entities in.

$exclude: An array of characters which should not be decoded. For example, array('<', '&', '"'). This affects both named and numerical entities.

5 calls to decode_entities()

db_connect in includes/database.pgsql.inc: Initialize a database connection.
filter_xss_bad_protocol in modules/filter.module: Processes an HTML attribute value and ensures it does not contain an URL with a disallowed protocol (e.g. javascript:)
format_rss_channel in includes/common.inc: Formats an RSS channel.
search_simplify in modules/search.module: Simplifies a string according to indexing rules.
_comment_form_submit in modules/comment.module

File

includes/unicode.inc, line 276

Code

function decode_entities($text, $exclude = array()) {
  static $table;

  // We store named entities in a table for quick processing.
  if (!isset($table)) {

    // Get all named HTML entities.
    $table = array_flip(get_html_translation_table(HTML_ENTITIES));

    // PHP gives us ISO-8859-1 data, we need UTF-8.
    $table = array_map('utf8_encode', $table);

    // Add apostrophe (XML)
    $table['&apos;'] = "'";
  }
  $newtable = array_diff($table, $exclude);

  // Use a regexp to select all entities in one pass, to avoid decoding double-escaped entities twice.
  return preg_replace('/&(#x?)?([A-Za-z0-9]+);/e', '_decode_entities("$1", "$2", "$0", $newtable, $exclude)', $text);
}

You are here

function decode_entities in Drupal 4

Parameters

File

Code

API Navigation