PHP之mb_substr使用

mb_substr

  • (PHP 4 >= 4.0.6, PHP 5, PHP 7)
  • mb_substr — Get part of string
  • mb_substr — 获取部分字符串

Description

string mb_substr ( 
    string $str ,
    int $start [,
    int $length = NULL [, 
    string $encoding = mb_internal_encoding() ]] 
    )
// Performs a multi-byte safe substr() operation based on number of characters. Position is counted from 
// the beginning of str. First character's position is 0. Second character position is 1, and so on.
//根据字符数执行一个多字节安全的 substr() 操作。 位置是从 str 的开始位置进行计数。 第一个字符的位置是 0。第二个字符的位置是 1,以此类推。

Parameters

str

  • The string to extract the substring from.
  • 从该 string 中提取子字符串。

start

  • If start is non-negative, the returned string will start at the start'th position in str, counting from zero. For instance, in the string 'abcdef', the character at position 0 is 'a', the character at position 2 is 'c', and so forth.

  • 如果 start 不是负数,返回的字符串会从 str 第 start 的位置开始,从 0 开始计数。举个例子,字符串 'abcdef',位置 0 的字符是 'a',位置 2 的字符是 'c',以此类推。

  • If start is negative, the returned string will start at the start'th character from the end of str.

  • 如果 start 是负数,返回的字符串是从 str 末尾处第 start 个字符开始的。

length

  • Maximum number of characters to use from str. If omitted or NULL is passed, extract all characters to the end of the string.
  • str 中要使用的最大字符数。如果省略了此参数或者传入了 NULL,则会提取到字符串的尾部。

encoding

  • The encoding parameter is the character encoding. If it is omitted, the internal character encoding value will be used.
  • encoding 参数为字符编码。如果省略,则使用内部字符编码。

Return Values

  • mb_substr() returns the portion of str specified by the start and length parameters.
  • mb_substr() 函数根据 start 和 length 参数返回 str 中指定的部分。

Changelog

  • 5.4.8 - Passing NULL as length extracts all characters to the end of the string. Prior to this version NULL was treated the same as 0.

Examples

<?php
/**
 * Created by PhpStorm.
 * User: zhangrongxiang
 * Date: 2018/1/30
 * Time: 下午8:51
 */

$string = "0123456789你好";
/** start > 0  length > 0*/
$mystring = mb_substr( $string, 5, 1 );
echo $mystring . PHP_EOL; // 5
$mystring = mb_substr( $string, 5, 2 );
echo $mystring . PHP_EOL; // 56
$mystring = mb_substr( $string, 10, 2 );
echo $mystring . PHP_EOL; // 你好

/** start < 0  length > 0*/
$mystring = mb_substr( $string, - 2, 2 );
echo $mystring . PHP_EOL; // 你好
echo 'mb_strlen : ' . mb_strlen( $string ) . PHP_EOL;//12
$mystring = mb_substr( $string, - mb_strlen( $string ), 2 );
echo $mystring . PHP_EOL; // 01
$mystring = mb_substr( $string, - 3, 2 );
echo $mystring . PHP_EOL; // 9你

/** start > 0  length <  0*/
$mystring = mb_substr( $string, 5, - 1 );
echo $mystring . PHP_EOL; // 56789你
$mystring = mb_substr( $string, 0, - mb_strlen( $string ) + 1 );
echo $mystring . PHP_EOL; // 0
$mystring = mb_substr( $string, 5, - 5 );
echo $mystring . PHP_EOL; // 56

/** start < 0  length <  0*/
$mystring = mb_substr( $string, - 10, - 1 );
echo $mystring . PHP_EOL; // 23456789你
$mystring = mb_substr( $string, - 5, - 1 );
echo $mystring . PHP_EOL; // 789你

function mb_ucfirst( $str, $enc = 'utf-8' ) {
    return mb_strtoupper( mb_substr( $str, 0, 1, $enc ), $enc ) . mb_substr( $str, 1, mb_strlen( $str, $enc ), $enc );
}

echo mb_ucfirst( "hello world 你好 中国" ) . PHP_EOL; //Hello world 你好 中国

/**
 * @param $string
 * @param string $encoding
 *
 * @return array
 */
function get_character_classes( $string, $encoding = "UTF-8" ) {
    $current_encoding = mb_internal_encoding();
    mb_internal_encoding( $encoding );
    $has          = array();
    $stringlength = mb_strlen( $string, $encoding );
    for ( $i = 0; $i < $stringlength; $i ++ ) {
        $c = mb_substr( $string, $i, 1 );
        if ( ( $c >= "0" ) && ( $c <= "9" ) ) {
            $has['numeric'] = "numeric";
        } else if ( ( $c >= "a" ) && ( $c <= "z" ) ) {
            $has['alpha']      = "alpha";
            $has['alphalower'] = 'alphalower';
        } else if ( ( $c >= "A" ) && ( $c <= "Z" ) ) {
            $has['alpha']      = "alpha";
            $has['alphaupper'] = "alphaupper";
        } else if ( ( $c == "$" ) || ( $c == "£" ) ) {
            $has['currency'] = "currency";
        } else if ( ( $c == "." ) && ( $has['decimal'] ) ) {
            $has['decimals'] = "decimals";
        } else if ( $c == "." ) {
            $has['decimal'] = "decimal";
        } else if ( $c == "," ) {
            $has['comma'] = "comma";
        } else if ( $c == "-" ) {
            $has['dash'] = "dash";
        } else if ( $c == " " ) {
            $has['space'] = "space";
        } else if ( $c == "/" ) {
            $has['slash'] = "slash";
        } else if ( $c == ":" ) {
            $has['colon'] = "colon";
        } else if ( ( $c >= " " ) && ( $c <= "~" ) ) {
            $has['ascii'] = "ascii";
        } else {
            $has['binary'] = "binary";
        }
    }
    mb_internal_encoding( $current_encoding );
    
    return $has;
}

$string = "1234asdfA£^_{}|}~žščř";
foreach ( get_character_classes( $string ) as $k => $v ) {
    echo $k . " : " . $v . PHP_EOL;
}
//numeric : numeric
//alpha : alpha
//alphalower : alphalower
//alphaupper : alphaupper
//currency : currency
//ascii : ascii
//binary : binary

文章参考

转载注明出处

©著作权归作者所有,转载或内容合作请联系作者
平台声明:文章内容(如有图片或视频亦包括在内)由作者上传并发布,文章内容仅代表作者本人观点,简书系信息发布平台,仅提供信息存储服务。

推荐阅读更多精彩内容

  • 背景 一年多以前我在知乎上答了有关LeetCode的问题, 分享了一些自己做题目的经验。 张土汪:刷leetcod...
    土汪阅读 14,350评论 0 33
  • Lua 5.1 参考手册 by Roberto Ierusalimschy, Luiz Henrique de F...
    苏黎九歌阅读 14,740评论 0 38
  • 在生命之花的职业一栏里面,我填写了学习5-10个excel技巧,今天从第一个开始学。 vlookup是excel公...
    小草的森林阅读 3,043评论 0 1
  • 一直以为身边有几个好朋友是很正常的,最近才知道好像并不是人人有这种幸运。最近一直低头故意不看身边的世界,沉浸在低气...
    branchette阅读 1,573评论 0 0
  • 种 大 蒜 作者:四年级四班 余海苧 今天,放学后回到家里,我就对妈妈说:“妈妈,我想种一种植物,观察它的生...
    小老鼠果果阅读 1,313评论 0 0