public final class UTF16BEEncoding extends UnicodeEncoding
| Modifier and Type | Field and Description |
|---|---|
protected static CaseFoldCodeItem[] |
EMPTY_FOLD_CODES |
static UTF16BEEncoding |
INSTANCE |
| Modifier | Constructor and Description |
|---|---|
protected |
UTF16BEEncoding() |
| Modifier and Type | Method and Description |
|---|---|
protected void |
asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg) |
protected CaseFoldCodeItem[] |
asciiCaseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end) |
protected int |
asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower) |
int |
codeToMbc(int code,
byte[] bytes,
int p)
Extracts code point into it's multibyte representation
|
int |
codeToMbcLength(int code)
Returns character length given a code point
Oniguruma equivalent:
code_to_mbclen |
int[] |
ctypeCodeRange(int ctype,
IntHolder sbOut)
onigenc_utf16_32_get_ctype_code_range
|
protected boolean |
isCodeCTypeInternal(int code,
int ctype)
ONIGENC_IS_XXXXXX_CODE_CTYPE
|
boolean |
isNewLine(byte[] bytes,
int p,
int end)
onigenc_is_mbc_newline_0x0a / used also by multibyte encodings
|
boolean |
isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm
Oniguruma equivalent:
is_allowed_reverse_match |
int |
leftAdjustCharHead(byte[] bytes,
int p,
int s,
int end)
Seeks the previous character head in a stream
Oniguruma equivalent:
left_adjust_char_head |
int |
length(byte[] bytes,
int p,
int end)
Returns character length given stream, character position and stream end
returns
1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwise |
int |
mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] fold)
onigenc_ascii_mbc_case_fold
|
int |
mbcToCode(byte[] bytes,
int p,
int end)
Returns code point for a character
Oniguruma equivalent:
mbc_to_code |
applyAllCaseFold, caseFoldCodesByString, ctypeCodeRange, getCharsetName, isCodeCType, propertyNameToCTypelength, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoFourGreatedThan127, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthasciiToLower, asciiToUpper, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitValpublic static final UTF16BEEncoding INSTANCE
protected static final CaseFoldCodeItem[] EMPTY_FOLD_CODES
public int length(byte[] bytes,
int p,
int end)
Encoding1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwisepublic boolean isNewLine(byte[] bytes,
int p,
int end)
public int mbcToCode(byte[] bytes,
int p,
int end)
Encodingmbc_to_codepublic int codeToMbcLength(int code)
Encodingcode_to_mbclencodeToMbcLength in class Encodingpublic int codeToMbc(int code,
byte[] bytes,
int p)
Encodingpublic int mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] fold)
mbcCaseFold in class UnicodeEncodingflag - case fold flagpp - an IntHolder that points at character headfold - a buffer where to extract case folded character
Oniguruma equivalent: mbc_case_foldpublic int[] ctypeCodeRange(int ctype,
IntHolder sbOut)
ctypeCodeRange in class Encodingpublic int leftAdjustCharHead(byte[] bytes,
int p,
int s,
int end)
Encodingleft_adjust_char_headleftAdjustCharHead in class Encodingbytes - byte streamp - positions - stopend - endpublic boolean isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Encodingis_allowed_reverse_matchisReverseMatchAllowed in class Encodingprotected final boolean isCodeCTypeInternal(int code,
int ctype)
protected final int asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
protected final void asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
protected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end)
Copyright © 2016. All Rights Reserved.