public abstract class FixedWidthUnicodeEncoding extends UnicodeEncoding
| Modifier and Type | Field and Description |
|---|---|
protected static CaseFoldCodeItem[] |
EMPTY_FOLD_CODES |
protected int |
shift |
| Modifier | Constructor and Description |
|---|---|
protected |
FixedWidthUnicodeEncoding(String name,
int width) |
| Modifier and Type | Method and Description |
|---|---|
protected void |
asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg) |
protected CaseFoldCodeItem[] |
asciiCaseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end) |
protected int |
asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower) |
int |
codeToMbcLength(int code)
Returns character length given a code point
Oniguruma equivalent:
code_to_mbclen |
int[] |
ctypeCodeRange(int ctype,
IntHolder sbOut)
onigenc_utf16_32_get_ctype_code_range
|
protected boolean |
isCodeCTypeInternal(int code,
int ctype)
ONIGENC_IS_XXXXXX_CODE_CTYPE
|
boolean |
isNewLine(byte[] bytes,
int p,
int end)
onigenc_is_mbc_newline_0x0a / used also by multibyte encodings
|
boolean |
isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm
Oniguruma equivalent:
is_allowed_reverse_match |
int |
leftAdjustCharHead(byte[] bytes,
int p,
int s,
int end)
Seeks the previous character head in a stream
Oniguruma equivalent:
left_adjust_char_head |
int |
length(byte c)
Returns character length given character head
returns
1 for singlebyte encodings or performs direct length table lookup for multibyte ones. |
int |
length(byte[] bytes,
int p,
int end)
Returns character length given stream, character position and stream end
returns
1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwise |
int |
strCodeAt(byte[] bytes,
int p,
int end,
int index) |
int |
strLength(byte[] bytes,
int p,
int end) |
applyAllCaseFold, caseFoldCodesByString, ctypeCodeRange, getCharsetName, isCodeCType, mbcCaseFold, propertyNameToCTypelengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoFourGreatedThan127, safeLengthForUptoThree, safeLengthForUptoTwoasciiToLower, asciiToUpper, codeToMbc, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, maxLength, maxLengthDistance, mbcodeStartPosition, mbcToCode, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitValprotected final int shift
protected static final CaseFoldCodeItem[] EMPTY_FOLD_CODES
protected FixedWidthUnicodeEncoding(String name, int width)
public final int length(byte c)
Encoding1 for singlebyte encodings or performs direct length table lookup for multibyte ones.length in class MultiByteEncodingc - Character head
Oniguruma equivalent: mbc_enc_len
To be deprecated very soon (use length(byte[]bytes, int p, int end) version)public final int length(byte[] bytes,
int p,
int end)
Encoding1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwisepublic final int strLength(byte[] bytes,
int p,
int end)
strLength in class MultiByteEncodingpublic final int strCodeAt(byte[] bytes,
int p,
int end,
int index)
strCodeAt in class MultiByteEncodingpublic final int codeToMbcLength(int code)
Encodingcode_to_mbclencodeToMbcLength in class Encodingpublic final int[] ctypeCodeRange(int ctype,
IntHolder sbOut)
ctypeCodeRange in class Encodingpublic final int leftAdjustCharHead(byte[] bytes,
int p,
int s,
int end)
Encodingleft_adjust_char_headleftAdjustCharHead in class Encodingbytes - byte streamp - positions - stopend - endpublic final boolean isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Encodingis_allowed_reverse_matchisReverseMatchAllowed in class Encodingprotected final boolean isCodeCTypeInternal(int code,
int ctype)
public boolean isNewLine(byte[] bytes,
int p,
int end)
protected final int asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
protected final void asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
protected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end)
Copyright © 2016. All Rights Reserved.