public final class NonStrictEUCJPEncoding extends EucEncoding
| Modifier and Type | Field and Description |
|---|---|
protected static CaseFoldCodeItem[] |
EMPTY_FOLD_CODES |
static NonStrictEUCJPEncoding |
INSTANCE |
| Modifier | Constructor and Description |
|---|---|
protected |
NonStrictEUCJPEncoding() |
| Modifier and Type | Method and Description |
|---|---|
void |
applyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
onigenc_ascii_apply_all_case_fold / used also by multibyte encodings
|
protected void |
asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg) |
protected CaseFoldCodeItem[] |
asciiCaseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end) |
protected int |
asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower) |
CaseFoldCodeItem[] |
caseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end)
onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodings
|
int |
codeToMbc(int code,
byte[] bytes,
int p)
Extracts code point into it's multibyte representation
|
int |
codeToMbcLength(int code)
Returns character length given a code point
Oniguruma equivalent:
code_to_mbclen |
int[] |
ctypeCodeRange(int ctype,
IntHolder sbOut)
Returns code range for a given character type
Oniguruma equivalent:
get_ctype_code_range |
boolean |
isCodeCType(int code,
int ctype)
Perform a check whether given code is of given character type (e.g.
|
protected boolean |
isCodeCTypeInternal(int code,
int ctype)
ONIGENC_IS_XXXXXX_CODE_CTYPE
|
protected boolean |
isLead(int c) |
boolean |
isNewLine(byte[] bytes,
int p,
int end)
onigenc_is_mbc_newline_0x0a / used also by multibyte encodings
|
boolean |
isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm
Oniguruma equivalent:
is_allowed_reverse_match |
int |
length(byte[] bytes,
int p,
int end)
Returns character length given stream, character position and stream end
returns
1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwise |
int |
mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
onigenc_ascii_mbc_case_fold
|
int |
mbcToCode(byte[] bytes,
int p,
int end)
Returns code point for a character
Oniguruma equivalent:
mbc_to_code |
int |
propertyNameToCType(byte[] bytes,
int p,
int end)
onigenc_minimum_property_name_to_ctype
notably overridden by unicode encodings
|
leftAdjustCharHeadlength, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoFourGreatedThan127, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthasciiToLower, asciiToUpper, digitVal, equals, getCharset, getCharsetName, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitValpublic static final NonStrictEUCJPEncoding INSTANCE
protected static final CaseFoldCodeItem[] EMPTY_FOLD_CODES
public int length(byte[] bytes,
int p,
int end)
Encoding1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwisepublic int mbcToCode(byte[] bytes,
int p,
int end)
Encodingmbc_to_codepublic int codeToMbcLength(int code)
Encodingcode_to_mbclencodeToMbcLength in class Encodingpublic int codeToMbc(int code,
byte[] bytes,
int p)
Encodingpublic int mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
flag - case fold flagpp - an IntHolder that points at character headlower - a buffer where to extract case folded character
Oniguruma equivalent: mbc_case_foldprotected boolean isLead(int c)
isLead in class EucEncodingpublic boolean isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Encodingis_allowed_reverse_matchisReverseMatchAllowed in class Encodingpublic int propertyNameToCType(byte[] bytes,
int p,
int end)
public boolean isCodeCType(int code,
int ctype)
EncodingisCodeCType in class Encodingcode - a code point of a characterctype - a character type to check against
Oniguruma equivalent: is_code_ctypepublic int[] ctypeCodeRange(int ctype,
IntHolder sbOut)
Encodingget_ctype_code_rangectypeCodeRange in class Encodingprotected final boolean isCodeCTypeInternal(int code,
int ctype)
public boolean isNewLine(byte[] bytes,
int p,
int end)
protected final int asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
protected final void asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
public void applyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
applyAllCaseFold in class Encodingflag - case fold flagfun - case folding functor (look at: ApplyCaseFold)arg - case folding functor argument (look at: ApplyCaseFoldArg)
Oniguruma equivalent: apply_all_case_foldprotected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end)
public CaseFoldCodeItem[] caseFoldCodesByString(int flag, byte[] bytes, int p, int end)
caseFoldCodesByString in class EncodingCopyright © 2016. All Rights Reserved.