public final class EUCJPEncoding extends EucEncoding
| Modifier and Type | Field and Description |
|---|---|
protected static CaseFoldCodeItem[] |
EMPTY_FOLD_CODES |
static EUCJPEncoding |
INSTANCE |
| Modifier | Constructor and Description |
|---|---|
protected |
EUCJPEncoding() |
| Modifier and Type | Method and Description |
|---|---|
void |
applyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
onigenc_ascii_apply_all_case_fold / used also by multibyte encodings
|
protected void |
asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg) |
protected CaseFoldCodeItem[] |
asciiCaseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end) |
protected int |
asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower) |
CaseFoldCodeItem[] |
caseFoldCodesByString(int flag,
byte[] bytes,
int p,
int end)
onigenc_ascii_get_case_fold_codes_by_str / used also by multibyte encodings
|
int |
codeToMbc(int code,
byte[] bytes,
int p)
Extracts code point into it's multibyte representation
|
int |
codeToMbcLength(int code)
Returns character length given a code point
Oniguruma equivalent:
code_to_mbclen |
int[] |
ctypeCodeRange(int ctype,
IntHolder sbOut)
Returns code range for a given character type
Oniguruma equivalent:
get_ctype_code_range |
String |
getCharsetName() |
boolean |
isCodeCType(int code,
int ctype)
Perform a check whether given code is of given character type (e.g.
|
protected boolean |
isCodeCTypeInternal(int code,
int ctype)
ONIGENC_IS_XXXXXX_CODE_CTYPE
|
protected boolean |
isLead(int c) |
boolean |
isNewLine(byte[] bytes,
int p,
int end)
onigenc_is_mbc_newline_0x0a / used also by multibyte encodings
|
boolean |
isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm
Oniguruma equivalent:
is_allowed_reverse_match |
int |
length(byte[] bytes,
int p,
int end)
Returns character length given stream, character position and stream end
returns
1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwise |
int |
mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
onigenc_ascii_mbc_case_fold
|
int |
mbcToCode(byte[] bytes,
int p,
int end)
Returns code point for a character
Oniguruma equivalent:
mbc_to_code |
int |
propertyNameToCType(byte[] bytes,
int p,
int end)
onigenc_minimum_property_name_to_ctype
notably overridden by unicode encodings
|
leftAdjustCharHeadlength, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoFourGreatedThan127, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthasciiToLower, asciiToUpper, digitVal, equals, getCharset, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitValpublic static final EUCJPEncoding INSTANCE
protected static final CaseFoldCodeItem[] EMPTY_FOLD_CODES
public int length(byte[] bytes,
int p,
int end)
Encoding1 for singlebyte encodings or performs sanity validations for multibyte ones
and returns the character length, missing characters in the stream otherwisepublic String getCharsetName()
getCharsetName in class Encodingpublic int mbcToCode(byte[] bytes,
int p,
int end)
Encodingmbc_to_codepublic int codeToMbcLength(int code)
Encodingcode_to_mbclencodeToMbcLength in class Encodingpublic int codeToMbc(int code,
byte[] bytes,
int p)
Encodingpublic int mbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
flag - case fold flagpp - an IntHolder that points at character headlower - a buffer where to extract case folded character
Oniguruma equivalent: mbc_case_foldprotected boolean isLead(int c)
isLead in class EucEncodingpublic boolean isReverseMatchAllowed(byte[] bytes,
int p,
int end)
Encodingis_allowed_reverse_matchisReverseMatchAllowed in class Encodingpublic int propertyNameToCType(byte[] bytes,
int p,
int end)
public boolean isCodeCType(int code,
int ctype)
EncodingisCodeCType in class Encodingcode - a code point of a characterctype - a character type to check against
Oniguruma equivalent: is_code_ctypepublic int[] ctypeCodeRange(int ctype,
IntHolder sbOut)
Encodingget_ctype_code_rangectypeCodeRange in class Encodingprotected final boolean isCodeCTypeInternal(int code,
int ctype)
public boolean isNewLine(byte[] bytes,
int p,
int end)
protected final int asciiMbcCaseFold(int flag,
byte[] bytes,
IntHolder pp,
int end,
byte[] lower)
protected final void asciiApplyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
public void applyAllCaseFold(int flag,
ApplyAllCaseFoldFunction fun,
Object arg)
applyAllCaseFold in class Encodingflag - case fold flagfun - case folding functor (look at: ApplyCaseFold)arg - case folding functor argument (look at: ApplyCaseFoldArg)
Oniguruma equivalent: apply_all_case_foldprotected final CaseFoldCodeItem[] asciiCaseFoldCodesByString(int flag, byte[] bytes, int p, int end)
public CaseFoldCodeItem[] caseFoldCodesByString(int flag, byte[] bytes, int p, int end)
caseFoldCodesByString in class EncodingCopyright © 2016. All Rights Reserved.