Class EUCKREncoding
java.lang.Object
org.jcodings.Encoding
org.jcodings.AbstractEncoding
org.jcodings.MultiByteEncoding
org.jcodings.EucEncoding
org.jcodings.specific.EUCKREncoding
- All Implemented Interfaces:
Cloneable
- Direct Known Subclasses:
GB2312Encoding
-
Field Summary
FieldsModifier and TypeFieldDescription(package private) static final int[]private static final int[][]static final EUCKREncoding -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionintcodeToMbc(int code, byte[] bytes, int p) Extracts code point into it's multibyte representationintcodeToMbcLength(int code) Returns character length given a code point Oniguruma equivalent:code_to_mbclenint[]ctypeCodeRange(int ctype, IntHolder sbOut) Returns code range for a given character type Oniguruma equivalent:get_ctype_code_rangebooleanisCodeCType(int code, int ctype) Perform a check whether given code is of given character type (e.g.protected booleanisLead(int c) booleanisReverseMatchAllowed(byte[] bytes, int p, int end) Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_matchintlength(byte[] bytes, int p, int end) Returns character length given stream, character position and stream end returns1for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwiseintmbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower) onigenc_ascii_mbc_case_foldintmbcToCode(byte[] bytes, int p, int end) Returns code point for a character Oniguruma equivalent:mbc_to_codeMethods inherited from class EucEncoding
leftAdjustCharHeadMethods inherited from class MultiByteEncoding
caseMap, isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthMethods inherited from class AbstractEncoding
applyAllCaseFold, asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, caseFoldCodesByString, isCodeCTypeInternal, isNewLine, propertyNameToCTypeMethods inherited from class Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getCharsetName, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
Field Details
-
EUCKREncLen
static final int[] EUCKREncLen -
EUCKRTrans
private static final int[][] EUCKRTrans -
INSTANCE
-
-
Constructor Details
-
EUCKREncoding
protected EUCKREncoding() -
EUCKREncoding
-
-
Method Details
-
length
public int length(byte[] bytes, int p, int end) Description copied from class:EncodingReturns character length given stream, character position and stream end returns1for singlebyte encodings or performs sanity validations for multibyte ones and returns the character length, missing characters in the stream otherwise -
mbcToCode
-
codeToMbcLength
public int codeToMbcLength(int code) Description copied from class:EncodingReturns character length given a code point Oniguruma equivalent:code_to_mbclen- Specified by:
codeToMbcLengthin classEncoding
-
codeToMbc
-
mbcCaseFold
Description copied from class:AbstractEncodingonigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFoldin classAbstractEncoding- Parameters:
flag- case fold flagpp- anIntHolderthat points at character headlower- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
isCodeCType
public boolean isCodeCType(int code, int ctype) Description copied from class:EncodingPerform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCTypein classEncoding- Parameters:
code- a code point of a characterctype- a character type to check against Oniguruma equivalent:is_code_ctype
-
ctypeCodeRange
Description copied from class:EncodingReturns code range for a given character type Oniguruma equivalent:get_ctype_code_range- Specified by:
ctypeCodeRangein classEncoding
-
isLead
protected boolean isLead(int c) - Specified by:
isLeadin classEucEncoding
-
isReverseMatchAllowed
public boolean isReverseMatchAllowed(byte[] bytes, int p, int end) Description copied from class:EncodingReturns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match- Specified by:
isReverseMatchAllowedin classEncoding
-