Class BaseEUCJPEncoding
java.lang.Object
org.jcodings.Encoding
org.jcodings.AbstractEncoding
org.jcodings.MultiByteEncoding
org.jcodings.EucEncoding
org.jcodings.specific.BaseEUCJPEncoding
- All Implemented Interfaces:
Cloneable
- Direct Known Subclasses:
EUCJPEncoding, NonStrictEUCJPEncoding
-
Field Summary
FieldsModifier and TypeFieldDescriptionprivate static final int[]private static final int[]private static final int[]private static final int[]private static final int[]private static final int[]private static final CaseInsensitiveBytesHash<Integer> (package private) static final int[]private static final int[][] -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionintcodeToMbc(int code, byte[] bytes, int p) Extracts code point into it's multibyte representationintcodeToMbcLength(int code) Returns character length given a code point Oniguruma equivalent:code_to_mbclenint[]ctypeCodeRange(int ctype, IntHolder sbOut) Returns code range for a given character type Oniguruma equivalent:get_ctype_code_rangeprivate static intgetLowerCase(int code) booleanisCodeCType(int code, int ctype) Perform a check whether given code is of given character type (e.g.protected booleanisLead(int c) booleanisReverseMatchAllowed(byte[] bytes, int p, int end) Returns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_matchintmbcCaseFold(int flag, byte[] bytes, IntHolder pp, int end, byte[] lower) onigenc_ascii_mbc_case_foldintmbcToCode(byte[] bytes, int p, int end) Returns code point for a character Oniguruma equivalent:mbc_to_codeintpropertyNameToCType(byte[] bytes, int p, int end) onigenc_minimum_property_name_to_ctype notably overridden by unicode encodingsMethods inherited from class EucEncoding
leftAdjustCharHeadMethods inherited from class MultiByteEncoding
caseMap, isInRange, length, lengthForTwoUptoFour, mb2CodeToMbc, mb2CodeToMbcLength, mb2IsCodeCType, mb4CodeToMbc, mb4CodeToMbcLength, mb4IsCodeCType, mbnMbcCaseFold, mbnMbcToCode, missing, missing, safeLengthForUptoFour, safeLengthForUptoThree, safeLengthForUptoTwo, strCodeAt, strLengthMethods inherited from class AbstractEncoding
applyAllCaseFold, asciiApplyAllCaseFold, asciiCaseFoldCodesByString, asciiMbcCaseFold, caseFoldCodesByString, isCodeCTypeInternal, isNewLineMethods inherited from class Encoding
asciiToLower, asciiToUpper, digitVal, equals, getCharset, getCharsetName, getIndex, getName, hashCode, isAlnum, isAlpha, isAscii, isAscii, isAsciiCompatible, isBlank, isCntrl, isDigit, isDummy, isFixedWidth, isGraph, isLower, isMbcAscii, isMbcCrnl, isMbcHead, isMbcWord, isNewLine, isPrint, isPunct, isSbWord, isSingleByte, isSpace, isUnicode, isUpper, isUTF8, isWord, isWordGraphPrint, isXDigit, length, load, load, maxLength, maxLengthDistance, mbcodeStartPosition, minLength, odigitVal, prevCharHead, rightAdjustCharHead, rightAdjustCharHeadWithPrev, setDummy, setName, setName, step, stepBack, strByteLengthNull, strLengthNull, strNCmp, toLowerCaseTable, toString, xdigitVal
-
Field Details
-
CR_Hiragana
private static final int[] CR_Hiragana -
CR_Katakana
private static final int[] CR_Katakana -
CR_Han
private static final int[] CR_Han -
CR_Latin
private static final int[] CR_Latin -
CR_Greek
private static final int[] CR_Greek -
CR_Cyrillic
private static final int[] CR_Cyrillic -
PropertyList
private static final int[][] PropertyList -
CTypeNameHash
-
EUCJPEncLen
static final int[] EUCJPEncLen
-
-
Constructor Details
-
BaseEUCJPEncoding
protected BaseEUCJPEncoding(int[][] Trans)
-
-
Method Details
-
mbcToCode
-
codeToMbcLength
public int codeToMbcLength(int code) Description copied from class:EncodingReturns character length given a code point Oniguruma equivalent:code_to_mbclen- Specified by:
codeToMbcLengthin classEncoding
-
codeToMbc
-
getLowerCase
private static int getLowerCase(int code) -
mbcCaseFold
Description copied from class:AbstractEncodingonigenc_ascii_mbc_case_fold- Overrides:
mbcCaseFoldin classAbstractEncoding- Parameters:
flag- case fold flagpp- anIntHolderthat points at character headlower- a buffer where to extract case folded character Oniguruma equivalent:mbc_case_fold
-
isLead
protected boolean isLead(int c) - Specified by:
isLeadin classEucEncoding
-
isReverseMatchAllowed
public boolean isReverseMatchAllowed(byte[] bytes, int p, int end) Description copied from class:EncodingReturns true if it's safe to use reversal Boyer-Moore search fail fast algorithm Oniguruma equivalent:is_allowed_reverse_match- Specified by:
isReverseMatchAllowedin classEncoding
-
propertyNameToCType
public int propertyNameToCType(byte[] bytes, int p, int end) Description copied from class:AbstractEncodingonigenc_minimum_property_name_to_ctype notably overridden by unicode encodings- Overrides:
propertyNameToCTypein classAbstractEncoding
-
isCodeCType
public boolean isCodeCType(int code, int ctype) Description copied from class:EncodingPerform a check whether given code is of given character type (e.g. used by isWord(someByte) and similar methods)- Specified by:
isCodeCTypein classEncoding- Parameters:
code- a code point of a characterctype- a character type to check against Oniguruma equivalent:is_code_ctype
-
ctypeCodeRange
Description copied from class:EncodingReturns code range for a given character type Oniguruma equivalent:get_ctype_code_range- Specified by:
ctypeCodeRangein classEncoding
-