Class BytesRef
- All Implemented Interfaces:
Cloneable
,Comparable<BytesRef>
bytes
member should never be null; use EMPTY_BYTES
if necessary.
Important note: Unless otherwise noted, Lucene uses this class to represent terms that
are encoded as UTF8 bytes in the index. To convert them to a Java String
(which is
UTF16), use utf8ToString()
. Using code like new String(bytes, offset, length)
to
do this is wrong, as it does not respect the correct character set and may return wrong
results (depending on the platform's defaults)!
BytesRef
implements Comparable
. The underlying byte arrays are sorted
lexicographically, numerically treating elements as unsigned. This is identical to Unicode
codepoint order.
-
Field Summary
Modifier and TypeFieldDescriptionbyte[]
The contents of the BytesRef.static final byte[]
An empty byte array for convenienceint
Length of used bytes.int
Offset of first valid byte. -
Constructor Summary
ConstructorDescriptionBytesRef()
Create a BytesRef withEMPTY_BYTES
BytesRef
(byte[] bytes) This instance will directly reference bytes w/o making a copy.BytesRef
(byte[] bytes, int offset, int length) This instance will directly reference bytes w/o making a copy.BytesRef
(int capacity) Create a BytesRef pointing to a new array of sizecapacity
.BytesRef
(CharSequence text) Initialize the byte[] from the UTF8 bytes for the provided String. -
Method Summary
Modifier and TypeMethodDescriptionboolean
bytesEquals
(BytesRef other) Expert: compares the bytes against another BytesRef, returning true if the bytes are equal.clone()
Returns a shallow clone of this instance (the underlying bytes are not copied and will be shared by both the returned object and this object.int
Unsigned byte order comparisonstatic BytesRef
deepCopyOf
(BytesRef other) Creates a new BytesRef that points to a copy of the bytes fromother
boolean
int
hashCode()
Calculates the hash code as required by TermsHash during indexing.boolean
isValid()
Performs internal consistency checks.toString()
Returns hex encoded bytes, eg [0x6c 0x75 0x63 0x65 0x6e 0x65]Interprets stored bytes as UTF8 bytes, returning the resulting string
-
Field Details
-
EMPTY_BYTES
public static final byte[] EMPTY_BYTESAn empty byte array for convenience -
bytes
public byte[] bytesThe contents of the BytesRef. Should never benull
. -
offset
public int offsetOffset of first valid byte. -
length
public int lengthLength of used bytes.
-
-
Constructor Details
-
BytesRef
public BytesRef()Create a BytesRef withEMPTY_BYTES
-
BytesRef
public BytesRef(byte[] bytes, int offset, int length) This instance will directly reference bytes w/o making a copy. bytes should not be null. -
BytesRef
public BytesRef(byte[] bytes) This instance will directly reference bytes w/o making a copy. bytes should not be null -
BytesRef
public BytesRef(int capacity) Create a BytesRef pointing to a new array of sizecapacity
. Offset and length will both be zero. -
BytesRef
Initialize the byte[] from the UTF8 bytes for the provided String.- Parameters:
text
- This must be well-formed unicode text, with no unpaired surrogates.
-
-
Method Details
-
bytesEquals
Expert: compares the bytes against another BytesRef, returning true if the bytes are equal.- Parameters:
other
- Another BytesRef, should not be null.- NOTE: This API is for internal purposes only and might change in incompatible ways in the next release.
-
clone
Returns a shallow clone of this instance (the underlying bytes are not copied and will be shared by both the returned object and this object. -
hashCode
public int hashCode()Calculates the hash code as required by TermsHash during indexing.This is currently implemented as MurmurHash3 (32 bit), using the seed from
StringHelper.GOOD_FAST_HASH_SEED
, but is subject to change from release to release. -
equals
-
utf8ToString
Interprets stored bytes as UTF8 bytes, returning the resulting string -
toString
Returns hex encoded bytes, eg [0x6c 0x75 0x63 0x65 0x6e 0x65] -
compareTo
Unsigned byte order comparison- Specified by:
compareTo
in interfaceComparable<BytesRef>
-
deepCopyOf
Creates a new BytesRef that points to a copy of the bytes fromother
The returned BytesRef will have a length of other.length and an offset of zero.
-
isValid
public boolean isValid()Performs internal consistency checks. Always returns true (or throws IllegalStateException)
-