Bit Streams…

pspeed · December 20, 2012, 3:55pm

In the course of developing my network layer for Mythruna, I wrote some bit stream classes and thought I would share them. By this point, they are pretty well debugged and if there are any surprise bugs then I’d like to know!

BitOutputStream:
[java]
public class BitOutputStream {
private OutputStream out;
private int currentByte = 0;
private int bits = 8;

public BitOutputStream( OutputStream out )  {
    this.out = out;
}

public int getPendingBits()  {
    return bits;
}

public void writeBits( int value, int count ) throws IOException {
    if( count == 0 )
        throw new IllegalArgumentException( "Cannot write 0 bits." );

    // Make sure the value is clean of extra high bits
    value = value & (0xffffffff >>> (32 - count));

    int remaining = count;
    while( remaining > 0 ) {
        int bitsToCopy = bits < remaining ? bits : remaining;

        int sourceShift = remaining - bitsToCopy;
        int targetShift = bits - bitsToCopy;
        
        currentByte |= (value >>> sourceShift) << targetShift;
        
        remaining -= bitsToCopy;
        bits -= bitsToCopy;                      
        
        value = value & (0xffffffff >>> (32 - remaining));
                    
        // If there are no more bits left to write to in our
        // working byte then write it out and clear it.
        if( bits == 0 ) {
            flush();                
        }
    }   
}

public void writeLongBits( long value, int count ) throws IOException {
    if( count == 0 )
        throw new IllegalArgumentException( "Cannot write 0 bits." );

    // Make sure the value is clean of extra high bits
    value = value & (0xffffffffffffffffL >>> (64 - count));

    int remaining = count;
    while( remaining > 0 ) {
        int bitsToCopy = bits < remaining ? bits : remaining;

        int sourceShift = remaining - bitsToCopy;
        int targetShift = bits - bitsToCopy;
        
        currentByte |= (value >>> sourceShift) << targetShift;
        
        remaining -= bitsToCopy;
        bits -= bitsToCopy;                      

        value = value & (0xffffffffffffffffL >>> (64 - remaining));
                    
        // If there are no more bits left to write to in our
        // working byte then write it out and clear it.
        if( bits == 0 ) {
            flush();                
        }
    }   
}

protected void flush() throws IOException {
    out.write(currentByte);
    bits = 8;
    currentByte = 0;
}

public void close() throws IOException {
    flush();
    out.close();
}

}
[/java]

BitInputStream
[java]
public class BitInputStream {
private InputStream in;
private int lastByte;
private int bits = 0;

public BitInputStream( InputStream in ) {
    this.in = in;
}

public long readLongBits( int count ) throws IOException {
    if( count == 0 )
        throw new IllegalArgumentException( "Cannot read 0 bits." );

    if( count > 64 )
        throw new IllegalArgumentException( "Bit count overflow:" + count );
        
    long result = 0;

    // While we still have bits remaining...       
    int remainingCount = count;
    while( remainingCount > 0 ) {
        // See if we need to refill the current read byte
        if( bits == 0 ) {
            int b = in.read();
            if( b < 0 )
                throw new IOException( "End of stream reached." );
            lastByte = b;
            bits = 8;                    
        }

        // Copy the smaller of the two: remaining bits
        // or bits left in lastByte.
        int bitsToCopy = bits < remainingCount ? bits : remainingCount;
        
        // How much do we have to shift the read byte to just
        // get the high bits we want?
        int sourceShift = bits - bitsToCopy;
        
        // And how much do we have to shift those bits to graft
        // them onto our result?
        int targetShift = remainingCount - bitsToCopy;

        // Copy the bits           
        result |= ((long)lastByte >> sourceShift) << targetShift;

        // Keep track of how many bits we have left
        remainingCount -= bitsToCopy;
        bits -= bitsToCopy;
                  
        // Now we need to mask off the bits we just copied from
        // lastByte.  Just keep the bits that are left.
        lastByte = lastByte & (0xff >> (8 - bits));           
    }
        
    return result;
}

public int readBits( int count ) throws IOException {
    if( count == 0 )
        throw new IllegalArgumentException( "Cannot read 0 bits." );
        
    if( count > 32 )
        throw new IllegalArgumentException( "Bit count overflow:" + count );
    
    int result = 0;

    // While we still have bits remaining...       
    int remainingCount = count;
    while( remainingCount > 0 ) {
        // See if we need to refill the current read byte
        if( bits == 0 ) {
            int b = in.read();
            if( b < 0 )
                throw new IOException( "End of stream reached." );
            lastByte = b;
            bits = 8;                    
        }

        // Copy the smaller of the two: remaining bits
        // or bits left in lastByte.
        int bitsToCopy = bits < remainingCount ? bits : remainingCount;
        
        // How much do we have to shift the read byte to just
        // get the high bits we want?
        int sourceShift = bits - bitsToCopy;
        
        // And how much do we have to shift those bits to graft
        // them onto our result?
        int targetShift = remainingCount - bitsToCopy;

        // Copy the bits           
        result |= (lastByte >> sourceShift) << targetShift;

        // Keep track of how many bits we have left
        remainingCount -= bitsToCopy;
        bits -= bitsToCopy;
                  
        // Now we need to mask off the bits we just copied from
        // lastByte.  Just keep the bits that are left.
        lastByte = lastByte & (0xff >> (8 - bits));           
    }
        
    return result;
}

public void close() throws IOException {
    in.close();
}

}
[/java]

Example usage using a byte array as an example. Any InputStream or OutputStream will work.
[java]
ByteArrayOutputStream bOut = new ByteArrayOutputStream();
BitOutputStream out = new BitOutputStream(bOut);
int value1 = 1;
int value2 = 0;
int value3 = 12;
long value4 = 1234567890L;
out.writeBits(value1, 1); //write a single on bit
out.writeBits(value2, 1); //write a single off bit
out.writeBits(value3, 4); //write a 3 bit number
out.writeLongBits(value4, 48) // write a 48 bit number
out.close(); // make sure everything is flushed

// This byte array should be 7 bytes long
// (1 + 1 + 4 + 48) / 8 with an extra byte for the remainder.
byte[] array = bOut.toArray();
ByteArrayInputStream bIn = new ByteArrayInputStream(array);
BitInputStream in = new BitInputStream(bIn);
int v1 = in.readBits(1);
int v2 = in.readBits(1);
int v3 = in.readBits(4);
long v4 = in.readLongBits(48);
[/java]

Because of the padding of the last byte, you can’t properly detect the end of bit streams so that is one difference between this and a regular Java stream. You can’t just keep reading until there is no more data, you have to know what you are grabbing and how many there are.

These stream are exceptionally useful for tightly packing network data. If you know some value can fit in 4 bits (0-15) or 2 bits (0-3) then you don’t have to write a whole byte for those. It’s especially useful for booleans where you get an 8 to 1 savings.

Run Length Encoding is another good use-case where can easily implement some 9-bit protocol to store run lengths and so on.

wezrule · December 20, 2012, 4:07pm

I can see this be very useful for people who send a lot of boolean data, as there is often a lot of wasted bits with that

pspeed · December 20, 2012, 4:28pm

Yeah, waste adds up in other cases, too. I send my Quaternions as 4 12 bit values (4 fixed point decimals). So I can send a quaternion in 48 bits… versus 64 bits if those were each 16 bit values (and why would I even bother using 12 bits at that point.)

There are many cases where the numbers we send over the wire have a known range much smaller than their bitness would imply. This lets those get squished together.

And because the single bit savings (as you mentioned) is so nice, you can make more adaptive protocols. Storing a byte to say whether the next value exists or not is 8 times more expensive than storing the bit. So, for example, a protocol that can send the 48 bit Quaternion optionally could use a bit to signify whether it was included. At 1/49th the total size, it is an easy win… where as a byte to do the same thing is 1/7th the total size (8 out of 56 bits). You have to start looking at how often its unset, how many times you send it, etc. before deciding if it’s worth that extra 8 bits just to save the occasional 48.

“1 bit or 49 bits” is a much easier decision than “8 bits or 56 bits”.

(I use that as an example since I do that exact thing in the Mythruna protocol for a variety of values that are intermittently set.)

entrusc · December 21, 2012, 12:57pm

Thanks for sharing these two classes!

Just a suggestion: if you check for count == 0 you might also wanna check for count < 0

pspeed · December 21, 2012, 2:06pm

Yeah, anyone wanting to do that in their own code can. I know I don’t pass negative values ever. I added the checks because I had some single bit writes that had the parameters inverted:
writeBits(1,0) instead of writeBits(0,1) and couldn’t figure out why I was screwing up my whole protocol.

shirkit · December 21, 2012, 5:22pm

Checks for weird values like that are normally worth it because avoid dumb mistakes (I make them a lot aawhooaaa)

destroflyer · December 23, 2012, 1:16pm

I’m facing some problems when writing bit counts, that are not dividable by 8. Testcase:
[java]ByteArrayOutputStream byteArrayOutputStream = new ByteArrayOutputStream();
BitOutputStream bitOutputStream = new BitOutputStream(byteArrayOutputStream);
bitOutputStream.writeBits(3, 32);
bitOutputStream.writeBits(1, 1); //This line should write one bit (Value: 1)
byte[] bytes = byteArrayOutputStream.toByteArray();

for(int i=0;i<bytes.length;i++){
System.out.println(bytes[i]);
}[/java]

The marked line has no influence on the byte[]-array. Maybe this is a “bug” in the “toByteArray()”-method - The method “toArray” as mentioned in your example doesn’t seem to exist. The output is always:
0
0
0
3

EDIT: Ah, in your comment it says, that the array length is always bitsCount / 8 which explains why the last part is missing. So this is a typical behaviour of the method^^ So if I understand right, I have to manually add bits to the stream until a length dividable by 8 is reached?

pspeed · December 23, 2012, 1:50pm

No. Java does not have a bit type so I must store bits in bytes. If you store only 1 bit then there is one byte in the array. If you store 8 bits then there is one byte in the array. If you store 9 bits then there is two bytes in the array.

In the above example you didn’t close the stream so the last byte didn’t get flushed.

By the way, do not ever call flush() manually unless you are really done with the stream because it writes that last unfinished byte. Subsequent writes will cause strange behavior when you read it back (since now there will be a gap).

pspeed · December 23, 2012, 2:20pm

Note: went back and edited the original post so that close() is included in the example.