Gzip encoded data from HTTP

**Michael Colvin** · 01-16-2012

Hi. I having a problem with a web site sending data gzip compressed even when told not to (Transfer-encoding: idenity). So I looked up how gzip is compressed (RFC 1951 & 1952) and wrote am inflate function. It compiles and runs, and the first 1000-2000 bytes seem to decompress fine, then errors start to occur. It's taken 3 weeks to bash this into my head and turn it into c code. I'm hoping some one with some experience can see a logic error I may have made.

gzip.h
gzip.cpp

Sorry this is in the networking forum it's in a rss downloading program.

**Salem** · 01-16-2012

Do you have a particular reason not to use zlib Home Site

> I'm hoping some one with some experience can see a logic error I may have made.
A test wrapper "main()" and an example data file which causes the problem would help.
Then we could just run the code in a debugger and start looking (have you run the code in the debugger?)

**rags_to_riches** · 01-16-2012

I'm concerned you've spent a long time re-inventing the wheel. Based on your use of MessageBoxA, you seem to be using Windows, which allows you to use WinInet for HTTP transport which in turn will do the decoding for you. You might consider investigating that.

**memcpy** · 01-16-2012

> which allows you to use WinInet for HTTP transport which in turnwill do the decoding for you.

Am I the only one concerned that
a) it's not portable, and
b) you have no idea as to the overhead that this API provides, the potential waste of memory due to this,
c) it doesn't leave any of the low-level understanding and fine-tuning up to the user, and finally,
d) its features are limited to what MS wants to limit it to, and not what the language can do

**rags_to_riches** · 01-16-2012

If you're using Windows (and he is based on the use of MessageBoxA, which to the best of my knowledge is a Windows API call and therefore non-portable), why not avail yourself of the facilities made available to you by the Win32 SDK?

**Michael Colvin** · 01-17-2012

Salem:
I've actually used the gzip.org source to try and figure this thing and I could have changed the input from files to a buffer, but I got interested in how it works, and the the best way to learn is to do it.

As to running it in a debugger I've spent plenty of time watching the data decompress. That's how I know the <1000 bytes are decompressing correctly. once the data starts to get over that size errors start appearing.

wrapper
wrapper.cpp

I did have to change gzip.c a little to compile
change #include<stdafx.h> to #include<stdio.h>
and remove the call to MessageBoxA(); it's only there for error checking

data
Site won't let me up load a data file.
https://docs.google.com/open?id=0Bxj...IxMWY0YzY4YWIx

Rags:
Same thing, doing it to learn not just to get it done.

Memcpy:
Nice points, but Rags it kinda right It's all ready a win32 app so there is a bunch of over head any way.

**Michael Colvin** · 01-19-2012

Howdy guys. if your still following, I solved my problem. In the GetBit function v is declared short (2 bytes) and BitBuffer is declared int (4 bytes). So "G_BitBuffer|=v;" is converting G_BitBuffer to a 2 byte element. There is a case when the bits will exceeded 16 and be lost. You never get more then 13 at one time but if your at 12 and need 13 you pull the next char (8 bits) running over and loosing your top bits.

So this is one thing I've wondered for a long time, Why was't C designed with fixed size data types? a short is only defined as being longer or equal to and char and shorter or equal to a int in size. Visual C allows for _int8,_int16,_int32,_int64 but I don't think that is portable. Do you guys know or have link to find out why?

**memcpy** · 01-19-2012

C is designed with fixed-size data types, or at least the potential to implement them. The best way to use the <stdint.h>, which deals with the byte size problems (types follow the standard "uint8_t, int8_t, uint16_t, ..."). Or, you could implement them yourself by doing #ifdef for the architecture and system, and if it's not defined, define them.

The automatic conversion thing really shouldn't be an issue, but if it is, typecast.

Thread: Gzip encoded data from HTTP

Thread Tools

Search Thread

Display

Gzip encoded data from HTTP

response

Similar Threads

http post data to a website help

uploading file to http server via multipart form data

HTTP & Telnet Data Interpreter Component/Control in VC++?

sending HTTP POST data with Socket

GZIP Data