CCS C Software and Maintenance Offers
FAQFAQ   FAQForum Help   FAQOfficial CCS Support   SearchSearch  RegisterRegister 

ProfileProfile   Log in to check your private messagesLog in to check your private messages   Log inLog in 

CCS does not monitor this forum on a regular basis.

Please do not post bug reports on this forum. Send them to support@ccsinfo.com

String Compression

 
Post new topic   Reply to topic    CCS Forum Index -> General CCS C Discussion
View previous topic :: View next topic  
Author Message
chingB



Joined: 29 Dec 2003
Posts: 81

View user's profile Send private message

String Compression
PostPosted: Sat Dec 04, 2004 8:36 am     Reply with quote

Hello,

Anybody in the community have a wild ideas on how to implement a string compression?

Any info, help, comments and/or suggesstions is appreciated.

It will be great if a code snipet /psuedo code is provided.

Thanx
Ttelmah
Guest







Re: String Compression
PostPosted: Sun Dec 05, 2004 3:52 am     Reply with quote

chingB wrote:
Hello,

Anybody in the community have a wild ideas on how to implement a string compression?

Any info, help, comments and/or suggesstions is appreciated.

It will be great if a code snipet /psuedo code is provided.

Thanx

Unfortunately, 'generic' compression, requires quite a lot of code, and is unlikely to save any space on the small strings involved in most PIC applications. Basically, the commonest 'style' of algorithm, relies on finding 'common' sections of the text, and then storing these just once, and replacing the character code at this point, with a 'marker' to say that the stored version should be used. So (for instance), if you have two strings:
The rain in Spain.
The quick brown fox.
Then you could potentially use an otherwise unused character (one of the control characters), as a marker. Replace the only 'common' word (The), with the marker, and a 'key number' to say which of the library strings to use. You then have to have a library containing the common words. The problem is that compression of this sort, only works efficiently when there are a _lot_ of common words, since the library has to be included in the stored data. If (for instance), you run PKZIP, on text, you will find that under about 10K characters, the algorithm will not compress at all, knowing that the relatively low number of comon strings, and the overhead of adding the library, will outweigh any savings.
Probably the best way of saving space, if you are needing to handle a number of messages, is to take full advantage of the 8bit range for an integer. Either saving two bits per character, by coding your own 'subset' of the ASCII set involving only 64 characters, and then storing these 'nose to tail' inside the available space (a potential saving of 25% on the size needed), or to generate your own library of commonly used words, and use values over 127, as 'keys' to these.
Doing generic 'on the fly' compression, uses a lot of code space, and RAM. I did an algorithm some time ago, on a PIC18F252 (proprietary so I can't post it), and had to use 80% of the ROM space, and 90% of the RAM, just to implement the compression...

Best Wishes
treitmey



Joined: 23 Jan 2004
Posts: 1094
Location: Appleton,WI USA

View user's profile Send private message Visit poster's website

PostPosted: Mon Dec 06, 2004 9:04 am     Reply with quote

Ching, I don't know what type of compression your looking for,... but
if your just trying to store strings more effieient, then there is someone on the forum that wrote a PC program to locate the strings for you.

http://www.ccsinfo.com/forum/viewtopic.php?t=20782&highlight=strings
Display posts from previous:   
Post new topic   Reply to topic    CCS Forum Index -> General CCS C Discussion All times are GMT - 6 Hours
Page 1 of 1

 
Jump to:  
You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum
You cannot vote in polls in this forum


Powered by phpBB © 2001, 2005 phpBB Group