Data parsing help

maria100 · Joined: 01 Feb 2012 Posts: 63

Hello Forum. I'm working on a project that receives data from UART1 and save it to an 24LC1025 ext eeprom. The project run fine, but now the next step for it is to parse the data that i receive from the serial port. I have been scratching my head for 2-3 days and tried some basic if, or routines but I get stuck every time. Maybe I'm just choosing a bad way.
The goal that I want to achieve:

-I have three phrases (with known start format) that I need to write to eeprom only, say:
STARTF1= A126
STARTF2= CD1E
STARTF3= FF20

When a start format is detected, I need to keep ONLY the start format+the following 30 chars that comes after that start format, then write them both in the EEPROM in a single line as:
A1260000000000AAAAAAAAAAFFFFFFFFFF
If for example within that 30 chars after one start format is detected another start format occurs, the 30 char counter will be reset so i will save:
A126(startformat)+AAAA(1-29chars)+FF20||CD1E||A126 (another start format is detected in the 30 char string)+30 next following chars.

or

Another possible way when a fresh start format is detected within that 30 chars, the old buffer will be write and a new buffer will be created with the new string. Thank you very much. Hope somebody can give me a hand. I promise that I'm not lazy or such, and I ask questions only when I go out of solutions. Good Luck forum!

Ttelmah · Joined: 11 Mar 2010 Posts: 19469

You don't give any real description of the 'start format'. How many bytes is this?. How many possible varieties?. Is the data in text, or direct binary?. Is it just the three byte pairs you show, or three sets of four characters?. Do these change?.
Sounds a perfect excuse for a state machine.

Best Wishes

maria100 · Joined: 01 Feb 2012 Posts: 63

The start formats are 3 and they never change and they are represented in hexadecimal.The data after the start string ( 30 hexa chars) changes. Start strings are 3 hexa-chars long ( FFFFFF)

John P · Joined: 17 Sep 2003 Posts: 331

I thought I understood this until you explained it.

You say "Start strings are 3 hexa-chars long ( FFFFFF)" but your first posting showed 4 characters per string-- A126, CD1E or FF20. And it's still not clear how much data each of them contains.

How many bytes come in over the serial line to create a single start sequence, and how many bytes do you want to store to the EEPROM at a time? Bytes. Forget hexa-chars.

maria100 · Joined: 01 Feb 2012 Posts: 63

Sorry for the confusions. I said in the first time two bytes and the second time three bytes because two of the start-sequences consist of 2 bytes(FFFF) and one consist of 3 bytes ( FFFFFF).
I have said hexadacimal because the bytes must be read AS HEX to make a correct interpretation of the data.
I wanna store in the eeprom a record per/ page.
RECORD EXAMPLE:
F1F2F3(start sequence)+next 30 bytes after the start sequence is detected.
Hope you understand now.

jeremiah · Joined: 20 Jul 2010 Posts: 1342

Why not something like:

Ttelmah · Joined: 11 Mar 2010 Posts: 19469

So, what are the start sequences?. Are they fixed values that you will always be looking for, or something coming from somewhere else?. You showed originally 3*2 byte sequences, now you are talking about 2*2byte, and 1*3byte.
Values don't _ever_ 'have to be read as HEX'. Hex is just a representation of a binary value. 0x41='A'=0b01000001. All are representations of exactly the same number. The value is the value, nothing else.

Best Wishes

maria100 · Joined: 01 Feb 2012 Posts: 63

Jeremiah , i would definitely try your example , looks very promising.

Ttelmah , what I`m trying to say ( and Jeremiah got the ideea already )
is that the start bytes that I am searching for , they have the values that i said in hexadecimal , not in other format(but this , as you said it doesnt matter, numbers are numbers, ex. 0xAAAA=1010101010101010). And about the sequences what I`m trying to say is that are 3 possible start ways ( two of them has the following format : 0xFFFFFF and the other one just 0xFFFF).

jeremiah · Joined: 20 Jul 2010 Posts: 1342

mind you that example is incomplete. It just provides a skeleton. It doesn't save which start sequence you got or do anything special for each one. It just provides a means for differentiating between them. You would still need to fill in some code if you plan on doing something different based on each start sequence (or at least save which one you got to a variable to work with later).

Ttelmah · Joined: 11 Mar 2010 Posts: 19469

As I said, it looks like a perfect case for a state machine, and this is what Jeremiah is showing.
The only complexity would come, since you also talk about the possibility of start sequences inside the data itself. If this can happen, potentially you might need multiple buffers, if you get a sequence that carries repeated starts all the way through.
Big problem/question though is timing. Remember it takes a few mSec to write to an external EEPROM (basically to write either a byte, or a page if using multi byte writes). Are you happy that you have the time intervals needed to complete the writes. This is where devices like FRAM 'win' hand's down.

Best Wishes

maria100 · Joined: 01 Feb 2012 Posts: 63

Yup , timing could be a problem , well..I will post after i get some result. Thank you all for your attention!!!