rearrangement of columns in data file

**paliphy** · 07-15-2017

HI i am new to c programming and my area is research mainly. I am attaching here two files 1. forum_new.txt 2. forum_old.txt
Forum_old.txt is the file i am getting from software package and forum_new.txt is the file i require to analyze the data. This time i done it manually as size of fiel is small. If there is any program that can convert automatically Forum_old.txt to forum_new.txt.
basically this is re-arrangement of columns
the first column is repeated after 10 values.

It will be very halpful to me if i get any kind of help.

**Salem** · 07-15-2017

Normally we ask you produce some kind of effort to solving your own problems.
More than "I'm new, here's my homework".

So here's a meta answer.

Code:

$ cat foo.pl
#!/usr/bin/perl
while ( <> ) {
  @f = split;
  push @{$lines{$f[0]}},$f[1] if ( $#f == 1 );
}
foreach $l ( sort keys %lines ) {
  print $l . "\t" . join("\t",@{$lines{$l}}) . "\n";
}
$ perl foo.pl FORUMS_OLD.txt 
0.0000	-5.6701	6.2418	6.2418	6.2418	8.8145
0.0333	-5.6653	6.2093	6.2206	6.2206	8.8023
0.0667	-5.6507	6.1159	6.1601	6.1601	8.7671
0.1000	-5.6263	5.9704	6.0666	6.0666	8.7103
0.1333	-5.5923	5.7837	5.9484	5.9484	8.6347
0.1667	-5.5486	5.5662	5.8131	5.8131	8.5436
0.2000	-5.4952	5.3259	5.6669	5.6669	8.4402
0.2333	-5.4322	5.0687	5.5145	5.5145	8.3267
0.2667	-5.3596	4.8000	5.3596	5.3596	8.2083
0.3000	-5.2775	4.5226	5.2046	5.2046	8.0865

**paliphy** · 07-15-2017

thanks for your reply but could you please answer in C language

**Salem** · 07-15-2017

Like I said, what do you know so far?

Can you open files using fopen?

Can you read lines using fscanf or fgets?

Do you know what structs are?

Can you use strcmp()?

Prove that you're not just being lazy and hoping to score free homework.

**paliphy** · 07-17-2017

yes i am familiar to fopen, scanf, printf, fscanf, frpintf, for loop whie loop, data types declaration , mathematical expressions in C but not efficient in file handling

fopen(file pointer, "mode")
scanf("abcd %f gggg %d ", &a, &b)

**whiteflags** · 07-17-2017

Since you can do it manually, maybe try to imagine a new 2D array in your head, try to notice how the notepad page could match up to the array in your head, and work out the steps.

**Salem** · 07-17-2017

> but not efficient in file handling
Worry about getting it to work before you worry about making it efficient.

**algorism** · 07-17-2017

This is not a good problem for C. Something like perl or awk is better suited. Here's some awk:

Code:

awk 'NF==2 {a[$1]=a[$1]"\t"$2}; END {for(x in a) print x,a[x]}' FORUMS_OLD.txt  | sort -n

(Beware anyone playing with the given data: it has the windows line endings. (dos2unix can fix that.))

In C, the problem is matching up the first column numbers. A linear search is good enough for the data you've shown, but if the real data has more 1st-column values, then a binary search may be better. Both the perl and awk solutions use hash tables for the purpose. Incidentally, they also treat all the numbers as strings, which is a good way to perfectly preserve the input values (instead of converting them to an internal floating-point format and back to an output string again).

A C solution is very ugly compared to the perl and awk solutions. And to write it in full generality (making it equivalent to the perl and awk solutions) would be somewhat painful.

**Salem** · 07-17-2017

Just an idea....

Read all the lines into memory, then do this
qsort(3): sort array - Linux man page

Then you're looking at data arranged as follows

Code:

0.0000  -5.6701
0.0000  6.2418
0.0000  6.2418
0.0000  6.2418
0.0000  8.8145
0.0333  -5.6653
0.0333  6.2093
0.0333  6.2206
0.0333  6.2206
0.0333  8.8023

It would be a lot easier to pick off the data you want in the order you want.

**algorism** · 07-18-2017

That's a good idea since the key fields are spread out in the original data. Bringing them together really simplifies things.

There's another possibility if we're allowed to assume that the same key fields appear in the same order in each group, and each group is separated by a blank line (which is the case in the given data).

Code:

char table[MAX_ROWS][MAX_LINE]

row = 0
loop:
  read a line
  if the line is blank
    break
  else
    strcpy(table[row++], line)

row = 0
loop:
  read a line
  if EOF
    break
  else if line is blank
    row = 0
    continue
  else
    strcat(table[row], "\t")
    strcat(table[row++], second field of line)

print out the table

You could add a check to ensure that (for all groups after the first) the first field of the input line matched the first field in the current table element (fatal error, otherwise).

That's probably the simplest solution, but only because it makes the most assumptions.

Thread: rearrangement of columns in data file

Thread Tools

Search Thread

Display

rearrangement of columns in data file

Similar Threads

copying columns from a text file.

Storing columns of numerical data from csv files to a vector

how to read specfic columns of data in a file.

reading a columns input data file

write to .txt file in columns

Tags for this Thread