View on GitHub

GaussianSort

A command line utility for creating list of file names sorted by standard deviation

First, to clarify, the above usage statement is written in the docopt syntax and it is more simple than it appears. The first word gausssort can be ignored, only the other four words need to be considered. The first of these words is a subcommand specifying which column of data (i.e. #A_bacteria, #B_bacteria, #_bacteria stored in each file shown in the Description section) to calculate the standard deviation on:

bact.a
bact.b
bact.total

The next word ‘<read_file_path>’ will be the path to the files you want to sort. For example, if I wanted to launch the utility and sort on the ‘#A_bacteria’ data in each of the files located in the ‘/home/gauss/data’ directory:

./gausssort.py bact.a /home/gauss/data

This usage will print to stdout the file names sorted with their ‘gaussian widths’ (i.e. standard deviation) next to each name:

1:   test-A_0_0.2_0.5_0.5_0.5_0.5.dat        2.62489899361

2:   test-A_0_0.4_0.5_0.5_0.5_0.5.dat        2.62489899361

3:   test-A_0_0.6_0.5_0.5_0.5_0.5.dat        2.62489899361

4:   test-A_0_0.8_0.5_0.5_0.5_0.5.dat        2.62489899361

5:   test-A_0_0_0.5_0.5_0.5_0.5.dat          2.62489899361

6:   test-A_0_1_0.5_0.5_0.5_0.5.dat          2.62489899361

7:   test-A_0.2_0.2_0.5_0.5_0.5_0.5.dat      3.03493910298

8:   test-A_0.2_0.4_0.5_0.5_0.5_0.5.dat      3.03493910298

9:   test-A_0.2_0.6_0.5_0.5_0.5_0.5.dat      3.03493910298

10:  test-A_0.2_0.8_0.5_0.5_0.5_0.5.dat      3.03493910298

...

Advanced Usage: Saving to Output File

A more advanced usage involves the save command:

./gausssort.py bact.a /home/gauss/data save

This will run the utility and write the output to a file in the GaussSort directory. The output file will be named based on the date and time, and will therefore be a unique file and not in danger of being overwritten.

Optionally, you can give a name to the output file:

./gausssort.py bact.a /home/gauss/data save list_of_gausssorted_filenames.txt

This will of course save the output in the ‘list_of_gausssorted_filenames.txt’ file.

GaussianSort

A command line utility for creating list of file names sorted by standard deviation

Table of Contents

Description

Installation

Usage

Basic Usage: Printing to Standard Out

Advanced Usage: Saving to Output File