craw_conf.py Documentation

Overview

Crawdad consists of a sequentially applied set of modules, as shown in the flowchart below

Briefly, the steps consist of: All these steps are controlled through the craw_conf.py application
.

craw_conf.py is a control script for CRAWDAD which uses an XML configuration file to define the steps and parameters for pre-processing, aligning, discovering signal differences, and mapping those differences to peptide identifications. Most interactions with craw_conf.py are set in the align_config.xml file, which is outlined below

align_config.xml

A sample align_config.xml file is shown here

the steps, or 'actions' as listed above are given a label in align_config.xml, and output for those steps are saved to the directories named after the labels. If the computer crawdad is running is able to produce images w/ python, then base peak chromatograms will be produced as images from the output of each sample group for each step. these are.

Generally speaking, if you edit the config file to run jobs on the queue, (using the ) as shown above, craw_conf.py will submit one single-processor job per file per step as outlined above. You can process more than two 'sample groups' at once, but at the moment comparisons to find differences are limited to comparing two.

Loading SQT into the MySQL database

Running Sample Data

run visualization programs

We have two classes of run visualization programs:

these programs are useful for viewing individual steps of the output, i.e. for assessing alignment quality, or determining if the appropriate smoothing has been used

CRAWDAD File

MSMAT format files Store the initial binned MS1 data and intermediate steps of individual runs

End Products from CRAWDAD

brief notes on XML