Summarizes information from region using sliding window or group ids (same as SQL GROUP BY).
When sliding window is used, input file needs to be sorted according to locationCol
Variation.jar is in microarray bundle.
Version | 1.0 |
---|---|
Bundle | sequencing |
Categories | VariationAnalysis |
Authors | Sirkku Karinen (sirkku.karinen@helsinki.fi) |
Issue tracker | View/Report issues |
Requires | Variation.jar (jar) |
Source files | component.xml |
Usage | Example with default values |
Name | Type | Mandatory | Description |
---|---|---|---|
data | CSV | Mandatory | Input has values that are summarized abd the positions or the ids that values are summarized by. |
Name | Type | Description |
---|---|---|
summarization | CSV | Summarized data. |
Name | Type | Default | Description |
---|---|---|---|
keyCols | string | "" | Columns that have ids for grouping for summarization (same functionality as SQL GROUP BY). Comma-separated list is accepted. |
locationCol | string | "" | Column that has the positions for window. If not specified, uses row indexes. |
method | string | "SUM" | Method of summarization, possible values are AVERAGE, MEDIAN, MAX, MIN, SUM, COUNT, MULTIPLY |
resultCol | string | "" | Names of the result column in comma-separated list. If not given, same as valueCols. |
resultType | string | "float" | int/float |
valueCols | string | (no default) | Columns that have the values for which summarization is calculated (Comma-separated list of column names). Summarization is calculated for each column independently. |
window | float | 0 | Window size in summarization, window size is either from locationCol and if that is not specified, uses row indices. If not specified, uses ids in keyCols. |
windowStart | int | 0 | From where window starts. -1 = the first location that is in input file. |
windowStep | float | 0 | Length of sliding of the window. |
Test case | Parameters▼ | IN data |
OUT summarization |
|||
---|---|---|---|---|---|---|
case1 | properties | data | summarization | |||
method=AVERAGE, |
||||||
case2 | properties | data | summarization | |||
method=SUM, |
||||||
case3 | properties | data | summarization | |||
method=MEDIAN, |
||||||
case4 | properties | data | summarization | |||
method=MEDIAN, |